I’m a Senior Data Engineer with 8+ years of experience building scalable data pipelines and analytics solutions across healthcare, finance, and retail domains. Currently at Elevance Health, I focus on developing PySpark pipelines, implementing real-time data streaming with AWS Kinesis, and managing cloud data infrastructure in compliance with HIPAA standards. I’m passionate about turning complex data into actionable insights and delivering high-quality, performance-optimized solutions that drive business value.
At JPMorgan Chase, I worked on modernizing legacy financial data pipelines into a cloud-based data lake architecture. Using PySpark, Apache Airflow, and AWS (S3, Redshift, Lambda), I built efficient, scalable ETL pipelines that processed transactional data for compliance and risk reporting. My work ensured timely delivery of regulatory reports (CCAR, Basel III) and improved data quality across finance and risk domains. I also partnered with data governance teams to implement data lineage tracking, enabling auditability and transparency in financial operations.
Key ContributionsAt Elevance Health, I led the design and development of large-scale data engineering solutions focused on healthcare claims and electronic health records (EHR). Using PySpark and AWS services (Glue, S3, Kinesis, Lambda), I built end-to-end ETL pipelines capable of handling millions of records daily, ensuring HIPAA compliance and data integrity. I also contributed to the migration of on-prem systems to Snowflake, optimizing cost and performance for real-time analytics. The project enabled timely insights for care management teams, significantly reducing data latency and supporting better member outcomes.
Key ContributionsAt Swiggy, I designed and maintained real-time data pipelines using Apache Kafka, Spark Streaming, and Hive to handle high-velocity order, delivery, and user behavior data. My work enabled live tracking, dynamic pricing, and restaurant ETA calculations across multiple cities. I also contributed to building a data quality monitoring framework, ensuring consistency in downstream analytics used by marketing and operations teams. This project enhanced Swiggy’s ability to deliver personalized experiences and optimize delivery logistics.
Key ContributionsAt HomeGoods, I built robust ETL pipelines to support retail sales analytics, integrating data from POS systems, supply chain sources, and vendor feeds. Using SQL, Informatica, and Teradata, I automated the data ingestion and transformation processes, significantly reducing manual effort and increasing reporting accuracy. I also supported inventory forecasting models by delivering clean, aggregated datasets to data scientists, leading to more accurate replenishment strategies and reduced stockouts across stores.
Key Contributions