Sarthak Madan — Data Engineer
Data Engineer with 3+ years of experience building scalable, cloud-native data pipelines using PySpark, Airflow, and AWS. I specialize in transforming legacy ETL workflows into automated, high-performance architectures that reliably process 100M+-record datasets with minimal latency.I’ve delivered major efficiency wins—cutting ETL runtimes by 90%, reducing cloud spend through EMR/S3 optimization, and improving data quality with validation frameworks that prevent 35–40% of downstream issues.My strengths include workflow orchestration, cost-efficient cloud design, performance tuning, data quality engineering, and end-to-end pipeline ownership. I enjoy solving complex data problems, modernizing systems, and building reliable pipelines that scale.
Stackforce AI infers this person is a Data Engineer specializing in cloud-native data pipeline development.
Location: Delhi, India
Experience: 3 yrs 8 mos
Skills
- Apache Airflow
- Aws
- Data Quality Engineering
- Pyspark
- Etl
- Data Analysis
- Data Engineering
Career Highlights
- Reduced ETL runtimes by 90% using PySpark.
- Implemented cost-saving strategies, lowering cloud spend by 25%.
- Enhanced data quality, preventing 40% of downstream issues.
Work Experience
Precisely
Data Engineer 2 (8 mos)
Data Engineer 1 (1 yr)
Associate Software Engineer (2 yrs)
Associate Software Engineer Intern (4 mos)
National Centre For Medium Range Weather ForecastingÊ(Ncmrwf)
Summer Internship (1 mo)
Indian School of Business
Intern (1 mo)
Decathlon Sports India
Summer Intern (1 mo)
Education
Msc.Geoinformatics at TERI School of Advanced Studies
Bachelor of Arts - BA at Shivaji College, Delhi University