SANKALP JAIN — Data Engineer
Senior Data Engineer with over 3.5 years of experience in designing and implementing scalable data pipelines, distributed systems, and automated workflows. Skilled in Python, SQL, Airflow, Azure, Generative AI, and PySpark, with hands-on expertise in ETL/ELT processes, data warehousing, and CI/CD practices. Certified as an Azure and Databricks Data Engineer Associate, with a strong focus on performance optimization and delivering production-grade data engineering solutions. Experienced in applying Generative AI for data enrichment, intelligent automation, and building LLM-integrated workflows. At Optum (UnitedHealth Group), I’ve led high-impact initiatives such as: Modernizing legacy COBOL infrastructure to Teradata and Python, improving processing efficiency by 30%.Engineering distributed pipelines across 16 upstream sources, reducing data processing time by 40%. Developing financial modules that enhanced reserve prediction accuracy by 30% and reduced manual intervention by 50%. Creating reusable onboarding frameworks and data quality systems that ensured 99% accuracy across critical reporting tables. My expertise spans ETL/ELT, data warehousing, CI/CD pipelines, and real-time streaming with Kafka. I’ve also architected GenAI-powered platforms for ETL logic extraction and error resolution, delivering up to 60% triage time reduction and projected cost savings of $2.3M. Certified as an Azure Data Engineer and Databricks Data Engineer Associate, I’m passionate about leveraging cutting-edge technologies—including LLMs, RAG pipelines, and prompt engineering—to solve complex data challenges responsibly and efficiently. Beyond engineering, I’ve contributed to healthcare data solutions across financial, clinical, provider, and pharmacy domains. My research on COVID-19 outbreak prediction using machine learning reflects my commitment to applying data science for real-world impact. I hold a Bachelor of Technology in Electronics & Telecommunications Engineering from K.J. Somaiya College of Engineering, where I built a strong foundation in data communication and signal processing. My early internships in web development helped shape my full-stack perspective and problem-solving mindset.
Stackforce AI infers this person is a Data Engineering expert in Healthcare and Web Development sectors.
Location: Bengaluru, Karnataka, India
Experience: 4 yrs 8 mos
Skills
- Data Engineering
- Cloud Computing
- Etl
- Web Development
- Data Science
Career Highlights
- Led migration to modern data infrastructure, boosting efficiency by 30%.
- Developed GenAI-powered platforms, reducing triage time by 60%.
- Achieved 99.93% accuracy in COVID-19 prediction using machine learning.
Work Experience
Optum
Senior Data Engineer (1 yr 4 mos)
Data Engineer (2 yrs 6 mos)
Dbug Technicals
Web Development Intern (2 mos)
Learnation
Web Development Intern (1 mo)
KJ Somaiya College of Engineering, Vidyavihar
Machine Learning Intern (1 mo)
Indian Society for Technical Education (KJSCE)
Joint Technical Head (10 mos)
Education
Bachelor of Technology - BTech at KJ Somaiya College of Engineering, Vidyavihar