SANKALP JAIN

Data Engineer

Bengaluru, Karnataka, India4 yrs 8 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Led migration to modern data infrastructure, boosting efficiency by 30%.
  • Developed GenAI-powered platforms, reducing triage time by 60%.
  • Achieved 99.93% accuracy in COVID-19 prediction using machine learning.
Stackforce AI infers this person is a Data Engineering expert in Healthcare and Web Development sectors.

Contact

Skills

Core Skills

Data EngineeringCloud ComputingEtlWeb DevelopmentData Science

Other Skills

Agentic AIAirflowAnime.jsApache AirflowApache SparkAzure Data FactoryAzure Data Lake StorageAzure DevOps ServicesAzure Event HubsAzure Key VaultAzure Open AIAzure SQLAzure Stream AnalyticsAzure Synapse AnalyticsBig Data

About

Senior Data Engineer with over 3.5 years of experience in designing and implementing scalable data pipelines, distributed systems, and automated workflows. Skilled in Python, SQL, Airflow, Azure, Generative AI, and PySpark, with hands-on expertise in ETL/ELT processes, data warehousing, and CI/CD practices. Certified as an Azure and Databricks Data Engineer Associate, with a strong focus on performance optimization and delivering production-grade data engineering solutions. Experienced in applying Generative AI for data enrichment, intelligent automation, and building LLM-integrated workflows. At Optum (UnitedHealth Group), I’ve led high-impact initiatives such as: Modernizing legacy COBOL infrastructure to Teradata and Python, improving processing efficiency by 30%.Engineering distributed pipelines across 16 upstream sources, reducing data processing time by 40%. Developing financial modules that enhanced reserve prediction accuracy by 30% and reduced manual intervention by 50%. Creating reusable onboarding frameworks and data quality systems that ensured 99% accuracy across critical reporting tables. My expertise spans ETL/ELT, data warehousing, CI/CD pipelines, and real-time streaming with Kafka. I’ve also architected GenAI-powered platforms for ETL logic extraction and error resolution, delivering up to 60% triage time reduction and projected cost savings of $2.3M. Certified as an Azure Data Engineer and Databricks Data Engineer Associate, I’m passionate about leveraging cutting-edge technologies—including LLMs, RAG pipelines, and prompt engineering—to solve complex data challenges responsibly and efficiently. Beyond engineering, I’ve contributed to healthcare data solutions across financial, clinical, provider, and pharmacy domains. My research on COVID-19 outbreak prediction using machine learning reflects my commitment to applying data science for real-world impact. I hold a Bachelor of Technology in Electronics & Telecommunications Engineering from K.J. Somaiya College of Engineering, where I built a strong foundation in data communication and signal processing. My early internships in web development helped shape my full-stack perspective and problem-solving mindset.

Experience

4 yrs 8 mos
Total Experience
2 yrs 4 mos
Average Tenure
3 yrs 10 mos
Current Experience

Optum

2 roles

Senior Data Engineer

Promoted

Feb 2025Present · 1 yr 4 mos · Bengaluru, Karnataka, India

  • 1. Led RPS migration from COBOL Mainframe to Teradata, Python & Airflow—boosted data efficiency by 30% with zero-issue go-live via AI-driven pipelines and parallel testing.
  • 2. Recognized by leadership (Jayme, Rishi, Anoop, Sundara) for delivering 4 key modules: Unassigned Strings, Triggers, Bulk Uploads, and Schedule Date Simulator.
  • 3. Built distributed pipelines processing petabytes of data across 16 sources, reducing processing time by 40%.
  • 4. Developed Data Quality tool ensuring 99% accuracy across 13 reference tables; reduced manual validation by 50% via Airflow automation.
  • 5. Architected modular, scalable frameworks enabling seamless onboarding of new tables/sources without code changes.
  • 6. Resolved FDF Dashboard security issues, converted Docker images to Golden Images, and led cloud security KT sessions.
  • 7. Selected as GitHub Copilot Super User; featured with CIO Paul Waymouth and showcased in CDDS Townhall for Gen AI innovation.
  • 8. Won 2nd place in CDDS Novathon for Gen AI-based ETL logic extraction tool.
  • 9. Mentored 2 interns on AI-powered troubleshooting assistant; presented at AI Innovation Forum and Innovation Expo.
  • 10. First in Rishi’s Org to complete AI Dojo, now pursuing ML Dojo; supporting peers via AI Dojo Buddy Program.
  • 11. Speaker at CDDS Engineering Days on GenAI in SDLC; received Bravo recognition.
  • 12. Leading FSDB to FDF migration for Student Resource Revenue feed across 6+ data flows (DA, DI, PSGL, RPS, Correction, Reversal).
PythonAirflowTeradataData QualityCloud SecurityData Engineering+1

Data Engineer

Aug 2022Feb 2025 · 2 yrs 6 mos · Bengaluru, Karnataka, India

  • 1. Integrated petabytes of financial data from VBR source into Unified Data Warehouse improving data
  • accessibility.
  • 2. Real-time data was ingested using Kafka-based data streaming architecture which reduced data latency by 30%.
  • 3. Utilized DataStage, Python, SQL, and Airflow to design and implement efficient ETL jobs, ensuring seamless extraction, transformation, and loading of diverse data from multiple sources into the Unified Data Warehouse improving data accuracy by 20%.
  • 4. Automated pipeline processes by developing Python scripts and orchestrated workflows using Airflow and CI/CD using Jenkins and Github enhancing efficiency by 35%.
  • 5. Conducted code reviews, documentation, and knowledge transfer (KT) sessions, mentoring team members on developed modules and new advancements in the data engineering domain, enhancing team capabilities and knowledge retention.
KafkaDataStagePythonSQLAirflowData Engineering+1

Dbug technicals

Web Development Intern

Jan 2021Mar 2021 · 2 mos · Mumbai, Maharashtra, India

  • Debug Technicals is a company that helps to reach business objectives by studying business from every angle and helps with branding, marketing, accounting, and development. They provide - Website/Application development, design solutions that include web design (UI/UX), visual design and graphic design, digital marketing - brand strategy, branding, social media marketing, email marketing, SEO, financial reporting, taxation, auditing, and many solutions to enhance online business.
  • Guided by Debabrata Dash (API Platforms Engineer @Barclays) and Maulik Tanna (Product Manager @neoXL).
  • https://testwebsitedev.netlify.app/
  • Achievements/Tasks:-
  • 1. Involved in designing the user interface of the website.
  • 2. Followed the mobile-first approach while designing the website.
  • 3. Designed the website using HTML5, CSS3, Javascript, Anime.js, and Browser Stack which are the core tools for building the frontend of the website.
  • 4. Learned to animate website using CSS and Anime.js.
  • 5. Learned the use of polyfills from polyfill.io to achieve cross-browser compatibility.
  • 6. Done Search Engine Optimization(SEO) for the website.
  • 7. Learned how to test the website using Browser Stack.
  • 8. Coordinated well with the frontend team.
HTML5CSS3JavaScriptAnime.jsWeb Development

Learnation

Web Development Intern

Sep 2020Oct 2020 · 1 mo · Mumbai, Maharashtra, India

  • Learnation is a startup where students can work and earn as teachers by teaching juniors and helping them in academics and technical courses. Students who want to learn can avail of the benefits of this service at a nominal price.
  • Guided By: Avanish Batkulia (Learnation Founder)
  • Website: https://learnation2020.herokuapp.com/
  • Achievements/Tasks:-
  • 1. Involved in designing the user interface of the website.
  • 2. Followed the mobile-first approach while designing the website.
  • 3. Designed the website using HTML5, CSS3, Javascript, and Bootstrap which are the core tools for building the frontend of the website.
  • 4. Learned how to manage, use, and share code using version control like GitHub.
  • 5. Coordinated well with the frontend and backend teams.
HTML5CSS3JavaScriptBootstrapWeb Development

Kj somaiya college of engineering, vidyavihar

Machine Learning Intern

Jun 2020Jul 2020 · 1 mo · Mumbai, Maharashtra, India

  • Guided By: Professor Ninad Mahendale Sir and Professor Mahesh Warang Sir
  • Paper: https://link.springer.com/article/10.1007/s40745-020-00314-9
  • Achievements/Tasks:-
  • 1. Worked on the research paper with my team and professors on "Outbreak prediction of
  • COVID-19 patients for dense and populated countries using machine learning".
  • 2. Done data preprocessing and data cleaning.
  • 3. Built various machine learning models like Support Vector Regressor, Bayesian Ridge
  • Polynomial Regressor, Linear Regressor Polynomial, XGBoost Regressor, and Random Forest Regressor to determine the total number of COVID-19 patients admitted on a particular day for different dense and populated countries.
  • 4. Done data visualizations using seaborn,matplotlib, and plotly to visualize the data in a better manner.
  • 5. Worked with Random Search CV and Grid Search CV to get the best set of parameters for our dataset.
  • 6. Achieved the highest accuracy of 99.93% for the prediction of COVID-19 cases on a particular day
  • in India using the above models and a few other models of my teammates.
  • 7. Submitted our research paper for review to SSRN.
  • 8. Research Paper made during internship got published in Springer-Nature journal "Annals of Data Science".
Machine LearningData PreprocessingData VisualizationData Science

Indian society for technical education (kjsce)

Joint Technical Head

Aug 2019Jun 2020 · 10 mos · KJSCE

Education

KJ Somaiya College of Engineering, Vidyavihar

Bachelor of Technology - BTech — Telecommunications Engineering

Aug 2018Jun 2022

Stackforce found 100+ more professionals with Data Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience