Mohit R Sudhera

Data Engineer

Gurugram, Haryana, India14 yrs 8 mos experience

Most Likely To SwitchHighly Stable

Key Highlights

Led cloud-native data engineering initiatives in healthcare.
Expert in optimizing data pipelines for real-time analytics.
Proven track record in cross-functional leadership.

Stackforce AI infers this person is a Data Engineering Leader in Healthcare with expertise in cloud-native solutions.

Contact

Skills

Core Skills

Data EngineeringData ArchitectureBig Data AnalyticsBig DataJava

Other Skills

837 EDI x12 data837I837PAd Hoc ReportingAnalytical SkillsApache KafkaApache Spark StreamingAzure DatabricksCore JavaData AnalyticsData ModelingDesign PatternsDistributed SystemsGitHBase

About

Data & Cloud Engineering Leader | 11+ Years Driving Scalable, AI-Ready Platforms in Healthcare I build and lead high-performing engineering teams that turn complex, high-volume data into actionable insight. As a Data Strategy Leader at Optum, I lead, design, standardize, and optimize cloud-native stacks—Spark, Kafka, Snowflake, Azure Databricks—to power real-time analytics, AI/ML pipelines, and low-latency reporting for one of the world’s largest healthcare providers. Key strengths: • End-to-end architecture of resilient, petabyte-scale data platforms • Hands-on expertise across Data Engineering Stack - Databricks, Snowflake, Spark, Kafka, SQL, Scala, Python, Shell Scripting • Proven track record of boosting analytical solutions performance and reducing data-pipeline latency for enterprise-wide users • Cross-functional leadership—partnering with business stakeholders, product owners, and vendors to unlock data value and accelerate innovation • Passion for continuous improvement: mentoring engineers, codifying best practices, and championing a “better-every-day” culture My mission is simple: use data engineering excellence to improve healthcare outcomes. If you’re interested in collaborating on transformational data initiatives—or just want to exchange ideas—let’s connect.

Experience

14 yrs 8 mos

Total Experience

2 yrs 5 mos

Average Tenure

2 yrs 3 mos

Current Experience

Optum

2 roles

Sr. Manager Data Engineering

Mar 2025 – Present · 1 yr 2 mos · Gurugram, Haryana, India

Sr. Manager Architecture

Jan 2024 – Feb 2025 · 1 yr 1 mo · Gurugram, Haryana, India

Unitedhealthcare

2 roles

Principal Data Engineer

Promoted

Jun 2022 – Dec 2023 · 1 yr 6 mos · Richardson, Texas, United States

Prepare end-to-end lifecycle for onboarding existing projects and devise out strategies to identify the scope of optimization, layout delivery plan and estimate cost-benefit from
the solutions developed.
Architect and design curated data layer to standardize reporting solutions across the organization, with the motive to minimize gaps between data structures, operational complexity, and metrics definitions.
Responsible for planning resource strategies for upcoming projects, build roadmap for them, and understand team’s bandwidth.
Create and present value-story updates to UHC E&I Advocacy leadership on ongoing efforts in Data and Cloud Infrastructure and Engineering capabilities.
Partner with digital product managers and deliver robust cloud-based solutions that drive powerful experiences to help business achieve financial empowerment.
Collaborate with and across agile teams to design, develop, test, implement, and support technical solutions in full-stack analytics solutions and technologies.
Conduct reviews with other team members to ensure applications are rigorously designed, elegantly coded, and effectively tuned for performance.
Design business-specific checkpoints to enable data workflows in pro-actively handling potential failures/leaks in data ingestion services.
Leverage company’s proprietary tools to design high-performant distributed systems, which ensure safe and seamless migration of on-premise healthcare data to Azure/Snowflake Cloud environment and outmaneuver any data uncertainties.
Own data environments, integrate with new technologies, and oversee the development of new processes that support teams across the organization.
Layout effective data access policies to ensure strategic data-governance in cloud environment for PHI/PII data.
Analyze operating workspace’s data infrastructure and resourcing needs of the team and forecast budgetary aspect of upcoming year in accordance with the same.

SnowflakeDesign PatternsData ModelingData EngineeringAzure DatabricksData Architecture

Senior Data Engineer

Feb 2021 – Jun 2022 · 1 yr 4 mos · Richardson, Texas, United States

Partner with External Vendors to establish interface for incoming claims, in order to enrich the Fraud Waste and Abuse model network, which facilitates identification of malicious claims with higher rate of accuracy.
Optimize and resolve high-latency issues occurring in Tableau dashboards, with utilizing Snowflake as a data storage and compute system. In the very implementation, minimized response time of the dashboards from 120 seconds to ~15 seconds , with effective implementation of clustering, search-optimization and micro-pruning techniques.
Design and implement data governance policies while ensuring security standards across claims’ data feed, and educate cross functional teams on best practices of accessing restricted PHI/PII data.
Develop scalable data driven pipelines against a given business problem, with the required solution and application features by determining the appropriate programming language and leveraging business, technical, and data requirements.
Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack analytics solutions and technologies.
Analyze data-related system integration challenges and propose appropriate solutions.
Train and mentor lateral hires/new joiners with technical as well as functional knowledge that drives FWA analytics .
Deliver exhaustive knowledge sessions on Apache Kafka to multiple teams, with the purpose of educating associates on employing Kafka in data engineered pipelines, and enable them to align with enterprise-wide adoption of Kafka as a real-time messaging system.

Analytical SkillsApache Spark StreamingSnowflakeApache KafkaData EngineeringShell Scripting+1

Optum

2 roles

Associate Data Engineer

Promoted

Jun 2019 – Feb 2021 · 1 yr 8 mos

Create Spark streaming based Kafka producer and consumer applications, to subscribe to Claims’ inlet network of the organization in order to consume claims x12 feed on real time basis, and deliver analytical results to Claims Pre-processing team and Claims Analytic Engines.
Employ Schema Registry and Maven-Avro compiler to develop end-to-end Kafka based
applications.
Utilize HBase for establishing Kafka’s Offset Management feature in streaming applications.
Employ data monitoring models to calculate key metrics pertaining to data funneling through Data Engineered pipelines.
Perform load/performance testing for Kafka based applications while collaborating with source and target teams.
Review code base developed by fellow team members, and provide feedback highlighting the potential areas where optimization could be needed in Spark driven applications.
Strategically integrate Data Engineered Pipelines to ensure auto-recovery of data leakage, with robust monitoring alerts and quality checks in place.
Always on the lookout for possibilities to enrich, automate tasks, and build reusable components that can be leveraged across multiple use cases and teams.
Mentor newcomers/lateral hires on conventional methodologies and big data technologies, utilized in developing and implementing data driven models.
Devise out project plan along with associated features and user stories for Data Engineering projects, and registering the same in CA Rally.
Create project documentation in Confluence space, outlining overall workflow of the project and enabling support team to monitor and support it whenever needed.
Actively involved in constructing Scala specific spark streaming applications to build and channelize critical data pipelines from formed batches, using Spark DStream objects, into Hive and HBase tables (responsible for accumulating healthcare data).

Data Engineering Analyst

Jan 2017 – Jun 2019 · 2 yrs 5 mos

A member of fraud analytics team who is actively responsible for developing intelligent and fault-tolerant big data applications with the purpose of flagging potential fraudulent claims by analyzing 837 EDI Files, Providers, and NPPES healthcare data in an automated manner.
1. Built MapReduce jobs to form batches of incoming 837 EDI data files.
2. Developed, on demand, Spark - Java/Scala and Unix based reporting applications to fetch
data from Hive and HBase tables to be utilized for the analysis by stakeholders or
leadership teams.
3. Optimized Spark Jobs for the efficient utilization of the cluster.
4. Created Stored Procedures to maintain metadata and Triggers for audit purpose in
MSSQL Server Database.
5. Developed generic microservices to be utilized by Spark applications.
6. Utilized Github as a version control platform for developed applications.
7. Developed application manager services to keep a check on health of dedicated edge
node and applications running on Yarn.
8. Possess efficient business knowledge and understanding of 837 EDI Files, Provider, NPPES
and NDC healthcare data.
9. Understanding in identifying Fraud Waste and Abuse of healthcare claims.

Tata consultancy services

Systems Engineer

Jan 2014 – Jan 2017 · 3 yrs · Mumbai Area, India

1. Developed offline spark jobs to provide provide data to various lines of business across
organization.
2. Optimized Hive and Spark SQL jobs.
3. Wrote Hive UDFs to handle various business scenarios.
4. Built real-time data pipelines using IBM WPS.
5. Developed mappings that perform Extraction, Transformation and Load of source data
into Derived Masters schema using various power center transformations like Source
Qualifier, Look Up, Expression, Router, Stored Procedure and Update Strategy to meet
business logic in the mappings.
6. Carried out unit testing and user acceptance testing for the developed interfaces.
7. Performed Impact Analysis by analyzing CRs received from the application teams.

Apache Spark StreamingHiveMapReduceSqoop837 EDI x12 dataJava+8

Tcy learning solutions (p) ltd.

Centre Coordinator

Jun 2013 – Dec 2013 · 6 mos · Ludhiana Area, India

Was solemnly responsible for administering the Training Centre, IELTS and Personality Development Faculty. Organized various events and did students counselling as an additional work while working in the same organization.

WebSphere Process ServerHiveBig DataInformatica

National informatics centre, govt of india

Information Technology Intern

Jan 2013 – May 2013 · 4 mos · New Delhi Area, India

I completed my internship program at the National Informatics Centre, NIC-NEW DELHI, under the guidance of Mr. Alok Roy, who holds the position of Technical Director at NIC, New Delhi. He is renowned for his exceptional contributions to the Ministry of Information and Communication Technology.
Under his mentorship, I not only acquired a solid foundation in the fundamental programming principles of STRUTS 2.0 but also cultivated valuable skills in collaborating within a corporate team, demonstrating adaptability, and resourcefulness. Throughout my tenure, I actively engaged with STRUTS 2.0 on the Eclipse Platform and successfully developed a Change Management System (CMS) application. This application facilitates the handling of change requests from users within any organization, encompassing various organizational levels, including USER, TEAM LEADER, PROGRAMMER, and TESTER. Furthermore, it incorporates a feedback loop, allowing for revisions and eventual approval by the user after rectification of any errors.

Guru nanak dev engineering college, ludhiana

Student Co-ordinator, Training and Placement Cell

Mar 2011 – Mar 2013 · 2 yrs · Ludhiana Area, India

It brought me immense pride when, following a year of dedicated service in the Training and Placement Cell at Guru Nanak Dev Engineering College, I was selected as the Student Coordinator for my batch (2009-2013). This role proved to be a transformative experience during which I not only honed exceptional qualities of team leadership but also gained valuable insights into fostering students' personal development, equipping them with the skills necessary to excel in interviews with leading multinational corporations.
Throughout my tenure as Student Coordinator, I effectively oversaw numerous placement drives, encompassing prestigious organizations such as TCS, Microsoft, Mahindra & Mahindra, Nestle India, Thermax, Shapoorji Pallonji, ITC Pvt. Ltd., Punjlloyd, and HCL Technologies, to name just a few. This position demanded a high degree of multitasking ability, affording me invaluable exposure and enabling me to establish fruitful campus recruitment partnerships.

Oracle DatabaseHTMLSQLJavaStruts