Pardha Kanumuri

Data Engineer

Kent, Ohio, United States7 yrs 5 mos experience

Key Highlights

Designed scalable ETL pipelines for healthcare data.
Optimized data processing workflows, improving efficiency by 40%.
Developed a digital health platform enhancing patient management.

Stackforce AI infers this person is a Data Engineer specializing in Healthcare and Fintech data solutions.

Contact

Skills

Core Skills

Data EngineeringEtlBig Data AnalyticsSoftware DevelopmentMicroservices

Other Skills

KafkaSpring BootSnowflakePySparkPythonJavaScalaDatabricksCosmos DBAzure Blob StorageGraphQLDockerApache SparkSplunkHive

About

👋🏽 Hello, my name is Pardha Kanumuri. 📚 In the past, I worked as a Data Engineer and Java Developer, successfully delivering projects across the healthcare and financial sectors. My roles involved streamlining data pipelines, developing robust backend services, and optimizing data processing workflows to improve efficiency and reliability. 💻 Currently, I am leveraging my skills in technologies like Spark, Kafka, Spring Boot, and cloud platforms like Azure and Snowflake to design and implement scalable data solutions for UnitedHealth Group. My commitment to excellence and delivering value drives my work. 🚀 As for the future, I aspire to grow as a Senior Data Engineer, collaborating with diverse and innovative teams to develop transformative data solutions. I’m passionate about utilizing my technical expertise to enhance data-driven decision-making and drive business success. 🧘🏽‍♂️⚽💡🐾 In my free time, you can find me meditating, exploring tech innovations, playing cricket, and volunteering for community events. I also love spending time with pets and supporting international students in their journey. My resume highlights my professional and technical expertise, but there’s so much more to me than what’s on paper. I might be an introvert, but I’m always open to stepping out of my comfort zone to connect with new people and embrace diverse perspectives. I’m always open to meaningful conversations, collaborations, and learning opportunities. Feel free to reach out - I’d love to connect! Personal belief: Where there’s a will, there’s a way!

Experience

7 yrs 5 mos

Total Experience

1 yr 10 mos

Average Tenure

Current Experience

Unitedhealth group

Data Engineer

Oct 2025 – Present · 7 mos · United States · Remote

As a Data Engineer at UnitedHealth Group, I'm responsible for designing ETL pipelines for ingesting clinical data from more than 30 sources, including hospitals, internal systems, and third-party providers. Our goal was to ensure data accuracy, improve processing efficiency, and maintain regulatory compliance in a fast-paced healthcare environment. Working closely with data analysts, software engineers, and healthcare professionals, I helped design, develop, and optimize key data pipelines, focusing on scalability, security, and real-time processing.
Highlights:
Processed 30+ data sources by building Kafka + Spring Boot pipelines for FHIR & HL7 data, improving clinical data efficiency by 40%.
Optimized 5+ TB of healthcare data, reducing processing time by 25% with Snowflake and PySpark workflows.
Secured 1M+ patient records by implementing RBAC in Snowflake, ensuring 100% HIPAA, and SOC 2 compliance.
Built a Spring Boot application deployed on Azure Kubernetes Service to transform FHIR data into the ECDH model, validating and storing it in Azure Cosmos DB with 99.9% data accuracy while processing over 1 million records daily.
Developed efficient PySpark code in Databricks, optimizing data storage across Azure Cosmos DB and Snowflake environments, reducing processing time by 25% and effectively managing over 5 terabytes of large datasets.
Implemented Snowflake procedures, Snow pipes, and stages to ingest data from Azure Blob Storage, improving data ingestion efficiency by 35%, and configured streams for incremental data loading, reducing processing time by 30%.
Skills: Kafka | Spring Boot | Snowflake | PySpark | Python | Java | Scala | Databricks | Cosmos DB | Azure Blob Storage | GraphQL | Docker | Apache Spark | Splunk | Hive | Git | Microsoft Azure | Hadoop | Data Pipelines | Analytical Skills | Jenkins | REST APIs

KafkaSpring BootSnowflakePySparkPythonJava+18

Kent state university

Graduate Research Assistant

Aug 2023 – Dec 2024 · 1 yr 4 mos

Helped professors and students with Big Data, Cloud Computing, and Database coursework ensuring smooth lab sessions and projects data pipelines and managing databases using SQL, Python and Spark improving data analysis for research and student performance tracking.
Conducted review sessions and created tutorials/documentation on data modeling, normalization, and governance best practices, improving comprehension for diverse student groups.
Assisted in configuring AWS (S3, EC2) and Azure (ADF, Functions, AKS) environments for coursework, giving students exposure to cloud-native data engineering.
Conducted review sessions and created tutorials/documentation on data modeling, normalization, and governance best practices, improving comprehension for diverse student groups.

Optum

Data Engineer

Apr 2022 – Aug 2023 · 1 yr 4 mos · Hyderabad · Hybrid

As a Data Engineer at Optum (UnitedHealth Group), I became part of a team responsible for building and optimizing the Enterprise Clinical Data Hub (ECDH), a critical platform designed to streamline clinical data ingestion from 30+ sources, including hospitals, internal systems, and third-party providers. Our goal was to ensure data accuracy, improve processing efficiency, and maintain regulatory compliance in a fast-paced healthcare environment. Working closely with data analysts, software engineers, and healthcare professionals, I helped design, develop, and optimize key data pipelines, focusing on scalability, security, and real-time processing.
Highlights:
Collaborated with cross-functional teams including stakeholders, Business Analysts and QA to design and deliver end-to-end data pipelines, resulting in 30% faster deployment cycles.
Designed and implemented cloud-native pipelines in Databricks (PySpark, Delta Lake) to ingest HL7v2, FHIR, and CCDA datasets into Snowflake and reducing data preparation time by 35%.
Implemented Snowflake RBAC (Role-Based Access Control) to enforce fine-grained data access policies, ensuring data security and compliance with HIPAA/GDPR requirements.
Integrated Spark pipelines with Kafka and Cosmos DB, ensuring seamless ingestion of streaming ADT, Lab, ORU, ODX and RSP messages which directly impacted UHG’s health dashboards used by 5K+ clinicians nation-wide.
Reduced manual data collation time by 200 hours per month by building the ECDH (Enterprise Clinical Data Hub) application using Kafka, Java and Spring Boot to ingest data from 30+ FHIR/HL7 sources.
Wrote robust SQL queries and Python logic to validate and reconcile inconsistencies between NoSQL collections and normalized SQL datasets, reducing data integrity issues by 80%..
Reduced data processing time by 35% by implementing Snowflake procedures, Snowpipes, and streams for auto-mated incremental loading that cuts approximately 80 hours/month of manual effort.

JavaScriptMavenData EngineeringETL

Dxc technology

Data Engineer

Jun 2019 – Apr 2022 · 2 yrs 10 mos · Chennai · Hybrid

Joining DXC Technology, I became part of a dynamic team responsible for transforming data into actionable insights for SmartOps, an advanced platform driving Predictive and Preventive Maintenance at Stellantis. My role involved designing scalable data pipelines, enhancing real-time processing, and improving system efficiency to support mission-critical operations across 70+ business units. I worked closely with engineers, analysts, and system architects, ensuring seamless data flow, optimized query performance, and high availability for complex datasets. The challenge wasn’t just managing huge volumes of data, but making it faster, more accessible, and insightful for decision-makers.
Highlights:
Engineered batch processing pipelines using Spark (Scala) for the SmartOps Platform in an Agile environment, im-proving resource management and allocation across 70+ operational units.
Migrated legacy EHR data to AWS Data Lake using Delta Lake for ACID compliance, utilizing AWS Glue to catalog and prepare over 2 million patient records for secure downstream access, improving query performance by 30%.
Processed structured (Parquet, Avro, CSV) and semi-structured (JSON, XML) data formats that optimizes storage and schema design which reduced storage costs by 20% and improved data retrieval speed by 40%.
Managed Jenkins CI/CD pipelines, achieving a 95% deployment success rate and ensuring consistent production reliability.
Skills: Apache Spark | Apache Kafka | Scala | Hive | Hadoop | Snowflake | Ni-Fi | Microsoft Azure | AWS | HDFS | REST APIs | SQL | Databricks | Data Pipelines | Java Development | Big Data Analytics | Jenkins | Git | Data Loading

Apache SparkApache KafkaScalaHiveHadoopSnowflake+14

Aditya birla idea payments bank limited

Software Engineer

Jul 2017 – Jun 2019 · 1 yr 11 mos · India · On-site

Contributed to the development of a unified digital health platform for managing patient records, hospital operations, and insurance claim workflows across the Fortis hospital network. The project focused on automating key clinical and administrative tasks including patient onboarding, diagnostics, discharge summaries, and reimbursement processes, while maintaining data security and healthcare interoperability standards.
Highlights:
Contributed to the design & development of digital payments banking platform, which supported 10000+ accounts and helped achieve a 200% increase in user retention.
Implemented secure REST APIs with input validation, centralized exception handling and secured user access by implementing the OAuth2 authentication protocol in compliance with security standards.
Built and deployed user authentication and onboarding microservices using Spring Boot, reducing login failures by 80% and improving onboarding success by 50%.
Integrated NSDL and Aadhaar-based e-KYC services using X.509 certificate parsing, enabling document-free patient verification and reducing onboarding time by 60%.
Streamlined CI/CD pipelines using Git, Jenkins, and Docker, cutting release time by 60%.
Skills: Spring Boot| Java | AWS Lambda| AWS S3| PostgreSQL| Jenkins| X.509 Certificates| PDF Generation| HIS Integration| REST APIs| Microservices Architecture