Jitendra Shah

Engineering Manager

Bengaluru, Karnataka, India7 yrs 1 mo experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Architected a Modern Lakehouse on AWS.
  • Built frameworks for Reverse ETL and data governance.
  • AWS Community Builder 2022.
Stackforce AI infers this person is a Data Engineer specializing in Healthcare data solutions with expertise in cloud architecture.

Contact

Skills

Core Skills

Data EngineeringData ArchitectureData Analysis

Other Skills

AWSAWS EMRAWS GlueAWS LambdaAirflowAmazon AthenaAmazon EC2Amazon Elastic MapReduce (EMR)Amazon RedshiftAmazon Relational Database Service (RDS)Amazon S3ApacheApache FlinkApache HudiApache Kafka

About

Hello and welcome to my profile. I appreciate your time and interest. Let me give you a quick overview of who I am and what I do. I am a Data Engineer with a B.Tech degree in Computer Science from SMU and an AWS Certified Data Analytics - Specialty credential. I have over five years of experience in the health tech industry, working with various cloud platforms and open-source software to develop and implement end-to-end data pipeline architectures. Currently, I am a Data Engineer III at Connect and Heal - CNH Care, a leading digital health platform that connects patients and providers. In my current role, I have architected a cutting-edge Modern Lakehouse leveraging AWS cloud infrastructure and Apache HUDI, empowering seamless support for analytics and data-driven use cases. I have also ensured robust data governance and data quality standards were integrated throughout the data platform design process, safeguarding data integrity and enhancing overall reliability. Additionally, I have built frameworks to support Reverse ETL use cases for application and data export features, and designed schema modeling in our Lakehouse to streamline analytics, minimizing query complexity and slashing data scanning costs through intelligent dataset linking. My passion lies in exploring and learning new technologies and trends, and applying them to solve real-world problems. I am always eager to take on new challenges and collaborate with other professionals in the field. I was also an AWS Community Builder 2022, a program that recognizes and supports AWS enthusiasts who share their knowledge and expertise with others. I enjoy contributing to the AWS community through blogs, webinars, and events. Thank you for visiting my profile. I hope you find it informative and engaging. If you have any questions or comments, please feel free to reach out to me. I look forward to hearing from you.

Experience

7 yrs 1 mo
Total Experience
2 yrs 6 mos
Average Tenure
1 yr 11 mos
Current Experience

Halodoc

Engineering Manager - Data

Jul 2024Present · 1 yr 11 mos · India · Hybrid

Connect and heal - cnh care

Data Engineer III

Jul 2022Jun 2024 · 1 yr 11 mos · Bengaluru, Karnataka, India

  • Built 0-1 and 1-N Data Platform Journey at CNH.
  • 1) Architected a cutting-edge Modern Lakehouse leveraging AWS cloud infrastructure and Apache Hudi, empowering seamless support for analytics and data-driven use cases.
  • 2) Ensured robust data governance and data quality standards were integrated throughout the data platform design process, safeguarding data integrity and enhancing overall reliability
  • 3) Build framework to support Reverse ETL usecase for application and data export features.
  • 4) Designed schema modeling in our Lakehouse to streamline analytics, minimizing query complexity and slashing data scanning costs through intelligent dataset linking.
AWSApache HudiData GovernanceData QualityReverse ETLSchema Modeling+2

Halodoc

3 roles

Data Engineer II

Promoted

Jan 2021Jun 2022 · 1 yr 5 mos

  • 1) Built Lakehouse architecture using Apache HUDI and AWS EMR.
  • 2) Built datawarehous using schema modelling - star schema in Redshift.
  • 3) Migrated ~800 mysql tables to Lakehouse
  • 4) Developed clickstream data pipeline for Braze and Amplitude.
  • 5) Created frameworks to onboard new data source or data assets in each layer of the platform.
  • Ingestion Framework
  • Extraction Framework
  • Processing Framework
  • ETL Framework
Apache HudiAWS EMRData WarehouseSchema ModelingData MigrationData Pipeline+2

Data Engineer I

Jul 2019Dec 2020 · 1 yr 5 mos

  • 1) Developed streaming data pipeline using Kafka, Flink, Elasticsearch and Kibana.
  • 2) Setup Flink and Elasticsearch in HA mode in ec2 instance.
  • 3) Did PoC for ETL using AWS Glue.
  • 4) Migrated traditional Pentaho jobs to Airflow.
  • 5) Optimized Redshift performance
  • Reduced storage by 40% by applying correct compression technique and introducing archival policy.
  • Reduced CPU leader node usage from 80% to 25% by reviewing and applying correct sortkey and distkey and optimizing the queries.
  • 6) Wrote many custom Airflow plugins to abstract the complex logic from Airflow dags.
  • 7) Hosted Airflow in Ec2 cluster in HA mode.
  • 8) Built clickstream data pipeline for Clevertap, Mixpanel and Appsflyer for Analytical teams.
KafkaFlinkElasticsearchKibanaAWS GlueAirflow+2

Data Analyst Intern

Jan 2019Jun 2019 · 5 mos

  • 1) Predicted the Quality of consultations using the ML algorithm. Developed a model using Gradient Boosting Classification (GBC) with an accuracy of ~75%.
  • 2) Automated the cohort analysis graph that gives insights to product manager about the retention rate of users across the services.
  • 3) Analysed the ERX issued datasets and provided the insights for the conversion rate of the Tele consultation to Medicine Delivery.
Machine LearningCohort AnalysisData Analysis

Education

Sikkim Manipal Institute of Technology - SMU

B.Tech — Computer Science

Jan 2015Jan 2019

Stackforce found 100+ more professionals with Data Engineering & Data Architecture

Explore similar profiles based on matching skills and experience