Prakash Choudhary

DevOps Engineer

Noida, Uttar Pradesh, India11 yrs 6 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Led migration to Apache Iceberg for GDPR compliance.
  • Achieved significant cost reductions in data processing.
  • Expert in building scalable data solutions.
Stackforce AI infers this person is a Data Engineering expert specializing in Big Data and Cloud solutions.

Contact

Skills

Core Skills

Apache IcebergAwsDatabricksApache Airflow

Other Skills

PySparkTrinoTerraformAmazon Web Services (AWS)Delta tableAmazon QuickSightSparkAzure DatabricksBig DataApache HudiPython (Programming Language)PandasPower BIMongoDBData Analysis

About

Currently leading the Batch ETL as a Lead Data Engineer with 10+ years of experience. My work encompasses designing, managing, and developing Lakehouse pipelines, Data Lakes, Data Mesh architectures, and Microservices. I bring expertise in a diverse Big Data stack, including Spark, Kafka, Airflow, Terraform, Iceberg and Databricks. Additionally, I possess deep expertise in the AWS Cloud, working extensively with services like S3, Glue, SQS, Lambda. My background also includes significant experience with distributed databases, including TiDB, MySQL, PostgreSQL, and MongoDB, enabling me to specialize in building scalable, robust and cost-effective data products that drive business value at Delhivery. Key Achievements: Real-Time Data Solutions: Built Kafka-based pipelines for seamless ingestion and transformation, leveraging tools like Delta Lake, AWS Glue, Terraform and Databricks to enable real-time decision-making. Cluster Optimization: Automated PrestoSQL/Trino cluster scaling with AWS Lambda, reducing scaling time by 85%. Technical Proficiencies: Programming: Python, SQL(Expert) Big Data Tools: Apache Kafka, PrestoSQL, PySpark, Hadoop, Hive, Airflow Databases: MySQL, PostgreSQL, MongoDB Cloud Services: AWS (S3, Glue, Lambda, Athena, EC2, RDS, CloudWatch, SNS) InfraAsCode: Terraform Version Control: Git, GitHub ETL and Data Pipelines: Expertise in designing efficient ETL workflows and real-time streaming pipelines I am passionate about optimizing workflows, enhancing data accessibility, and building innovative solutions that drive business outcomes.

Experience

11 yrs 6 mos
Total Experience
5 yrs 9 mos
Average Tenure
10 yrs
Current Experience

Delhivery

5 roles

Lead Data Engineer

Promoted

May 2025Present · 1 yr 1 mo

  • Led the migration of 150+ legacy Hive Parquet tables to Apache Iceberg, enhancing GDPR/DPDPA compliance.
  • Achieved a 15% reduction in overall S3 costs and a 5% improvement in query performance.
  • Optimized Spark processes by redesigning Delta Upsert, resulting in a 50% reduction in daily costs.
PySparkApache IcebergApache AirflowTrinoTerraformAmazon Web Services (AWS)+3

Sr Data Engineer

Promoted

Oct 2022May 2025 · 2 yrs 7 mos

  • ◦ Delta Lakehouse:
  • Designed and implemented the Optimized Package Waybill Lakehouse solution. Reduced the Batch run time by 40% and cost saving of 50%.
  • ◦ Spark Batch Job to Spark BatchStreaming Migration
  • Migrated an Apache Airflow based Databricks workflow which list s3 files to process in batch and runs for 15 Delta tables pipelines to upsert latest data
  • New setup usage Spark Batch Streaming and Schema Registry Sync which allows better Job management and Schema evolution.
Azure DatabricksBig DataPySparkAmazon QuickSightApache AirflowDelta table+3

Data Engineer

Promoted

Jan 2020Oct 2022 · 2 yrs 9 mos

Python (Programming Language)Azure DatabricksPySparkAmazon QuickSightTrinoApache Airflow

Senior Reporting Analyst

Mar 2019Jan 2020 · 10 mos

Data Quality Analyst

Jun 2016Mar 2019 · 2 yrs 9 mos

Mindtree

Data Analyst

Dec 2014Jun 2016 · 1 yr 6 mos · Bangalore Area, India

Ashapurna infotech jodhpur

Android Developer

Jun 2014Oct 2014 · 4 mos · jodhpur,india

  • worked as android developer

Education

MECRC, Jodhpur

Bachelor of Technology (B.Tech.) — Computer Science

Jan 2010Jan 2014

Stackforce found 100+ more professionals with Apache Iceberg & Aws

Explore similar profiles based on matching skills and experience