Ishan Bhagria

Software Engineer

Ludhiana, Punjab, India4 yrs 10 mos experience

Key Highlights

  • Reduced data infrastructure costs by up to 70%.
  • Built a robust Scala-based Central Data Platform.
  • Onboarded over 1500 datasets for analytics.
Stackforce AI infers this person is a Data Engineer specializing in building scalable data platforms and ETL solutions.

Contact

Skills

Core Skills

Data EngineeringEtl

Other Skills

AirflowAlgorithmsApache IcebergArduino programmingAzureAzure Data FactoryAzure Data LakeAzure DatabricksC (Programming Language)C++ClickHouseCloud ComputingCloud Computing IaaSCommunicationCryptography

About

I’m a Data Engineer with hands-on experience building large-scale data platforms and robust ETL pipelines at PhysicsWallah and Shell. Currently, I’m part of the core data team at PhysicsWallah where I work on a Scala-based Central Data Platform using Apache Iceberg, Trino, and Kafka, enabling analytics, business dashboards, and AI/ML solutions. I’ve worked on optimising DAG scheduling in Airflow, migrated terabytes of data from legacy systems, and helped reduce data infra costs by up to 70%. My toolkit includes PySpark, SQL, DBT, Airflow, and Azure – and I thrive in fast-paced environments where data meets impact.

Experience

4 yrs 10 mos
Total Experience
1 yr 3 mos
Average Tenure
11 mos
Current Experience

Blinkit

Senior Data Engineer

Jul 2025Present · 11 mos · Gurugram, Haryana, India

Pw (physicswallah)

SDE-1

Oct 2024Jul 2025 · 9 mos · Remote

  • Central Data Platform Development: Contributed to building PhysicsWallah’s Scala-based Central Data Platform (CDP), built on Iceberg table format, hosted on Trino (Presto) structured using Medallion Architecture. Integrated Debezium for real-time CDC streams into Kafka from various data sources such as MongoDB.
  • Large-Scale Data Onboarding: Onboarded 1500+ datasets from APIs, gsheets, Kafka topics, and MongoDB into the CDP, enabling decommissioning of multiple costly warehouses and reducing storage/query cost by up to 70%.
  • AI & Business Enablement: Delivered high-quality, analytics-ready datasets that powered sales dashboards and AI/ML models such as AI Guru.
  • DBT Model Engineering: Designed and developed DBT models on top of CDP tables to support key business intelligence use cases.
  • Airflow Enhancements: Introduced YAML-based task-level configuration and dataset-triggered DAG scheduling with inter-DAG dependencies, reducing DAG run and report generation time by 50%.
  • Data Migration & Transformation: Executed historical data migration from Redshift to CDP to replicate legacy systems. Transformed unstructured MongoDB data using PySpark and migrated to ClickHouse, driving actionable business insights.
ScalaApache IcebergTrinoKafkaPySparkDBT+5

Shell

Data Engineer

Aug 2023Oct 2024 · 1 yr 2 mos · Bengaluru, Karnataka, India · Hybrid

  • Extensively worked on Azure cloud platform to create a datalake having data for numerous LOBs.
  • Experience in performing ELT on streaming data using Azure Data Factory.
  • Hands on experience in Spark Core and SparkSQL-Transformation and actions.
  • Hands on experience with Databricks, Spark, Azure Cloud and its services
  • Developed the ETL pipeline to migrate and ingest batch data for various business units in data warehouse from their legacy platforms.
  • Developed and delivered datamart and views related to KPI for business use case.
  • Collaborated with internal stakeholders, identifying and gathering analytical requirements for customer, product and project's needs.
AzureAzure Data FactorySparkDatabricksData EngineeringETL

Chegg india

MANAGED NETWORK EXPERT

Aug 2020Aug 2022 · 2 yrs · India

  • MNE,Other maths

Education

Thapar Institute of Engineering & Technology

Bachelor of Technology - BTech — electronics and computer engineering

Jan 2019Jan 2023

sacred heart convent school

senior secondary — Mathematics

Jan 2006Jan 2019

Stackforce found 100+ more professionals with Data Engineering & Etl

Explore similar profiles based on matching skills and experience