Nishant Panwar

Director of Engineering

Gurugram, Haryana, India11 yrs 1 mo experience

Key Highlights

Architected a 70TB+ Hudi data lakehouse platform.
Saved $40,000/month through AWS cost optimization.
Led migration of data to BigQuery, enhancing analytics.

Stackforce AI infers this person is a Data Engineering expert in cloud-based solutions and data architecture.

Contact

Skills

Core Skills

Data ArchitectureAmazon Web Services (aws)Apache KafkaData PipelinesApache Spark StreamingData WarehousingAws GlueBig DataHadoopEtlSpark

Other Skills

AWS CloudFormationAWS Identity and Access Management (AWS IAM)AWS LambdaAWS SageMakerAWS Step FunctionsAirflowAmazon AthenaAmazon DynamodbAmazon EC2Amazon Elastic MapReduce (EMR)Amazon KinesisAmazon RedshiftAmazon Relational Database Service (RDS)Amazon S3Amazon Simple Notification Service (SNS)

About

With over 9.5 years of experience in data engineering - building data infrastructure, data platforms, data lakes, data Lakehouses, Data warehouses, data pipelines. Architected a scalable 70TB+ Hudi data lakehouse platform, processing 50K messages per seconds, achieving 30-seconds query response, enabling seamless data access, improve governance, and analytics across the organisation Led Migration of existing data to BigQuery, saved cost of ~ 30000$ per month

Experience

11 yrs 1 mo

Total Experience

1 yr 11 mos

Average Tenure

1 yr 2 mos

Current Experience

Nielsen

Engineering Manager

Apr 2025 – Present · 1 yr 2 mos · Gurugram, Haryana, India · Hybrid

For over 100 years, Nielsen has been a global leader in audience measurement, data and analytics, shaping the future of media. Measuring behaviour across all channels and platforms to discover what audiences love. It empowers clients with trusted intelligence that fuels action.
As an Engineering Manager at Nielsen, I lead the design, development, and delivery of large-scale data lake and data platform solutions. I focus on architectural leadership, strategic planning, and technical guidance, balancing engineering priorities with business objectives.
Key achievements:
Strategic cost optimisation of 700k$$/year
Leveraging AI; reducing development and debugging efforts by 40%.
Provide architectural blueprints and technical leadership to engineering team.
Evaluate and recommend tools, technologies, and process to ensure best quality product platform.
Led the design and development of large-scale data lake and data platform solutions at Nielsen.
Focused on architectural leadership and strategic planning to align engineering priorities with business objectives.
Successfully migrated Talend-based ETL jobs to AWS EMR-PySpark, achieving an 80% reduction in runtime.

Data ArchitectureSparkPython (Programming Language)Amazon Web Services (AWS)Delta LakeApache Kafka+6

Ecom express limited

Lead Data Engineer - Data Platform

Sep 2023 – Apr 2025 · 1 yr 7 mos · Gurugram, Haryana, India · Hybrid

I am leading the platform data engineering team, I am responsible for architecting a scalable 70TB+
Hudi data lakehouse platform, achieving 30-seconds query response, enabling seamless data access,
improve governance, and analytics across the organisation
> PySpark, EMR, MSK, Debezium, Python, SQL, EC2, Apache Hudi, Docker, Prometheus,
Lambda, Grafana, Athena and Airflow
Engineered Change Data Capture (CDC) architecture with Debezium 2.7, Kafka connect clusters for
real-time CDC ingestion, improving scalability and reducing operational overhead by 99%, and
resulting in a remarkable 40% decrease in costs associated with MySQL queries
Led Migration of Shipments multi data to BigQuery, saved cost of ~ 3000$ per month
> MySQL, CDC, EC2, Debezium, Python, Kafka, BigQuery
Led AWS cost optimization initiatives, saving the organization ~$40,000/month
Created Athena workgroups and optimized queries to enable secure, cost-effective data access and
analytics across multiple teams
Design, coordinate, and execute pilots, prototypes and POCs (e.g, Flink, Iceberg, DataBricks, BigQuery,
Snowflake, Apache Pinot, DMS, Redshift); document and share technical best practices
Developed a centralized automatic alerting and monitoring system using Grafana, and Prometheus
I am responsible for hiring and mentoring a strong engineering team across the organisation while
setting technical direction across backend, cloud infrastructure, security, quality, and DevOps
Design, Architecture, Code Reviews, Cloud Infra, Team Management, Hiring, Mentoring
Built CI/CD pipelines, standardizing deployments and cutting code deployment time by 30%
Maintain internal forks of Apache Hudi 0.14.1, fixed multiple Hudi open issues in lower version for
production hot fixes
Implemented, a highly scalable, and completely server-less architecture for CDC data ingestion for multi-cloud use cases

Systems DesignApache KafkaAmazon Relational Database Service (RDS)Azure Data FactoryProblem SolvingStakeholder Management+21

Lowe's india

Senior Data Engineer

Oct 2021 – Sep 2023 · 1 yr 11 mos · Bengaluru, Karnataka, India · Hybrid

Built Spark Streaming data pipeline to fetch real time pricing data in and generate real time analytical insights
Migrated the existing application from hive to Spark 3.0, reduced the processing time to ~35%
Developed the automated data quality framework, to ensure data quality across all data sources, improved the accuracy
Developed 10+ Airflow DAGs for Pipeline automation, scheduling, and monitoring, reduced maintenance tasks by 60%
Automated alerting system for job monitoring using Airflow, Shell Scripting, Python, Slack, it saved 1.5 hrs/day/Developers
Deep dive into massive data sets to answer key business questions, creating data pipelines and spark jobs for analysis

Big DataStakeholder ManagementDevOpsData PipelinesTechnical RequirementsAirflow+6

Ge healthcare

Senior Data Engineer

Aug 2019 – Oct 2021 · 2 yrs 2 mos · Bangalore

▪ Project: Migration of our On-Premise GE Healthcare Services Enterprise Data Lake to a cloud-based solution on AWS.
Building "One Data Platform" where TB's of data is sourced from multiple systems of various LOB's into Datalake built on the cloud(S3).
Architected and implemented a reusable data quality framework for the Central DataLake on S3, enabling efficient and consistent data quality management.
Engineered a versatile data compaction framework that effectively reduced data overhead, achieving an exceptional 70% decrease in data fat from incoming sources.
Designed and developed an automation framework that streamlined routine development tasks, yielding a remarkable 4-hour reduction in daily effort per engineer.
Provided mentorship and guidance to a team of 4+ members, empowering them to implement optimized solutions and successfully navigate technical challenges.
Developed data pipelines from scratch; optimized data aggregation from 30+ independent sources and automated the ETL development process.
Led the automation initiative for onboarding new data pipelines, resulting in ~ 80% reduction in manual efforts.
Implemented an SNS notification system that includes comprehensive error information for Data Pipeline activities.

Data WarehousingAWS GlueAWS SageMakerAmazon RedshiftAmazon EC2Amazon Dynamodb+7

Carelon global solutions india

Data Engineer

Jul 2018 – Aug 2019 · 1 yr 1 mo · Bengaluru Area, India · On-site

▪ Design ELT Pipelines for clinical visit reports and surveys.
Build generic & optimized pipeline for highly critical & confidential clinical survey Data
Developed HQL scripts to create external tables and analyze incoming and intermediate data for analytics applications in Hive.
Used Partitioning, and Bucketing concepts in Hive and designed both Managed and External tables in Hive to optimize performance
Optimized spark jobs using optimization techniques like broadcasting, executor tuning, persisting, etc.
Written configurable automation scripts in Python to test the codes and migration.
Documented and shared best practices for Spark optimization within the team and organization.

Big DataUnixBusiness Intelligence (BI)HiveSqoopOnline Transaction Processing (OLTP)+14

Accenture

2 roles

Software Engineer

Promoted

Jan 2016 – Jul 2018 · 2 yrs 6 mos · Bengaluru Area, India

▪ Tech Modernization for State Farm Insurance (Client)
Worked on Hadoop migration project from Informatica and Oracle to Cloudera distribution.
Developed pipelines to handle data of 1.5 TB/day from ingestion to reporting layer using Hadoop, Spark, and Shell script.
Redesigned and refactored project architecture and Spark Scala ETL code, bringing down costs by 70%
Performed performance tuning in Spark, SQL, and Sqoop, resulting in a 60% response time reduction
Gained experience in stakeholder interaction, requirements gathering, data analysis, design document creation, solutions, performance tuning, and enhancements.

Big DataUnixInformaticaHiveHDFSOnline Transaction Processing (OLTP)+10

Associate Software Engineer

May 2015 – Jun 2016 · 1 yr 1 mo · Bengaluru Area, India

Developed tables, views, and materialized views using SQL
Automated the ETL processes for data ingestion and transformation using Spark, Scala, and Hive.
Developed and maintained technical documentation from source to target load process.
Involved in project estimation and planning.

UnixHiveHDFSOracle DatabaseComputer ScienceShell Scripting+9