Gaurav .

Data Engineer

Greater Delhi, India5 yrs 6 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Achieved 80% reduction in pipeline costs.
  • Mentored over 5 junior engineers.
  • Ensured 99.5%+ SLA compliance across 200M+ records.
Stackforce AI infers this person is a Data Engineering specialist in SaaS environments, focusing on cost optimization and performance enhancement.

Contact

Skills

Core Skills

Data EngineeringEtl

Other Skills

PySparkSynapseSnowflakeAzure Data FactoryPython (Programming Language)Engineering SupportStatistical Data AnalysisMicrosoft ExcelSQL Server Integration Services (SSIS)Oracle DatabaseActive DirectoryDockerData Build Tool (DBT)PostgreSQLDatabricks Products

About

Senior Data Engineer | 3.5+ Years | Cloud Data Platform Architecture Specialist I solve expensive data problems: • Reduced pipeline infrastructure costs from $10K/month to $2K/month (80% reduction) • Decreased query execution time from 45 minutes to 15 seconds (97% improvement) • Achieved 99.5%+ SLA compliance across 200M+ daily records • Mentored 5+ junior engineers on production system design What I Build in Production: • Data pipelines processing 200M+ records/day (Snowflake, Azure, PySpark) • ETL/ELT architectures that survive production, not just pass code review • Real-time analytics platforms serving 50+ internal teams • Data quality frameworks with automated anomaly detection • Cost-optimized cloud infrastructure with 40%+ savings • Medallion architecture (Bronze→Silver→Gold) implementations using dbt Technical Foundation: Languages: Python (PySpark), SQL (query optimization specialist), dbt Platforms: Snowflake, Azure Data Factory, Databricks, Google BigQuery Architecture: Data warehouse design (Star schema, Snowflake), medallion patterns Infrastructure: Git, CI/CD (Azure DevOps), Docker, Kubernetes concepts Tools: Apache Spark, Azure Synapse, Power BI, dbt, Delta Lake Current Focus: • Building scalable data platforms at TCS • Mentoring engineers in data engineering best practices • Creating technical content (1,570 followers, 8,500+ monthly views) • Interview prep guides for ADF & Data Engineering roles Certifications: ✓ Microsoft Certified: Fabric Analytics Engineer Associate (2025) ✓ Power BI Data Analyst Associate (2025) Open to: Data Engineer / Lead Data Engineer roles Location: Gurugram, Haryana | Flexible for relocation Contact: k.gaurav653@gmail.com

Experience

5 yrs 6 mos
Total Experience
2 yrs 10 mos
Average Tenure
3 yrs 10 mos
Current Experience

Tata consultancy services

3 roles

Data Engineer

Promoted

May 2024Present · 2 yrs · On-site

  • 🔷 Designed and built 10+ end-to-end ETL pipelines using Snowflake, Azure Data Factory, and PySpark, processing 200M+ records daily from APIs, databases, and streaming sources
  • 🔷Optimized data pipelines to reduce infrastructure costs by ~80% through Spark tuning (partitioning, broadcast joins, skew handling), reducing cluster cost from $10K/month to $2K/month
  • 🔷Improved pipeline performance significantly:
  • 🔷 Reduced long-running Spark jobs from 45 minutes to seconds
  • 🔷 Optimized joins (18 min → ~90 sec) using broadcast strategies
  • 🔷 Implemented predicate pushdown to reduce data scan by 90%
  • 🔷Built a data quality framework ensuring 99.5%+ SLA compliance, including validation checks and quarantine handling for bad data
  • 🔷Led and mentored junior data engineers, guiding them on production-grade system design and best practices
PySparkSynapseData EngineeringETL

Software Engineer

Promoted

May 2023Apr 2024 · 11 mos · On-site

  • 🔷Designed and developed ETL pipelines to process financial data (Accounts Receivable, General Ledger, Accounts Payable)
  • 🔷Performed data extraction, transformation, and loading (ETL) using SQL and scripting techniques
  • 🔷Ensured data quality, validation, and consistency across multiple business systems
  • 🔷Optimized SQL queries to improve data processing performance and reporting efficiency
  • 🔷Built and maintained data models and reporting datasets for business dashboards
  • 🔷Collaborated with business teams to translate requirements into data-driven solutions
  • 🔷Monitored batch jobs and handled data pipeline failures, debugging, and issue resolution
  • 🔷Worked on Oracle EBS datasets and financial data workflows for enterprise reporting.
Data EngineeringPython (Programming Language)ETL

System Engineer

May 2022Apr 2023 · 11 mos · On-site

  • 🔷 Developed and maintained ETL pipelines for financial systems (AR, GL, AP)
  • 🔷 Worked with SQL and data processing workflows
  • 🔷 Built dashboards and reports for business insights
  • 🔷 Ensured data accuracy, validation, and consistency
Engineering SupportData Engineering

Telecrats india

Data Analyst

Sep 2020Jul 2022 · 1 yr 10 mos · Raipur, Chhattisgarh, India · Remote

  • 🔷Monitored and supported data pipelines and ETL workflows, ensuring smooth daily operations and timely data availability
  • 🔷Handled incident management and production support, including root cause analysis (RCA) and issue resolution
  • 🔷Performed data validation and quality checks to ensure accuracy and consistency across datasets
  • 🔷Assisted in troubleshooting pipeline failures, debugging SQL queries, and resolving data discrepancies
  • 🔷Worked with Azure services (Logic Apps, Service Bus, API Management, Event Hub) to support data integrations
  • 🔷Collaborated with cross-functional teams to ensure data reliability and system stability
  • 🔷Automated routine monitoring and support tasks using scripts, improving operational efficiency
  • 🔷Gained exposure to ETL processes, data workflows, and cloud-based data platforms
Statistical Data AnalysisMicrosoft Excel

Education

Amity University

Bachelor in computer application — Information Technology

Jun 2017Jun 2020

Stackforce found 100+ more professionals with Data Engineering & Etl

Explore similar profiles based on matching skills and experience