Himanshu Tripathi

Backend Engineer

Pune, Maharashtra, India6 yrs 4 mos experience

Highly Stable

Key Highlights

Expert in building scalable data pipelines on Azure.
Proven track record in optimizing ETL workflows.
Strong collaboration with business teams for data solutions.

Stackforce AI infers this person is a Data Engineer specializing in SaaS data solutions.

Contact

Skills

Core Skills

Data EngineeringEtlData IntegrationData Automation

Other Skills

ADLS Gen2Azure Data FactoryAzure DevOpsAzure SynapseData StructuresDatabricksEnglishHindiHive SQLHubSpotInformaticaNeo4jOraclePySparkPython

About

I’m a Data Engineer specializing in building scalable, high-performance data pipelines on Microsoft Azure. With hands-on experience across Azure Databricks, Data Factory, Synapse, and ADLS, I transform raw enterprise data into clean, analytics-ready datasets that drive business decisions. Over the past few years, I’ve worked on optimizing complex ETL workflows, integrating multi-system data sources (SAP, HubSpot, Oracle, PostgreSQL), and automating end-to-end data operations using CI/CD and Terraform. My focus is always on efficiency, reliability, and clarity—turning technical complexity into simple, trusted data systems. I’m passionate about designing systems that don’t just “work” but perform at scale — with reduced latency, improved reliability, and measurable business impact.

Experience

6 yrs 4 mos

Total Experience

3 yrs 9 mos

Average Tenure

2 yrs 6 mos

Current Experience

Atlas copco

Data Engineer

Dec 2023 – Present · 2 yrs 6 mos · India · Hybrid

Designing and optimizing large-scale data pipelines on Microsoft Azure to deliver clean, reliable, and high-performance data for analytics and reporting.
Key Responsibilities & Impact:
Built and maintained end-to-end ETL pipelines using Azure Data Factory, Databricks (PySpark), and Synapse Analytics.
Optimized performance through partitioning, caching, and parallel processing, reducing runtime and improving reliability.
Developed data models (fact & dimension tables) and implemented SCD Type 2 for historical data accuracy.
Integrated and transformed data from SAP, HubSpot, Oracle, and SQL Server into a unified data layer.
Automated error handling, monitoring, and CI/CD using ADF triggers, Terraform, and Azure DevOps.
Processed large datasets in Databricks to enable business insights on finance, sales, and inventory.
Partnered with business teams to translate requirements into scalable, production-ready data solutions.
✅ Tech Stack:
Azure Data Factory · Databricks · PySpark · SQL · Azure Synapse · ADLS Gen2 · Terraform · Azure DevOps · Python · Hive SQL

Azure Data FactoryDatabricksPySparkSQLAzure SynapseADLS Gen2+6

Capgemini

2 roles

Associate Consultant

Promoted

Jul 2022 – Dec 2023 · 1 yr 5 mos · India

Designing and optimizing Azure-based data solutions to deliver scalable, reliable, and analytics-ready data for enterprise decision-making.
Key Contributions & Impact:
Designed and maintained end-to-end ETL pipelines using Azure Data Factory, Databricks (PySpark), and Synapse Analytics.
Leveraged Databricks, PySpark, and HiveSQL to process and analyze large-scale sales and operational data from Oracle, SAP, and SQL Server.
Utilized Azure Data Lake Storage Gen2 for scalable, secure data storage and improved downstream analytics performance.
Built data models and transformations that enabled accurate business estimations, revenue forecasting, and performance insights.
Developed and automated KPI tracking frameworks using ADF and Databricks to support business dashboards and reporting.
Implemented Terraform-based infrastructure automation and CI/CD pipelines via Azure DevOps for reliable deployments.
Built real-time JIRA API extractors for dynamic operational dashboarding and improved project visibility.
Collaborated with cross-functional teams to translate complex business logic into scalable, production-grade data pipelines.
✅ Tech Stack:
Azure Data Factory · Azure Databricks · PySpark · SQL · Azure Synapse · ADLS Gen2 · Terraform · Azure DevOps · Python · Hive SQL · Talend · Tableau