Himanshu Kumar — Data Engineer
Results-driven Data Engineer with 6+ years of experience in building and maintaining large-scale ETL pipelines, cloud data migrations, and big data processing. Hands-on expertise with Azure Blob Storage, AWS Glue, Lambda, Step Functions, Redshift, S3, and Databricks. Led a 74+ PB data migration project for a Microsoft client, moving billions of files from Huawei Cloud to Azure using AzCopy. Developed automated file-level validation using PySpark on Databricks, ensuring data and metadata consistency at scale. Proficient in PySpark, Python, SQL, and Shell scripting for data transformation, automation, and performance tuning. Experienced in working with Kafka, Hive, HBase, and orchestration tools like Airflow and IICS. Built serverless ETL frameworks and high-throughput pipelines processing 10TB+ of data daily. Strong foundation in data warehousing, data quality, and distributed computing. Adept at delivering reliable, scalable, and cost-effective data solutions aligned with business needs.
Stackforce AI infers this person is a Data Engineer specializing in cloud data solutions and big data processing.
Location: Bengaluru, Karnataka, India
Experience: 7 yrs 2 mos
Skills
- Data Engineering
- Etl Pipelines
- Data Migration
- Data Validation
Career Highlights
- Over 6 years of data engineering experience.
- Expert in building scalable ETL pipelines.
- Led a major cloud data migration project.
Work Experience
Cognizant
Associate (3 yrs 10 mos)
Lumen Technologies
Software Engineer (3 yrs 4 mos)
Education
Bachelor of Engineering - BE at J N N College of Engineering, SHIMOGA