Himanshu Sagar

Data Engineer

Bengaluru, Karnataka, India8 yrs 8 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in big data technologies and data lakehouse architecture.
  • Proven track record in leading complex data migration initiatives.
  • Strong background in Java and Scala for data processing.
Stackforce AI infers this person is a Big Data Engineer with expertise in cloud-based data architectures and regulatory technology.

Contact

Skills

Core Skills

Data MigrationData Lakehouse ArchitectureData Pipeline EngineeringData IntegrationCloud ArchitectureData Processing

Other Skills

Big DataApache SparkJavaOracle CloudHadoopSparkScalaCApache IcebergTrinoArtificial Intelligence (AI)GitOCIAzure FunctionsProduct Innovation

About

Experienced in big data technologies such as sqoop, hive, apache spark . Excellent technical skills with a strong background in Java Programming Language and Scala. Highly motivated to learn new technologies while developing innovative solutions for real life problems. Have demonstrated an ability to work well under pressure. Effective team player looking forward to contribute significantly. Enthusiastic big data developer and eager to enhance critical problems solving ability to solve the real world big data problem.

Experience

8 yrs 8 mos
Total Experience
2 yrs
Average Tenure
5 mos
Current Experience

Regnology

Senior Data Engineer

Jan 2026Present · 5 mos · Remote

  • Driving the modernization of enterprise data infrastructure within the regulatory technology sector. Responsible for leading complex data migration initiatives, transitioning legacy relational databases into a high-performance, scalable Data Lakehouse architecture to support massive regulatory reporting volumes.
  • Key Responsibilities & Impact:
  • Enterprise Data Migration: Leading the end-to-end migration of legacy RDBMS (Oracle) workloads to a modern Data Lakehouse environment, ensuring zero data loss, exact traceability, and strict adherence to regulatory compliance standards.
  • Pipeline Engineering: Designing, developing, and deploying highly scalable ETL/ELT data pipelines using Apache Spark and Java to reliably process and transform complex, large-scale financial datasets.
  • Lakehouse Architecture: Implementing Apache Iceberg as the core open table format to enable ACID transactions, time travel, and safe schema evolution on distributed storage, drastically improving data reliability.
  • High-Performance Analytics: Integrating Trino as the distributed SQL query engine to federate queries across the data lake, optimizing read performance and significantly reducing query latency for downstream reporting consumers.
  • Data Modeling & Optimization: Collaborating with stakeholders to design scalable data models, tune complex Spark jobs for memory and compute efficiency, and enforce rigorous data quality checks throughout the migration lifecycle.
Big DataApache SparkData MigrationData Lakehouse Architecture

Oracle

Senior Member of Technical Staff

May 2024Dec 2025 · 1 yr 7 mos · Bengaluru · Hybrid

  • Senior Member of Technical Staff at Oracle with extensive experience in big data technologies such as
  • Apache Spark, Java, Iceberg, Delta Lake, Azure, and Oracle Cloud. Designed and developed a scalable
  • SDK enabling seamless data transfer from Autonomous Data Warehouse (ADW) to various sinks,
  • including Azure IDL. Proven ability to build efficient, high-performance data pipelines and integration
  • solutions, contributing to improved data accessibility and system interoperability across cloud platforms.
  • Skilled in optimizing large-scale data workflows and cloud-based architecture.
  • Roles & Responsiblities:
  • Handling DataShare SDK as a primary POC
  • Spearheaded the design and implementation of various sinks such as Iceberg, Delta, Azure, IDL.
  • Designed and lead the reverse ETL project for NetSuite.
  • Scaled SDK to handle data load more than 1 billion rows
  • Lead the design and implementation of integration of OCI gen AI into our DataShare SDK for better
  • use experience.
Apache SparkJavaData IntegrationCloud Architecture

Apple

2 roles

Software Engineer

Sep 2021May 2024 · 2 yrs 8 mos

  • Apple maps via thoughtgenesis
HadoopApache Spark

Senior Data Engineer

Sep 2021May 2024 · 2 yrs 8 mos

  • Worked and Contributed on apple maps by Apple.I had been a part of Rx team for over 2 years and contributing immensely to the apple maps while working on different teams such as drive coding, WILC and territories. Wrote and delivered feature requests and cleanup jobs to make product more efficient and accurate.
  • Roles & Responsiblities:
  • Handling Territory and Water component as a primary POC.
  • Spearheaded the design and implementation of Spark Scala jobs for processing and analyzing geospatial data from external vendors.
  • Develop and implement data quality checks to ensure data accuracy, completeness, and consistency.
  • Implemented new features and enhancements, contributing to the continuous improvement of Apple Maps functionality and user experience. (features such as Adding population in territory, cleanup jobs, speed updater etc.)
  • Collaborated with data scientists, analysts, and other stakeholders to understand business requirements and provide technical solutions aligned with Apple Maps goals.
  • Tools Used: Spark, Scala, Hadoop, Linux, AWS

Cisco

Software Engineer

Aug 2018Oct 2021 · 3 yrs 2 mos · Bengaluru Area, India

CHadoop

Indian institute of technology, roorkee

Teaching Assistant

Jul 2017Jun 2018 · 11 mos · Roorkee, Uttarakhand, India · On-site

Education

Indian Institute of Technology, Roorkee

Master of Technology — Computer Science

Jan 2016Jan 2018

Shri Govindram Seksaria Institute of Technology & Science, 23,Park Road, Indore

Bachelor of Engineering - BE — Information Technology

Jan 2012Jan 2016

Stackforce found 100+ more professionals with Data Migration & Data Lakehouse Architecture

Explore similar profiles based on matching skills and experience

Himanshu Sagar - Data Engineer | Stackforce