Himanshu Sagar

Data Engineer

Bengaluru, Karnataka, India8 yrs 8 mos experience

AI EnabledAI ML Practitioner

Key Highlights

Expert in big data technologies and data lakehouse architecture.
Proven track record in leading complex data migration initiatives.
Strong background in Java and Scala for data processing.

Stackforce AI infers this person is a Big Data Engineer with expertise in cloud-based data architectures and regulatory technology.

Contact

Skills

Core Skills

Data MigrationData Lakehouse ArchitectureData Pipeline EngineeringData IntegrationCloud ArchitectureData Processing

Other Skills

Big DataApache SparkJavaOracle CloudHadoopSparkScalaCApache IcebergTrinoArtificial Intelligence (AI)GitOCIAzure FunctionsProduct Innovation

About

Experienced in big data technologies such as sqoop, hive, apache spark . Excellent technical skills with a strong background in Java Programming Language and Scala. Highly motivated to learn new technologies while developing innovative solutions for real life problems. Have demonstrated an ability to work well under pressure. Effective team player looking forward to contribute significantly. Enthusiastic big data developer and eager to enhance critical problems solving ability to solve the real world big data problem.

Experience

8 yrs 8 mos

Total Experience

2 yrs

Average Tenure

5 mos

Current Experience

Regnology

Senior Data Engineer

Jan 2026 – Present · 5 mos · Remote

Driving the modernization of enterprise data infrastructure within the regulatory technology sector. Responsible for leading complex data migration initiatives, transitioning legacy relational databases into a high-performance, scalable Data Lakehouse architecture to support massive regulatory reporting volumes.
Key Responsibilities & Impact:
Enterprise Data Migration: Leading the end-to-end migration of legacy RDBMS (Oracle) workloads to a modern Data Lakehouse environment, ensuring zero data loss, exact traceability, and strict adherence to regulatory compliance standards.
Pipeline Engineering: Designing, developing, and deploying highly scalable ETL/ELT data pipelines using Apache Spark and Java to reliably process and transform complex, large-scale financial datasets.
Lakehouse Architecture: Implementing Apache Iceberg as the core open table format to enable ACID transactions, time travel, and safe schema evolution on distributed storage, drastically improving data reliability.
High-Performance Analytics: Integrating Trino as the distributed SQL query engine to federate queries across the data lake, optimizing read performance and significantly reducing query latency for downstream reporting consumers.
Data Modeling & Optimization: Collaborating with stakeholders to design scalable data models, tune complex Spark jobs for memory and compute efficiency, and enforce rigorous data quality checks throughout the migration lifecycle.

Big DataApache SparkData MigrationData Lakehouse Architecture

Oracle

Senior Member of Technical Staff

May 2024 – Dec 2025 · 1 yr 7 mos · Bengaluru · Hybrid

Senior Member of Technical Staff at Oracle with extensive experience in big data technologies such as
Apache Spark, Java, Iceberg, Delta Lake, Azure, and Oracle Cloud. Designed and developed a scalable
SDK enabling seamless data transfer from Autonomous Data Warehouse (ADW) to various sinks,
including Azure IDL. Proven ability to build efficient, high-performance data pipelines and integration
solutions, contributing to improved data accessibility and system interoperability across cloud platforms.
Skilled in optimizing large-scale data workflows and cloud-based architecture.
Roles & Responsiblities:
Handling DataShare SDK as a primary POC
Spearheaded the design and implementation of various sinks such as Iceberg, Delta, Azure, IDL.
Designed and lead the reverse ETL project for NetSuite.
Scaled SDK to handle data load more than 1 billion rows
Lead the design and implementation of integration of OCI gen AI into our DataShare SDK for better
use experience.

Apache SparkJavaData IntegrationCloud Architecture

Apple

2 roles

Software Engineer

Sep 2021 – May 2024 · 2 yrs 8 mos

Apple maps via thoughtgenesis

HadoopApache Spark

Senior Data Engineer

Sep 2021 – May 2024 · 2 yrs 8 mos

Worked and Contributed on apple maps by Apple.I had been a part of Rx team for over 2 years and contributing immensely to the apple maps while working on different teams such as drive coding, WILC and territories. Wrote and delivered feature requests and cleanup jobs to make product more efficient and accurate.
Roles & Responsiblities:
Handling Territory and Water component as a primary POC.
Spearheaded the design and implementation of Spark Scala jobs for processing and analyzing geospatial data from external vendors.
Develop and implement data quality checks to ensure data accuracy, completeness, and consistency.
Implemented new features and enhancements, contributing to the continuous improvement of Apple Maps functionality and user experience. (features such as Adding population in territory, cleanup jobs, speed updater etc.)
Collaborated with data scientists, analysts, and other stakeholders to understand business requirements and provide technical solutions aligned with Apple Maps goals.
Tools Used: Spark, Scala, Hadoop, Linux, AWS