Hemraj Singh

Data Engineer

Bengaluru, Karnataka, India10 yrs 9 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Led a team to transform pricing data platform.
  • Accelerated development velocity by 40% using AI strategies.
  • Mentored engineers and drove strategic planning.
Stackforce AI infers this person is a Data Engineering expert in E-commerce with strong cloud and analytics capabilities.

Contact

Skills

Core Skills

Data EngineeringCloud ComputingEtl DevelopmentBusiness Intelligence

Other Skills

Apache AirflowInfrastructure as code (IaC)GenAIMedallion ArchitectureApache SparkETLData AnalysisSQLShell ScriptingData VisualizationQuicksightPythonInformaticaTechnical Project LeadershipContinuous Integration and Continuous Delivery (CI/CD)

About

Technical Lead with 11 years of experience in high-scale data engineering at Amazon, currently serving as the Site Lead for a team of 6 Data Engineers. Specialized in analyzing business needs, driving AI-augmented development cycles and CDK-based platform strategies that have accelerated development velocity by 40%. Proven expertise in mentoring engineers, engineering management, strategic planning, data engineering, data modeling and big data analytics. Specialties: Domain Knowledge: Data Modeling, Data Warehousing, Data Lake, Data Lakehouse, Analytics, Medallion Architecture, Data Governance & Security,Big Data, Apache Hadoop, Apache Spark, Distributed Computing Cloud: AWS, Infrastructure as Code (IaC), S3, Lambda, Kinesis Stream, Athena, Glue, EMR, EC2, Quicksight Languages: SQL, Python, SparkSQL, PySpark, Pandas, Numpy, Shell Scripting, PL/SQL Database: Amazon Redshift, Oracle, DynamoDB, MySQL, PostgreSQL Business Skills: Engineering Strategy, Team management, Mentoring, Cost Optimization, Sprint Planning, Reporting, Agentic AI Development

Experience

10 yrs 9 mos
Total Experience
5 yrs 4 mos
Average Tenure
6 yrs 9 mos
Current Experience

Amazon

3 roles

Senior Data Engineer

Promoted

Apr 2024Present · 2 yrs 1 mo · On-site

  • Leading data engineering team to transform pricing data platform into GenAI enabled lake warehouse with medallion architecture.
Apache AirflowInfrastructure as code (IaC)Data EngineeringCloud Computing

Data Engineer II

Jan 2022Jun 2024 · 2 yrs 5 mos · On-site

  • Orchestrated the design of 6 core pricing ETL pipelines and a Unified Price Competitiveness platform processing 2.6 TB of daily data ; provided business leaders with a 360-degree view of retail and 3P price health, enabling root-cause analysis for un-competitive customer experience of 0.8% glance views.
  • Designed Real-Time Data pipeline with sub-minute latency for flagged prices to empower bulk auditing framework that audited 24.6K flagged offers per month effectively saving 28 person worth of bandwidth.
  • Designed ”Sandbox Table Clean-up” utility to drop ad-hoc tables created 90 days ago if there is no exception raised by users post weekly notification. This automated utility helped us save 598 TB(11% of total disk space) annually.
  • Introduced 1-click ML model training using ML Ops to prevent low-price errors flowing to the Amazon Website.
Apache SparkData EngineeringETL Development

Business Intelligence Engineer II

Jul 2019Dec 2021 · 2 yrs 5 mos · On-site

  • Extracted key business insights metrics using Data Science and Machine Learning methodologies (Numpy, Pandas, Scikit-learn, Seaborn etc).
  • Created multiple dashboards using Quicksight, BI reporting tool to showcase metrics across Home Page, Navigation, Search and Personalization experience.
  • Created 60+ ETL data pipeline to extract data from disparate sources (S3, Redshift, CSV, Parquet, Andes, Excel) to transform and load into Data Mart/Data-warehouse.
  • Scripting Language - Python.
  • Working experience of AWS technologies (AWS SageMaker, S3, Amazon Redshift, RDS ,DynamoDB, AWS Glue, Lambda, Quicksight, Andes).
Data EngineeringData VisualizationBusiness Intelligence

Amdocs

Data Engineer

Jun 2015Jun 2019 · 4 yrs · Pune Area, India

  • Amdocs
  • SingTel-Optus-ODS | ETL Developer | Jan'2017-Jun'2019
  •  Development, maintenance of mapping, workflows & sessions using Informatica power center (Lookup, Joiner, Aggregator, Router, Rank, Update Strategy, XML Generator) to transform raw data from disparate sources (files, RDBMS, XML’s).
  •  Developed critical “Sales Commissioning” Extracts on Ordering Management system using SQL.
  •  Experience in Agile Methodology.
  •  Well versed in Function, Stored Procedures, Cursors, Collection and Dynamic SQL to load large and complex datasets. Ex: Usage, billing.
  •  Developed dashboards and reports using IBM Cognos Analytics 11, BI tool.
  •  Created Unix/shell scripts for maintaining code repository & audit control.
  •  Knowledge of Hadoop Ecosystem (Hive, HDFS, Kafka, Presto), Tableau, Excel, NoSQL database (HBase, MongoDB).
  • Amdocs
  • TEF Chile-ODS | ETL Developer | Aug'2016-Jan'2017
  •  Developed Informatica mappings and scheduled them to extract data from live application (CRM, ABP, OMS & TC), transform and load into ODS for operational reporting.
  •  Implemented SCD1, SCD3 using Informatica power center for customer address.
  •  Job optimization using partitioning, indexing, materialized views including schema creation.
  •  Created detail design document and implemented Data Mart for OMS system.
  •  Took complete ownership and lead the successful SIT/UAT completion until production deployment/support for the complete drop.
  •  Experience of working on defect management tool- HP-ALM quality Center 11.
  •  Experienced in Control-M 8, a batch scheduling and automation tool.
  • Amdocs
  • TEF Peru-ODS | ETL Developer | Jun'2015-Aug'2016
  •  Developed and implemented IBM Infosphere DataStage jobs.
  •  Worked on source connection, target connection and workflow creation through IBM Change Data Capture (CDC) for continuous data replication.
  •  Deep knowledge of Data Warehousing concepts, Data mining, Dimensional modeling, star & snowflake schema, and partitioning.
  •  Worked on SVN (source control tool).
Data EngineeringData VisualizationETL Development

Education

BIET Jhansi

Bachelor of Technology (B.Tech.) — Information Technology

Jan 2011Jan 2015

Stackforce found 100+ more professionals with Data Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience