Ajay Dubey

Product Manager

Delhi, Delhi, India17 yrs 1 mo experience
Most Likely To SwitchHighly Stable

Key Highlights

  • 15+ years of experience in data and AI platforms.
  • Expert in architecting scalable data solutions.
  • Proven leadership in managing high-performing engineering teams.
Stackforce AI infers this person is a Data Engineering expert with a focus on scalable cloud solutions.

Contact

Skills

Core Skills

Data EngineeringProject DeliveryBig DataCloud Computing

Other Skills

AWSAgentic AIAirflowApache IcebergApache SparkAzureC#CDHCosmos DBData GovernanceData PipelineData QualityDatadogDistributed SystemsDocker

About

With 15+ years of hands-on experience, I have spearheaded the design, architecture, and development of cutting-edge data & AI platforms for both batch and real-time needs. I have consistently led initiatives that modernize data ecosystems and drive business-critical insights at scale. 💡 Currently, I oversee multiple high-performing engineering teams responsible for building scalable & Intelligent data & AI Platforms,Enabling capabilities like Iceberg migration, data governance, data quality, and observability frameworks. I champion best practices in system design and delivery across distributed environments. Key Achievements: • Architected and developed a configuration-driven, real-time processing platform to handle high-throughput batch and streaming workloads. • Building Multi Agent based System to improve Efficiency of system and reducing TAT • Architected and developed a flexible in-house OLAP engine, drawing inspiration from open source OLAP cubes to support analytical use cases. • Architected a configuration-driven, real-time fraud management and Anomaly Detection system • Led globally distributed engineering teams and ensured timely, high-quality product releases through disciplined SDLC execution. Technical Skills: • Languages: Java, Scala, C#, Python • Big Data Ecosystems: Apache Spark, Flink, Hadoop HDFS, Presto, MapReduce, Hive, Sqoop, Zookeeper, Hue, Kudu, Yarn • AI Landscape: Gen AI, Agentic AI, Multi Agent, RAG based systems • NoSQL Databases: MongoDB, DocumentDB, Redis, Cosmos DB • Databases: SQL Server, Sybase, PostgreSQL • In-Memory Databases: H2 • Streaming Technologies: Kafka, Spark Streaming, Flink • Query Engines: Hive, Trino, Waggledance, Presto • Distributions: CDH, Open Source, EMR • Message Queues: Kafka, RabbitMQ, ZeroMQ • Microservices: Spring Boot, Web API, gRPC, REST API, Docker, Kubernetes • Cloud Providers: Azure, AWS • Logging/Metrics/Observability: Splunk, Datadog, Elasticsearch • Orchestration: Oozie, Airflow • Reporting Tools: Power BI, Tableau • Data Pipelining: NiFi • Containers: Docker, Kubernetes • Miscellaneous: Performance Improvements, System Design, Distributed Systems, Highly Scalable Systems • CI/CD: Jenkins, Spinnaker,GitHub Actions Motivated by building systems that are not just robust and scalable, but also elegant in design. I excel in dynamic, fast-paced environments where my leadership and technical depth foster innovation and deliver measurable business outcomes. I continuously strive for operational excellence and embrace a mindset of learning, mentorship, and strategic impact.

Experience

17 yrs 1 mo
Total Experience
2 yrs 1 mo
Average Tenure
3 yrs 6 mos
Current Experience

Expedia group

Sr Manager/Architect Data Engineering

Dec 2022 – Present · 3 yrs 6 mos · Gurugram, Haryana, India

Apache IcebergProject DeliveryStakeholder ManagementData QualityData GovernanceTrino+1

Airtel

Sr Technical Lead Big data

May 2021 – Dec 2022 · 1 yr 7 mos

Apache IcebergProject DeliveryStakeholder ManagementData QualityData GovernanceTrino+1

Macquarie group

Manager Big data

Oct 2017 – May 2021 · 3 yrs 7 mos · Gurgaon, India

  • Big Data and Cloud Computing SME within the Data Analytics Team.
  • In this role, I was responsible for -
  • Implementing PoCs for Big Data.
  • Contribution to Big Data and application development Knowledge Base.
  • Expanding the team size.
  • Creating the architecture for data lake solution on AWS Cloud (using services like Glue, Data pipeline, Lambda, Kinesis, EMR, RedShift, Dynamo DB and S3, etc.).
  • Designing and implementing an custom ingestion framework based on Spark
Big DataCloud Computing

Publicis sapient

Senior Associate Big Data

Aug 2016 – Oct 2017 · 1 yr 2 mos · Gurgaon, Haryana, India

  • Involved in gathering business requirement, performing data mapping for different user stories in agile mode.
  • Involved in automation of routine test cases resulting in cost reduction and increased efficiency.
  • Implemented Bigdata solutions using Scala, Hive, Spark in IntelliJ IDE.
  • Worked on Hive partition and bucketing concept and created external and internal tables.
  • Solved multiple performance issues/optimizations in Hive and Spark.
  • Led team of 4 associates.
Big DataData Engineering

Unitedhealth group

Data Engineer

Oct 2014 – Jul 2016 · 1 yr 9 mos · Gurgaon, India

  • Responsibilities includes design and implementation of various Big Data platform
  • components like (Batch Processing, Live Stream Processing) .Continuous focus on
  • Scaling, Fault Tolerance, Performance and Availability of System.
  • Design and Implemented Data Access Layer, which can connect to various data
  • sources and uses advanced caching techniques to provide fast responses to
  • real time SQL queries using Big Data Technologies
Big DataData Engineering

Dxc technology

Software Engineer

Aug 2011 – Sep 2014 · 3 yrs 1 mo

Team computers

Software Developer

May 2010 – Sep 2011 · 1 yr 4 mos · Delhi Area, India

Softcreations

Software Engineer

Mar 2009 – May 2010 · 1 yr 2 mos

Education

Netaji Subhas University of Technology, East Campus

B.tech. — Electronice & communication

Jan 2004 – Jan 2008

Kala niketan sr secondary

Bachelor of Technology (B.Tech.)

Jan 2001 – Jan 2003

Stackforce found 100+ more professionals with Data Engineering & Project Delivery

Explore similar profiles based on matching skills and experience