Harekrushna Mishra

Product Manager

Pune, Maharashtra, India14 yrs 2 mos experience
Highly Stable

Key Highlights

  • 14 years of experience in data engineering
  • Expertise in building scalable data pipelines
  • Led development of Audience360 CDP product
Stackforce AI infers this person is a Data Engineering expert in MarTech with extensive experience in cloud and big data technologies.

Contact

Skills

Core Skills

CloudBig DataData Quality ManagementData EngineeringEtl DevelopmentData Warehousing

Other Skills

AWSGCPDatabricksApache AirflowPySparkEMRGlueS3RedshiftLambdaAthenaIcebergPythonSQLHive

About

Learning enthusiast and hands-on technologist with over 14 years of experience designing and building scalable, fault-tolerant data pipelines across cloud and on-prem environments. As a Lead Data Engineer at Data Axle, I lead the design and development of advanced data and marketing solutions for Customer Data Platform (CDP) clients. My work focuses on large-scale data infrastructure, advanced analytics, and fully automated data processing pipelines. I’ve architected data lake solutions on AWS and Databricks, built robust feature engineering pipelines, and developed comprehensive data quality frameworks to ensure reliability and scalability. Currently, I lead a team responsible for building **Audience360**—a dynamic, fully automated CDP product that processes customer data at scale for marketing campaign management and real-time dashboarding. I specialize in modern data engineering using AWS, GCP, Apache Spark, and Databricks, with strong expertise in both batch and near real-time processing. I’ve delivered end-to-end data platforms that are optimized, cost-effective, and closely aligned with evolving business goals. Core Skills: - Cloud: AWS , GCP - Big Data: Apache Spark, Spark Streaming, Databricks, Hive - Orchestration: Apache Airflow - Programming: Python, Advanced SQL, Data Structures & Algorithms - DevOps: GitHub, CI/CD - Other: Data Modeling, Performance Optimization -GenAI tools - RAG, LLM, Vector DB

Experience

14 yrs 2 mos
Total Experience
2 yrs 4 mos
Average Tenure
2 yrs 4 mos
Current Experience

Data axle

Tech Lead Data engineer

Feb 2024Present · 2 yrs 4 mos

  • Building enterprise Customer Data Platform processing 50M+ customer records daily, enabling marketing campaigns for both offline and digital channels.
  • Migrated AWS, GCP compute and analytics layers to Databricks creating cloud-agnostic architecture. Redesigned EMR pipeline architecture reducing processing time from 5 hours to 45 minutes which is 87% improvement.
  • Enhanced pipeline performance by implementing Z-ordering data layout optimization and binpack file compaction strategies on Apache Iceberg tables, reducing query execution time by 65% and storage overhead by 30%.
  • Migrated PySpark jobs from GCP BigQuery to Dataproc, achieving $60,000 annual cost savings.
  • Technologies: AWS, EMR, Glue, S3, Redshift, Lambda, Athena, GCP, Dataproc, Bigquery, PySpark, Databricks, Iceberg, Airflow
AWSGCPDatabricksApache AirflowPySparkEMR+8

Citi

Lead data engineer

Mar 2021Jan 2024 · 2 yrs 10 mos

  • Worked on Data Quality Management framework with an objective to validate FRB business rules over source contract datasets. It is a generic reusable framework built on PySpark to generates data for 20+ different product’s report results and summary.
  • Engineered performance optimization for critical PySpark data pipeline, reducing processing time from 8 hours to 1.5 hours (81% improvement), accelerating business reporting cycles and improving operational efficiency for downstream analytics teams.
  • Technologies: Python, SQL, Pyspark, Hive, Pandas, Unix, AWS, S3, EMR, Athena
PythonSQLPysparkHivePandasUnix+6

Barclays

Senior Data Engineer

Nov 2017Mar 2021 · 3 yrs 4 mos · Pune, Maharashtra, India

  • Worked on Safire Daily Viewer which Consolidates business source data and top side adjusted data related to accounts, products, bankers between the automated source data mart and the manual adjustments through UI.
  • Crafted a Big Data based solution using Hadoop Ecosystems(HDFS,YARN), Spark and Python for organized structure data. Integrated hive tables with QlikView , designed and developed data models and backend queries for presenting data
  • Technologies: Python, SQL, Spark, Hive, QlikView, Data Modeling
PythonSQLSparkHiveQlikViewData Modeling+2

Cognizant

Associate Project

Sep 2015Nov 2017 · 2 yrs 2 mos

  • Worked on GRU ETL Build is an UBS in house reconciliation tool of various GLOBAL,SWISS application’s report files.
  • The GRU Build team analyzed report files through the ETL process. ETL build played an important role in data validation, building mappings in Informatica PowerCenter by applying the transformations necessary as per the specification and generating workflows.
  • Technologies: SQL, Informatica, Unix, Dimensional Data Modeling
SQLInformaticaUnixDimensional Data ModelingETL Development

Tech mahindra

informatica developer

Nov 2014Aug 2015 · 9 mos

  • Developed the data warehouse for HR and QMG process.
  • This project deals with Collecting, modeling, and Loading Personal details of employees and maintaining the history of Promotions, project details through ETL operations to load data into Fact and Dimensions of Data Warehouse. Maintained the history through SCD type 2.
  • Technologies: SQL, Informatica, Dimensional Data Modeling
SQLInformaticaDimensional Data ModelingData Warehousing

Syntel

Software Developer

Feb 2012Nov 2014 · 2 yrs 9 mos

  • The primary objective of this project is to capture the records of employees based on 4 conditions. Tracking and updating the user accounts of users who are no longer with Amex, transferring within GMS users transferring out of GMS and updating new hires.
  • Implemented new ETL Integrations with better development practice and gained more efficiency.
  • Technologies: SQL, Informatica
SQLInformatica

Education

Biju Patnaik University of Technology, Odisha

Bachelor of Technology (B.Tech.) — electronics and instrumentation engineering

Jan 2007Jan 2011

E.Co.railway mixed higher secondary school

Stackforce found 100+ more professionals with Cloud & Big Data

Explore similar profiles based on matching skills and experience