Mohit Manna

Data Engineer

Gurugram, Haryana, India5 yrs 5 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Expert in building real-time data pipelines using Spark and Kafka.
  • Proficient in integrating complex data systems for customer engagement.
  • Strong background in data engineering with Azure and Databricks.
Stackforce AI infers this person is a Data Engineering expert in SaaS environments, focusing on real-time data integration and analytics.

Contact

Skills

Core Skills

PythonSparkSqlDbtDatabricksAirflowData EngineeringBig Data

Other Skills

Python (Programming Language)Java Database Connectivity (JDBC)DatabasesSpark-StreamingKafkaAzure Data FactoryAzure Synapse AnalyticsApache AirflowData LakeMicrosoft AzureArgoPowerPointProgrammingMachine LearningData Mining

About

Data Engineer at To The New

Experience

5 yrs 5 mos
Total Experience
2 yrs 8 mos
Average Tenure
2 yrs 9 mos
Current Experience

Expedia group

Data Engineer III (Contract)

Aug 2025Present · 9 mos · Gurugram, Haryana, India · Hybrid

  • Payroll: EPAM
  • FDS

Epam systems

Data Engineer

Aug 2023Present · 2 yrs 9 mos · Gurugram, Haryana, India · Remote

Python (Programming Language)SparkPython

Atlassian

Data Engineer

Aug 2023Jun 2025 · 1 yr 10 mos · Gurugram, Haryana, India · Remote

  • Payroll Organization : EPAM Systems
  • CSS Data Engineering
  • Creating Jobs in Databricks and Airflow to provide support for BI and Data Science teams
SQLDBT

To the new

2 roles

Data Engineer

Dec 2020Aug 2023 · 2 yrs 8 mos

  • Integrating transactional data, MDM data and customer data to create KPIs to track shopping behavior of customers and pushing it into Braze for customer engagement
  • Integrating receipt scanning data with existing KPIs
  • Building real-time pipelines using Spark-Streaming and Kafka to to reduce latency
  • Loading data into Oneview system from data warehouse
  • Created auditing and alert framework in Airflow to track performance of KPI jobs and dags at each level of processing. With dashboard in Redash
  • Optimizing data pipelines to reduce run time and save data point costing on Braze API
  • End-to-end data solution using Airflow and Spark to ingest data in Vertica Data Warehouse from multiple data sources such as APIs, BigQuery, S3, Relational databases and SFTP etc.
Java Database Connectivity (JDBC)DatabasesData EngineeringBig Data

Trainee

Feb 2020Nov 2020 · 9 mos

  • Create end to end robust data pipeline in Airflow to do data ingestion via Azure Data Factory, Azure Synapse Analytics and Data Quality rules written using Deequ library and analysis in Azure HD Insight for clickstream data
Python (Programming Language)DatabricksData Engineering

Education

Centre for Development of Advanced Computing (C-DAC)

Master of Computer Applications - MCA — Computer Science

Jan 2017Jan 2020

Aryabhatta Knowledge University, Patna

BCA

Jan 2014Jan 2017

Stackforce found 100+ more professionals with Python & Spark

Explore similar profiles based on matching skills and experience