Sumit Kumar

Data Engineer

India4 yrs 10 mos experience
AI ML PractitionerAI Enabled

Key Highlights

  • Led development of scalable data platforms at Cocoblu.
  • Improved data reliability with custom CDC pipelines at Gen.
  • Designed innovative solutions for real-time data challenges.
Stackforce AI infers this person is a Data Engineering expert in Fintech and Retail sectors.

Contact

Skills

Core Skills

Data EngineeringData PlatformsPythonData Science

Other Skills

AI-driven initiativesPython (Programming Language)Data ArchitectsAWSKafkaSpring BootFastAPIETL frameworksCloud data platformsMachine LearningData ArchitectureApache KafkaExtract, Transform, Load (ETL)Google Cloud Platform (GCP)Google BigQuery

About

Data Engineering Lead with 4.5+ years of experience building scalable data platforms, real-time pipelines, and analytics systems across fintech and retail domains. Currently working at Cocoblu, focusing on developing data warehouse and data lake solutions to enable large-scale business analytics, automation, and AI-driven initiatives. Previously worked at Gen Digital and PayPal, where I built CDC pipelines, event-driven microservices, and cross-cloud ETL frameworks that improved data reliability, scalability, and processing efficiency. My core interests lie in designing robust data platforms, solving complex data challenges, and leveraging AI to drive real business impact.

Experience

4 yrs 10 mos
Total Experience
2 yrs 3 mos
Average Tenure
4 mos
Current Experience

Cocoblu retail

Senior Manager – DSIT (Data Science & IT)

Feb 2026Present · 4 mos · Bengaluru, Karnataka, India

  • Developing data pipelines and scalable data platforms for retail analytics, automation, and reporting. Contributing to data engineering and AI-driven initiatives including RAG and automation use cases.
Data EngineeringData PlatformsAI-driven initiatives

Gen

Data Engineer (Gen AI & ML)

Dec 2024Jan 2026 · 1 yr 1 mo · Chennai · Hybrid

  • Worked on real-time data platforms and financial wellness initiatives on AWS.
  • Built custom Kafka Connect SMT for Debezium CDC pipelines, improving data reliability.
  • Developed Spring Boot services for transaction normalization and data validation.
  • Designed event-driven Python microservices using ECS, Kafka, and DynamoDB.
  • Built financial insights services using FastAPI with intelligent anomaly detection.
Python (Programming Language)Data ArchitectsAWSKafkaSpring BootFastAPI+2

Tata consultancy services

Data Engineer (ML)

Jul 2021Dec 2024 · 3 yrs 5 mos · Bangalore · Hybrid

  • Worked on large-scale data engineering solutions for PayPal, focusing on ETL frameworks, cloud data platforms, and analytics systems.
  • Built Python-based data migration framework across AWS, GCP, and BigQuery, reducing migration time by 20%.
  • Optimized entity resolution framework using LSH and APSS, improving linkage accuracy by 35%.
  • Developed reporting pipelines and backend integrations, reducing report generation time by 25%.
Data EngineeringPythonETL frameworksCloud data platforms

National institute of technology , patna

Data Science Research Intern

May 2020Jun 2020 · 1 mo · Patna, Bihar, India · On-site

  • Built a real-time forest fire detection system using Python, ML, and fuzzy logic to predict fire likelihood and severity.
Data SciencePython

Education

National Institute of Technology , Patna

Bachelor of Technology — Computer Science

Jan 2017Jan 2021

Jawahar Navodaya Vidyalaya - JNV

Senior Secondary — Science

Jan 2014Jan 2016

Jawahar Navodaya Vidyalaya - JNV

Higher Secondary

Jan 2009Jan 2014

Stackforce found 100+ more professionals with Data Engineering & Data Platforms

Explore similar profiles based on matching skills and experience