Manu Jain

Data Engineer

San Francisco, California, United States4 yrs 5 mos experience
AI ML PractitionerHighly Stable

Key Highlights

  • 3+ years of experience in Data Engineering and Machine Learning.
  • Proven expertise in deploying large-scale machine learning models.
  • Strong background in Azure Cloud and Databricks technologies.
Stackforce AI infers this person is a Data Engineer with a strong focus on Machine Learning and Big Data technologies.

Contact

Skills

Core Skills

Machine LearningData EngineeringData ScienceNetworkingData Center Management

Other Skills

PythonSQLPySparkDatabricksAzure MLTensorFlowScikit-learnMLOpsAPI DevelopmentCI/CDAzure DatabricksAzure SynapseData FactoryU-SQLAzure Cloud

About

Data Engineer and Machine Learning Enthusiast with 3+ years of experience deploying large-scale machine learning models and designing ETL pipelines. Proven expertise in PySpark, SQL, Databricks, and Azure Cloud, driving business impact in gaming, marketing, and consumer analytics. As a graduate student at San Jose State University specializing in Data Science, I aim to bridge the gap between engineering and AI-driven decision-making. ✅ Graduating in May 2026 and Open to Full Time opportunities in Data Engineering, Big Data, AI/ML domains and Software Engineering Domains.

Experience

Tata consultancy services

2 roles

Data Scientist | Machine Learning Engineer | Software Engineer (Microsoft - Team Xbox via TCS)

Promoted

Oct 2022Aug 2024 · 1 yr 10 mos

  • 💡 Tech Stack: Python, SQL, PySpark, Databricks, Azure ML, TensorFlow, Scikit-learn, MLOps, API Development, CI/CD
  • Developed & deployed 10+ machine learning models in Azure Databricks (Python, PySpark), increasing customer behavior prediction accuracy by 15%.
  • Built scalable ML pipelines & data preprocessing frameworks using Azure Synapse, Data Factory, and PySpark, reducing model training time by 40%.
  • Designed & implemented REST APIs to integrate ML models into production systems, ensuring real-time inferencing and automated model deployment.
  • Optimized ML model scoring workflows, reducing scoring time by 48% and cloud compute costs by 25%.
  • Engineered automated feature engineering pipelines, streamlining ETL processes for training datasets and improving data quality by 20%.
  • Deployed CI/CD workflows for ML models using Azure ML, Azure DevOps, and Git, ensuring seamless model retraining & versioning.
  • Collaborated with data engineers & software developers to build scalable AI-driven applications, integrating ML insights into enterprise solutions.
  • Led R&D for AI-based automation & visualization tools, enhancing engineering workflows by 25%.
PythonSQLPySparkDatabricksAzure MLTensorFlow+6

Data Engineer | Software Engineer (Microsoft - Team Xbox via TCS)

Sep 2020Oct 2022 · 2 yrs 1 mo

  • 💡 Tech Stack: Python, SQL, PySpark, U-SQL, Databricks, Azure Cloud, Airflow, ELK Stack, CI/CD, Software Development
  • Developed & maintained scalable data pipelines using Python, PySpark, SQL, and U-SQL, processing 50TB+ of data per month on Azure Cloud (Databricks, Data Factory, Synapse Analytics).
  • Optimized ETL workflows by implementing incremental data loading, reducing data processing time by 50% and saving 2,000+ compute hours/month.
  • Designed & deployed REST APIs to enable seamless data exchange between cloud applications and ML models.
  • Implemented CI/CD pipelines (Azure DevOps, Git) for automated deployment and version control of data engineering workflows.
  • Developed logging & alerting systems using ELK stack (Elasticsearch, Logstash, Kibana), enhancing system reliability and incident response.
  • Contributed to Research & Development of automation and visualization tools, optimizing engineering workflows and reducing manual effort by 25%.
  • Collaborated cross-functionally with software engineers, data scientists, and cloud engineers to build high-performance, scalable applications and backend solutions.
PythonSQLPySparkU-SQLDatabricksAzure Cloud+4

Microsoft

Data Science Engineer

Sep 2020Aug 2024 · 3 yrs 11 mos

Data ScienceApache SparkDatabricks ProductsData Lakes

Mpsedc - state it center

2 roles

Network Engineer

Jun 2019Jun 2019 · 0 mo · Greater Bhopal Area · On-site

  • Team/Project: Madhya Pradesh State Electronics Development Corporation (MPSEDC) :
  • Role: Networking Engineer Trainee
  • Got Introduced to diverse networking techniques employed by the state government in their project SWAN (State Wide Area Network).
  • Collaborated in brainstorming sessions to providing networking solutions for various government departments across the state, fostering e-governance.
Computer NetworkingNetwork ServicesNetworking

Data Center Engineer

May 2018Jun 2018 · 1 mo · Bhopal · On-site

  • Team/Project: State Data Center :
  • Role: Data Center Trainee
  • Gained exposure to varied data storage, backup, and recovery methodologies within the MP State Data Center. Supported the delivery of G2G, G2B, and G2C services, aligning with stakeholders' needs through its Cloud Adoption.

Mozilla

Member - Mozilla Club Coherent

Jan 2018Jan 2019 · 1 yr · Bhopal, Madhya Pradesh, India

  • Team: Mozilla Club Coherent
  • As part of the state university's student club, Mozilla Club organised event on Browser Add-ons for students interested in the computer science domain. 150+ participants were introduced to the world of browser add-ons development by being part of the workshop
Data Backup SolutionsData Center OperationsData Center ArchitectureData ManagementData Center VirtualizationData Center Management

Internshala

Internshala Student Partner 8.0

Aug 2017Feb 2018 · 6 mos

Tedx

Operations at TEDxRGPV

Aug 2017Aug 2017 · 0 mo · Bhopal, Madhya Pradesh, India

  • Was part of organising commitee of TEDx RGPV, 2017 which happened to be the First TEDx of the state organised at the state technical universtity - RGPV. With 9 prominent speakers and cultural events, the event was a hit with full house getting coverage across the state.
  • https://www.tedxrgpv.com
  • https://www.ted.com/tedx/events/23705

E-cell rgpv

Media and Public Relations

Jan 2016Jan 2020 · 4 yrs · Bhopal Area, India

  • As a part of Media and Public Relations department
  • Ensured national and state level coverage of our events by preparing & providing press notes/releases in English & Hindi to newspapers and digital media for publishing.
  • Organised 10+ events in association with Microsoft, Apple, Google, Oppo etc.
  • Worked on the flagship event - Imprenditore 3.0 and collaborated with team size of 15+ members and 70+ volunteeers to manage footfall of 1000+ participants
  • Organised Internship Fair, a first of its kind for colleges in state. Got more than 30 startups on board to provide internships to more than 150 students.
  • https://ecellrgpv.com/alumni

Education

San José State University

Master of Science - MS — Computer Software Engineering

Aug 2024May 2026

University Institute of Technology, RGPV

Bachelor of Engineering - BE — Computer Science and Engineering

Burn Hall School

HSC — Science & Information Practices

Burn Hall School

SSC

Stackforce found 100+ more professionals with Machine Learning & Data Engineering

Explore similar profiles based on matching skills and experience

Manu Jain - Data Engineer | Stackforce