Shubham Modi

Lead ML Engineer

Santa Clara, California, United States12 yrs 3 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Led a team to improve LLMs by over 50%.
  • Developed scalable AI systems at major tech companies.
  • Bridged research and business impact effectively.
Stackforce AI infers this person is a Machine Learning Leader with expertise in AI and E-commerce.

Contact

Skills

Core Skills

Large Language Models (llm)Generative Ai

Other Skills

AlgorithmsCC++CUDAComputer ScienceData StructuresEclipseEvaluation protocolsHTMLJavaJavaScriptLeadershipLinuxMachine LearningProduct Development

About

Staff Machine Learning Leader with 12+ years of experience driving innovation in LLMs, NLP, and multimodal AI.Proven track record at Meta, Walmart, and Altisource in delivering scalable AI systems, accelerating product adoption, and leading global teams across the US, UK, and India.Adept at bridging cutting-edge research and real-world business impact through technical strategy, cross-functional leadership, and hands-on expertise in advanced ML systems.

Experience

Meta

3 roles

Staff Research Scientist

Promoted

Mar 2024Present · 2 yrs

  • Leading a team of 8 engineers to fine-tune lllama3/4 models across 12+ languages, improving conversational and cultural quality by more than 50% in international markets, growing MAU from 1M to 30M (and growing).
  • Designed robust evaluation protocols and metrics that reduced error rates by 40% in multilingual conversational AI.
  • Designed and built the full post-training pipeline—including SFT, DPO, reward modeling, RJS, and OPPO stages—which significantly improved LLAMA4's response quality.
  • Translated strategic business goals into actionable machine learning initiatives through alignment with cross-functional stakeholders and leadership.
Large Language Models (LLM)RLHFGenerative AIProduct DevelopmentLeadership

Lead Machine Learning Generalist

Promoted

Mar 2021May 2024 · 3 yrs 2 mos

  • Directed machine learning initiatives for Workplace by Meta, launching AI-powered features like automatic meeting summarization (BARD model) and headline generation (in-house system) that boosted user satisfaction and adoption.
  • Developed and deployed ML systems to detect and prevent scraping activity on FB and IG, reducing data leakage by 30%.Worked with auditors to provide evidence of the work, reducing risk to Meta for future fines.
  • Developed and executed a cross-functional strategy to balance anti-scraping measures with user experience, leading organizational alignment with Growth teams to drive sustainable business impact.

Machine Learning Generalist

Jun 2019Jun 2022 · 3 yrs

  • Developed and deployed deep learning solutions (BERT, neural networks) that surfaced 100K+ work-related Facebook groups, powering targeted outreach with a >90% conversion rate.
  • Collaborated with global teams to integrate ML systems into production, significantly increasing product engagement.
  • Mentored 5+ junior engineers, enhancing organizational ML capabilities.

Walmart labs

Senior Data Scientist

Dec 2017May 2019 · 1 yr 5 mos · Bangalore

  • Developed end to end ML framework for e-commerce product matching
  • Integrated multiple machine learning models like text, image and other together
  • Used deep learning models like CNN, LSTM, Bi-LSTM CRF for product matching
  • Lead the project from requirement gathering to deployment with 2 person team
  • Collaborated with multiple teams for product adaptation and deployment
  • Bridged the gap between business and technology by presenting the business impact of the project
  • Trained other members of the team to grow in Machine learning domain

Altisource labs

2 roles

Senior Data Scientist

Promoted

Apr 2016Dec 2017 · 1 yr 8 mos

  • Developed scalable document classification engines using XGBoost and Random Forest to categorize documents across 300+ classes, outperforming commercial solutions.
  • Built a fully open-source document processing pipeline leveraging R, Tesseract OCR, an in-house classification system, and automated deployment, replacing subscription-based tools and generating multimillion-dollar annual savings.

Data Scientist

Dec 2014Apr 2016 · 1 yr 4 mos

  • Working on classification of mortgage document in 300+ classes and separate document boundaries. Used Random forest algorithms to successfully classify documents with above 85% accuracy and Modified Hidden Markov Model for document separation with above 84% accuracy on very large dataset.

Technosoft corporation

Data Scientist

Mar 2014Aug 2014 · 5 mos · Bangalore

  • Worked on various challenging projects like converting SAS to R, alalyzing EEG data and predicting seizure, conversation mining

Texas instruments

2 roles

Software Engineer

Jul 2013Mar 2014 · 8 mos · Bangalore

  • Joined in video drivers team. Working very closely with the hardware TI produces and knowledge of how video works. Made the code MISRA C compliant. Did major bug fixes in drivers. Added interrupts to solve the issue of overflow

Internee

May 2012Jul 2012 · 2 mos · Bangalore (India)

  • Did a project with buffer management during AV playback. Decoupled the use of 2D tiler and 1D ion buffer for processing and display.

Education

Indian Institute of Technology, Roorkee

Bachelor's degree — Computer Science

Jan 2009Jan 2013

Stackforce found 100+ more professionals with Large Language Models (llm) & Generative Ai

Explore similar profiles based on matching skills and experience