S

Sshubam Verma

Machine Learning Engineer

Bengaluru, Karnataka, India1 yr 4 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Pioneered first benchmark for Indic LLM evaluation.
  • Engineered scalable data generation pipelines.
  • Developed deep learning solutions for real-time applications.
Stackforce AI infers this person is a Machine Learning Engineer specializing in Natural Language Processing and AI solutions.

Contact

Skills

Core Skills

Large Language Models (llm)Distributed ComputingData ScienceNatural Language Processing (nlp)Automatic Speech Recognition (asr)Data MiningComputer VisionDeep LearningMachine Learning

Other Skills

Microsoft AzureGoogle Cloud Platform (GCP)FastAPIagentic systemsdockerResearch and Development (R&D)SQLPyTorchWeb DevelopmentData ScrapingFlaskSeleniumImage ProcessingPython (Programming Language)TensorFlow

About

Machine Learning Engineer at Sarvam AI, working on building sovereign foundation models for India. I design and deploy scalable data and ML pipelines for large language models, multilingual NLP, and evaluation systems, with a strong focus on reliability, efficiency, and production readiness. Previously at AI4Bharat (IIT Madras), I worked on large-scale Indic benchmarks and multilingual systems, with research published at NAACL 2025 and EMNLP 2024 (Outstanding Paper Award). I enjoy operating at the intersection of research and engineering, turning complex ideas into robust systems that scale!

Experience

Sarvam

2 roles

Machine Learning Engineer

Aug 2025Present · 7 mos · On-site

  • Building Sovereign AI for India!
Large Language Models (LLM)Distributed ComputingMicrosoft AzureGoogle Cloud Platform (GCP)Data ScienceFastAPI+2

Research Fellow

May 2025Jul 2025 · 2 mos · On-site

Ai4bhārat

Associate Researcher

Jul 2024Apr 2025 · 9 mos · Chennai, Tamil Nadu, India · On-site

  • Pioneered MILU, the first comprehensive benchmark for evaluating Large Language Models (LLMs) on authentic Indic contextual understanding
  • Architected robust synthetic data generation pipelines for collecting high-quality audio data grounded in accurate Indian cultural and linguistic contexts
  • Engineered a scalable, distributed translation infrastructure on Google Cloud Platform, processing millions of tokens to support the development of IndicTrans3 (Sarvam-M)
  • Implemented advanced monitoring systems and automated job allocation services for the internal GPU cluster, optimizing resource utilization and computational efficiency
  • Evaluation methodology and application development for IndicTrans3 and coordinated annotation teams to ensure quality assurance and consistent performance metrics
Research and Development (R&D)Large Language Models (LLM)SQLDistributed ComputingMicrosoft AzureGoogle Cloud Platform (GCP)+4

Indian institute of technology, madras

Research Intern

Feb 2023Jun 2024 · 1 yr 4 mos · Chennai, Tamil Nadu, India · On-site

  • Worked on domain adaptation of Automatic Speech Recognition (ASR) systems using Class language models, training language models, evaluating and tuning hyperparameters of ASR models, generating and filtering data for training language models.
  • Designed comprehensive end-to-end data scraping and filtering pipelines for rigorous evaluation of LLM capabilities and performance metrics
  • Developed and successfully deployed critical internal data collection platforms, including the inaugural version of Anudesh and specialized annotation tools for comparative LLM output analysis for research projects.
Natural Language Processing (NLP)Data MiningPyTorchAutomatic Speech Recognition (ASR)Research and Development (R&D)Large Language Models (LLM)+6

Scaler

Data Science Intern

Aug 2022Nov 2022 · 3 mos

  • Expertly developed cutting-edge scripts and assessments, elevating Computer Vision standards.
  • Coded and explained CV algorithms from scratch, such as CNN, through animations and relatable analogies
  • Implementing and decoding state-of-the-art models like MobileNet, ResNet, EfficientNet, etc.
  • Received 5/5 learner rating and appreciation from HOD Data Science.
Image ProcessingPython (Programming Language)TensorFlowDeep LearningNumPyComputer Vision+1

Interviewbit

Data Science Intern

Aug 2022Nov 2022 · 3 mos

Indian institute of technology, delhi

Machine Learning Intern

Jun 2022Jul 2022 · 1 mo · Delhi, India

  • Worked as a Summer ML Intern at the IITD AIA Foundation for Smart Manufacturing at IIT Delhi.
  • Key Roles :
  • Developing an end-to-end deep learning pipeline to identify power grid fault using voltage sensor data
  • Researching and implementing state-of-the-art model architectures
  • Deploying the Deep learning pipeline in Flask
  • Hosting the Web App on Cloud to perform real time inference based on sensor data
  • Optimizing the big data pipeline to minimize memory usage on training pipelines
  • Delivered the deployed and hosted deep learning pipeline with ~85% accuracy
HerokuPython (Programming Language)PyTorchTensorFlowtime seriesDeep Learning+2

Iitd-aia foundation for smart manufacturing

Machine Learning Intern

Jun 2022Jul 2022 · 1 mo · Delhi, India

Education

Maharaja Agrasen Institute Of Technology, Delhi

Bachelor of Technology - BTech — Computer Science

Jan 2020Jan 2024

DAV Public School

Jan 2016Jan 2020

Stackforce found 100+ more professionals with Large Language Models (llm) & Distributed Computing

Explore similar profiles based on matching skills and experience