Sshubam Verma

Machine Learning Engineer

Bengaluru, Karnataka, India1 yr 6 mos experience

AI EnabledAI ML Practitioner

Key Highlights

Pioneered first benchmark for Indic LLM evaluation.
Engineered scalable data generation pipelines.
Developed deep learning solutions for real-time applications.

Stackforce AI infers this person is a Machine Learning Engineer specializing in Natural Language Processing and AI solutions.

Contact

verma.sshubam@gmail.com LinkedIn

Skills

Core Skills

Large Language Models (llm)Distributed ComputingData ScienceNatural Language Processing (nlp)Automatic Speech Recognition (asr)Data MiningComputer VisionDeep LearningMachine Learning

Other Skills

Microsoft AzureGoogle Cloud Platform (GCP)FastAPIagentic systemsdockerResearch and Development (R&D)SQLPyTorchWeb DevelopmentData ScrapingFlaskSeleniumImage ProcessingPython (Programming Language)TensorFlow

About

Machine Learning Engineer at Sarvam AI, working on building sovereign foundation models for India. I design and deploy scalable data and ML pipelines for large language models, multilingual NLP, and evaluation systems, with a strong focus on reliability, efficiency, and production readiness. Previously at AI4Bharat (IIT Madras), I worked on large-scale Indic benchmarks and multilingual systems, with research published at NAACL 2025 and EMNLP 2024 (Outstanding Paper Award). I enjoy operating at the intersection of research and engineering, turning complex ideas into robust systems that scale!

Experience

1 yr 6 mos

Total Experience

9 mos

Average Tenure

9 mos

Current Experience

Sarvam

2 roles

Machine Learning Engineer

Aug 2025 – Present · 9 mos · On-site

Building Sovereign AI for India!

Large Language Models (LLM)Distributed ComputingMicrosoft AzureGoogle Cloud Platform (GCP)Data ScienceFastAPI+2

Research Fellow

May 2025 – Jul 2025 · 2 mos · On-site

Ai4bhārat

Associate Researcher

Jul 2024 – Apr 2025 · 9 mos · Chennai, Tamil Nadu, India · On-site

Pioneered MILU, the first comprehensive benchmark for evaluating Large Language Models (LLMs) on authentic Indic contextual understanding
Architected robust synthetic data generation pipelines for collecting high-quality audio data grounded in accurate Indian cultural and linguistic contexts
Engineered a scalable, distributed translation infrastructure on Google Cloud Platform, processing millions of tokens to support the development of IndicTrans3 (Sarvam-M)
Implemented advanced monitoring systems and automated job allocation services for the internal GPU cluster, optimizing resource utilization and computational efficiency
Evaluation methodology and application development for IndicTrans3 and coordinated annotation teams to ensure quality assurance and consistent performance metrics

Research and Development (R&D)Large Language Models (LLM)SQLDistributed ComputingMicrosoft AzureGoogle Cloud Platform (GCP)+4

Indian institute of technology, madras

Research Intern

Feb 2023 – Jun 2024 · 1 yr 4 mos · Chennai, Tamil Nadu, India · On-site

Worked on domain adaptation of Automatic Speech Recognition (ASR) systems using Class language models, training language models, evaluating and tuning hyperparameters of ASR models, generating and filtering data for training language models.
Designed comprehensive end-to-end data scraping and filtering pipelines for rigorous evaluation of LLM capabilities and performance metrics
Developed and successfully deployed critical internal data collection platforms, including the inaugural version of Anudesh and specialized annotation tools for comparative LLM output analysis for research projects.

Natural Language Processing (NLP)Data MiningPyTorchAutomatic Speech Recognition (ASR)Research and Development (R&D)Large Language Models (LLM)+6

Scaler

Data Science Intern

Aug 2022 – Nov 2022 · 3 mos

Expertly developed cutting-edge scripts and assessments, elevating Computer Vision standards.
Coded and explained CV algorithms from scratch, such as CNN, through animations and relatable analogies
Implementing and decoding state-of-the-art models like MobileNet, ResNet, EfficientNet, etc.
Received 5/5 learner rating and appreciation from HOD Data Science.

Image ProcessingPython (Programming Language)TensorFlowDeep LearningNumPyComputer Vision+1

Interviewbit

Data Science Intern

Aug 2022 – Nov 2022 · 3 mos

Indian institute of technology, delhi

Machine Learning Intern

Jun 2022 – Jul 2022 · 1 mo · Delhi, India

Worked as a Summer ML Intern at the IITD AIA Foundation for Smart Manufacturing at IIT Delhi.
Key Roles :
Developing an end-to-end deep learning pipeline to identify power grid fault using voltage sensor data
Researching and implementing state-of-the-art model architectures
Deploying the Deep learning pipeline in Flask
Hosting the Web App on Cloud to perform real time inference based on sensor data
Optimizing the big data pipeline to minimize memory usage on training pipelines
Delivered the deployed and hosted deep learning pipeline with ~85% accuracy