Gaurav Sarkar

DevOps Engineer

Hyderabad, Telangana, India4 yrs 4 mos experience

AI EnabledAI ML Practitioner

Key Highlights

Published multiple research papers on LLM optimization.
Developed innovative frameworks for model training efficiency.
Built a language learning platform leveraging native languages.

Stackforce AI infers this person is a Machine Learning Engineer with a focus on AI research and development.

Contact

Skills

Core Skills

Large Language Models (llm)Machine LearningData AnalysisNatural Language Processing (nlp)

Other Skills

AlgorithmsBERT (Language Model)BlockchainC (Programming Language)C++ContainerizationConvolutional Neural Networks (CNN)Data AnalyticsData PrivacyData ScienceData StructuresDeep LearningDeep video streaming using jetson nanoDockerGenerative AI

About

I am currently working on making fastest diffusion llm model with intel labs. I am pursuing masters at IIIT Hyderabad where my masters thesis is based on how to leverage data in an efficient manner to come up with better models basically efficient training methods. I am building indilingo, where anyone can learn any language using their own mother tongue language. It got selected for eleven labs program, residency program etc..

Experience

4 yrs 4 mos

Total Experience

1 yr 1 mo

Average Tenure

Current Experience

Intel corporation

AI Solution Engineer

Feb 2024 – Aug 2025 · 1 yr 6 mos · Bengaluru, Karnataka, India · On-site

1) Published a research paper on making Diffusion llm faster and accurate using a training free optimization algorithm compared to SOTA fast dllm results with Intel Labs (https://arxiv.org/pdf/2512.07173)
2) Published a research paper on novel activation function called SG-Blend with Intel Labs (https://arxiv.org/abs/2505.23942)
3)Built a Framework for Optimizing any HuggingFace model to get a boost upto 80%
4) Added MT5 model to support on Optimum-Habana Library for Gaudi AI Accelerator.
5) Built End to End Speech 2 Speech Translation Pipeline using Bhashini Models
6) Built Auto train on Gaudi Framework for training models using No Code platform

Generative AILarge Language Models (LLM)Machine Learning

International institute of information technology hyderabad (iiith)

Research Assistant

Aug 2023 – Dec 2023 · 4 mos · Hyderabad, Telangana, India · On-site

Currently My Research Work is On Studying Neural Network's Behaviour on Different Clusters and Developing a Framework for Training Models Efficiently.
@Data Science and Analytics Centre (DSAC)

Openmined

Padawan Fellow

Aug 2023 – Sep 2023 · 1 mo

Perform Data Science on Data that remains in someone else's server using Syft Stack.

Wells fargo

Intern Analyst

Jun 2023 – Aug 2023 · 2 mos · Hyderabad, Telangana, India

Our team's current focus revolves around tackling escalated complaints, aiming to distinguish them from the usual ones. Additionally, we're implementing a categorization system that will organize these complaints into different sections, aligning them with the specific services they pertain to.
In the longer term we aim to streamline the complete complaint resolution process using Large language models (LLM)

Quantitative AnalyticsData Analysis

Multyfi

Quantitative Researcher

Apr 2023 – Jun 2023 · 2 mos · New Delhi, Delhi, India · Remote

Currently working on effective alpha generating strategies
using machine learning and deep learning.
Predicting volatility index of indian market, calculating spread
for each week days. Working on pair trading strategy
(market-neutral), backtested on different stocks.

Hugging face

Hugging Face – Student Ambassador

Jun 2022 – Aug 2023 · 1 yr 2 mos · Remote

As part of the cohort, I will be volunteering to work with the community in their ML democratization efforts. I will contribute to the Machine Learning open-source ecosystem.

Natural Language Processing (NLP)Open-Source DevelopmentTransformer

Scaler

Problem Setter Intern (Data Science)

Jun 2022 – Aug 2022 · 2 mos · Remote

My Responsibilities were to create Article Structure, Data Science Curriculum, Review Content, etc.
My work revolves around topics like Numpy, Pandas, Keras, Matplotlib, NLP, etc.(We Got 501k Monthly Clicks)

Mitacs

Globalink Research Intern

Jun 2022 – Aug 2022 · 2 mos · Montreal, Quebec, Canada

I have been selected for the Mitacs Globalink Research Internship 2022 at McGill University, Canada, a global top 50 university and top 3 in Canada.
I would be working on the project titled "Machine Learning to Predict Temporomandibular Disorders Risk from Genotypes", under Sahir Bhatnagar.

Ford motor company

Data Scientist Intern

May 2022 – Jul 2022 · 2 mos · Chennai, Tamil Nadu, India · Remote

I will be part of the Artificial Intelligence Advancement Center (AIAC) Team and working on NLP based projects.

Google

Google CrowdSource ExploreML Facilitator

Apr 2022 – May 2022 · 1 mo

I took Machine learning session in which I taught them about basics of machine learning like thinking of features, dimension issues, fitting lines and curves etc.
We also covered kaggle courses as assignments and applied skills on real data sets. It was a hybrid session. Over 200 students registered for the event.

Data PrivacyTransformersBERT (Language Model)Natural Language Processing (NLP)

Dataweave

Data Engineer Intern

Oct 2021 – Dec 2021 · 2 mos · Bangalore Urban, Karnataka, India

I was responsible for writing scripts and scraping data from different websites. I learn how to handle requests, regex, SQL.

Iiit-naya raipur

Summer Research Intern [ Machine Learning ]

Apr 2021 – Nov 2021 · 7 mos · India

I was doing research work under the guidance of IIIT NR Professor in Data Privacy. I learned differential privacy and applied it so that every company and country can share data without any security concerns.

Kaggle

Kaggle Dataset Expert

Dec 2020 – May 2022 · 1 yr 5 mos

World rank :- 335 / 29,147

Quantum computing india

Fellowship Scholarship

Nov 2020 – Jan 2021 · 2 mos · India

In this, we learn about quantum computing, quantum physics, the mathematics behind it, and quantum algorithm.
We also implemented an algorithm using IBM’S QISKIT
I selected Quantum Machine Learning for the specialization domain.

Omdena

2 roles

Junior ML Engineer

Aug 2020 – Sep 2020 · 1 mo · United States

To build an economic well-being model. The goal is to train machine models to learn features related to urbanization and to changing agriculture thus giving a better understanding of economic well-being. The project falls under the UN ́s Sustainable Development Goal 8.

Junior ML Engineer

Jul 2020 – Aug 2020 · 1 mo · United States

Project: In this two-month AI Challenge, will work on detecting rooftops via satellite images to identify crucial rooftop features for solar panel installments. The developed solutions could significantly help to facilitate solar energy adoption