Gaurav Sarkar

DevOps Engineer

Hyderabad, Telangana, India4 yrs 4 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Published multiple research papers on LLM optimization.
  • Developed innovative frameworks for model training efficiency.
  • Built a language learning platform leveraging native languages.
Stackforce AI infers this person is a Machine Learning Engineer with a focus on AI research and development.

Contact

Skills

Core Skills

Large Language Models (llm)Machine LearningData AnalysisNatural Language Processing (nlp)

Other Skills

AlgorithmsBERT (Language Model)BlockchainC (Programming Language)C++ContainerizationConvolutional Neural Networks (CNN)Data AnalyticsData PrivacyData ScienceData StructuresDeep LearningDeep video streaming using jetson nanoDockerGenerative AI

About

I am currently working on making fastest diffusion llm model with intel labs. I am pursuing masters at IIIT Hyderabad where my masters thesis is based on how to leverage data in an efficient manner to come up with better models basically efficient training methods. I am building indilingo, where anyone can learn any language using their own mother tongue language. It got selected for eleven labs program, residency program etc..

Experience

Intel corporation

AI Solution Engineer

Feb 2024Aug 2025 · 1 yr 6 mos · Bengaluru, Karnataka, India · On-site

  • 1) Published a research paper on making Diffusion llm faster and accurate using a training free optimization algorithm compared to SOTA fast dllm results with Intel Labs (https://arxiv.org/pdf/2512.07173)
  • 2) Published a research paper on novel activation function called SG-Blend with Intel Labs (https://arxiv.org/abs/2505.23942)
  • 3)Built a Framework for Optimizing any HuggingFace model to get a boost upto 80%
  • 4) Added MT5 model to support on Optimum-Habana Library for Gaudi AI Accelerator.
  • 5) Built End to End Speech 2 Speech Translation Pipeline using Bhashini Models
  • 6) Built Auto train on Gaudi Framework for training models using No Code platform
Generative AILarge Language Models (LLM)Machine Learning

International institute of information technology hyderabad (iiith)

Research Assistant

Aug 2023Dec 2023 · 4 mos · Hyderabad, Telangana, India · On-site

  • Currently My Research Work is On Studying Neural Network's Behaviour on Different Clusters and Developing a Framework for Training Models Efficiently.
  • @Data Science and Analytics Centre (DSAC)

Openmined

Padawan Fellow

Aug 2023Sep 2023 · 1 mo

  • Perform Data Science on Data that remains in someone else's server using Syft Stack.

Wells fargo

Intern Analyst

Jun 2023Aug 2023 · 2 mos · Hyderabad, Telangana, India

  • Our team's current focus revolves around tackling escalated complaints, aiming to distinguish them from the usual ones. Additionally, we're implementing a categorization system that will organize these complaints into different sections, aligning them with the specific services they pertain to.
  • In the longer term we aim to streamline the complete complaint resolution process using Large language models (LLM)
Quantitative AnalyticsData Analysis

Multyfi

Quantitative Researcher

Apr 2023Jun 2023 · 2 mos · New Delhi, Delhi, India · Remote

  • Currently working on effective alpha generating strategies
  • using machine learning and deep learning.
  • Predicting volatility index of indian market, calculating spread
  • for each week days. Working on pair trading strategy
  • (market-neutral), backtested on different stocks.

Hugging face

Hugging Face – Student Ambassador

Jun 2022Aug 2023 · 1 yr 2 mos · Remote

  • As part of the cohort, I will be volunteering to work with the community in their ML democratization efforts. I will contribute to the Machine Learning open-source ecosystem.
Natural Language Processing (NLP)Open-Source DevelopmentTransformer

Scaler

Problem Setter Intern (Data Science)

Jun 2022Aug 2022 · 2 mos · Remote

  • My Responsibilities were to create Article Structure, Data Science Curriculum, Review Content, etc.
  • My work revolves around topics like Numpy, Pandas, Keras, Matplotlib, NLP, etc.(We Got 501k Monthly Clicks)

Mitacs

Globalink Research Intern

Jun 2022Aug 2022 · 2 mos · Montreal, Quebec, Canada

  • I have been selected for the Mitacs Globalink Research Internship 2022 at McGill University, Canada, a global top 50 university and top 3 in Canada.
  • I would be working on the project titled "Machine Learning to Predict Temporomandibular Disorders Risk from Genotypes", under Sahir Bhatnagar.

Ford motor company

Data Scientist Intern

May 2022Jul 2022 · 2 mos · Chennai, Tamil Nadu, India · Remote

  • I will be part of the Artificial Intelligence Advancement Center (AIAC) Team and working on NLP based projects.

Google

Google CrowdSource ExploreML Facilitator

Apr 2022May 2022 · 1 mo

  • I took Machine learning session in which I taught them about basics of machine learning like thinking of features, dimension issues, fitting lines and curves etc.
  • We also covered kaggle courses as assignments and applied skills on real data sets. It was a hybrid session. Over 200 students registered for the event.
Data PrivacyTransformersBERT (Language Model)Natural Language Processing (NLP)

Dataweave

Data Engineer Intern

Oct 2021Dec 2021 · 2 mos · Bangalore Urban, Karnataka, India

  • I was responsible for writing scripts and scraping data from different websites. I learn how to handle requests, regex, SQL.

Iiit-naya raipur

Summer Research Intern [ Machine Learning ]

Apr 2021Nov 2021 · 7 mos · India

  • I was doing research work under the guidance of IIIT NR Professor in Data Privacy. I learned differential privacy and applied it so that every company and country can share data without any security concerns.

Kaggle

Kaggle Dataset Expert

Dec 2020May 2022 · 1 yr 5 mos

  • World rank :- 335 / 29,147

Quantum computing india

Fellowship Scholarship

Nov 2020Jan 2021 · 2 mos · India

  • In this, we learn about quantum computing, quantum physics, the mathematics behind it, and quantum algorithm.
  • We also implemented an algorithm using IBM’S QISKIT
  • I selected Quantum Machine Learning for the specialization domain.

Omdena

2 roles

Junior ML Engineer

Aug 2020Sep 2020 · 1 mo · United States

  • To build an economic well-being model. The goal is to train machine models to learn features related to urbanization and to changing agriculture thus giving a better understanding of economic well-being. The project falls under the UN ́s Sustainable Development Goal 8.

Junior ML Engineer

Jul 2020Aug 2020 · 1 mo · United States

  • Project: In this two-month AI Challenge, will work on detecting rooftops via satellite images to identify crucial rooftop features for solar panel installments. The developed solutions could significantly help to facilitate solar energy adoption

Education

International Institute of Information Technology Hyderabad (IIITH)

Master of Science - MS by Research — Computer Science

Aug 2025Jul 2027

Dayananda Sagar College of Engineering, BANGALORE

Bachelor of Engineering - BE — Computer Science

Jan 2019Jan 2023

Delhi Public School (DPS)

High School — Science Stream

Stackforce found 100+ more professionals with Large Language Models (llm) & Machine Learning

Explore similar profiles based on matching skills and experience