Shubham Agarwal — AI Researcher

I bring over 10 years of research experience across academia and industry after my undergrads, with a strong focus on training frontier models for multimodal and multilingual representation learning. I have been fortunate enough to have my work also published at top venues like CVPR, ICLR, TMLR, ACL, EMNLP, and NeurIPS. Past: ServiceNowResearch | MILA | AdobeResearch | Google Summer of Code (GSoC) | Xerox Research (XRCE) | IIT Delhi GScholar: https://scholar.google.com/citations?user=aSMFGScAAAAJ&hl=en Research Interests: Large Language Models (LLMs) | Cultural LLMs / VLMs | Multimodal LLMs | Foundation Models | Conversational AI | Vision and Language | Deep Learning Research challenge participation: 007 @ English-to-lowres Multimodal Machine Translation Task’ 24 | Alana @ Alexa Prize Socialbot Challenge' 18 | Pikabot @ Visual Dialog Challenge' 18 | NLE @ E2E NLG Challenge' 17 Training frontier multilingual AI models for Indian languages. My research mostly focused on visual grounding (symbol grounding) and context modeling (communicative grounding) in multi-modal visual conversational agents using Multimodal LLMs. Broadly, I am interested to build machines that can see and talk. Please visit my homepage (https://shubhamagarwal92.github.io) for more information. Also see git (handle: shubhamagarwal92) for some of my public repositories. Some authored blogs demonstrating my research as well as coding style: 1. List of some nice Github PRs and my contributions. 2. How-to-do-research https://medium.com/@shubhamagarwal92/how-to-do-research-a-ph-d-student-narrative-bca8dc2dd39e) 3. Code-like-a-pro(-ish) - Featured by AnalyticsVidhya https://medium.com/@shubhamagarwal92/code-like-a-pro-ish-right-from-101-tools-from-a-deep-learning-perspective-34d8df1e38e 4. How-to-do-a-literature-review https://medium.com/@shubhamagarwal92/how-to-do-a-literature-review-research-101-5c5206039c32 Open source project: Sample dashboard for Demographics of India (link: https://shubhamagarwal92.shinyapps.io/shinyapp/) Please get in touch if you would like to collaborate on an interesting idea

Stackforce AI infers this person is a leading expert in AI research with a focus on multilingual and multimodal systems.

Location: Bengaluru, Karnataka, India

Experience: 5 yrs 11 mos

Skills

Large Language Models (llm)
Conversational Ai
Computer Vision

Career Highlights

Over 10 years of research experience in AI.
Published work in top venues like CVPR and NeurIPS.
Expertise in multilingual and multimodal representation learning.

Work Experience

Krutrim

Staff Research Scientist (2 yrs 1 mo)

ServiceNow Research

Visiting Researcher (9 mos)

Pulse Labs

Research Scientist (1 yr 11 mos)

Adobe

Research Intern (3 mos)

Google Summer of Code

Google Summer of Code (GSoC) Student (3 mos)

Xerox

Research Intern (5 mos)

TrulyMadly.com

Data Scientist (1 yr 1 mo)

Indian Institute of Technology, Delhi

Undergraduate Teaching Assistant (10 mos)

Télécom Bretagne (ex ENST Bretagne, école nationale des télécommunications de Bretagne)

Research Intern (2 mos)

ORACLE FINANCIAL SERVICES SOFTWARE LIMITED

Project Trainee (2 mos)

IDRBT

Research Intern (2 mos)

Avanti Fellows

Mentor (10 mos)

Education

Postdoctoral Researcher at Mila - Quebec Artificial Intelligence Institute

Integrated Masters (5 years) at Indian Institute of Technology, Delhi

Doctor of Philosophy - PhD at Heriot-Watt University

Master’s Degree at Grenoble INP - UGA

12th Standard at Wood Row School

10th standard at Bishop Conrad School

Shubham Agarwal

AI Researcher

Bengaluru, Karnataka, India5 yrs 11 mos experience

AI EnabledAI ML Practitioner

Key Highlights

Over 10 years of research experience in AI.
Published work in top venues like CVPR and NeurIPS.
Expertise in multilingual and multimodal representation learning.

Stackforce AI infers this person is a leading expert in AI research with a focus on multilingual and multimodal systems.

Contact

Skills

Core Skills

Large Language Models (llm)Conversational AiComputer Vision

Other Skills

Artificial Intelligence (AI)Natural Language Processing (NLP)Data AnalysisMachine LearningData MiningData StructuresPythonAlgorithmsProgrammingBusiness IntelligencePyTorchKerasTensorFlowRSQL

About

Experience

5 yrs 11 mos

Total Experience

10 mos

Average Tenure

Current Experience

Krutrim

Staff Research Scientist

Apr 2024 – Present · 2 yrs 1 mo · Bengaluru, Karnataka, India · On-site

Building Multilingual and Multimodal Foundation LLMs with a focus on Indian culture as well as languages. Some of our recent accepted works:
IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs (ICLR'26)
BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages (AAAI'26)
Chitrakshara: A Large Multilingual Multimodal Dataset for Indian languages (VLMs4ALL, CVPR'25)
Pragyaan: Designing and Curating High-Quality Cultural Post-Training Datasets for Indian Languages (MRL, EMNLP'25)
Chitrarth: Bridging Vision and Language for a Billion People (MAR, NeurIPS'24)
Our Multimodal VLM supports 10 Indian languages - extended version at ICASSP'25
Chitranuvad: Adapting Multi-lingual LLMs for Multimodal Translation (WMT, EMNLP'24)
Winner of the English-to-lowres Multimodal Machine Translation task at EMNLP 2024.
3 other works under review at main conferences (available at arXiv and GScholar)

Artificial Intelligence (AI)Conversational AILarge Language Models (LLM)Natural Language Processing (NLP)Computer Vision

Servicenow research

Visiting Researcher

Jul 2023 – Apr 2024 · 9 mos · Montreal, Quebec, Canada · Hybrid

Affiliated to MILA, University of Montreal (HEC Montreal) and ServiceNow Research
Worked on LLMs for multi-document summarization and multi-modal applications. (Publications at CVPR'25, ICLR'25, TMLR'25)

Pulse labs

Research Scientist

Jul 2021 – Jun 2023 · 1 yr 11 mos · Remotely · Remote

Worked on the UX platform for acquisition and analysis of human behavioural data and their engagement with latest technology such as LLM.
Integrated LLM based insights and search into the product as well as worked on interactive dashboards

Adobe

2 roles

Research Intern

Jun 2020 – Sep 2020 · 3 mos · Remotely · Remote

Worked on multi-modal dialog systems.

Research Intern

May 2019 – Aug 2019 · 3 mos · San Francisco Bay Area

Worked on Visually grounded open-domain dialog systems.
Resulted in ACL'20 publication:
History for Visual Dialog: Do we really need it?
https://arxiv.org/abs/2005.07493
2 proposals accepted as grants for Adobe Gift Funding.

Google summer of code

Google Summer of Code (GSoC) Student

May 2017 – Aug 2017 · 3 mos · France

Worked on the "Scientific Visualization" project for the PEcAn organization
More information and links can be found at
https://medium.com/@shubhamagarwal_91893/google-summer-of-code-2017-pecan-daa2fd11755a

Xerox

Research Intern

Feb 2017 – Jul 2017 · 5 mos · Grenoble Area, France

Worked on my master Thesis at Xerox Research Center Europe (XRCE), Grenoble, France
NLE @ E2E NLG Challenge 2017
http://www.macs.hw.ac.uk/InteractionLab/E2E/#results
(only char based seq2seq model)

Trulymadly.com

Data Scientist

Jul 2015 – Aug 2016 · 1 yr 1 mo · New Delhi Area, India

Worked on the Personalised Recommendation System using Random Forest in R (H20) on highly imbalanced dataset using downsampling and upsampling techniques.
Developed interactive Dashboards assisting monetisation strategies using Shiny in R.
Identification of relevant features & parameters for Reciprocal Recommender System.
Conceptualised and built personalisation module in PHP for restaurant recommender
system "Datelicious" - product offering curated date offers for couple.
Sample Dashboard in Shiny for reference:
https://shubhamagarwal92.shinyapps.io/shinyapp/

Indian institute of technology, delhi

Undergraduate Teaching Assistant

Jul 2014 – May 2015 · 10 mos · New Delhi Area, India

Course: Ordinary Differential Equations
Dept. of Mathematics
Supervisor: Dr. VVK Srinivas
Course: System Design Lab
Dept. of Mathematics
Supervisor: Dr. B. Chandra

Télécom bretagne (ex enst bretagne, école nationale des télécommunications de bretagne)

Research Intern

May 2014 – Jul 2014 · 2 mos · Brest Area, France

Community Detection in Social Networks:
Analysed different Community Detection algorithms used in Social Network Analysis
Deployed iGraphs package in R for community detection on standard graph datasets
Prepared detailed analysis of resulting communities based on algorithm used & graph structure

Oracle financial services software limited

Project Trainee

May 2013 – Jul 2013 · 2 mos · Mumbai Area, India

Business Intelligence and Data Warehousing:
Worked on Oracle’s proprietary OBIEE platform to provide Business Intelligence technologies for client banking organization
Deployed Dimensional Modeling based on star schema to develop client specific Data Warehouse
Delivered interactive Dashboards & Analyses as part of offered Enterprise Reporting solutions

Idrbt

Research Intern

May 2012 – Jul 2012 · 2 mos · Hyderabad Area, India

Data Mining Tools on Integrated Complaint Management System (ICMS):
Performed suitability analysis & comparison of analytical tools on Integrated Complaint Management System
Studied event correlation based on sensitive network parameters; developed customized desktop application to efficiently generate the performance reports
Analysed periodicity of underlying patterns & events to facilitate capacity planning