S

Shubham Agarwal

AI Researcher

Bengaluru, Karnataka, India5 yrs 11 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Over 10 years of research experience in AI.
  • Published work in top venues like CVPR and NeurIPS.
  • Expertise in multilingual and multimodal representation learning.
Stackforce AI infers this person is a leading expert in AI research with a focus on multilingual and multimodal systems.

Contact

Skills

Core Skills

Large Language Models (llm)Conversational AiComputer Vision

Other Skills

Artificial Intelligence (AI)Natural Language Processing (NLP)Data AnalysisMachine LearningData MiningData StructuresPythonAlgorithmsProgrammingBusiness IntelligencePyTorchKerasTensorFlowRSQL

About

I bring over 10 years of research experience across academia and industry after my undergrads, with a strong focus on training frontier models for multimodal and multilingual representation learning. I have been fortunate enough to have my work also published at top venues like CVPR, ICLR, TMLR, ACL, EMNLP, and NeurIPS. Past: ServiceNowResearch | MILA | AdobeResearch | Google Summer of Code (GSoC) | Xerox Research (XRCE) | IIT Delhi GScholar: https://scholar.google.com/citations?user=aSMFGScAAAAJ&hl=en Research Interests: Large Language Models (LLMs) | Cultural LLMs / VLMs | Multimodal LLMs | Foundation Models | Conversational AI | Vision and Language | Deep Learning Research challenge participation: 007 @ English-to-lowres Multimodal Machine Translation Task’ 24 | Alana @ Alexa Prize Socialbot Challenge' 18 | Pikabot @ Visual Dialog Challenge' 18 | NLE @ E2E NLG Challenge' 17 Training frontier multilingual AI models for Indian languages. My research mostly focused on visual grounding (symbol grounding) and context modeling (communicative grounding) in multi-modal visual conversational agents using Multimodal LLMs. Broadly, I am interested to build machines that can see and talk. Please visit my homepage (https://shubhamagarwal92.github.io) for more information. Also see git (handle: shubhamagarwal92) for some of my public repositories. Some authored blogs demonstrating my research as well as coding style: 1. List of some nice Github PRs and my contributions. 2. How-to-do-research https://medium.com/@shubhamagarwal92/how-to-do-research-a-ph-d-student-narrative-bca8dc2dd39e) 3. Code-like-a-pro(-ish) - Featured by AnalyticsVidhya https://medium.com/@shubhamagarwal92/code-like-a-pro-ish-right-from-101-tools-from-a-deep-learning-perspective-34d8df1e38e 4. How-to-do-a-literature-review https://medium.com/@shubhamagarwal92/how-to-do-a-literature-review-research-101-5c5206039c32 Open source project: Sample dashboard for Demographics of India (link: https://shubhamagarwal92.shinyapps.io/shinyapp/) Please get in touch if you would like to collaborate on an interesting idea

Experience

5 yrs 11 mos
Total Experience
10 mos
Average Tenure
--
Current Experience

Krutrim

Staff Research Scientist

Apr 2024Present · 2 yrs 1 mo · Bengaluru, Karnataka, India · On-site

  • Building Multilingual and Multimodal Foundation LLMs with a focus on Indian culture as well as languages. Some of our recent accepted works:
  • IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs (ICLR'26)
  • BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages (AAAI'26)
  • Chitrakshara: A Large Multilingual Multimodal Dataset for Indian languages (VLMs4ALL, CVPR'25)
  • Pragyaan: Designing and Curating High-Quality Cultural Post-Training Datasets for Indian Languages (MRL, EMNLP'25)
  • Chitrarth: Bridging Vision and Language for a Billion People (MAR, NeurIPS'24)
  • Our Multimodal VLM supports 10 Indian languages - extended version at ICASSP'25
  • Chitranuvad: Adapting Multi-lingual LLMs for Multimodal Translation (WMT, EMNLP'24)
  • Winner of the English-to-lowres Multimodal Machine Translation task at EMNLP 2024.
  • 3 other works under review at main conferences (available at arXiv and GScholar)
Artificial Intelligence (AI)Conversational AILarge Language Models (LLM)Natural Language Processing (NLP)Computer Vision

Servicenow research

Visiting Researcher

Jul 2023Apr 2024 · 9 mos · Montreal, Quebec, Canada · Hybrid

  • Affiliated to MILA, University of Montreal (HEC Montreal) and ServiceNow Research
  • Worked on LLMs for multi-document summarization and multi-modal applications. (Publications at CVPR'25, ICLR'25, TMLR'25)

Pulse labs

Research Scientist

Jul 2021Jun 2023 · 1 yr 11 mos · Remotely · Remote

  • Worked on the UX platform for acquisition and analysis of human behavioural data and their engagement with latest technology such as LLM.
  • Integrated LLM based insights and search into the product as well as worked on interactive dashboards

Adobe

2 roles

Research Intern

Jun 2020Sep 2020 · 3 mos · Remotely · Remote

  • Worked on multi-modal dialog systems.

Research Intern

May 2019Aug 2019 · 3 mos · San Francisco Bay Area

  • Worked on Visually grounded open-domain dialog systems.
  • Resulted in ACL'20 publication:
  • History for Visual Dialog: Do we really need it?
  • https://arxiv.org/abs/2005.07493
  • 2 proposals accepted as grants for Adobe Gift Funding.

Google summer of code

Google Summer of Code (GSoC) Student

May 2017Aug 2017 · 3 mos · France

  • Worked on the "Scientific Visualization" project for the PEcAn organization
  • More information and links can be found at
  • https://medium.com/@shubhamagarwal_91893/google-summer-of-code-2017-pecan-daa2fd11755a

Xerox

Research Intern

Feb 2017Jul 2017 · 5 mos · Grenoble Area, France

  • Worked on my master Thesis at Xerox Research Center Europe (XRCE), Grenoble, France
  • NLE @ E2E NLG Challenge 2017
  • http://www.macs.hw.ac.uk/InteractionLab/E2E/#results
  • (only char based seq2seq model)

Trulymadly.com

Data Scientist

Jul 2015Aug 2016 · 1 yr 1 mo · New Delhi Area, India

  • Worked on the Personalised Recommendation System using Random Forest in R (H20) on highly imbalanced dataset using downsampling and upsampling techniques.
  • Developed interactive Dashboards assisting monetisation strategies using Shiny in R.
  • Identification of relevant features & parameters for Reciprocal Recommender System.
  • Conceptualised and built personalisation module in PHP for restaurant recommender
  • system "Datelicious" - product offering curated date offers for couple.
  • Sample Dashboard in Shiny for reference:
  • https://shubhamagarwal92.shinyapps.io/shinyapp/

Indian institute of technology, delhi

Undergraduate Teaching Assistant

Jul 2014May 2015 · 10 mos · New Delhi Area, India

  • Course: Ordinary Differential Equations
  • Dept. of Mathematics
  • Supervisor: Dr. VVK Srinivas
  • Course: System Design Lab
  • Dept. of Mathematics
  • Supervisor: Dr. B. Chandra

Télécom bretagne (ex enst bretagne, école nationale des télécommunications de bretagne)

Research Intern

May 2014Jul 2014 · 2 mos · Brest Area, France

  • Community Detection in Social Networks:
  • Analysed different Community Detection algorithms used in Social Network Analysis
  • Deployed iGraphs package in R for community detection on standard graph datasets
  • Prepared detailed analysis of resulting communities based on algorithm used & graph structure

Oracle financial services software limited

Project Trainee

May 2013Jul 2013 · 2 mos · Mumbai Area, India

  • Business Intelligence and Data Warehousing:
  • Worked on Oracle’s proprietary OBIEE platform to provide Business Intelligence technologies for client banking organization
  • Deployed Dimensional Modeling based on star schema to develop client specific Data Warehouse
  • Delivered interactive Dashboards & Analyses as part of offered Enterprise Reporting solutions

Idrbt

Research Intern

May 2012Jul 2012 · 2 mos · Hyderabad Area, India

  • Data Mining Tools on Integrated Complaint Management System (ICMS):
  • Performed suitability analysis & comparison of analytical tools on Integrated Complaint Management System
  • Studied event correlation based on sensitive network parameters; developed customized desktop application to efficiently generate the performance reports
  • Analysed periodicity of underlying patterns & events to facilitate capacity planning

Avanti fellows

Mentor

Aug 2011Jun 2012 · 10 mos

Education

Mila - Quebec Artificial Intelligence Institute

Postdoctoral Researcher — Artificial Intelligence

Jun 2023Apr 2024

Indian Institute of Technology, Delhi

Integrated Masters (5 years) — Mathematics and Computing

Jan 2010Jan 2015

Heriot-Watt University

Doctor of Philosophy - PhD — Computer Science

Nov 2017May 2021

Grenoble INP - UGA

Master’s Degree — Data Science

Jan 2016Jan 2017

Wood Row School

12th Standard — PCM

Jan 2010Jan 2010

Bishop Conrad School

10th standard — PCM

Jan 2008Jan 2008

Stackforce found 100+ more professionals with Large Language Models (llm) & Conversational Ai

Explore similar profiles based on matching skills and experience