Yash J.

AI Researcher

San Francisco, California, United States6 yrs 6 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Published papers at NAACL, CVPR, and NeurIPS
  • Excellence in Research award from IIT Bombay
  • Expertise in building foundation models and AI systems
Stackforce AI infers this person is a Research Scientist specializing in AI and machine learning technologies.

Contact

Skills

Core Skills

Foundation ModelsArtificial Intelligence (ai)Large Language Models (llm)Natural Language Processing (nlp)Deep LearningComputer VisionSpeech Recognition

Other Skills

AlgorithmsAmazon Web Services (AWS)C++Computer ScienceData MiningData ScienceDiffusion modelGenerative AIMachine LearningMixture-of-ExpertsMulti-modalNFCNeuro-Linguistic Programming (NLP)Object DetectionObjective-C

About

I am a research scientist at Essential AI - building open-source foundation models. I graduated from Computer Science at Georgia Tech, and completed my Bachelor's with Honors in Computer Science from IIT Bombay, where I won the Excellence in Research award. I research in training Foundation models and have recently published an oral NAACL paper on Local Prompt Optimization. Previously, I have first-authored CVPR paper on diffusion models (Peekaboo) and a NeurIPS paper on Mixture-of-Experts (look up DAMEX). Webpage: https://yash-jain.com/ Reach out to me at yash.jain3599@gmail.com

Experience

Essential ai

Member of Technical Staff

Jun 2025Present · 9 mos · San Francisco, California, United States

  • Building open-source foundation models.
Scholarly ResearchFoundation modelsArtificial Intelligence (AI)Large Language Models (LLM)

Microsoft

ML Scientist II

Jun 2023Jun 2025 · 2 yrs · Redmond, Washington, United States · On-site

  • Building an automatic prompt authoring system.
  • Working on building next set of Office Copilot features.
Large Language Models (LLM)Computer VisionNatural Language Processing (NLP)Artificial Intelligence (AI)Multi-modalScholarly Research

Amazon usa alexa

Applied Scientist

Aug 2022Dec 2022 · 4 mos · Sunnyvale, California, United States · On-site

  • Led the development and implementation of a novel ML algorithm that improves speech recognition accuracy by 38.45% compared to existing state-of-the-art, using videos as training data.
Speech RecognitionComputer VisionArtificial Intelligence (AI)Multi-modalAmazon Web Services (AWS)

Microsoft

Applied Scientist

May 2022Jul 2022 · 2 mos · Redmond, Washington, United States · On-site

  • Developed a novel pipeline of image difference captioning task for PowerPoint slide data by generating a synthetic dataset in a self-supervised manner, benefitting 4.4. million users.
  • US Patent applied.
Computer ScienceComputer VisionNatural Language Processing (NLP)Artificial Intelligence (AI)Multi-modal

Georgia institute of technology

2 roles

Graduate Teaching Assistant

Jan 2022Mar 2022 · 2 mos · Atlanta, Georgia, United States

  • Graduate Algorithms

Student

Aug 2021May 2023 · 1 yr 9 mos · Atlanta, Georgia, United States

  • Master's Thesis: Analysis of mixture of experts on individual, sparsely-annotated and multi-dataset object detection
Deep LearningComputer VisionArtificial Intelligence (AI)Sparse dataObject DetectionMixture-of-Experts

Nokia bell labs

Research Internship

May 2021Aug 2021 · 3 mos · United Kingdom

  • Formulated a novel framework, Group Supervised Learning (GSL), which utilizes synchronous multi-device unsupervised data, extending the principles of contrastive learning to a group setting.
  • Outperformed supervised and semi-supervised baselines by 0.15 in F-1 score in RealWorld dataset.

Flipkart

Data Science Intern

May 2020Jul 2020 · 2 mos · Bengaluru, Karnataka, India

  • Built a QA system to automatically answer user's product-related queries in natural language

Indian institute of technology, bombay

Researcher at InfoLab

Dec 2019May 2021 · 1 yr 5 mos · Mumbai, Maharashtra, India

  • 1. Developed a novel idea for replacing node features in online social networks by integrating transductive and inductive models (GNNs) for the link prediction tasks. Work published in CIKM, 21
  • 2. Worked on the field of QA systems and Information retrieval with Prof. Soumen Chakrabarti.

Carnegie mellon university

Research Intern

May 2019Jul 2019 · 2 mos · Pittsburgh, Pennsylvania

  • 1. Built a speech recognition system using RFID stretchable Tattoos placed on the user's face. Tattoos would send their stretch data to a RFID reader which after applying NLP models would detect the word all in real-time.
  • 2. Increasing the range of Near-Field Communication (NFC), we developed a sound theoretical approach along with a working demo that the traditional NFC is not safe anymore due to communication range limitation.

Indian institute of technology, bombay

Institute Technical Summer Project

Jun 2018Jul 2018 · 1 mo · Mumbai Metropolitan Region

  • 1. Remodelled a wearable gesture interface by programming colour tracking algorithms that perform functions in correspondence with hand gestures by recognizing the colour markers on fingers
  • 2. Prototyped a remote with air mouse capability which recognizes its relative change in orientation and position in 2D plane using a gyroscope and IP algorithms on a PiCamera

Tata institute of fundamental research

Research Intern

May 2018Dec 2018 · 7 mos · Mumbai Metropolitan Region

  • 1. Devised an alternative approach for educational assessment of multiple choice questions using Discriminant Index. I also devised the algorithm for carrying out the experiments to further prove the working of the unconventional approach.
  • 2. Created robust scripts for parsing nuclear values from raw datasets to standardize nuclear values like spin, energy level etc. for hundreds of known nuclei.

Education

Indian Institute of Technology, Bombay

Bachelor's degree — Computer Science and Engineering

Jan 2017Jan 2021

Georgia Institute of Technology

Master's degree — Computer Science

Aug 2021May 2023

Stackforce found 100+ more professionals with Foundation Models & Artificial Intelligence (ai)

Explore similar profiles based on matching skills and experience