Soumya Chatterjee

AI Researcher

Seattle, Washington, United States4 yrs 4 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Expert in Natural Language Processing and Machine Learning.
  • Strong academic background from Stanford and IIT Bombay.
  • Proven track record in AI research and industry projects.
Stackforce AI infers this person is a Machine Learning Engineer with expertise in AI and Natural Language Processing.

Contact

Skills

Core Skills

Natural Language Processing (nlp)Machine TranslationMl SystemsInformation RetrievalMachine LearningComputer Vision

Other Skills

AlgorithmsApache BeamArtificial Intelligence (AI)BashBazelC (Programming Language)CUDAConvolutional Neural Networks (CNN)Deep LearningGenerative AIGitGoogle Cloud Platform (GCP)JavaJavaScriptKeras

About

I am a MLE at Apple working on Siri and Spotlight Search. I did my master's in Computer Science from Stanford University and my undergraduate studies at IIT Bombay. Previously, I was an MLE Intern at Apple and an AI Resident at Google Research. I have strong industry and research experience in various areas of Natural Language Processing including Question Answering, Information Retrieval and Machine Translation. I am also interested in ML systems.

Experience

Apple

2 roles

Machine Learning Engineer

Jul 2024Present · 1 yr 8 mos · Seattle, Washington, United States

  • Agents for Search

Machine Learning Engineering Intern

Jun 2023Sep 2023 · 3 mos · Cupertino, California, United States · On-site

  • Worked on enabling existing translation models to correctly translate unseen terms without human intervention
  • Designed prompts to generate sentences with the target term and their translations using large language models (LLMs)
  • Used parameter efficient finetuning methods like LoRA to obtain 95+% term translation accuracy without drop in chrF
Large Language Models (LLM)Machine TranslationMLOps

Stanford university

2 roles

Teaching Assistant

Sep 2023Jun 2024 · 9 mos · Stanford, California, United States

  • Teaching Assistant for CS 224N - Natural Language Processing with Deep Learning (Spring '24)
  • Teaching Assistant for CS 224N - Natural Language Processing with Deep Learning (Winter '24)
  • Teaching Assistant for CS 236 - Deep Generative Models (Fall '23)
Natural Language Processing (NLP)Generative AI

Graduate Researcher

Sep 2022Jun 2023 · 9 mos · Stanford, California, United States · On-site

  • Worked on the following projects:
  • 1. Extending FlexFlow, a framework for hardware-aware ML model training to support graph neural network based models for protein conformation detection
  • 2. Worked on a novel information retrieval setting where data comes from different data distributions, some unseen during training. Designed allocating strategies leading to 8 points higher recall
ML SystemsInformation RetrievalCUDA

Google

AI Resident

Jul 2021Sep 2022 · 1 yr 2 mos · Bengaluru, Karnataka, India

  • Worked on the following projects:
  • 1. Efficient knowledge update in language models by disentangling factual knowledge from language semantics
  • 2. Methods for efficient re-use of pretrained policies for unseen tasks in hierarchical reinforcement learning
  • 3. Modeling adjustments in human behavioral policies over time
Machine LearningTensorFlowPythonNatural Language Processing (NLP)

Indian institute of technology, bombay

Teaching Assistant

Aug 2020May 2021 · 9 mos · Mumbai, Maharashtra, India

  • Teaching assistant for Artificial Intelligence and Machine Learning (CS 337) and Linear Algebra (MA 106) courses

Google

Software Engineering Intern

May 2020Jul 2020 · 2 mos · Bengaluru, Karnataka, India

  • Built sample‑efficient models to replicate human behaviour. Used meta-learning to capture population‑level traits and individual variations.
Machine LearningPyTorch

Awl, inc.

Machine Learning Engineering Intern

Dec 2019Jan 2020 · 1 mo · Sapporo, Hokkaido, Japan

  • Developed a system for detecting people in 360° videos using Faster R‑CNN. Improved detection mAP by 20% at a detection frequency of 20 FPS on a single GPU.
Machine LearningPyTorchComputer VisionObject Detection

National chung cheng university

Research Intern

May 2019Jul 2019 · 2 mos · Chiayi County/City, Taiwan

  • Designed a U‑Net model to convert infrared face images to visible spectrum leading to 10% higher night‑time face recognition accuracy.
Machine LearningComputer VisionConvolutional Neural Networks (CNN)

Greyatom school of data science

Data Science Winter Fellow

Nov 2018Dec 2018 · 1 mo · Mumbai, Maharashtra, India

  • Created lectures and mini‑projects on named entity recognition and machine translation for a self‑paced NLP course.
Machine LearningNatural Language Processing (NLP)Machine Translation

Education

Stanford University

Master of Science - MS — Computer Science

Sep 2022Jun 2024

Indian Institute of Technology, Bombay

Bachelor of Technology - BTech — Computer Science

Jan 2017Jan 2021

Pace Junior Science College

Jan 2015Jan 2017

Lilavatibai Podar Sr Secondary School

Jan 2005Jan 2015

Stackforce found 100+ more professionals with Natural Language Processing (nlp) & Machine Translation

Explore similar profiles based on matching skills and experience