K

Kunal Chawla

AI Researcher

Sunnyvale, California, United States8 yrs 8 mos experience

Key Highlights

  • 8+ years of experience in Applied ML.
  • Led multi-modal LLM evaluations at Meta.
  • Published in top conferences with 10k+ citations.
Stackforce AI infers this person is a Machine Learning and Computer Vision expert with extensive experience in AI-driven applications.

Contact

Skills

Core Skills

Machine LearningNatural Language ProcessingComputer Vision

Other Skills

LLMsSynthetic Data GenerationData Pipeline DevelopmentAd Ranking ModelsQuestion AnsweringData SynthesisTransformer ModelsVideo RetrievalDeep LearningEmbedding TechniquesPatent FilingImage RetrievalCaffeTensorFlowPyTorch

About

With leading-author publications in top conferences (EMNLP, ECCV, 10k+ citations), 8+ years of work experience in Applied ML and experience leading and deploying projects, I have expertise and proficiency in improving the capabilities of and unlocking new use cases for large language models(LLMs) and other Machine Learning models in Natural Language Processing and Computer Vision. I am a member of the post-training team at Apple and was previously in the Llama Post-Training team at Meta. My work includes fine-tuning LLMs for improving web-search, coding and agentic tool calling capabilities of the model through synthetic data generation and fine-tuning. Prior to Meta, I contributed to multilingual question-answering models for Bing Search at Microsoft, text-based video retrieval models at Amazon, and image-based product search at Samsung.

Experience

8 yrs 8 mos
Total Experience
2 yrs 7 mos
Average Tenure
11 mos
Current Experience

Apple

Senior ML Applied Scientist

Jun 2025Present · 11 mos · Cupertino, California, United States · On-site

Meta

Machine Learning Engineer

Aug 2022May 2025 · 2 yrs 9 mos · Menlo Park, California, United States · On-site

  • Led evaluations for multi-modal LLMs, conducting 100+ ablations for LLama 4, 3.1, 3.2 and 3.3
  • Crafted synthetic data to improve tool calling, web search & coding performance for LLama 3 405B
  • Developed a shared data pipeline framework and automated 50+ evaluation tasks across 4 teams for Llama
  • Built contextual Ad2Ad ranking model for Instagram Feed/Story and Facebook Feed, increasing Ads revenue by 0.06%
  • Improved Ads ranking models for organic contextual ads based on interacted posts, decreasing eval NE by 0.25

Microsoft

Applied Scientist II

May 2021Jul 2022 · 1 yr 2 mos · Bellevue, Washington, United States · On-site

  • Improved question answering in Bing Search and Edge browser, increasing F1 score by 75% and relevance in human evaluations by 0.7 percentage points.
  • Synthesized 500M+ samples across 15+ languages and trained 4k-context Transformer-based teacher model for above

Amazon

Applied Scientist

Sep 2020Dec 2020 · 3 mos · Seattle, Washington, United States

  • Retrieve videos from text queries using multi-level embeddings from video to represent global, object and motion features.
  • Introduced a novel triplet loss using continuous labels, increasing Recall@10 on Berkeley Deep Drive-X Dataset.

Ibm

Research Software Engineer

Jun 2020Aug 2020 · 2 mos · San Jose, California, United States

  • Deployed a system to detect people who use mobility aids using on-device cameras in autonomous vehicles
  • Filed a patent to find optimal workflow for Deep Learning tasks in edge devices, dynamically choosing cloud server, model and size depending on workload; and achieved mAP@0.5 of 84% on Mobility Aids Dataset with >20 fps

Samsung electronics

2 roles

Research Software Engineer

Promoted

Mar 2017Aug 2019 · 2 yrs 5 mos · Seoul, South Korea

  • Introduced a method for image retrieval by ensembling features from multiple deep learners using attention masks, achieving state-of-the-art results on SOP, CARS and CUB datasets (published in ECCV 2018)
  • Designed and deployed image-based product search for Bixby Vision using Caffe and Tensorflow, optimised for low runtime and memory, currently running on 100 million+ smartphones (Galaxy S8+)
  • Developed knowledge-based question answering chatbot for Samsung Customer Service using BERT
  • Won first prize in Perfect Product Image Recognition Challenge, presented at ACMMM 2018

Software Engineer

Sep 2015Feb 2017 · 1 yr 5 mos · Seoul, South Korea

  • Designed a conditional rule engine and predictive battery statistics module for Tizen Operating System
  • Created apps for abstractive email summarization and contact and calendar event extraction from Emails
  • Built a Swagger Codegen module for Tizen to create client SDKs from OpenAPI specification

Ibm

Research Software Engineer

May 2015Aug 2015 · 3 mos · New Delhi, Delhi, India

  • Research Paper Recommendation
  • Designed an algorithm to recommend research papers to read based on learning aim of a reader.
  • Built a topic dependency graph of 500+ topics, based on Wikipedia page links, paper references and textbook glossary.
  • Used the topic dependencies along with prior knowledge and reading list of the reader for paper reading suggestions.

Xerox

Software Engineer

May 2013Jul 2013 · 2 mos · Bengaluru, Karnataka, India

  • Patented a method to increase accuracy of digitisation of handwritten document using crowdsourcing, and co-developed an Android app.
  • Devised and used metrics for transcriptions similarity, workers’ performance and language models-based likelihood.

Education

Georgia Institute of Technology

Master of Science - MS — Computer Science

Aug 2019May 2021

Indian Institute of Technology, Delhi

Bachelor of Technology - BTech — Computer Science and Engineering

Jul 2011May 2015

Stackforce found 100+ more professionals with Machine Learning & Natural Language Processing

Explore similar profiles based on matching skills and experience