Kunal Chawla

AI Researcher

Sunnyvale, California, United States8 yrs 8 mos experience

Key Highlights

8+ years of experience in Applied ML.
Led multi-modal LLM evaluations at Meta.
Published in top conferences with 10k+ citations.

Stackforce AI infers this person is a Machine Learning and Computer Vision expert with extensive experience in AI-driven applications.

Contact

Skills

Core Skills

Machine LearningNatural Language ProcessingComputer Vision

Other Skills

LLMsSynthetic Data GenerationData Pipeline DevelopmentAd Ranking ModelsQuestion AnsweringData SynthesisTransformer ModelsVideo RetrievalDeep LearningEmbedding TechniquesPatent FilingImage RetrievalCaffeTensorFlowPyTorch

About

With leading-author publications in top conferences (EMNLP, ECCV, 10k+ citations), 8+ years of work experience in Applied ML and experience leading and deploying projects, I have expertise and proficiency in improving the capabilities of and unlocking new use cases for large language models(LLMs) and other Machine Learning models in Natural Language Processing and Computer Vision. I am a member of the post-training team at Apple and was previously in the Llama Post-Training team at Meta. My work includes fine-tuning LLMs for improving web-search, coding and agentic tool calling capabilities of the model through synthetic data generation and fine-tuning. Prior to Meta, I contributed to multilingual question-answering models for Bing Search at Microsoft, text-based video retrieval models at Amazon, and image-based product search at Samsung.

Experience

8 yrs 8 mos

Total Experience

2 yrs 7 mos

Average Tenure

11 mos

Current Experience

Apple

Senior ML Applied Scientist

Jun 2025 – Present · 11 mos · Cupertino, California, United States · On-site

Microsoft

Applied Scientist II

May 2021 – Jul 2022 · 1 yr 2 mos · Bellevue, Washington, United States · On-site

Improved question answering in Bing Search and Edge browser, increasing F1 score by 75% and relevance in human evaluations by 0.7 percentage points.
Synthesized 500M+ samples across 15+ languages and trained 4k-context Transformer-based teacher model for above

Amazon

Applied Scientist

Sep 2020 – Dec 2020 · 3 mos · Seattle, Washington, United States

Retrieve videos from text queries using multi-level embeddings from video to represent global, object and motion features.
Introduced a novel triplet loss using continuous labels, increasing Recall@10 on Berkeley Deep Drive-X Dataset.

Ibm

Research Software Engineer

Jun 2020 – Aug 2020 · 2 mos · San Jose, California, United States

Deployed a system to detect people who use mobility aids using on-device cameras in autonomous vehicles
Filed a patent to find optimal workflow for Deep Learning tasks in edge devices, dynamically choosing cloud server, model and size depending on workload; and achieved mAP@0.5 of 84% on Mobility Aids Dataset with >20 fps

Samsung electronics

2 roles

Research Software Engineer

Promoted

Mar 2017 – Aug 2019 · 2 yrs 5 mos · Seoul, South Korea

Introduced a method for image retrieval by ensembling features from multiple deep learners using attention masks, achieving state-of-the-art results on SOP, CARS and CUB datasets (published in ECCV 2018)
Designed and deployed image-based product search for Bixby Vision using Caffe and Tensorflow, optimised for low runtime and memory, currently running on 100 million+ smartphones (Galaxy S8+)
Developed knowledge-based question answering chatbot for Samsung Customer Service using BERT
Won first prize in Perfect Product Image Recognition Challenge, presented at ACMMM 2018

Software Engineer

Sep 2015 – Feb 2017 · 1 yr 5 mos · Seoul, South Korea

Designed a conditional rule engine and predictive battery statistics module for Tizen Operating System
Created apps for abstractive email summarization and contact and calendar event extraction from Emails
Built a Swagger Codegen module for Tizen to create client SDKs from OpenAPI specification

Ibm

Research Software Engineer

May 2015 – Aug 2015 · 3 mos · New Delhi, Delhi, India

Research Paper Recommendation
Designed an algorithm to recommend research papers to read based on learning aim of a reader.
Built a topic dependency graph of 500+ topics, based on Wikipedia page links, paper references and textbook glossary.
Used the topic dependencies along with prior knowledge and reading list of the reader for paper reading suggestions.

Xerox

Software Engineer

May 2013 – Jul 2013 · 2 mos · Bengaluru, Karnataka, India

Patented a method to increase accuracy of digitisation of handwritten document using crowdsourcing, and co-developed an Android app.
Devised and used metrics for transcriptions similarity, workers’ performance and language models-based likelihood.