Sanyam Agarwal

Senior Software Engineer

San Francisco, California, United States9 yrs 11 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in deep learning and reinforcement learning models.
  • Proven track record in machine learning at Google.
  • Strong background in computer vision and NLP.
Stackforce AI infers this person is a Machine Learning Engineer specializing in AI-driven solutions.

Contact

Skills

Core Skills

Reinforcement LearningLarge Language Models (llm)Generative AiMachine LearningKnowledge Graph Embeddings

Other Skills

GeminiSupervised LearningPrompt EngineeringGenerative Pre-TrainingGoogle GeminiBERT (Language Model)C++CPythonSoftware DevelopmentProgrammingComputer ScienceArtificial IntelligenceWeb DevelopmentAndroid Development

About

Experienced in developing and implementing deep learning and reinforcement learning models in computer vision and natural language processing. Looking for software engineering roles in machine learning. Check my website for resume and details on projects: http://sanyam5.github.io/

Experience

9 yrs 11 mos
Total Experience
1 yr 11 mos
Average Tenure
6 yrs 7 mos
Current Experience

Google

4 roles

Senior Software Engineer

Jun 2025Present · 1 yr

  • Cloud AI > Autorater: Improving the Auto-Rating ability of Gemini's own responses through prompt-engineering, SFT, and RLHF.
GeminiSupervised LearningReinforcement LearningPrompt EngineeringLarge Language Models (LLM)

Senior Software Engineer

Jan 2024Jun 2025 · 1 yr 5 mos

  • AdSpam > ML Platforms: Trained Auto-regressive decoder-only Tabular Foundational Models. Using column embeddings for positional encoding. Large scale pretraining over tabular data enabled easier finetuning and rapid deployment of spam-filters.
Generative Pre-TrainingGenerative AIGoogle GeminiMachine Learning

Senior Software Engineer

Promoted

Nov 2022Jan 2024 · 1 yr 2 mos

  • Search Quality > Knowledge Engine > Labs: Did SFT and RLHF on LLMs like ULM and PaLM2 for recipe recommendation. Google AI overview (Magi) for TV series & Movies vertical: Passage retrieval and structured outputs.
Large Language Models (LLM)Supervised LearningReinforcement LearningGoogle Gemini

Software Engineer

Nov 2019Nov 2022 · 3 yrs

  • Search Quality > Knowledge Engine: Trained dual encoder models, embedding based retrieval, MUM encoder decoder models.
Machine LearningKnowledge Graph EmbeddingsBERT (Language Model)

Georgia institute of technology

Visiting Research Scholar

Sep 2018Jul 2019 · 10 mos · Atlanta, Georgia, USA

  • Advised by Prof. Dhruv Batra and Prof. Devi Parikh. Performing research in the areas of machine learning, reinforcement learning, computer vision, and natural language processing. Developed models for generating navigational instructions for navigational trajectories in Matterport3D (a photo-realistic 3D indoor environment). Developed a hard-attention model, trained via gumbel-softmax straight through estimator. Published at CVPR 2019 (see Publications).
  • Currently fine-tuning instruction generation using reinforcement learning methods such as self-critical sequence training and actor-critic.

Indian institute of science (iisc)

Research Intern

Nov 2017Jul 2018 · 8 mos · Greater Bengaluru Area

  • Advised by Prof. Ambedkar Dukkipati. Developed a decoder-less variant of Skip-Thought Vectors (maps natural language sentences to vectors ) which trained 10 times faster. Worked on generative adversarial networks (GANs) for semi-supervised learning. Developed models that explicitly learned to generate functions which in turn access a knowledge base (KB) for answering questions.

Soroco pvt ltd

Platform Engineer

Jun 2016Nov 2017 · 1 yr 5 mos · Greater Bengaluru Area

  • Led a team which used recurrent neural networks (RNNs) to automatically find easy-to-automate business processes (for e.g. payroll management, settling vendor disputes for top e-commerce websites).
  • Designed and implemented a patent-pending work queue-based distributed systems framework operational at Fortune 500 companies. The framework allowed developing efficient and fault-tolerant automation systems with minimal internal state.

Meta

Software Engineering Intern

May 2015Jul 2015 · 2 mos · Menlo Park, California · On-site

  • Worked with the configuration management team in the Core Engineering Group. Found and fixed bottlenecks in existing architecture. Parallelized configuration checking service by splitting into master and slaves, speeding it up by 1000 times.

Intugine technologies

Software Engineer (part time)

Aug 2013Jan 2014 · 5 mos · Kharagpur Area, India

  • Developed and implemented fast customized algorithms for gesture recognition and integrated them with Windows OS for a smooth experience of playing video games, delivering powerpoint presentations and much more just by hand gestures.

Education

Indian Institute of Technology, Kharagpur

Bachelor of Technology (Honours) — Computer Science

Jan 2012Jan 2016

Stackforce found 100+ more professionals with Reinforcement Learning & Large Language Models (llm)

Explore similar profiles based on matching skills and experience