Sanyam Agarwal

Senior Software Engineer

San Francisco, California, United States9 yrs 11 mos experience

Most Likely To SwitchHighly Stable

Key Highlights

Expert in deep learning and reinforcement learning models.
Proven track record in machine learning at Google.
Strong background in computer vision and NLP.

Stackforce AI infers this person is a Machine Learning Engineer specializing in AI-driven solutions.

Contact

Skills

Core Skills

Reinforcement LearningLarge Language Models (llm)Generative AiMachine LearningKnowledge Graph Embeddings

Other Skills

GeminiSupervised LearningPrompt EngineeringGenerative Pre-TrainingGoogle GeminiBERT (Language Model)C++CPythonSoftware DevelopmentProgrammingComputer ScienceArtificial IntelligenceWeb DevelopmentAndroid Development

About

Experienced in developing and implementing deep learning and reinforcement learning models in computer vision and natural language processing. Looking for software engineering roles in machine learning. Check my website for resume and details on projects: http://sanyam5.github.io/

Experience

9 yrs 11 mos

Total Experience

1 yr 11 mos

Average Tenure

6 yrs 7 mos

Current Experience

Google

4 roles

Senior Software Engineer

Jun 2025 – Present · 1 yr

Cloud AI > Autorater: Improving the Auto-Rating ability of Gemini's own responses through prompt-engineering, SFT, and RLHF.

GeminiSupervised LearningReinforcement LearningPrompt EngineeringLarge Language Models (LLM)

Senior Software Engineer

Jan 2024 – Jun 2025 · 1 yr 5 mos

AdSpam > ML Platforms: Trained Auto-regressive decoder-only Tabular Foundational Models. Using column embeddings for positional encoding. Large scale pretraining over tabular data enabled easier finetuning and rapid deployment of spam-filters.

Generative Pre-TrainingGenerative AIGoogle GeminiMachine Learning

Senior Software Engineer

Promoted

Nov 2022 – Jan 2024 · 1 yr 2 mos

Search Quality > Knowledge Engine > Labs: Did SFT and RLHF on LLMs like ULM and PaLM2 for recipe recommendation. Google AI overview (Magi) for TV series & Movies vertical: Passage retrieval and structured outputs.

Large Language Models (LLM)Supervised LearningReinforcement LearningGoogle Gemini

Software Engineer

Nov 2019 – Nov 2022 · 3 yrs

Search Quality > Knowledge Engine: Trained dual encoder models, embedding based retrieval, MUM encoder decoder models.

Machine LearningKnowledge Graph EmbeddingsBERT (Language Model)

Georgia institute of technology

Visiting Research Scholar

Sep 2018 – Jul 2019 · 10 mos · Atlanta, Georgia, USA

Advised by Prof. Dhruv Batra and Prof. Devi Parikh. Performing research in the areas of machine learning, reinforcement learning, computer vision, and natural language processing. Developed models for generating navigational instructions for navigational trajectories in Matterport3D (a photo-realistic 3D indoor environment). Developed a hard-attention model, trained via gumbel-softmax straight through estimator. Published at CVPR 2019 (see Publications).
Currently fine-tuning instruction generation using reinforcement learning methods such as self-critical sequence training and actor-critic.

Indian institute of science (iisc)

Research Intern

Nov 2017 – Jul 2018 · 8 mos · Greater Bengaluru Area

Advised by Prof. Ambedkar Dukkipati. Developed a decoder-less variant of Skip-Thought Vectors (maps natural language sentences to vectors ) which trained 10 times faster. Worked on generative adversarial networks (GANs) for semi-supervised learning. Developed models that explicitly learned to generate functions which in turn access a knowledge base (KB) for answering questions.

Soroco pvt ltd

Platform Engineer

Jun 2016 – Nov 2017 · 1 yr 5 mos · Greater Bengaluru Area

Led a team which used recurrent neural networks (RNNs) to automatically find easy-to-automate business processes (for e.g. payroll management, settling vendor disputes for top e-commerce websites).
Designed and implemented a patent-pending work queue-based distributed systems framework operational at Fortune 500 companies. The framework allowed developing efficient and fault-tolerant automation systems with minimal internal state.

Intugine technologies

Software Engineer (part time)

Aug 2013 – Jan 2014 · 5 mos · Kharagpur Area, India

Developed and implemented fast customized algorithms for gesture recognition and integrated them with Windows OS for a smooth experience of playing video games, delivering powerpoint presentations and much more just by hand gestures.