G

Gaurav Mahajan

AI Researcher

New Haven, Connecticut, United States12 yrs 5 mos experience
Highly Stable

Key Highlights

  • Expert in reinforcement learning and language model training.
  • Published research in top-tier conferences like NeurIPS and ICML.
  • Transitioning from academia to industry with strong technical skills.
Stackforce AI infers this person is a Machine Learning Researcher with a focus on Reinforcement Learning and Language Models.

Contact

Skills

Core Skills

Machine LearningData ScienceReinforcement LearningResearchSoftware Development

Other Skills

PythonJulia (Programming Language)MathematicsStatisticsDeep LearningPost-Training LLMsLaTeXLearning TheoryTypeScriptAlgorithmsDiscrepancy ResolutionLeadership MentoringMargin AnalysisCSEVoice & Data Convergence

About

I’m a Postdoctoral Researcher at Yale’s Institute for Foundations of Data Science, where I work with Daniel Spielman. I earned my Ph.D. in Computer Science from UC San Diego (2023), advised by Sanjoy Dasgupta and Shachar Lovett. I’m transitioning from academia to industry, and looking for opportunities in post-training for language models. My research interests include reinforcement learning, post-training for language models, and learning theory: especially understanding the convergence properties of policy gradient methods (e.g. PPO, NPG, TRPO), and designing computationally efficient algorithms for learning language models. My work has appeared in venues such as COLT, ICML, NeurIPS, FOCS, ALT, and AISTATS, and spans topics including computationally efficient algorithms for learning language models (COLT 2023); the theory of policy gradient methods (COLT 2020); computational–statistical gaps in reinforcement learning (COLT 2022); and generalization frameworks for RL (ICML 2021, Neurips 2020, COLT 2020, FOCS 2021). Previously, I was a research intern at Microsoft Research, held visiting positions at the Institute for Advanced Study and the Simons Institute for the Theory of Computing, and was part of the Microsoft team that developed Microsoft PowerApps.

Experience

12 yrs 5 mos
Total Experience
3 yrs 1 mo
Average Tenure
3 yrs 1 mo
Current Experience

Yale university

Postdoctoral Associate

May 2023Present · 3 yrs 1 mo

Machine LearningPythonJulia (Programming Language)Data ScienceMathematicsStatistics+1

University of california san diego

Graduate Research Assistant

Sep 2017Mar 2023 · 5 yrs 6 mos · Greater San Diego Area

Post-Training LLMsResearchReinforcement LearningLaTeXLearning Theory

Microsoft

Software Developer

Sep 2014Sep 2017 · 3 yrs · Redmond, US

TypeScriptAlgorithmsSoftware Development

Epic

Software Developer

Oct 2013Aug 2014 · 10 mos · Madison, Wisconsin Area

École normale supérieure de cachan

Summer Research Intern

May 2012Jul 2012 · 2 mos · France

Sun microsystems

Summer Intern

Jun 2010Aug 2010 · 2 mos · New Delhi Area, India

Education

UC San Diego

Doctor of Philosophy — Computer Science

Jan 2017Present

Indian Institute of Technology, Delhi

Integrated Master of Technology — Mathematics and Computing

Jan 2008Jan 2013

Stackforce found 100+ more professionals with Machine Learning & Data Science

Explore similar profiles based on matching skills and experience