Chintu Kumar

Senior Software Engineer

Switzerland11 yrs 1 mo experience
Highly Stable

Key Highlights

  • Expert in Reinforcement Learning and Deep Learning.
  • Led successful launches of Gemini models at Google.
  • Developed innovative Natural Language Generation services.
Stackforce AI infers this person is a skilled AI engineer with expertise in reinforcement learning and natural language processing.

Contact

Skills

Core Skills

Reinforcement LearningDeep LearningReward Model TrainingNatural Language Generation

Other Skills

CC++Customer ServiceHTMLJavaManagementMicrosoft ExcelMicrosoft OfficeMicrosoft WordMulti-agent SystemsText editing/repair models

Experience

Google deepmind

Senior Software Engineer

Oct 2024Present · 1 yr 5 mos · On-site

  • Improve tool usage of Gemini models using multi-step RL with verifiable rewards, focusing primarily on Google Search and Workspace tools.
Multi-agent SystemsDeep LearningReinforcement Learning

Google

2 roles

Senior Software Engineer

Promoted

Mar 2024Oct 2024 · 7 mos

  • RLHF
  • Lead one of Gemini Pro launches significantly improving reasoning/code and the first launch to successfully incorporate user feedback in the RL recipe.
  • RL captain for several releases of Gemini free tier models.
  • Improved stability of RL recipe for Gemini basic models.
  • Reward model training
  • Release owner of Reward model for Bard advanced launch.
  • RM owner for Gemma 2 27B launch.
  • Worked with external vendors on collecting Human labeled preference data for RM training for Gemini 1.0 and 1.5 family of models.
Reinforcement LearningReward model training

Software Engineer

Mar 2019Mar 2024 · 5 yrs

  • Natural Language Generation service to produce natural and fluent output across languages.
  • Infra for integrating with text editing/repair models significantly reducing the time/effort spent authoring templates.
  • WebAnswers
  • Offer informative punts to the user based on extracted intent to better clarify why the query could not be answered. This improved user satisfaction and led to reduction in repeated queries.
  • Improve user interaction by offering follow-up for information seeking queries providing them with a more detailed answer if the user accepts the suggestion.
Natural Language GenerationText editing/repair models

Ola (ani technologies pvt. ltd)

Senior Software Engineer

Mar 2017Jan 2019 · 1 yr 10 mos · Bengaluru Area, India · On-site

  • Building large scale, real-time streaming platform for dynamic pricing

Flipkart

2 roles

Software Engineer

Jun 2015Mar 2017 · 1 yr 9 mos · Bengaluru Area, India

Software Engineer Intern

Jan 2015Jun 2015 · 5 mos · Bengaluru Area, India

Quantium

Internship

Dec 2013Jan 2014 · 1 mo · Hyderabad Area, India

Education

Indian Institute Of Information Technology Allahabad

Bachelor of Technology - BTech — Information Technology

Jan 2011Jan 2015

Stackforce found 100+ more professionals with Reinforcement Learning & Deep Learning

Explore similar profiles based on matching skills and experience