Chintu Kumar

Senior Software Engineer

Switzerland11 yrs 2 mos experience

Highly Stable

Key Highlights

Stackforce AI infers this person is a skilled AI engineer with expertise in reinforcement learning and natural language processing.

Reinforcement LearningDeep LearningReward Model TrainingNatural Language Generation

CC++Customer ServiceHTMLJavaManagementMicrosoft ExcelMicrosoft OfficeMicrosoft WordMulti-agent SystemsText editing/repair models

11 yrs 2 mos

Total Experience

3 yrs 2 mos

Average Tenure

1 yr 7 mos

Current Experience

Oct 2024 – Present · 1 yr 7 mos · On-site

Improve tool usage of Gemini models using multi-step RL with verifiable rewards, focusing primarily on Google Search and Workspace tools.

Multi-agent SystemsDeep LearningReinforcement Learning

2 roles

Promoted

Mar 2024 – Oct 2024 · 7 mos

RLHF
Lead one of Gemini Pro launches significantly improving reasoning/code and the first launch to successfully incorporate user feedback in the RL recipe.
RL captain for several releases of Gemini free tier models.
Improved stability of RL recipe for Gemini basic models.
Reward model training
Release owner of Reward model for Bard advanced launch.
RM owner for Gemma 2 27B launch.
Worked with external vendors on collecting Human labeled preference data for RM training for Gemini 1.0 and 1.5 family of models.

Reinforcement LearningReward model training

Mar 2019 – Mar 2024 · 5 yrs

Natural Language Generation service to produce natural and fluent output across languages.
Infra for integrating with text editing/repair models significantly reducing the time/effort spent authoring templates.
WebAnswers
Offer informative punts to the user based on extracted intent to better clarify why the query could not be answered. This improved user satisfaction and led to reduction in repeated queries.
Improve user interaction by offering follow-up for information seeking queries providing them with a more detailed answer if the user accepts the suggestion.

Natural Language GenerationText editing/repair models