Prankur Rusia — CTO

Working at the intersection of reinforcement learning and large language models, current focus is on post-training: the process of shaping model behavior after pretraining through RL, reward modeling, and evaluation. Currently at Microsoft SuperIntelligence, I design and run RL-based post-training pipelines for GPT models with responsibilities spanning dataset curation, RL reward pipelines, evaluation design, and training at scale. My research interests include reasoning control in LLMs, knowledge distillation via RL, and agentic systems with structured rollout design. Some problems I've worked on: • Controlling chain-of-thought behavior in reasoning models using pure RL, converting between reasoning and chat modes without significant capability loss • Sparse RL-based KD for efficient model compression • Structured agentic harnesses for long horizon LLMs • Reward design for real-world tasks where ground truth is noisy or implicit Before this, I spent time at Qualcomm on edge-deployed vision models and at ISRO as a scientist, where I published work on deep learning for atmospheric correction of satellite imagery. I'm a Kaggle Competitions Expert (top 0.6%, rank ~1100 of 180k+), with multiple medals across NLP, vision, and time-series competitions. I'm broadly interested in the science of making models more capable, controllable, and efficient through post-training and in the open questions that sit at the boundary of RL theory and large-scale empirical practice.

Stackforce AI infers this person is a leading expert in AI and machine learning, specializing in reinforcement learning and large language models.

Location: Bengaluru, Karnataka, India

Experience: 11 yrs 6 mos

Skills

Large Language Models (llm)
Reinforcement Learning
Model Training
Natural Language Processing (nlp)
Statistical Modeling

Career Highlights

Expert in reinforcement learning and large language models.
Kaggle Competitions Expert, top 0.6% rank.
Experience in AI model training and evaluation at scale.

Work Experience

Microsoft AI

Principal Member of Technical Staff (2 mos)

Senior Member of Technical Staff (3 mos)

Microsoft

Senior Applied Scientist (1 yr 7 mos)

Qualcomm

Senior Lead Engineer (2 yrs 3 mos)

Fractal

Senior Data Scientist (1 yr 4 mos)

ISRO - Indian Space Research Organization

Scientist "C" (3 yrs 6 mos)

Bharat Electronics Limited

Deputy Engineer (1 yr 5 mos)

Samsung Electronics

Software Engineer (1 yr 3 mos)

Headroom Learning Strategies

Summer Intern (2 mos)

VenusSoft

Chairman (3 yrs 9 mos)

Education

Bachelor of Technology (B.Tech.) at National Institute of Technology Raipur

Prankur Rusia

CTO

Bengaluru, Karnataka, India11 yrs 6 mos experience

AI EnabledAI ML Practitioner

Key Highlights

Expert in reinforcement learning and large language models.
Kaggle Competitions Expert, top 0.6% rank.
Experience in AI model training and evaluation at scale.

Stackforce AI infers this person is a leading expert in AI and machine learning, specializing in reinforcement learning and large language models.

Contact

Skills

Core Skills

Large Language Models (llm)Reinforcement LearningModel TrainingNatural Language Processing (nlp)Statistical Modeling

Other Skills

GitHub CopilotDataset CurationEvaluation DesignTraining at ScaleRegression ModelsGenerative AIScikit-LearnQuestion AnsweringBERT (Language Model)RedisKubernetesFastAPIQuantum MechanicsLinear AlgebraImage Processing

About

Experience

11 yrs 6 mos

Total Experience

1 yr 10 mos

Average Tenure

2 mos

Current Experience

Microsoft ai

2 roles

Principal Member of Technical Staff

Promoted

Mar 2026 – Present · 2 mos · Bengaluru

Microsoft Super-Intelligence x Coding models for Github Copilot products.

Large Language Models (LLM)Reinforcement LearningGitHub Copilot

Senior Member of Technical Staff

Dec 2025 – Mar 2026 · 3 mos · Bengaluru

Microsoft Super-Intelligence x Coding models for Github Copilot products.

Reinforcement LearningLarge Language Models (LLM)GitHub Copilot

Microsoft

Senior Applied Scientist

May 2024 – Dec 2025 · 1 yr 7 mos · Bengaluru

Design and execute RL-based post-training pipelines for GPT models, spanning reward model training, dataset curation, evaluation design, and distributed training at scale.

Large Language Models (LLM)Reinforcement LearningGitHub Copilot

Qualcomm

Senior Lead Engineer

Mar 2022 – Jun 2024 · 2 yrs 3 mos

Model TrainingLarge Language Models (LLM)Regression ModelsGenerative AIScikit-Learn

Fractal

Senior Data Scientist

Nov 2020 – Mar 2022 · 1 yr 4 mos

Question AnsweringModel TrainingBERT (Language Model)RedisNatural Language Processing (NLP)Scikit-Learn+2

Isro - indian space research organization

Scientist "C"

May 2017 – Nov 2020 · 3 yrs 6 mos

ICRB AIR-3

Model TrainingStatistical ModelingRedisScikit-LearnKubernetes

Bharat electronics limited

Deputy Engineer

Oct 2015 – Mar 2017 · 1 yr 5 mos · Bengaluru · On-site

AIR 6
Worked with Ministry of Defence in Strategic Projects of National Importance.
Details redacted due to national security clearance.

Samsung electronics

Software Engineer

Jul 2014 – Oct 2015 · 1 yr 3 mos

Doing awesome stuff & research!

Scikit-Learn

Headroom learning strategies

Summer Intern

May 2013 – Jul 2013 · 2 mos · Pune, Mumbai

Worked on Line Filtering module using Image Processing techniques.

Venussoft

Chairman

Oct 2008 – Jul 2012 · 3 yrs 9 mos

#VenusSoft
Its often said that computers are here to make our life easy.
We humbly say that we are here to make computers easy.
Our focus is to develop quality applications that are targeted to make user’s life easier.
We try to make this happen by developing applications that
are automated (read, run-it-and-forget-it),
are smart enough,
are flexible enough to be integrated with other applications or the OS to extend the overall functionality & experience,
are simple to use,
are portable
are tiny!