Prankur Rusia

CTO

Bengaluru, Karnataka, India11 yrs 6 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in reinforcement learning and large language models.
  • Kaggle Competitions Expert, top 0.6% rank.
  • Experience in AI model training and evaluation at scale.
Stackforce AI infers this person is a leading expert in AI and machine learning, specializing in reinforcement learning and large language models.

Contact

Skills

Core Skills

Large Language Models (llm)Reinforcement LearningModel TrainingNatural Language Processing (nlp)Statistical Modeling

Other Skills

GitHub CopilotDataset CurationEvaluation DesignTraining at ScaleRegression ModelsGenerative AIScikit-LearnQuestion AnsweringBERT (Language Model)RedisKubernetesFastAPIQuantum MechanicsLinear AlgebraImage Processing

About

Working at the intersection of reinforcement learning and large language models, current focus is on post-training: the process of shaping model behavior after pretraining through RL, reward modeling, and evaluation. Currently at Microsoft SuperIntelligence, I design and run RL-based post-training pipelines for GPT models with responsibilities spanning dataset curation, RL reward pipelines, evaluation design, and training at scale. My research interests include reasoning control in LLMs, knowledge distillation via RL, and agentic systems with structured rollout design. Some problems I've worked on: • Controlling chain-of-thought behavior in reasoning models using pure RL, converting between reasoning and chat modes without significant capability loss • Sparse RL-based KD for efficient model compression • Structured agentic harnesses for long horizon LLMs • Reward design for real-world tasks where ground truth is noisy or implicit Before this, I spent time at Qualcomm on edge-deployed vision models and at ISRO as a scientist, where I published work on deep learning for atmospheric correction of satellite imagery. I'm a Kaggle Competitions Expert (top 0.6%, rank ~1100 of 180k+), with multiple medals across NLP, vision, and time-series competitions. I'm broadly interested in the science of making models more capable, controllable, and efficient through post-training and in the open questions that sit at the boundary of RL theory and large-scale empirical practice.

Experience

11 yrs 6 mos
Total Experience
1 yr 10 mos
Average Tenure
2 mos
Current Experience

Microsoft ai

2 roles

Principal Member of Technical Staff

Promoted

Mar 2026Present · 2 mos · Bengaluru

  • Microsoft Super-Intelligence x Coding models for Github Copilot products.
Large Language Models (LLM)Reinforcement LearningGitHub Copilot

Senior Member of Technical Staff

Dec 2025Mar 2026 · 3 mos · Bengaluru

  • Microsoft Super-Intelligence x Coding models for Github Copilot products.
Reinforcement LearningLarge Language Models (LLM)GitHub Copilot

Microsoft

Senior Applied Scientist

May 2024Dec 2025 · 1 yr 7 mos · Bengaluru

  • Design and execute RL-based post-training pipelines for GPT models, spanning reward model training, dataset curation, evaluation design, and distributed training at scale.
Large Language Models (LLM)Reinforcement LearningGitHub Copilot

Qualcomm

Senior Lead Engineer

Mar 2022Jun 2024 · 2 yrs 3 mos

Model TrainingLarge Language Models (LLM)Regression ModelsGenerative AIScikit-Learn

Fractal

Senior Data Scientist

Nov 2020Mar 2022 · 1 yr 4 mos

Question AnsweringModel TrainingBERT (Language Model)RedisNatural Language Processing (NLP)Scikit-Learn+2

Isro - indian space research organization

Scientist "C"

May 2017Nov 2020 · 3 yrs 6 mos

  • ICRB AIR-3
Model TrainingStatistical ModelingRedisScikit-LearnKubernetes

Bharat electronics limited

Deputy Engineer

Oct 2015Mar 2017 · 1 yr 5 mos · Bengaluru · On-site

  • AIR 6
  • Worked with Ministry of Defence in Strategic Projects of National Importance.
  • Details redacted due to national security clearance.

Samsung electronics

Software Engineer

Jul 2014Oct 2015 · 1 yr 3 mos

  • Doing awesome stuff & research!
Scikit-Learn

Headroom learning strategies

Summer Intern

May 2013Jul 2013 · 2 mos · Pune, Mumbai

  • Worked on Line Filtering module using Image Processing techniques.

Venussoft

Chairman

Oct 2008Jul 2012 · 3 yrs 9 mos

  • #VenusSoft
  • Its often said that computers are here to make our life easy.
  • We humbly say that we are here to make computers easy.
  • Our focus is to develop quality applications that are targeted to make user’s life easier.
  • We try to make this happen by developing applications that
  • are automated (read, run-it-and-forget-it),
  • are smart enough,
  • are flexible enough to be integrated with other applications or the OS to extend the overall functionality & experience,
  • are simple to use,
  • are portable
  • are tiny!

Education

National Institute of Technology Raipur

Bachelor of Technology (B.Tech.) — Computer Science

Jan 2010Jan 2014

Stackforce found 100+ more professionals with Large Language Models (llm) & Reinforcement Learning

Explore similar profiles based on matching skills and experience