Shaheen Nabi — Co-Founder
I study how large language models perform multi-step reasoning and how training and post-training methods can improve their reliability, efficiency, and scalability. My work focuses on the post-training stack for LLMs — supervised fine-tuning (SFT), preference optimization, reinforcement learning methods such as RLVR, and inference-time compute strategies that improve reasoning without requiring larger models. I’m also interested in the interpretability of reasoning models: understanding the internal mechanisms that support multi-step reasoning and diagnosing failures such as shortcut reasoning, reward hacking, and unfaithful chain-of-thought. Currently building and open-sourcing implementations of reasoning-focused training pipelines and contributing to LLM infrastructure and post-training frameworks.
Stackforce AI infers this person is a specialist in AI and EdTech with a focus on reinforcement learning and computer vision.
Location: Bengaluru, Karnataka, India
Experience: 0 mo
Skills
- Reinforcement Learning
- Post-training
- Computer Vision
- Entrepreneurship
Career Highlights
- Expert in reinforcement learning and post-training systems.
- Developed open-source AI solutions for crop detection.
- Founded an edtech platform for AI education.
Work Experience
Self-employed
GitHub (Open Source) (5 mos)
Career Break
Career transition (8 mos)
iNeuron.ai
Data Science Intern (2 mos)
Lasso Pacific Pvt Ltd
Founder (11 mos)
Education
Bachelor of Arts - BA at Indira Gandhi National Open University
1 year course at Ineuron.ai
High School Diploma at Jammu and Kashmir Board of School Education (JKBOSE)