Karthik Abinav Sankararaman — AI Researcher
I do research on frontier model training — mid/post-training, RLHF, reward modeling, tool use, and agentic behavior. My focus is on making large language models more capable, reliable, and aligned with how people actually use them.At Meta Superintelligence Labs, I lead research on Llama/MetaAI, setting technical direction across data & RL, factuality, model personality & EQ, tool use, and agentic systems. I've developed novel RL algorithms, reward modeling pipelines, and data flywheel systems for continuous model improvement — and worked across teams to translate this research into every Llama release since 2023. Before moving to frontier models, I developed RL and bandit algorithms deployed across Meta's major product surfaces — ads, recommendations, content integrity leading to significant cumulative business impact. This grounded my research in what it means to build systems that work reliably at scale. Along with product impact, I have published several papers covering the algorithmic aspects of these works.I hold a PhD from the University of Maryland in sequential decision making and bandit theory. To know more about me and my research, visit my personal webpage: karthikabinavs.xyz
Stackforce AI infers this person is a leading AI researcher specializing in reinforcement learning and large language models.
Location: San Francisco, California, United States
Experience: 9 yrs 6 mos
Skills
- Reinforcement Learning
- Large Language Models (llm)
- Bandit Algorithms
Career Highlights
- Expert in reinforcement learning and large language models.
- Led impactful AI research at Meta Superintelligence Labs.
- Published multiple papers on algorithmic foundations.
Work Experience
Meta
AI Research Scientist, Frontier Model Research (3 yrs 3 mos)
AI Research Scientist, AI for Products (3 yrs 3 mos)
Microsoft Research India
Research (3 mos)
Visiting Researcher (2 mos)
Indian Institute of Science (IISc)
Research (2 mos)
IBM Almaden Research Center
Research (3 mos)
Adobe
Algorithms Research (2 mos)
University of Michigan
Research (1 mo)
Teritree Technologies
Founding Engineer (2 mos)
Early stage startups
Founding Engineer (2 yrs 6 mos)
Education
Doctor of Philosophy (Ph.D.) at University of Maryland
Bachelor of Technology Honours (BTech Hons.) at Indian Institute of Technology, Madras