Anurag Basant — Co-Founder
I design and deploy scalable AI/ML pipelines and microservices using open-source tools, Kubernetes, and cloud solutions. My expertise includes fine-tuning Large Language Models (LLMs) using advanced deep reinforcement learning techniques like Proximal Policy Optimization (PPO), Direct Preference Optimization (DPO), and Reward Modeling. I have hands-on experience implementing DRL in pricing models and adaptive decision-making systems. Additionally, I specialize in training and optimizing deep learning models (computer vision & NLP) and managing high-performance AI clusters for real-world applications.
Stackforce AI infers this person is a Data Science and AI Engineering expert with a focus on scalable solutions.
Location: Bengaluru, Karnataka, India
Experience: 9 yrs 3 mos
Skills
- Large Language Models (llm)
- Kubernetes
- Data Modeling
- Technical Architecture
- Deep Learning
- Natural Language Processing (nlp)
Career Highlights
- Expert in fine-tuning Large Language Models.
- Proficient in deploying scalable AI/ML pipelines.
- Experienced in managing high-performance AI clusters.
Work Experience
PW (PhysicsWallah)
Lead Machine Learning Engineer (1 yr 10 mos)
HealthifyMe
Senior AI Engineer (7 mos)
mlinterview.tech
Founder (8 mos)
Gojek
Data Scientist (1 yr 2 mos)
Freshworks
Data Scientist (3 yrs 2 mos)
Treebo Hotels
Business Analyst (1 yr 10 mos)
Education
Integrated MS at Indian Institute of Technology, Roorkee