Raj Thakur — Senior Software Engineer
As a Machine Learning Engineer, I specialise in fine tuning LLMs, building high throughput inference serving platform that work with production workloads, with a deep focus on accelerating model performance on specialized ML Accelarators hardware. My expertise spans system design, distributed systems, and robust architecture, leveraging C++, Golang, and Java for scalable backends. I am particularly adept at driving performance gains in deep learning workflows, having extensively worked with VLLM, JAX and PyTorch/XLA to harness the power of AWS ML accelerators (Trainium/Inferentia). My contributions include disaggregated inference using vLLM and optimizing execution through PJRT integration and direct hardware interaction by developing the AWS Neuron SDK, ensuring maximum throughput and efficiency for complex models. I apply HLD, LLD, and design patterns to deliver high-performance, cost-effective ML solutions.
Stackforce AI infers this person is a Backend-heavy Fullstack Engineer with expertise in Fintech and Machine Learning.
Location: Santa Clara, California, United States
Experience: 11 yrs 4 mos
Skills
- System Design
- Software Development
Career Highlights
- Expert in fine-tuning LLMs for optimal performance.
- Led development of scalable settlement platforms in Fintech.
- Proficient in leveraging AWS ML accelerators for deep learning.
Work Experience
Amazon Web Services (AWS)
Senior Software Development Engineer (1 yr 8 mos)
AWS AI SDE II (1 yr 7 mos)
Software Development Engineer II (2 yrs 6 mos)
Grab
Senior Software Engineer (8 mos)
Software Engineer backend III (1 yr 2 mos)
Yatra Online Private Ltd
Software Engineer (8 mos)
Tata Consultancy Services
Software Engineer (2 yrs)
Freelancer Consultant - Internet and Project Management
Software Development Specialist (1 yr 1 mo)
Education
Bachelor's degree at Jalpaiguri Government Engineering College
Full Stack Web Development at freeCodeCamp
ISC at St. Joans School