Sai Gokhale

Operations Associate

United States2 yrs 6 mos experience
Most Likely To Switch

Key Highlights

  • Focused on efficient and data-efficient ML.
  • Interned at AMD optimizing LLMs for Ryzen AI.
  • Graduate Teaching Assistant at Georgia Tech.
Stackforce AI infers this person is a Machine Learning Engineer with a focus on model optimization and inference efficiency.

Contact

Skills

Core Skills

Model OptimizationInference Efficiency

Other Skills

chunked prefillKV cache quantizationweight-only LLM quantization

About

Hi! I’m currently an MS CS student at Georgia Tech, focusing on efficient and data-efficient ML. I was interning at AMD, working on model optimization for LLMs to improve inference efficiency on Ryzen AI hardware. Previously, I worked as an MTS at Oracle, contributing to their cloud platform. I’d love to connect and chat if you share interests in ML systems and applied research. I’m also exploring full-time opportunities starting May 2026.

Experience

2 yrs 6 mos
Total Experience
1 yr 3 mos
Average Tenure
1 yr 5 mos
Current Experience

Georgia institute of technology

3 roles

Graduate Teaching Assistant

Jan 2026Present · 5 mos

Graduate Student Researcher

Jan 2025Present · 1 yr 5 mos

  • Working on Knowledge Distillation with Vision-Language Models (VLMs) for efficient labeling at DML Lab, Georgia Tech, under the supervision of Dr. Stephen Mussmann.

Graduate Teaching Assistant

Jan 2025May 2025 · 4 mos

  • Course - CSE 6242: Data & Visual Analytics

Amd

SDE AI intern

May 2025Dec 2025 · 7 mos · San Jose, California, United States · On-site

  • Working on model optimization techniques to improve inference efficiency on Ryzen AI NPUs. Implemented and evaluated chunked prefill, KV cache quantization, and weight-only LLM quantization strategies, enabling efficient long-context inference.
model optimizationinference efficiencychunked prefillKV cache quantizationweight-only LLM quantization

Oracle

Member of Technical Staff

Jul 2023Aug 2024 · 1 yr 1 mo · Bengaluru, Karnataka, India

Scaai - symbiosis centre for applied ai

Research Intern

Apr 2023Jul 2023 · 3 mos

Indian space research organisation (isro)

Project Intern

Sep 2022Apr 2023 · 7 mos

Oracle

Project Intern

Jun 2022Jul 2022 · 1 mo · Bengaluru, Karnataka, India

Vconstruct private limited

Project Intern

Mar 2021Jun 2021 · 3 mos · India

Education

Georgia Institute of Technology

Master of Science - MS — Computer Science

Aug 2024Present

MKSSS Cummins College of Engineering for Women

Bachelor of Technology - BTech — Computer Engineering

Jan 2019Jan 2023

Stackforce found 100+ more professionals with Model Optimization & Inference Efficiency

Explore similar profiles based on matching skills and experience