Vivek Nayak

Machine Learning Engineer

San Francisco, California, United States4 yrs 4 mos experience

Key Highlights

  • Expert in optimizing LLM inference for edge devices.
  • Proven track record in building production LLM inference stacks.
  • Strong background in data engineering and large-scale data management.
Stackforce AI infers this person is a Deep Learning Engineer specializing in AI/ML and Data Engineering.

Contact

Skills

Core Skills

Large Language Models (llm)Deep LearningData Engineering

Other Skills

PyTorchPython (Programming Language)HuggingFaceTensorRT-LLMTriton Inference ServergRPCLigerSGLang RuntimetrlLoRAPySparkSnowflakePandasXNNPACKCPP

About

deep learning engineer, interested in optimizing llm inference and improving efficiency for large model training. Resume - https://drive.google.com/drive/folders/13MqtGbwn5LKI8XJeLNbLSmWAOyh_AKJB

Experience

4 yrs 4 mos
Total Experience
1 yr 7 mos
Average Tenure
1 yr 1 mo
Current Experience

Meta

Senior Machine Learning Engineer

May 2025Present · 1 yr 1 mo · San Francisco Bay Area · Hybrid

  • Worked on LLM inference optimization for edge devices, Executorch pt2e quantized export flow and deep learning model optimization.
PyTorchPython (Programming Language)Large Language Models (LLM)Deep Learning

Capital one

Senior Machine Learning Engineer

Jan 2024May 2025 · 1 yr 4 mos · San Francisco Bay Area · Hybrid

  • Worked on deep learning research team, focussed on LLM training and inference.
  • Built production LLM inference stack based on TensorRT-LLM + Triton Inference Server + gRPC client.
  • Trained SOTA Medusa-1 adapter for Llama-3.1 8B, using Liger kernels for training and SGLang Runtime for self-distillation.
  • Trained and deployed Eagle adapters for speculative decoding for Llama-3.1 models. Wrote fused linear soft-target cross entropy Triton kernel for training models on A100s.
PyTorchHuggingFaceLarge Language Models (LLM)Deep Learning

Snowflake

Software Engineer Intern

May 2023Aug 2023 · 3 mos · San Francisco Bay Area · Hybrid

  • Worked on Snowpark client library, which provides pythonic PySpark-like dataframe APIs by translating dataframe manipulations into SnowSQL.
  • Received full time offer.
PySparkSnowflakeData Engineering

New york university

Research Assistant

Jan 2023Dec 2023 · 11 mos · New York City Metropolitan Area

  • Worked on sequence-parallel QLoRA framework to efficiently fine-tune LLMs for long context tasks.
trlLoRALarge Language Models (LLM)Deep Learning

Goldman sachs

Software Engineer

Jul 2020Jun 2022 · 1 yr 11 mos · Bengaluru, Karnataka, India

  • Worked on Data Lake, an internal Big Data product which hosts >20 PBs of firm data.
PySparkPandasData Engineering

Education

New York University

Master of Science - MS — Computer Science

Sep 2022Dec 2023

Birla Institute of Technology and Science, Pilani

BE — Computer Science

Sep 2015May 2020

Stackforce found 100+ more professionals with Large Language Models (llm) & Deep Learning

Explore similar profiles based on matching skills and experience