Vipul Gupta

Co-Founder

Bengaluru, Karnataka, India9 yrs 7 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Ph.D. in EECS from UC Berkeley
  • Experience at top tech companies
  • Expertise in large-scale machine learning
Stackforce AI infers this person is a Machine Learning and AI expert with a strong focus on cloud computing and large-scale systems.

Contact

Skills

Core Skills

Machine LearningLarge Language Models (llm)Generative AiNatural Language Processing (nlp)Distributed ComputingCloud ComputingDeep LearningComputer VisionAlgorithms

Other Skills

FinetuningLangchainPython (Programming Language)SQLAmazon Web Services (AWS)PyTorchRayGraph LearningTensorFlowAnalyticsHigh Performance Computing (HPC)Technical PapersPresentationsC++docker

About

I graduated with a Ph.D. from the EECS department at UC Berkeley, where my research lay at the intersection of machine learning, cloud computing, and statistics. I was mainly motivated by problems with various practical applications. In the past, I have also worked at Bytedance, Facebook, and Apple where I applied my research ideas to solve several problems of practical interest in large-scale machine learning and AI. Before that, I graduated from IIT Kanpur with a Bachelors and Masters in Electrical Engineering.

Experience

9 yrs 7 mos
Total Experience
2 yrs 1 mo
Average Tenure
2 yrs 9 mos
Current Experience

Microsoft

Principal Applied Scientist

May 2025Present · 1 yr · Bengaluru, Karnataka, India · Hybrid

  • Building LLM-based large retrieval models that power Bing search, Bing ads and Microsoft Copilot.
Large Language Models (LLM)Machine LearningCloud Computing

Coinbase

Senior ML Engineer

Aug 2024May 2025 · 9 mos · Bengaluru, Karnataka, India · Remote

  • Innovating at the intersection of ML and Blockchains.

Revsure ai

2 roles

Advisor

Jul 2024Present · 1 yr 10 mos

  • Continuing to work with RevSure in an advisory capacity, where I offer strategic guidance on ML and Generative AI methodologies to drive innovation and enhance the RevSure platform.

Consultant, Generative AI

Jul 2023Jun 2024 · 11 mos

  • Driving Generative AI Innovation at RevSure.AI
FinetuningLangchainLarge Language Models (LLM)Natural Language Processing (NLP)Generative AI

Uptrain

Co-Founder

Sep 2022Jul 2023 · 10 mos · San Francisco, California, United States

  • Built UpTrain, a popular open-source package with more than 2k GitHub stars, to evaluate, test and monitor LLM models. It helps users check their LLM applications' performance on aspects such as correctness, structural integrity, bias, hallucination, etc.
  • Learn more: https://github.com/uptrain-ai/uptrain
Python (Programming Language)Natural Language Processing (NLP)Generative AISQLAmazon Web Services (AWS)Machine Learning

Bytedance

Research Scientist

Jun 2021Nov 2022 · 1 yr 5 mos · Mountain View, California, United States

  • Developing efficient algorithms for training large ML models distributedly on the cloud. Specifically, working on the Ray ecosystem to digest and process large-scale graph data for graph-based learning and recommendations.
PyTorchRayDistributed ComputingNatural Language Processing (NLP)Graph LearningAlgorithms+3

Facebook

Research Engineer

May 2020Dec 2020 · 7 mos · Menlo Park, California, United States

  • Took a summers and a semester off during my Phd to work at Meta (then Facebook).
  • Developed state-of-the-art techniques to improve the training efficiency of deep learning recommender models while working with the ML infrastructure team at Facebook. Further, implemented these research ideas into production models at Facebook to see practical improvements in end-to-end training times.
PyTorchPython (Programming Language)C++Natural Language Processing (NLP)PresentationsMachine Learning+3

Apple

AI Research Intern

May 2019Sep 2019 · 4 mos · Cupertino

  • Worked with the AI Research team at Apple on devising algorithms for large-scale distributed training of Deep Neural Networks with the objective of improving the model performance.
PyTorchdockerPython (Programming Language)Computer VisionPresentationsAlgorithms+1

Microsoft

Visiting Researcher

Dec 2017Jan 2018 · 1 mo · Bangalore

  • Developed efficient schemes for straggler mitigation in Apache REEF for several distributed linear algebra and machine learning algorithms using ideas from information and coding theory.
C++PresentationsMachine LearningAlgorithms

Uc berkeley

Graduate Student Researcher

Aug 2016May 2021 · 4 yrs 9 mos · Berkeley, CA

  • Ph.D. student in the Department of EECS at UC Berkeley, where I collaborated with Profs. Kannan Ramchandran, Thomas Courtade, and Michael Mahoney on developing fast and principled algorithms for machine learning on the cloud. Our schemes were inspired by ideas from applied statistics, optimization, and information theory.
PyTorchAnalyticsDistributed ComputingPython (Programming Language)High Performance Computing (HPC)Natural Language Processing (NLP)+10

Epfl (école polytechnique fédérale de lausanne)

Visiting Researcher

May 2015Jul 2015 · 2 mos · Lausanne Area, Switzerland

  • Compressed EEG signals by selecting a minimum number of components from its Hadamard transform and facilitated an efficient recovery of the input signals with state-of-the-art performance results.

Syracuse university

Visiting Researcher

May 2014Jul 2014 · 2 mos · Syracuse, New York Area

  • Solved the problem of distributed sparse support recovery with 1-bit quantized compressive
  • measurements in the presence of multiple sensors.

Education

University of California, Berkeley

Doctor of Philosophy - PhD — EECS

Aug 2016May 2021

Indian Institute of Technology, Kanpur

B.Tech-M.Tech Dual Degree — Electrical Engineering

Jul 2011Jun 2016

Y Combinator

Technical Entrepreneurship

Jan 2023Apr 2023

Stackforce found 100+ more professionals with Machine Learning & Large Language Models (llm)

Explore similar profiles based on matching skills and experience