Vishal Garimella

AI Researcher

Amherst, Massachusetts, United States4 yrs 5 mos experience

Key Highlights

  • Expert in Natural Language Processing and Deep Learning.
  • Proven track record in optimizing AI models for performance.
  • Experience in building scalable systems for high traffic.
Stackforce AI infers this person is a highly skilled AI/ML engineer with a focus on optimization and scalable systems.

Contact

Skills

Core Skills

Natural Language Processing (nlp)Deep LearningApplied SciencesOptimization

Other Skills

Python (Programming Language)Software Engineering

About

I am interested in understanding systems, passionate about optimizing them, and building real-life products.

Experience

Amazon

Research Internship

Jun 2025Sep 2025 · 3 mos · Seattle, Washington, United States · On-site

  • Evaluated factuality of LLM generations under a noisy-datasource. Developed a
  • shapley-value inspired inference-time algorithm to improve the performance of
  • any LLM on TruthfulQA and MMLU-Pro datasets, improving the performance of
  • 70B model on TruthfulQA by 0.58% and competitive MMLU-Pro performance.
Natural Language Processing (NLP)Python (Programming Language)Deep LearningOptimization

Guidesspace

Applied Scientist

Jun 2023Aug 2024 · 1 yr 2 mos · Remote

  • Developed a mentor-mentee matching recommender system at a
  • startup, leveraging client-side language models. LOR from my manager at Microsoft https://drive.google.com/file/d/1ycZotmIe2Zm1l6fuhtMRqYn3ivrUaft8/view
Natural Language Processing (NLP)Python (Programming Language)Applied Sciences

Microsoft

Software Engineer 2

Feb 2021May 2023 · 2 yrs 3 mos · Bengaluru, Karnataka, India · On-site

  • Compressed BERT/GPT2 styled models for efficient edge-device inference with
  • techniques such as pruning and quantization resulting in 10x latency improvement (1s to 100ms) and 2x less memory requirement (~200MB to 10MB). Benchmarked with optimization frameworks such as NVIDIA's TensorRT.
  • Developed a system handling planet-scale traffic and has been optimized for GPU memory, GPU throughput, and network latency for image super-resolution called DeepEnhance within the Edge browser, using CNNs/Vision transformer architecture.
  • I developed services capable of handling planet-scale traffic to generate user
  • suggestions in the Bing chatbot powered by GPT-4. Designed and maintained the
  • suggestion recommendation system for Bing-ChatGPT integration, and
  • managed production live sites.
Natural Language Processing (NLP)Python (Programming Language)Deep LearningOptimization

Goldman sachs

2 roles

Engineering Analyst

Jan 2020Jan 2021 · 1 yr · Bengaluru, Karnataka, India · Hybrid

  • Built a risk management platform for risk management of structured
  • mortgage products and stress testing with different stress scenarios.
  • Built and benchmarked CVA(Credit Valuation Adjustment) computation using Deep Neural Networks and American Monte Carlo.
Applied SciencesDeep Learning

Internship

May 2019Jul 2019 · 2 mos · Bengaluru, Karnataka, India · On-site

  • Credit valuation adjustment with neural networks and monte-carlo. Design and stress-test pricing systems.
Deep Learning

Education

Indian Institute of Technology, Kharagpur

Bachelor of Technology - BTech — Computer Science

Jan 2016Jan 2020

University of Massachusetts Amherst

Masters of Science — Computer Science

Sep 2024May 2026

Stackforce found 100+ more professionals with Natural Language Processing (nlp) & Deep Learning

Explore similar profiles based on matching skills and experience