Hemant Pugaliya

AI Researcher

San Francisco, California, United States7 yrs experience
AI ML PractitionerAI Enabled

Key Highlights

  • Expert in Large Language Models and AI systems.
  • Proven track record in revenue-generating AI projects.
  • Strong background in machine learning and NLP.
Stackforce AI infers this person is a Machine Learning and AI specialist with a focus on E-commerce and Finance.

Contact

Skills

Core Skills

Large Language Models (llm)Agentic AiDeep Learning

Other Skills

AlgorithmsCC++Data StructuresGitHTMLJavaLinuxMulti-agent SystemsNatural Language Processing (NLP)PyTorchPythonQuestion AnsweringSQLTransformers

Experience

Ema unlimited

Machine Learning Lead

Dec 2024Present · 1 yr 3 mos · Mountain View, California, United States

  • Working on Generative Workflow Engine™ , a horizontal Agentic OS platform which aims to create, configure and train Multi-Agent Mesh (a.k.a AI employees) while collaborating with Human Employees. Read more about the vision at https://www.ema.co/blog/agentic-ai/generative-workflow-engine-building-emas-brain
Large Language Models (LLM)Agentic AIMulti-agent Systems

Amazon

2 roles

Senior Applied Scientist

Feb 2020Dec 2024 · 4 yrs 10 mos · San Francisco Bay Area

  • I was one of the initial members of Store Foundational AI (SFAI-M5) where we trained domain specific model to power multiple vertical applications like search , advertising and catalog management. During these years I worked on pre-training, post-training, distillation, efficient distributed training and inference optimisation to enable LLMs on Web-scale applications. My projects led to $XXX M annual search-attributed sales and $XX M annual revenue boost in Advertising.
TransformersLarge Language Models (LLM)Deep LearningNatural Language Processing (NLP)PyTorchPython

Applied Scientist Intern

May 2019Aug 2019 · 3 mos · San Francisco Bay Area

  • Interned at Amazon Shopping & Discovery team [in Palo Alto], to improve semantic product matching by incorporating product-query purchase graph in deep learning models.
Transformers

Carnegie mellon university - school of computer science - language technologies institute

2 roles

Teaching Assistant

Jan 2019May 2019 · 4 mos

  • Teaching Assistant for the course 11-791 - Design of Intelligent Information Systems (Spring 2019) taught by Professor Eric Nyberg. The aim of the course is to extrapolate the fundamentals of system engineering (requirements, analysis, design, implementation) and project management (teaming, planning, scheduling, tracking) to Machine learning and Artificial Intelligence pipelines. My role was to design and grade the assignments for the course.

Graduate Directed Study Student

Sep 2018May 2019 · 8 mos

  • Working with Prof. Eric Nyberg on Neural Question Answering(QA) Systems and QA agent architectures.
  • Investigating "A Question-Focused Multi-Factor Attention Network for Question Answering"(AMANDA) neural QA model and adapting it to different datasets, to see it's performance. The findings across different models will later be used to summarize their generalization ability.
  • Worked on Scraping and Entity resolution component of Text Graph, which is used to identify good passage candidates for a specific inference type for Question Generation.

Morgan stanley

Senior Associate Technology

Aug 2016Jul 2018 · 1 yr 11 mos · Bangalore

  • Prototyped an unsupervised learning system on mixed-attributes using K-modes and K-prototypes clustering algorithm for analyzing trade breaks
  • and fails, resulting in identification of 5 issues which led to 40% decrease in analyzed break types.
  • Led development of an Information Retrieval system for Firmwide Business Intelligence(BI) Portal, enabling fine-grained search on dashboard components to locate specific business metrics efficiently. Worked along a Summer Intern and a team of under-graduate hires (as a part of training project) to successfully complete the project.

Goldman sachs

Summer Intern

May 2015Jul 2015 · 2 mos · Bangalore

C42 engineering

Intern

May 2014Jul 2014 · 2 mos

  • I added features to BureauBuilder - digital solutions for matchmakers and marriage bureaus.The project was done on Ruby On Rails web-framework .

Education

Carnegie Mellon University

Master of Science - MS — Machine Leaning & Natural Language Processing

National Institute of Technology Calicut

Bachelor of Technology (B.Tech.) — Computer Science and Engineering

Vidyaniketan Public School,Ullal,Bangalore

Stackforce found 100+ more professionals with Large Language Models (llm) & Agentic Ai

Explore similar profiles based on matching skills and experience