J

Jay Gala

AI Researcher

Bengaluru, Karnataka, India4 yrs 7 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Published research at NeurIPS 2023.
  • Experience in optimizing AI workloads for Intel hardware.
  • Active participant and judge in hackathons.
Stackforce AI infers this person is a skilled AI and Machine Learning engineer with a focus on research and development.

Contact

Skills

Core Skills

Machine LearningLarge Language Models (llm)Computer VisionDeep LearningSoftware DevelopmentData Analysis

Other Skills

Artificial Intelligence (AI)C++Data VisualizationGitNLP LibrariesNatural Language Processing (NLP)Neural NetworksOpen-Source SoftwarePyTorchPython (Programming Language)REST APIsTensorFlowVersion Control

About

Jay is a Computer Engineering and MBA graduate. He works at Intel as an AI Software Solutions Engineer where he is optimizing AI Workloads and Frameworks for Intel hardware. He also works with researchers at the University of Surrey and the Indian Institute of Technology Patna. He is interested in AI research and has published multiple research papers including a paper at NeurIPS 2023. He is also an MLH Fellow and regularly participates in hackathons and community events as both, participant and Judge. He is well versed in and has experience in Python, Deep Learning (CV, NLP, Gen AI), PyTorch, TensorFlow, Git, Docker, Kubernetes, Shell Scripting, etc. If you have an interesting opportunity for him, you can reach out at jaygala260@gmail.com or just hit the connect button. He will be happy to connect with you! 😊

Experience

Microsoft

Research Fellow

Aug 2025 – Present Ā· 7 mos Ā· Bengaluru, Karnataka, India Ā· On-site

  • Diffusion Language Models and Efficient AI

Intel corporation

2 roles

AI Software Engineer

Promoted

May 2024 – Aug 2025 Ā· 1 yr 3 mos Ā· Bengaluru, Karnataka, India

  • Model optimization using novel algorithms and techniques
  • Enabled the NextGPT model end-to-end for training and inference
  • Optimizing Llama 2/3/3.1 inference with novel KV Caching, bucketing, etc.
Python (Programming Language)C++PyTorchMachine LearningLarge Language Models (LLM)

Generative AI Engineer

Dec 2023 – Apr 2024 Ā· 4 mos Ā· Bengaluru, Karnataka, India

  • Benchmarking models on Intel's Xeon and Gaudi 2
  • Developed use cases like Text-to-SQL chatbot, Real time ASR, RAG Chatbot, etc.
  • Used OpenVINO, IPEX, etc. to optimize models for performance

Centre for vision, speech and signal processing at university of surrey

Research Intern

Jul 2023 – Dec 2023 Ā· 5 mos

  • Working on remote sensing and generative AI.
  • Publications: 1 NeurIPS W
Python (Programming Language)PyTorchNeural NetworksComputer Vision

Indian institute of technology, patna

Research Intern

May 2023 – Dec 2023 Ā· 7 mos

  • Working on medical AI applications like cancer detection using few shot and meta learning techniques
  • Publications: 1 Pattern Recognition (in submission)
Python (Programming Language)PyTorchDeep LearningNeural NetworksComputer Vision

Mlh fellowship

Prep Fellow

Apr 2023 – Apr 2023 Ā· 0 mo Ā· Mumbai, Maharashtra, India Ā· Remote

Open-Source SoftwarePython (Programming Language)Git

Google

Google ML Bootcamp - Participant

Jul 2022 – Nov 2022 Ā· 4 mos Ā· Remote

  • A four month intensive program focused on deep learning and Tensorflow to build the next generation of problem solvers.
Artificial Intelligence (AI)Deep LearningNeural NetworksNatural Language Processing (NLP)Computer VisionNLP Libraries+1

Conbi

Software Engineer

Jul 2022 – Oct 2022 Ā· 3 mos Ā· Mumbai Metropolitan Region

  • REST APIs and Backend using Python, Flask, Firebase, GCP, etc.
Version ControlSoftware DevelopmentGitREST APIs

Transparent capital

Software Engineer

May 2022 – Jul 2022 Ā· 2 mos Ā· Mumbai, Maharashtra, India

  • Built a blogs section with Admin and User side with CRUD operations
  • Implemented automated email services for new clients using PHP
  • Revamped the company website using Bootstrap and AngularJS
Version ControlSoftware DevelopmentGit

Analytica

Data Scientist

Jan 2022 – Feb 2022 Ā· 1 mo Ā· Mumbai, Maharashtra, India

  • Made predictions using Machine Learning algorithms
  • Analyzed company data to give insights

Mozilla

Open Source Contributor

May 2021 – Aug 2021 Ā· 3 mos Ā· Mumbai, Maharashtra, India

  • Fixed Bugs in the Mozilla Firefox codebase using C++.
  • Reported issues in the fortnightly distributions which were causing build failures
Python (Programming Language)Machine LearningData VisualizationArtificial Intelligence (AI)Software DevelopmentData Analysis

Geeksforgeeks

Intern - Technical Writer

Mar 2021 – May 2021 Ā· 2 mos Ā· India

  • Published 14+ times on the GFG website.
  • Wrote on various technical topics like Python, Web Scraping, Excel, ASCII, TCP IP, Time Complexity analysis, Deep Learning, etc.
Open-Source SoftwareVersion ControlSoftware DevelopmentC++Git

Codechef mpstme chapter

2 roles

Co-Founder

Nov 2020 – May 2022 Ā· 1 yr 6 mos Ā· Mumbai, Maharashtra, India

Head of Media and Outreach

Nov 2020 – May 2022 Ā· 1 yr 6 mos Ā· Mumbai, Maharashtra, India

  • Led a team of 11 people
  • Organized more than 20 coding events like Speaker Sessions, contests, workshops, etc.
  • Built a coding community of more than 300 people

Education

SVKM's Narsee Monjee Institute of Management Studies (NMIMS)

Master of Business Administration - MBA

May 2024 – Present

International Institute of Information Technology Hyderabad (IIITH)

Summer School — Computer Vision and Machine Learning

Jul 2023 – Jul 2023

SVKM's Narsee Monjee Institute of Management Studies (NMIMS)

Bachelor of Technology - BTech — Computer Science

May 2024 – Present

Pace Junior Science College

Higher Secondary

May 2017 – May 2019

Dr. S. Radhakrishnan International School Borivali

secondary school certificate

Jan 2005 – Jan 2017

Stackforce found 100+ more professionals with Machine Learning & Large Language Models (llm)

Explore similar profiles based on matching skills and experience