Keval Morabia

Lead ML Engineer

Ahmedabad, Gujarat, India8 yrs 2 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Expert in optimizing foundational AI models.
  • Key contributor to innovative machine learning libraries.
  • Proven track record in leading technical product demos.
Stackforce AI infers this person is a Deep Learning Engineer with strong expertise in AI model optimization and machine learning applications.

Contact

Skills

Core Skills

Deep LearningArtificial IntelligenceMachine LearningComputer VisionSoftware EngineeringData Analysis

Other Skills

PruningDistillationQuantizationLarge Language Models (LLM)NVIDIA Model OptimizerPyTorchNeural Architecture Search (NAS)Visualization toolsONNXNVIDIA TensorRTQualcomm SNPEApache AirflowTextCNNBERTAWS SageMaker

About

• Staff Deep Learning Engineer at NVIDIA specializing in training and optimizing foundational AI models (following the acquisition of OmniML where I was part of the Founding team) • Former Senior AI Engineer at Bloomberg NYC, with prior roles at Microsoft Research and Amazon Web Services. 🎓 Education: • Master’s in Computer Science, University of Illinois Urbana-Champaign • Bachelor’s in Computer Science, BITS Pilani 🌍 Beyond Tech: An avid traveler who loves curating detailed itineraries for every adventure. Check out my Instagram handle @kevalmorabia

Experience

Nvidia

2 roles

Staff Deep Learning Engineer

Promoted

Mar 2025Present · 1 yr

  • Joined through the acquisition of OmniML.
  • Working on Pruning, Distillation, Quantization algorithms for LLMs.
  • Maintainer of NVIDIA Model Optimizer GitHub
PruningDistillationQuantizationLarge Language Models (LLM)NVIDIA Model OptimizerDeep Learning+1

Senior Deep Learning Engineer

Aug 2022Mar 2025 · 2 yrs 7 mos

Instagram

Content Creator

May 2023Present · 2 yrs 10 mos

  • My passion of creating Travel content and sharing itineraries on Instagram with many reels of over 1Million views

Omniml (acquired by nvidia)

Founding Member - Senior Machine Learning Engineer

Aug 2022Feb 2023 · 6 mos · San Francisco Bay Area

  • Key contributor for Omnimizer PyTorch library for hardware-efficient Neural Architecture Search (NAS) and pruning
  • Improved the torch.fx based tracer to support new PyTorch modules and operators for NAS
  • Built tools for understanding and visualizing module/operator-level latencies of a PyTorch / ONNX model for NVIDIA TensorRT and Qualcomm SNPE edge devices
  • Contributed to the MVP of a job management dashboard to track experiments, visualize ONNX models, and latency insights
  • Lead the technical product demos at executive summits and AI conferences
PyTorchNeural Architecture Search (NAS)Visualization toolsONNXNVIDIA TensorRTQualcomm SNPE+2

Bloomberg lp

Senior Software Engineer - Artificial Intelligence

Feb 2021Jul 2022 · 1 yr 5 mos · New York, United States

  • 1. Self-service ML Model Remediation and Maintenance Pipeline:
  • Built a generalized platform allowing AI teams to setup their self-service model maintenance workflows consisting of automated annotation pipeline setup, model training, evaluation, and deployment using Apache Airflow
  • Significantly reduced KTLO time in remediating issues in deployed models by empowering domain experts to perform model maintenance with minimal engineering support
  • 2. Topic Tagging in Financial Documents
  • Enriched documents like analyst research reports with trends and topics per paragraph using TextCNN and BERT models
  • Setup data pipelines for sampling and cleaning paragraphs using PySpark for collecting annotations
  • 3. Government Contract Recommender system:
  • Setup model training infrastructure on AWS SageMaker using data from Athena for recommending government contracts to Bloomberg Government’s (BGOV) client organizations
  • Created Docker images to deploy trained models to production using a FastApi server
Apache AirflowTextCNNBERTAWS SageMakerDockerFastApi+2

Amazon web services (aws)

Software Engineer

May 2020Aug 2020 · 3 mos · Greater Seattle Area

  • Deployed Java APIs to AWS cloud for providing a preview of tasks to be performed by technicians in Amazon data centers
  • Parsed complex BPMN Workflows extracted from Amazon Dynamo DB to identify expected order of task execution
  • Tested code with Unit, Integration and Load testing with JUnit, Mockito, and TestNG
JavaAWSBPMN WorkflowsJUnitMockitoTestNG+1

University of illinois at urbana-champaign

Research Assistant

Aug 2019Dec 2020 · 1 yr 4 mos · Urbana-Champaign Area

  • Experimented a Visual Attention-based Model in PyTorch for novel Webpage Object Detection formulation
  • Utilized contextual information using visual features of ordered web elements extracted using Resnet18
  • Created largest public labeled dataset of 7.7k product webpage screenshots
  • Achieved 95% accuracy for product Price detection (8.5% above Fast R-CNN) and interpreted Attention Visualizations Attention Visualizations
PyTorchVisual Attention-based ModelResnet18Computer Vision

Microsoft

Researcher

Jan 2019Jul 2019 · 6 mos · Bangalore

  • Implemented Graph Recurrent Neural Networks from scratch in TensorFlow to Learn Embeddings for 300,000 entities in a heterogeneous graph
  • Collaborated with a team of 10 to design a novel Deep Neural Net architecture for recommending messages in MSTeams
  • Outperformed Matrix Factorization Methods like SVD & SVD++ on 2 benchmark rating prediction datasets
  • Achieved comparable or better results than Graph Convolutional Networks (GCNs) on 8 benchmark datasets for node classification, ranking, and rating prediction tasks
TensorFlowGraph Recurrent Neural NetworksDeep Neural Net architectureArtificial Intelligence

Arcesium

Software Engineer

May 2018Jul 2018 · 2 mos · Hyderabad, Telangana, India

  • Worked on Budget Enhancements for Expense Management System by adding back-end services in Java to exclude centers for Budget Allocation Process and comparing budgeted inputs with actual expenses by extrapolating data.
  • Designed database schema for efficiently storing center exclusion rules with dynamic SQL using My Batis, and wrote stored procedures in MS SQL Server for modifying database contents.
  • Made a UI using JavaScript for creating/modifying center exclusions, uploading excel inputs, and showing comparison grid.
  • Wrote about 100 unit test cases in JUnit and Mockito and increased test coverage by 5%
  • Learned Continuous Integration using Jenkins and Software Development Life Cycle process.
  • Finished roadmap project before the expected date of completion.
JavaJavaScriptMyBatisMS SQL ServerSoftware Engineering

Bits pilani, hyderabad campus

Research Assistant

Jan 2018Dec 2018 · 11 mos · Greater Hyderabad Area

  • Analyzed Twitter Stream for Event Detection by leveraging Wikipedia
  • Segmented tweets and hash-tags; applied Jarvis-Patrick clustering; summarized newsworthy events in Python
  • Achieved a precision of 88.12% (absolute improvement of 8%)
Twitter Stream AnalysisPythonData Analysis

The gujarati association - bits pilani hyderabad campus

President

Aug 2017May 2018 · 9 mos · Greater Hyderabad Area

  • Organized ’Dandiya Night’ - a cultural dance event at BITS Pilani, Hyderabad which was attended by over 1000 students for which a fund of 30,000 INR was raised
  • Arranged food stalls of Gujarati Delicacies
  • Conducted several traditional dance workshops

Watconsult

Frontend Developer

May 2017Jul 2017 · 2 mos · Worli, Mumbai, India

  • Worked on the front end of a location tracking device using Google Maps JavaScript API
  • Made a blog website using HTML, CSS, JavaScript, and PHP

Education

University of Illinois Urbana-Champaign

Master's degree — Computer Science

Aug 2019Dec 2020

Birla Institute of Technology and Science, Pilani

Bachelor of Engineering - BE — Computer Science

Aug 2015May 2019

Stackforce found 100+ more professionals with Deep Learning & Artificial Intelligence

Explore similar profiles based on matching skills and experience