Kartik Mehta

CTO

San Francisco, California, United States13 yrs 6 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Over a decade of experience in deep learning.
  • Authored ~10 publications in top-tier conferences.
  • Key contributor to foundational LLMs at Amazon.
Stackforce AI infers this person is a SaaS expert specializing in AI and machine learning solutions.

Contact

Skills

Core Skills

Large Language Models (llm)Deep LearningNatural Language Processing (nlp)Machine LearningPredictive AnalyticsAnalytics

Other Skills

AlgorithmsBig DataBusiness AnalyticsData MiningDecision TreesGenAIGradient Boosting MachineImage ProcessingMatlabNeural NetworksOptimizationPattern RecognitionPythonRRandom Forest

About

With over a decade of experience in deep learning, I specialize in Large Language Model (LLM) training and Generative AI applications. My work spans cutting-edge research and real-world impact—developing and fine-tuning LLMs, deploying scalable AI systems, and solving complex business problems with practical ML solutions. I’ve authored ~10 publications in top-tier conferences (ACL, EMNLP, NAACL) and hold multiple patents. Passionate about bridging research and engineering, I help businesses unlock tangible value through robust model design, LLM optimization, and applied AI innovation. Actively involved in the AI research community as a published author and peer reviewer at top-tier conferences like ACL, EMNLP, and NAACL. Personal website (research): kartikmehta.me

Experience

Amazon

2 roles

Tech Lead

Nov 2021Present · 4 yrs 4 mos · On-site

  • 🔹Amazon Nova – Foundational LLM @ Amazon
  • Key contributor to Amazon Nova, a new generation of foundation LLMs powering enterprise applications via Amazon Bedrock.
  • 📢 Press Release (http://bit.ly/4lglJli) | 📜 Technical Report (https://bit.ly/3G0dbyu)
  • 🔹 Chatbot Experience on Alexa Mobile App
  • Led the launch of an LLM-based chatbot for the Amazon Alexa app, enabling in-depth text interactions and enhancing user engagement for US customers.
  • 🔍 Publications & Patents
  • Published ~10 papers in top-tier NLP conferences and hold multiple patents. Actively reviewing for top tier conferences (ACL, EMNLP, NAACL, NeurIps).
  • 📚 Google Scholar (https://scholar.google.co.in/citations?user=gInh5hIAAAAJ&hl=en)
GenAILarge Language Models (LLM)Deep LearningNatural Language Processing (NLP)Machine LearningPython

ML Tech Lead

Sep 2015Nov 2021 · 6 yrs 2 mos · On-site

  • 🚀 AI & ML Innovations at Scale
  • 🔹 Scalable E-commerce Attribute Extraction
  • Designed and deployed a unified deep learning architecture in Amazon’s production system to extract multiple product attributes (e.g., color, size) across diverse categories (e.g., shirts, phones, cameras). This reduced model redundancy and streamlined development for business units.
  • 📜 Based on our NAACL 2022 publication and patent.
  • 🔹 Chatbot for Amazon Product Pages
  • Led the development of a chatbot that delivers instant answers to shopper queries, eliminating the need to sift through long product descriptions, Q&As, and reviews. Launched at scale to enhance customer experience.
  • 🔹 Developed deep learning models for product question answering, improving accuracy in customer Q&A.
  • 🔹 Built deep learning based ad relevance & click-through rate prediction models for Amazon Sponsored Products, optimizing ad performance.
  • 🔹 Designed ML solutions for duplicate product detection in e-commerce, enhancing catalog quality and reducing redundancy.
Deep LearningNatural Language Processing (NLP)Machine LearningPredictive AnalyticsPython

Pwc india

Predictive Analytics Lead

Jun 2014Sep 2015 · 1 yr 3 mos · Mumbai

  • 🔹 Built a collaborative filtering-based product recommendation system and customer segmentation using latent semantic analysis.
  • 🔹 Developed a propensity-to-respond model for customer acquisition marketing for one of India’s largest retailers.
  • 🔹 Designed an analytics model to optimize store lease areas using demographic data.
  • 🔹 Created a price prediction model leveraging historical prices & macroeconomic factors to aid procurement planning.
Predictive AnalyticsMachine LearningData MiningR

Opera solutions

Senior Analytics Specialist

Sep 2012Jun 2014 · 1 yr 9 mos · Noida Area, India

  • 🔹 Built a text analytics model to predict resolvable service issues over phone, reducing agent visits for a leading US telecom provider.
  • 🔹 Identified profit opportunities for a pizza delivery chain in India by analyzing price elasticity, product bundling, and cross-sell strategies.
  • 🔹 Developed a probability-to-default model using credit bureau data for an auto financing service in the US.
AnalyticsData MiningPredictive Analytics

Education

Indian Institute of Technology, Delhi

Bachelor's + Master's degree

Jan 2007Jan 2012

Stackforce found 100+ more professionals with Large Language Models (llm) & Deep Learning

Explore similar profiles based on matching skills and experience