Ankit Jain

CEO

San Francisco, California, United States15 yrs 11 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Led a 50-member research team at Meta.
  • Co-authored a bestseller on TensorFlow projects.
  • Recognized as a top data scientist under 40.
Stackforce AI infers this person is a leading expert in AI and Machine Learning across multiple industries.

Contact

Skills

Core Skills

Generative AiLarge Language Models (llm)LeadershipArtificial Intelligence (ai)Graph Neural Networks

Other Skills

AI AgentsAnalyticsApache SparkBig DataBusiness StrategyC++Data AnalysisData MiningData ModelingData ScienceDistributed SystemsE-commerceEngineering ManagementFinancial ModelingFinetuning

About

I am an experienced AI Researcher/Machine Learning Engineer who has researched and deployed several scalable machine learning models across startups and big tech. I have deep applied research experience in areas of NLP, CV and Graph neural networks. Deployed many 0-->1 products along with publications in Neurips, ICML, ICLR,CVPR. Worked at startups and big companies like Uber AI Labs , founding member of Meta GenAI team and Bank of America. In my current role I lead a 50 research/eng team to develop multimodal GenAI models (voice, safety, inference, computer vision) for MetaAI and other GenAI products #Awards 40Under40 Data Scientists 2022 by Analytics India Magazine. Outstanding Leadership Award, Internet 2.0 conference, Intercontinental Dubai #Book Author Co-authored a bestseller book "Tensorflow Machine Learning Projects". You can find it on Amazon.com here. http://a.co/d/332CdOL #Speaker He has been a featured speaker at major conferences, universities and companies. Some of prominent ones include UC Berkeley, San Jose State University, IIM Ahmedabad (Best B-School in India), O'Reilly AI, Strata AI, Rework AI, Index IBM, Udacity, Square, Hitachi. #Educator Ankit is equally passionate about education and has mentored over 500 students in data science at following places: General Assembly (San Francisco), Springboard (San Francisco/Bangalore), INSOFE (Bangalore/Hyderabad), Upgrad as Subject Matter Expert (Mumbai), Acadgild (Bangalore).

Experience

15 yrs 11 mos
Total Experience
2 yrs 4 mos
Average Tenure
5 yrs 9 mos
Current Experience

Meta

5 roles

Senior Research Manager, Generative AI

Promoted

Feb 2025Present · 1 yr 4 mos · San Francisco Bay Area

  • Currently*
  • Leading a team for personalization of LLMs to achieve vision of "Personal SuperIntelligence". Team heavily involved in post training alignment through Reward Modeling, Reinforcement Learning (PPO, DPO, RLHF, RLVR), RAG based systems, Context Engineering and LLM Agents.
  • Previously*
  • Leading a 50 member Multimodal Research/Engineering team powering modeling, inference/deployment and safety. Specifically,
  • Voice modeling (TTS, LLAMA4) for MetaAI, AI Studio
  • Safety ( Media products like Text to Image, Text to Video, LLM conversation)
  • Computer Vision team to post train image based foundation models
  • Inference/deployment of Image based foundation models for MetaAI, AI Studio.
  • Ping me if interested in a role in any of these areas.
Generative AILarge Language Models (LLM)FinetuningSafetyAI AgentsPost Training+1

Engineering Manager, Generative AI

Jan 2023Feb 2025 · 2 yrs 1 mo · San Francisco Bay Area

  • Engineering manager on Generative AI Safety for meta’s products. Helped launched Imagine, Imagine yourself, AI Stickers and User generated AI’s on Instagram.
  • Leading a team of 25+ engineers/researchers/applied scientists to make our MetaAI products safer.
  • Team works on LLM/LDM finetuning(DPO/RLHF) , RAG based systems, synthetic data generation, pre training data mitigations, automated red teaming approaches and scaled evaluations.
Generative AILeadershipTeam BuildingLarge Language Models (LLM)Artificial Intelligence (AI)Machine Learning

Engineering Manager, Machine Learning

Promoted

Feb 2022Jan 2023 · 11 mos · San Francisco Bay Area

  • 2020-2022
  • Worked on People You May Know (Facebook), Accounts you may follow (Instagram).
  • Additionally, worked on detecting fake accounts for FB/IG.
  • We use advanced Graph Neural Networks, Large Scale unsupervised graph clustering, NLP and Computer vision technologies. Our models are used across Facebook and Instagram.
Artificial Intelligence (AI)Graph neural networksRecommender SystemsEngineering ManagementGenerative AIMachine Learning

Staff Research Scientist

Jan 2022Mar 2022 · 2 mos · San Francisco Bay Area

  • Machine Learning @ Facebook

Senior Software Engineer

Aug 2020Jan 2022 · 1 yr 5 mos · San Francisco Bay Area

Truefoundry

Investor

Jan 2023Present · 3 yrs 5 mos

  • MLops platform backed by Sequoia, Naval Ravikant and Anthony Goldbloom (Kaggle)

Growthschool

Investor

Jun 2022Present · 4 yrs

Ultrahuman

Angel Investor

Oct 2021Present · 4 yrs 8 mos

Cognizer inc

Advisor

Jul 2021Present · 4 yrs 11 mos · San Francisco Bay Area

  • Helping advise in graph and NLP technologies to build effective solutions for extracting and insights from text and unstructured data.

Inner fit

Angel Investor

Feb 2020Present · 6 yrs 4 mos

Samya.ai

Advisor

Feb 2020Sep 2021 · 1 yr 7 mos · Bengaluru, Karnataka, India

  • Technical Advisor on Machine Learning and Artificial Intelligence.
  • Company was acquired by Fractal

Uber ai

2 roles

Senior Research Scientist

Feb 2018Aug 2020 · 2 yrs 6 mos · San Francisco Bay Area

  • This group was led by Prof. Zoubin Ghahramani
  • ( https://en.wikipedia.org/wiki/Zoubin_Ghahramani).
  • I am involved in applied AI research on Uber datasets and have deployed multiple models at scale at Uber. Specifically working(worked) on:
  • + Large Scale Graph Representation Learning algorithm for Uber Eats, Fraud detection, Traffic
  • Prediction. This application mainly involves usage of Graph Convolutional Networks.
  • + Food Preparation Time Prediction in Uber Eats using Deep Neural Networks.
  • Tools: Python, R,Pytorch, Tensorflow, Hive, Docker, PySpark
  • Additionally, I run the internal AI education initiative at Uber. This has helped hundreds of engineers at Uber to learn and use AI in their jobs.

Research Scientist

Mar 2017Jan 2018 · 10 mos · San Francisco Bay Area

  • User Level Forecasting (Advised by Chief Scientist, AI Labs):
  • Led the effort on predicting # trips for each rider/driver on the platform.
  • o Developed a LSTM model with custom loss function (Zero Inflated Poisson) using PyTorch to predict trips for each individual driver in the short term (4-6 weeks). Incorporated incentive features and developed first of a kind incentive sensitivity curves for the company at aggregate level
  • Self Driving Car Simulation: Serving as a technical lead for developing an agent based simulation using Python to help business decide on the strategy for deploying self driving cars
  • Tools Used: Python, R,Pytorch, Tensorflow, Hive, Machine Learning, Deep Learning, BI Tools

Drishyam.ai

Co-Founder

Sep 2016Mar 2017 · 6 mos

  • Headed core technology and worked on developing patent technology on Inpainting of Home Improvement objects through Computer Vision.
  • Company was part of Nvidia Inception program and had provisional patent applied. .
  • Applied for provisional patent: “SYSTEM AND METHOD FOR IDENTIFYING, RECOMMENDING AND REPLACING VISUALLY SIMILAR PRODUCTS IN THE HOME IMPROVEMENT DOMAIN”
  • Drishyam AI got acquired by Mediaocean in March 2022!!

Runnr

Head Of Data Science and Analytics

Oct 2015Dec 2016 · 1 yr 2 mos · Bengaluru Area, India

  • Runnr was acquired by Zomato.
  • Solved complex last mile logistics problems in India with a data twist. Here is a brief of my work:
  • Led a team of 5 data scientists/analysts and 4 engineers while being an individual contributor.
  • Food Preparation Time (FPT) Prediction
  • Analyzed the data and built a Random Forest Regressor using scikit-learn to predict food
  • preparation time based on hour, day, cost of order etc.
  • Model was used to optimally dispatch the delivery boys and achieved 7% increase in efficiency
  • Grouping of Ecommerce Delivery: Developed a custom model on the lines of hierarchical clustering to group the ecommerce deliveries. Model led to 5% increase in number of orders delivered with the same fleet and similar SLA.
  • Driver Churn Prediction Model: Developed a churn prediction model to preempt the driver churn using Logistic Regression. Operationalized the model which led to 15% decrease in churn of drivers
  • Led several other data science projects using Time Series Analysis like Demand/Supply Prediction etc.
  • Tools Used: Python, R, BI Tools, SQL, Redshift SQL, Django, MongoDB.
  • Here is the link to my statistical analysis on Food delivery business in India.
  • http://yourstory.com/2016/07/food-delivery-sustainable-business/

Springboard

Mentor

May 2015Dec 2017 · 2 yrs 7 mos · San Francisco Bay Area

  • Mentor for Data Science course. Making students fall in love with data.

Clearslide

Data Scientist

Mar 2014Sep 2015 · 1 yr 6 mos · San Francisco Bay Area

  • Worked on problems relating to prediction of probability, time and amount of a sales deal close
  • Tools Used: MySQL, Redshift, R, Python, Tableau. Filed a patent on the same.

Bank of america

Quantitative Finance Analyst

Apr 2013Feb 2014 · 10 mos · San Francisco Bay Area

  • Working in the Capital Management Group. Analytics using Teradata SQL and Statistical Softwares.

Facebook

Data Science Intern

Jan 2013Mar 2013 · 2 mos · Menlo Park CA

  • Worked in product analytics team on Big data projects.
  • Extensively programmed in Hive, SQL, R, Python.
  • Machine Learning, Feature Engineering.

Schlumberger asia services ltd

Senior Field Engineer

Feb 2009Feb 2012 · 3 yrs · Mumbai Area, India

  • Supervised the entire drilling operations worth $ 8 million on the rig sites by ensuring optimum utilization of resources and time.
  • Led and trained cross-cultural teams of 4-7 engineers involved in the entire drilling operation on the rig site.
  • Supervised and drilled the first ever offshore High Temperature and High Pressure well in Eastern India. Unlocked the future market for the same, worth $ 15 million.

Education

University of California, Berkeley

Masters — Engineering

Indian Institute of Technology, Bombay

Dual Degree (B.tech +M.tech) — Electrical Engineering (Communication and Signal Processing)

Harvard Business School Online

Executive Education — Leadership and Strategy

Jan 2017Jan 2017

Stackforce found 100+ more professionals with Generative Ai & Large Language Models (llm)

Explore similar profiles based on matching skills and experience