Bhavul Gauri

AI Researcher

London, England, United Kingdom10 yrs 4 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Led multiple AI projects boosting engagement for billions.
  • Mentored over 40 engineers in AI technologies.
  • Frequent speaker at conferences on AI advancements.
Stackforce AI infers this person is a SaaS expert specializing in AI and machine learning solutions.

Contact

Skills

Core Skills

Ai AgentsMachine LearningGenerative AiLarge Language Models (llm)Natural Language Processing (nlp)

Other Skills

AlgorithmsAmazon Web Services (AWS)AndroidArtificial IntelligenceBayesian inferenceCC++Cluster AnalysisComputer VisionData MiningData ScienceDeep LearningEvent ManagementFace RecognitionHDF5

About

Supporting state-of-the-art autonomous agent research to accelerate the path toward AGI and unlock practical, open-source Llama Agents for real-world tasks. Day to day involves end-to-end design, training and evaluation of autonomous agent pipelines—spanning simulation environments, multi-modal inputs, emergent behavior analysis, RL reward modelling and evals on real world benchmarks. Formerly: • TL, Generative AI for Ads @ Meta: Owned fine-tuning (SFT, DPO), RAG and Prompting Science for Ad Creative Generations. Boosted relevance metrics and ad metrics for 1 M+ advertisers via large-scale multimodal LLMs. Multiple shout-outs in media and quarterly earnings report. • Video Understanding: Led Reels ranking and relevance (transformers, graph ML) at Facebook/Instagram, boosting engagement for billions of users and building multiple firsts at Meta, including leading massive 20+ engg team for user interest modelling Also: Mentored 40+ engineers, delivered multiple 0→1 AI products, frequent conference speaker, and passionate open-source advocate. Open to collaborative research in agents, AGI, Alignment and beyond.

Experience

10 yrs 4 mos
Total Experience
2 yrs 1 mo
Average Tenure
4 yrs 10 mos
Current Experience

Meta

3 roles

Research Engineer at FAIR

Promoted

Oct 2025Present · 8 mos · Hybrid

  • Part of FAIR within Meta Superintelligence Labs org. Working on AI Agents for Coding, for ML Research and for Science.
  • More open source releases to come soon. Keep an eye out! :)
AI AgentsMachine LearningPython

Research Engineer (TL)

Apr 2025Sep 2025 · 5 mos · Hybrid

  • Part of Meta Superintelligence Lab, working and supporting self-improvement and a path towards AGI through Agents research for automating ML Experiments and ML Research.
Generative AIAI Agents

Machine Learning Engineer (TL)

Jul 2021Apr 2025 · 3 yrs 9 mos · Hybrid

  • Lead ML Engineer, Meta
  • Generative AI Team, Ads (2023–Present)
  • Direct multiple GenAI tracks (Diffusion Fine-Tuning, Prompting Science, Image RAG) for advanced image generation
  • Architected a novel ImageRAG system, boosting adoption by 40% through reference-based retrieval and generation solving brand alignment and creating competitive edge over Google, Adobe and Tiktok
  • Owned the relevance metric, improving it by 35% via fine-tuning LLaMa 3 (70B) with SFT and DPO, resulting in 22% higher overall adoption.
  • Enhanced diffusion models (SFT, LoRa) to raise quality precision from 60% to 85%.
  • Developed refined prompting frameworks on 70B LLaMa (CoT, ICL using few-shot), increasing CTR for Ads by 5%.
  • Video Understanding Team (2021–2023)
  • Led Reels emotion-prediction with prefix-finetuned Llama-7B, significantly boosting mAP.
  • Elevated Reels ranking using hourly consumption patterns, driving +1.12% DAUs.
  • Co-created a new multimodal transformer (SET) for efficient cross-domain processing.
  • Oversaw a 20+ cross-functional initiative to refine user-interest modeling, adding +3.15M Cold-Start DAUs.
  • Deployed a 46B-edge graph pipeline for Watch, gaining +7.4M 10-min DAUs.
  • Additional Highlights
  • Active LLM researcher (alignment, multimodality, agents); frequent speaker at meetups/conferences; active open-source contributor
  • Tech Stack: Python, PyTorch, SparkSQL, Presto, Hive, FAISS, Elasticsearch, BigGraph, complex ETL pipelines, C++, Hack
Large Language Models (LLM)Generative AINatural Language Processing (NLP)Stable DiffusionLeadership

Zapr media labs

2 roles

Research Scientist (Tech Lead)

May 2020Jun 2021 · 1 yr 1 mo

  • Led a team of ML Engineers to build novel NLU capabilities for chatbots which power a Voice Bot architecture.
  • Designed a pipeline to automate building a closed-domain bot given conversational audio recordings reducing bot building timeline from 6 weeks to 1 week using state of the art transformer models and clustering algorithms
  • Built multi-intent handling capabilities by designing LSTM based multilingual sentence segmentation pipeline, achieves SOTA f1-score on mixed-code languages
  • Integrated multilingual language models (XLM-RoBertA) into Rasa to build support for Indian languages understanding into chatbots without needing translation
  • Developed framework to automatically test intent classification models for various biases
Large Language Models (LLM)Generative AINatural Language Processing (NLP)

Senior Data Science Engineer

Oct 2018Apr 2020 · 1 yr 6 mos

  • 1. Key creator and owner for Dekho App. I own the Data Science aspects of Dekho and am responsible for building the recommendation system powering it completely - From ideation, to experimentation, and data processing, to building a highly scalable ML system that can serve >2M users. The recommendation system uses a hybrid approach combining popularity and content based filtering.
  • In Nov 2019, I have upgraded the system to use BERT (State of the art in Natural Language Processing) for content based filtering system.
  • 2. Have worked on some smaller computer vision projects such as Depth map prediction to generate bokeh effect (using DeepLens paper approach), Content frame detection (detect which tv show / movie is being watched on streaming platform from the screenshot), TV Video Ad Classifier, and Face detection projects using both classical OpenCV approaches as well as deep learning based approaches (SSD, U-Net, Inception, etc).
  • 3. Have given multiple presentations on the entire history of NLP till BERT, On Introduction to Machine learning, and on Recommendation Systems. Presentations attached herewith.
Large Language Models (LLM)

Foyr

Machine Learning Specialist

May 2018Sep 2018 · 4 mos

  • Foyr is a real estate start-up which aims to put the process of getting a custom designed flat from the floor planning to interior designing on the web. I'm trying to lay down the foundation for the AI pipeline at Foyr. Currently building convolutional network (deep learning) solutions to detect and extract relevant entities like walls, doors, chairs, dimensions from a floor planning image, which would further integrate with their web app so as to offer an interactive experience to the user and designer.

Endurance international group

2 roles

Software Developer - ML and Backend

Promoted

Jun 2016Sep 2018 · 2 yrs 3 mos

  • Authored an Artificial Intelligence based Logo Builder app that generates logos for businesses by brand name as input. Developed a font-pairing engine which utilized font vectors to output which two fonts would go together well. Genetic algorithms were used to improve the output and take user feedback in the loop. Flask was used to serve the Restful APIs and MongoDB powers the database of logos. Also designed a DSL to describe any design of a logo in a templated manner. Experimented a bit with GANs although it didn't work out well with them.
  • Minimized the support ticket resolution time by developing and deploying a smart JIRA support ticket bot which finds similar tickets and assigns ticket to correct person. Doc2Vec and Tf-idf embeddings were used to train the model for similarity matching.
  • Played a key role in building orchestration layer behind the WebPro panel through which web designers and resellers can maintain their businesses. Built RESTful APIs on top of springboot back-end. Applied Aspect Oriented Programming to develop an authentication layer, and added elasticsearch caching for instant search for orders at scale.
  • Authored region based tax engine which can consume special taxes like GST, EU VAT. This made integration of any new taxes easy to the billing module of Orderbox - a control panel that powers businesses like BigRock, Hostgator, domain.com and over 5M customers.
  • Tech Stack : Java, Python, Tensorflow, Gensim, Flask, Springboot, Elasticsearch, PostgreSQL.

Software Developer Intern

Jan 2016May 2016 · 4 mos

  • Upgraded Orderbox legacy code from Java 7 to Java 8 with 0 downtime. Orderbox is provisioning codebase behind thousands of web presence businesses, some of which are BigRock, Hostgator, Webhosting.info, domain.com, ResellerClub, etc.
  • Integrated PayU Latam as a payment gateway for all 5M+ customers of Orderbox. Also secured other pre-integrated payment gateways by making them use TLS 1.2 for communication.

Birla institute of technology and science, pilani

2 roles

AI Researcher

Sep 2015Dec 2015 · 3 mos

  • Inductive Logic Programming is a relatively new field that lies at the intersection of
  • logic programming and inductive machine learning. Inductive Logic Programming can
  • construct rules or hypotheses by learning from examples and background knowledge
  • something other fields of Machine learning do not do.
  • My work involved finding out if I feed examples to an Inductive Logic Programming System, would it construct relevant features which could train a great machine learning model with maximum entropy. The study aimed to combine the logical power of Inductive Logic Programming,
  • and Probabilistic wonder of Maximum Entropy model. It serves as a step
  • towards the big goal of combining logical learning and probabilistic learning, which
  • is together called Statistical Relation learning, or sometimes Probabilistic Inductive
  • Logic Programming.

Teaching Assistant, Artificial Intelligence Course

Aug 2015Dec 2015 · 4 mos

  • Designed lab questions and managed the labs. Aided faculty in questions for projects and evaluation.

Operating systems course, bits pilani k k birla goa campus

Teaching Assistant

Aug 2015Dec 2015 · 4 mos · India

  • Helped in making tutorial questions as well as lab questions and for handling the labs.

Indian red cross society (ircs)

Software Developer Internship

May 2013Jul 2013 · 2 mos · Greater Delhi Area

  • Developed a responsive website based on Drupal CMS for 'Blood Bank' department of the organization, so as to help spread awareness about them, their camps, and their activities on the internet.

Education

Birla Institute of Technology and Science, Pilani

Master of Science - MS — Mathematics

Jan 2011Jan 2016

Birla Institute of Technology and Science, Pilani

Bachelor’s Degree — Computer Science

Jan 2011Jan 2016

Mira Model Senior Secondary School

High School — Science

Jan 2009Jan 2011

Stackforce found 100+ more professionals with Ai Agents & Machine Learning

Explore similar profiles based on matching skills and experience