Nikhil Garg

Director of Engineering

San Francisco, California, United States13 yrs 6 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Led a team of 100 ML engineers at Facebook.
  • Built data infrastructure for ML teams at Fennel.
  • Developed planet-scale AI infrastructure at Meta.
Stackforce AI infers this person is a SaaS and AI Infrastructure expert with extensive experience in machine learning and engineering management.

Contact

Skills

Core Skills

Engineering ManagementAi InfrastructureMachine LearningData InfrastructurePytorchInfrastructure ManagementPlatform EngineeringSoftware EngineeringPerformance OptimizationPerformance EngineeringWeb Development

Other Skills

Algorithm CoachingMachine Learning ResearchCombinatorics CoachingAlgorithmsPythonDistributed SystemsC++Software DevelopmentJavaScriptLinuxJavaCComputer Science

About

Building the future of data/AI Infra at Fennel. Previously, oversaw a team of ~100 ML engineers at Facebook working on various recommendation systems. Also led multiple teams building PyTorch.

Experience

13 yrs 6 mos
Total Experience
4 yrs 2 mos
Average Tenure
11 mos
Current Experience

Meta

Director of Engineering

May 2025Present · 11 mos · San Francisco Bay Area

  • Building the next generation of AI Infrastructure for Ads, touching majority of the XXX billion $ revenue at Meta. Reach out if you want to help build planet scale infra systems!
AI InfrastructureEngineering Management

Fennel (acquired by databricks)

CEO, Cofounder

Nov 2021May 2025 · 3 yrs 6 mos · San Francisco Bay Area

  • At Fennel, we built the data infrastructure for ML teams. Fennel was acquired by Databricks.
Data InfrastructureMachine Learning

Facebook

Engineering Leader

Apr 2018Dec 2021 · 3 yrs 8 mos · Menlo Park, CA

  • 2021: supported PyTorch ecosystem foundation, consisting of ~60 elite engineers/managers building PyTorch distributed, trainer, and all of PyTorch's domain libraries like torchrec, torchaudio, torch-rl etc)
  • 2018-2021: supported applied ML teams (of 100+ML engineers) working on search ranking & feed recommendations for multiple product lines.
PyTorchMachine LearningEngineering Management

Quora

3 roles

Head of Platform and Infrastructure

Jul 2017Apr 2018 · 9 mos · Mountain View, SF Bay Area

  • Supported a team of ~40 engineers and managers working on both platform and infrastructure, the two technical engineering organizations at Quora. During this time, I was responsible for all company-wide technical efforts and metrics (e.g. perf, reliability, AWS cost, security, development velocity). I also started and led a group to formulate and implement the company-wide technical strategy.
Platform EngineeringInfrastructure Management

Software Engineering Manager

Promoted

Aug 2014Jul 2017 · 2 yrs 11 mos · Mountain View, SF Bay Area

  • Led various engineering teams, projects, and manager groups all the way from product engineering to ML and to infrastructure. Some examples:
  • Started use of Machine Learning in the Content Quality team to understand the quality of user-generated content like answers.
  • Started ML Platform team to build centralized infrastructure for newsfeed and all other ML applications
  • Led Ads team to launch of self-serve advertiser platform and ML-based ad targeting.
  • Hired ~25% of all Quora engineering hires between July 2016 and Dec 2017.
  • Helped launch eng-wide matrix org structure by defining a horizontal unit called guild.
  • Led various high-impact engineering manager groups like recruiting and engineering standards.
Machine LearningEngineering Management

Software Engineer

Oct 2012Aug 2014 · 1 yr 10 mos · Mountain View, SF Bay Area

  • Did a little bit of everything as an early engineer --
  • Worked in a 1-member team responsible for end to end performance of Quora.
  • Worked on caching layer of our infrastructure and created a new distributed web server architecture called Ultralisk.
  • Launched some user-facing features like feed muting controls
  • As a founding engineer of the content quality team built several foundational systems to understand, monitor and improve the quality of user-generated content at Quora.
  • Did much of the engineering & product work to develop in-house content moderation system
  • Built lots of productivity tools for the whole engineering team
  • Led a successful initiative to bring step-function improvement to the company-wide code quality (examples of things that came out of this include: development of style guide, setting up code review process, development of a linter, development of better code push systems etc.)
Software EngineeringPerformance Optimization

Indian olympiad in informatics

Algorithm Instructor

May 2012Jun 2012 · 1 mo · Bengaluru Area, India

  • Coach and problem setter at training and selection camp for Indian team to IOI.
Performance EngineeringWeb Development

Microsoft research

Machine Learning Research Intern

May 2011Jul 2011 · 2 mos · Bangalore, India

  • Benchmarked linear approximations to the RBF kernels for SVMs and multiple-kernel-learning under the guidance of Prof. Manik Verma
Algorithm Coaching

Indian mathematics olympiad

Combinatorics Instructor

Jan 2011Jan 2011 · 0 mo · New Delhi Area, India

  • Taught Combinatorics to Delhi region mathematics olympiad candidates, graded their exams and helped select the team.
Machine Learning Research

Credii

Software Engineering Intern

May 2010Jul 2010 · 2 mos · New Delhi Area, India

  • As one of the first three developers, I set up Credii's full text search, designed DB schema for the core app and prototyped the frontend in Javascript.
Combinatorics Coaching

Education

Indian Institute of Technology, Delhi

B. Tech — Computer Science and Engineering

Jan 2008Jan 2012

Halwasiya Vidya Vihar Bhiwani

Stackforce found 100+ more professionals with Engineering Management & Ai Infrastructure

Explore similar profiles based on matching skills and experience