Dhyey Mavani

CTO

San Francisco, California, United States2 yrs 4 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Pioneered Rust-based model serving engine at LinkedIn.
  • Developed open-source Python packages with significant downloads.
  • Collaborated directly with CEO on strategic AI initiatives.
Stackforce AI infers this person is a highly skilled AI and Machine Learning engineer with a strong focus on research and development.

Contact

Skills

Core Skills

Artificial Intelligence (ai)Machine LearningQuantitative ResearchSoftware DevelopmentData AnalysisResearch And Development (r&d)

Other Skills

Multi-agent SystemsLarge Language Models (LLM)Product DevelopmentProduct ManagementExecutive-level CommunicationPython (Programming Language)TensorFlowPyTorchAPIsGenerative AIJavaAlgorithmsRust (Programming Language)Automated Machine Learning (AutoML)JavaServer Pages (JSP)

About

(+) ๐…๐จ๐ฅ๐ฅ๐จ๐ฐ for latest and diverse perspectives on ๐€๐ˆ, ๐„๐๐ฎ๐œ๐š๐ญ๐ข๐จ๐ง and ๐…๐ฎ๐ญ๐ฎ๐ซ๐ž ๐จ๐Ÿ ๐–๐จ๐ซ๐ค! For speaking engagements, media, and business inquiries: ddmavani2003@gmail.com. Dhyey is driven by complex problems at the intersection of mathematics, computer science, statistics, and economics. His liberal arts foundation equips him uniquely to build innovative solutions, leveraging strategic thinking and deep technical expertise. A strategic thinker at heart, Dhyey excels in dissecting intricate problems and devising actionable solutions. He believes in the power of collaboration, cherishing partnerships with professionals from diverse backgrounds to produce outstanding results. He has built AI infra, models, and agentic product capabilities at ๐‹๐ข๐ง๐ค๐ž๐๐ˆ๐ง after short but formative stints as CDN engineer at ๐€๐–๐’, and quantitative trader at ๐•๐š๐ฅ๐ค๐ฒ๐ซ๐ข๐ž. At LinkedIn, he delivered major LLM inference wins with speculative decoding, and significant GPU savings while pioneering LinkedIn's first Rust-based model serving engine for PyTorch Embedding Based Retrieval (EBR) models. He also integrated MCP into enterprise workflows that contributed to scalable model onboarding across Azure OpenAI and Kubernetes. He even proposed strategic ideas directly to CEO Ryan Roslansky, leading to cross-functional collaborations. He graduated from ๐€๐ฆ๐ก๐ž๐ซ๐ฌ๐ญ ๐‚๐จ๐ฅ๐ฅ๐ž๐ ๐ž with a ๐ญ๐ซ๐ข๐ฉ๐ฅ๐ž ๐ฆ๐š๐ฃ๐จ๐ซ in Computer Science, Mathematics, and Statistics, earning 2 ๐’๐ฎ๐ฆ๐ฆ๐š ๐‚๐ฎ๐ฆ ๐‹๐š๐ฎ๐๐ž theses, along with induction to ๐๐ก๐ข ๐๐ž๐ญ๐š ๐Š๐š๐ฉ๐ฉ๐š and ๐’๐ข๐ ๐ฆ๐š ๐—๐ข honor societies. His mathematical work includes building a Lean4-based machine-assisted proof framework for chip-firing and Graphical Riemannโ€“Roch alongside a Python package for simulations (chipfiring), while his statistics thesis led to โ€œccrvam,โ€ an open-source Python package for discrete copula modeling. He also completed six graduate-level math and statistics courses at ๐‚๐จ๐ฅ๐ฎ๐ฆ๐›๐ข๐š in one semester and continues advanced computational mathematics studies at ๐’๐ญ๐š๐ง๐Ÿ๐จ๐ซ๐. Outside the professional sphere, adventure calls out to Dhyey. Whether it's traversing new terrains, perfecting his archery shot, or engaging in a spirited badminton match, he's always up for a challenge. Other adventurous interests of Dhyey include jetskiing, ATV quad biking, and bungee jumping. ***Opinions, posts, and views expressed here are solely Dhyey Mavani's own and do not reflect that of his employers.***

Experience

2 yrs 4 mos
Total Experience
1 yr 2 mos
Average Tenure
1 yr 4 mos
Current Experience

Linkedin

3 roles

Machine Learning Engineer, Agentic AI (Special projects with C-suite)

Promoted

Jan 2025 โ€“ Present ยท 1 yr 4 mos

  • [Under NDA] Building agents with the office of CEO as the youngest machine learning research engineer working daily with Distinguished Engineers & SVP of Engineering
  • Led and published a research paper (coined "Support Tokens") with the Chief AI Officer.
Multi-agent SystemsArtificial Intelligence (AI)Machine LearningLarge Language Models (LLM)Product DevelopmentProduct Management

Software Engineer, Generative AI Inference

Jan 2025 โ€“ Jan 2025 ยท 0 mo

  • Optimized LLM inference in Hiring Agent (LangGraph); Increased throughput by 4x; Slashed latency by 66% with speculative decoding.
  • Built first-ever MCP integrations with Azure OpenAI deployment infra to improve model onboarding and k8s quota management.
  • Built SOTA fine-grained observability into the production vLLM and SGLang based hybrid serving stack with a centralized dashboard.
  • Recruited, Interviewed and Mentored Masters & PhDs on AI assisted coding and LLM serving specifics.
Software DevelopmentExecutive-level CommunicationPython (Programming Language)TensorFlowPyTorchMachine Learning+5

Software Engineer, GPU RAR (RecSys)

Jan 2024 โ€“ Jan 2024 ยท 0 mo ยท Sunnyvale, CA ยท On-site

  • Boldly pitched 3 novel product ideas to the CEO, Ryan Roslansky, and that led to further discussions with product leaders across 3 business units for formulating them for the future B2B2C strategies at LinkedIn.
  • Built & integrated an end-to-end Rust-based low-latency inference engine for Torch. Developed first production-ready PyTorch EBR kNN model with custom-filtering kernels.
  • In production machines, my work was being actively used to serve 5+ business use-cases leading to P99 inference latency reduction from 4 ms to 2 ms, and P99 tail data ingestion latency reduction from 70 ms to 7 ms. This also increased throughput by 4x, leading to GPU savings of ~ $1 million.
Executive-level CommunicationPython (Programming Language)TensorFlowPyTorchMachine LearningJava+6

Valkyrie trading

Quantitative Trader

Jan 2024 โ€“ Jan 2024 ยท 0 mo ยท Chicago, Illinois, United States ยท On-site

  • Developed a Monte-Carlo simulation-based strategy for theo-value estimation and shadowed quants/traders in treasuries and equity options desks. Gained hands-on experience in the execution and risk management of delta and gamma hedging market-making strategies through intensive mock sessions.
APIsSoftware DevelopmentQuantitative ResearchPython (Programming Language)

Amazon web services (aws)

Software Engineer

Jan 2023 โ€“ Jan 2023 ยท 0 mo ยท Greater Seattle Area ยท On-site

  • Populated a metrics dashboard, pushed latency optimizations to production at AWS CloudFront; Proposed & documented use-case of AutoML to analyze change-propagation/caching service logs to predict future outages
Automated Machine Learning (AutoML)JavaServer Pages (JSP)Python (Programming Language)SQLKotlinJava+3

Various companies

Fellow (list with details in description)

Jan 2022 โ€“ Jan 2023 ยท 1 yr ยท United States ยท On-site

  • โžก Columbia x Citadel Trading Challenge based on Auction Theory (New York, Apr 2023)
  • โžก Akuna Options 201 course (Virtual, Apr 2023)
  • โžก Schonfeld Early Engagement Summit (New York, Apr 2023)
  • โžก Amplify x Citadel Trading Challenge (New York, Feb 2023)
  • โžก D.E. Shaw & Co. Connect Event (New York, Sep 2022)
  • โžก D.E. Shaw & Co. Latitude Fellowship (New York, Aug 2022)
  • โžก Citadel Securities Invitational Terminal (Virtual, Aug 2022)
  • โžก Citadel Securities Invitational Datathon (Virtual, Jul 2022)
  • โžก Hudson River Trading BIPOC Tech Summit (Virtual, Jun 2022)
  • โžก Goldman Sachs Insights Program (Virtual, Apr 2022 - Jun 2022)
  • โžก Jane Street SEE Quant Research & Trading (New York, May 2022)
  • โžก Jane Street FOCUS Software Engineering (New York, Mar 2022)
  • โžก SIG Freshman Discovery Program (Virtual, Apr 2022)

Hp

Business Analyst & VC

Jan 2022 โ€“ Jan 2022 ยท 0 mo

  • Analyzed data using SQL & Excel, research and sourced deals for hybrid work model market based startups. Created a pitch deck with analysis for one of the companies to present to the relevant stakeholders.
Python (Programming Language)SQLData Analysis

Google

Computer Science Research Mentee (CSRMP Fellow Class of 2022B)

Jan 2022 โ€“ Jan 2022 ยท 0 mo ยท Remote ยท Remote

  • Selected as 1 of 90 fellows across the U.S. and Canada for the 2022B cohort. This fellowship,
  • focused on diversity in tech, recognizes academic excellence and commitment to impactful research. Fellows receive mentorship from leading experts and engage in cutting-edge computer science projects, developing advanced skills and contributing to real-world applications.
APIsResearch and Development (R&D)

Amherst college

Applied Researcher โ€“ Computational Mathematics & Statistics

Aug 2021 โ€“ Present ยท 4 yrs 9 mos

  • ccrvam - Python package on a novel discrete copula method (30k+ downloads in a few months) - summa cum laude honors
  • chipfiring - Python pakage on an open graph / number theory problem (7k+ downloads and industry adoption)
  • chip-firing-with-lean - First-ever formalization of chip firing dynamics in Lean4 (addressed a gap in Microsoft developed language); summa cum laude honors
  • QuantileFlow - state of the art performance quantile calculations in service logs (beating Datadogโ€™s Python; 6k+ downloads)
  • RBlocks - block-based programming interface for R; deployed in classrooms for accessibility; addressed gap in Googleโ€™s Blockly

Education

Amherst College

Bachelor's degree

Columbia University

Visiting Students Program (Exchange Semester) โ€” Graduate Coursework in Mathematics and Statistics with applications in Finance

AIT-Budapest

Study Abroad (Exchange Semester) โ€” Computer Software Engineering

Stanford University

NDO program (Sponsored by my employer)

Massachusetts Institute of Technology

Break Through Tech AI: Boston (Summer Extension Program) โ€” Artificial Intelligence

CodePath

Certificate in Advanced Software Engineering โ€” Computer Programming

Stackforce found 100+ more professionals with Artificial Intelligence (ai) & Machine Learning

Explore similar profiles based on matching skills and experience