M

Mu Cai

AI Researcher

Madison, Wisconsin, United States6 yrs 8 mos experience
Most Likely To Switch

Key Highlights

  • Expert in multimodal models and self-supervised learning.
  • Research experience at top tech companies.
  • Strong background in 3D perception and autonomous systems.
Stackforce AI infers this person is a specialist in AI Research with a focus on multimodal models and self-supervised learning.

Contact

Skills

Core Skills

Multimodal ModelsResearchSelf-supervised Learning3d PerceptionGenerative ModelingAutonomous SystemsSignal Processing

Other Skills

Multimodal Image/Video/Agent LLMVisual Content RepresentationMultimodal foundation models3D scene understandingLarge Multimodal ModelsImage translationAutonomous DrivingCausal Inference

About

Work on Gemini Multimodal * Multimodal Image/Video/Agent LLM. * Visual Content Representation (e.g. CLIP/DINO style).

Experience

6 yrs 8 mos
Total Experience
1 yr
Average Tenure
1 yr 1 mo
Current Experience

Google deepmind

Research Scientist

Mar 2025Present · 1 yr 1 mo · Mountain View, California, United States

  • Gemini Multimodal Research
Multimodal Image/Video/Agent LLMVisual Content RepresentationMultimodal ModelsResearch

Microsoft

Research Intern

Mar 2024Dec 2024 · 9 mos · Redmond, Washington, United States · On-site

  • Research on multimodal foundation models, especially for videos and multimodal agents.
Multimodal foundation modelsResearchMultimodal Models

Cruise

Research Intern

May 2023Dec 2023 · 7 mos · Sunnyvale, California, United States · On-site

  • Large Multimodal Models.
Large Multimodal ModelsMultimodal Models

University of wisconsin-madison

3 roles

Graduate Teaching Assistant

Jan 2023May 2023 · 4 mos · Madison, Wisconsin, United States

Research Assistant

Promoted

Jan 2022Apr 2025 · 3 yrs 3 mos · Madison, Wisconsin, United States

  • Multimodal foundation models, self-supervised learning, 3D scene understanding.
  • 2021 UW Madison Computer Science Summer Research Award
Multimodal foundation modelsSelf-supervised learning3D scene understandingMultimodal ModelsSelf-supervised Learning

Graduate Teaching Assistant

Sep 2020Jan 2022 · 1 yr 4 mos · Madison, Wisconsin, United States

Qcraft

Research Intern

May 2022Aug 2022 · 3 mos · San Jose, California, United States

  • Multi-modality Self-supervised learning for 3D perception
Self-supervised learning3D perceptionSelf-supervised Learning3D Perception

Kuaishou technology

Research Intern

Jun 2020Dec 2020 · 6 mos · Beijing, China

  • Image translation, generative modeling
Image translationGenerative modelingGenerative Modeling

Sensetime 商汤科技

Research Intern

Dec 2019Jun 2020 · 6 mos · Beijing, China

  • 3D perception
3D perception3D Perception

University of california, berkeley

Visiting Student Researcher

Jul 2019Nov 2019 · 4 mos · Berkeley, California, United States

  • Autonomous Driving
Autonomous DrivingAutonomous Systems

City university of hong kong

Student Researcher

Jan 2019Mar 2019 · 2 mos · Hong Kong SAR

  • Signal Processing. Causal Inference.
Signal ProcessingCausal Inference

Education

University of Wisconsin-Madison

Doctor of Philosophy - PhD — Computer Sciences

Jan 2020Jan 2025

Xi'an Jiaotong University

Bachelor's degree — Electrical Engineering

Aug 2016Jul 2020

Stackforce found 100+ more professionals with Multimodal Models & Research

Explore similar profiles based on matching skills and experience