Alireza Khadem — AI Researcher
I am a Research Scientist on the GenAI Efficiency team at Google, where I work on improving the performance and efficiency of Gemini serving on Google TPU datacenters. I earned my PhD in Computer Science and Engineering from the University of Michigan, where my research focused on hardware–software co-design. My work centered on memory system optimization and the performance characterization, modeling, and optimization of HPC applications and ML models, including LLMs, CNNs, and GNNs. Previously, I interned at Apple and Microsoft Research, where I worked on design space exploration for LM serving on large-scale clusters and performance modeling of communication primitives.
Stackforce AI infers this person is a specialist in AI/ML and HPC with a focus on performance optimization and accelerator design.
Location: Sunnyvale, California, United States
Experience: 7 yrs 1 mo
Skills
- Accelerator Design
- Performance Optimization
- Hardware-software Co-design
Career Highlights
- Expert in hardware-software co-design for ML applications.
- Proven track record in performance optimization for TPU datacenters.
- Innovative solutions in accelerator design and memory systems.
Work Experience
Research Scientist (5 mos)
Microsoft
Research Intern (2 mos)
Apple
Design Verification Intern (3 mos)
University of Michigan
Graduate Student Instructor (9 mos)
Graduate Student Research Assistant (5 yrs 5 mos)
University of Tehran
Research Assistant (1 yr 3 mos)
Education
Doctor of Philosophy - PhD at University of Michigan
Master's degree at University of Michigan
Bachelor's degree at University of Tehran
High School Diploma at Shahid Beheshti High School, Kashan