Siddharth Choudhary — AI Researcher
I'm a Principal Applied Scientist at Amazon AGI, where I am a tech-lead on multimodal pre-training team for the Amazon Nova family of foundation models. My work focuses on developing next-generation AI systems that seamlessly integrate vision, speech, and language understanding with generative capabilities. Currently, I'm the tech lead for next generation Amazon Nova, architecting unified multimodal models that achieve state-of-the-art performance across understanding and generation tasks. My research spans the full spectrum of AI and computer vision—from foundation models and vision-language systems to robotics and SLAM. I've published at top-tier venues including CVPR, IJRR, ICRA, and Nature Digital Medicine, with over 1,200 citations and an h-index of 14. My work on multimodal hallucination control was featured at CVPR 2024 and highlighted in AWS's keynote presentation. Before Amazon, I was a Principal Computer Vision Engineer at Magic Leap, where I architected the 3D object recognition system deployed in Magic Leap One. I earned my Ph.D. in Computer Science from Georgia Tech, focusing on distributed algorithms for multi-robot SLAM systems. I'm passionate about pushing the boundaries of multimodal LLMs and building systems that bridge the gap between understanding and generation. Checkout https://itzsid.github.io/ for up to date information.
Stackforce AI infers this person is a leading expert in AI and computer vision with a focus on multimodal systems.
Location: Dublin, California, United States
Experience: 16 yrs 6 mos
Skills
- Multimodal Ai
- Foundation Models
- Computer Vision
- Machine Learning
Career Highlights
- Tech lead for Amazon Nova family of foundation models.
- Published at top-tier venues with over 1,200 citations.
- Innovated multimodal AI systems integrating vision, speech, and language.
Work Experience
Amazon AGI
Principal Applied Scientist (11 mos)
Senior Applied Scientist (1 yr 3 mos)
Amazon Web Services (AWS)
Senior Applied Scientist (1 yr 11 mos)
Amazon Lab126
Senior Applied Scientist (2 yrs 9 mos)
Magic Leap
Principal Computer Vision Researcher/Engineer (2 yrs 10 mos)
Fyusion, Inc
Research Intern (3 mos)
Georgia Institute of Technology
Graduate Research Assistant (5 yrs)
Google Summer of Code Scholar (3 mos)
IIIT Hyderabad
Research Assistant (2 yrs 3 mos)
DrishtiCare
Research Intern (1 yr 1 mo)
Education
Doctor of Philosophy (PhD) at Georgia Institute of Technology
Master of Science (M.S.) at International Institute of Information Technology Hyderabad (IIITH)
Bachelor of Technology at International Institute of Information Technology Hyderabad (IIITH)