Sahil Khose β AI Researcher
I'm a Ph.D. student in Computer Science at Georgia Tech, where I'm fortunate to be advised by Prof. Judy Hoffman. My research focuses on developing multimodal vision-language models that integrate spatial, semantic, and temporal reasoning with minimal supervision. Recent work includes: 1. A 7B open-source VLM for open-vocabulary 3D scene graph generation. 2. SkyScenes, a synthetic aerial dataset for improving real-world segmentation, accepted at ECCV 2024. 3. A generalist multimodal LLM, where I designed a jointly-trained vision-audio model that outperforms larger generalist systems by reducing cross-modal interference. I bring prior experience in domain generalization, zero-shot learning, and synthetic-to-real adaptation, focusing on making models robust to diversity, correlation, and semantic shifts in real-world environments. My goal is to build generalizable systems that require minimal labeled data yet remain reliable under distribution shifts. I also review for top conferences (NeurIPS, CVPR, ECCV) and have published across both vision and language communities. πΌ I'm currently looking for research internships for Summer 2026 β feel free to reach out if you're hiring! Website: https://sahilkhose.github.io/
Stackforce AI infers this person is a Computer Vision and AI Researcher with a focus on multimodal models.
Location: Atlanta, Georgia, United States
Experience: 12 yrs 3 mos
Career Highlights
- Developed a 7B open-source VLM for 3D scene graph generation.
- Designed a vision-audio model outperforming larger systems.
- Mentored 50 students in advanced machine learning techniques.
Work Experience
Georgia Institute of Technology
Doctoral Student (1 yr 7 mos)
Graduate Teaching Assistant (4 mos)
Graduate Research Assistant (3 yrs 2 mos)
FruitPunch AI Hyderabad
AI Expertise Head (11 mos)
Indian Institute of Science (IISc)
Research Intern (1 yr)
Research Society MIT Manipal
Research Mentor (1 yr 7 mos)
AI Division Member (7 mos)
Manipal Institute of Technology
Medical AI Research Assistant (1 yr 6 mos)
Project MANAS
AI Perception Developer (2 yrs 2 mos)
M. Prakash Academy - India
Student (5 yrs 2 mos)
Education
Doctor of Philosophy - PhD at Georgia Institute of Technology
Master of Science - MS at Georgia Institute of Technology
Bachelor of Technology at Manipal Institute of Technology