Ram Ramrakhya

AI Researcher

Atlanta, Georgia, United States8 yrs 6 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in Machine Learning and Deep Learning.
  • Research on AI agents for human-like task execution.
  • Experience with multi-modal LLMs and computer vision.
Stackforce AI infers this person is a Machine Learning and AI specialist focused on research and development.

Contact

Skills

Core Skills

Machine LearningDeep LearningComputer VisionSoftware DevelopmentData Engineering

Other Skills

3D scene graphsAnalytics data processingAngularJSAutomationC++Collaborative platformsCore JavaData pipelinesDjangoETL pipelinesEmbodied agentsFeature designKerasMongoDBMulti-modal LLMs

About

Problem solving and deep learning enthusiast. My research interests are at the juncture of software engineering and AI, solving problems which lie at the intersection of Computer Vision, Machine Learning and Natural Language Processing.

Experience

Apple

Research Intern

Jan 2025Aug 2025 · 7 mos · New York, United States · On-site

  • Building generalist computer use agent by leveraging multi-modal LLMs for automating day-to-day task execution on web and UI on any device (iPhone, Android, iPad, etc).
Multi-modal LLMsAutomationUser InterfaceMachine Learning

Meta

Research Scientist Intern

May 2024Dec 2024 · 7 mos · Seattle, Washington, United States · On-site

  • Built a method to adapt multi-modal large language models using online reinforcement learning to train embodied agents that can interact with the environment and communicate with humans in natural language to resolve ambiguity when required.
Multi-modal large language modelsReinforcement LearningMachine Learning

Allen institute for ai (ai2)

Research Intern

May 2023Aug 2023 · 3 mos · Seattle, Washington, United States · On-site

  • Built a method leveraging off-the-shelf vision foundation models to learn visual common sense reasoning for localizing semantically meaningful placement for objects.
Vision foundation modelsVisual common sense reasoningComputer Vision

Mitsubishi electric research laboratories

Research Intern

May 2022Aug 2022 · 3 mos · Boston, Massachusetts, United States

  • Worked on building embodied agents that can learn to navigate and interact with objects using 3D scene graphs.
3D scene graphsEmbodied agentsMachine Learning

Georgia institute of technology

Graduate Research Assistant

Aug 2021Present · 4 yrs 7 mos · Atlanta, Georgia, United States

  • My research focuses on building agents that can see, reason, and interact reasonably. Currently, I am working on building embodied agents that can learn to explore from humans by imitating human-like behavior when trying to solve a task.
  • I am advised by Prof. Dhruv Batra and Abhishek Das and I collaborate closely with Prof. Devi Parikh.
Machine LearningDeep LearningComputer VisionNatural Language Processing

Cloudcv

2 roles

Google Summer of Code '20 - Mentor

Feb 2020Aug 2020 · 6 mos

  • Working as mentor for EvalAI project at CloudCV. Guiding contributors participating in Google Summer of Code 2019 to understand how open source development works.
  • Responsibilities involved are:
  • Reviewing work submitted
  • Helping them understand the project and resolve any difficulties faced.
  • Review the applicant proposal for a project
  • Help students throughout the GSOC timeline to complete the deliverables of the project.

Google Code In'19 - Organization Administrator

Dec 2019Jan 2020 · 1 mo

  • Working as a mentor for EvalAI project and managing other mentors at CloudCV. Guiding pre-university students to understand how open source development works.
  • Responsibilities involved are:
  • Reviewing work submitted
  • Helping them understand the project and resolve any difficulties faced.

Glance

Software Development Engineer 2

Aug 2019Aug 2021 · 2 yrs · Bengaluru, Karnataka, India

  • My work is around developing features that increase engagement of the platform using multiple software development frameworks and my knowledge of machine learning.
Software DevelopmentMachine Learning

Inmobi

2 roles

Software Development Engineer 2

Jul 2019Aug 2019 · 1 mo

  • Working on implementing ETL pipelines to process large scale analytics data on daily basis and implementing cache infrastructure support to improve the stability of our system.
ETL pipelinesAnalytics data processingData Engineering

Software Development Engineer

Jun 2018Jun 2019 · 1 yr

  • Working on implementing targeted content delivery system, content personalisation system using machine learning techniques and data pipelines for processing large scale analytics data.
Machine LearningData pipelines

Cloudcv

Google Summer of Code '18

Apr 2018Aug 2018 · 4 mos · Pune Area, India

  • Worked with CloudCV to make designing deep learning models easier with the help of Fabrik. Fabrik is an online collaborative platform to build, visualize and train deep learning models via a simple drag-and-drop interface. It allows researchers to collaboratively develop and debug models using a web GUI that supports importing, editing and exporting networks written in widely popular frameworks like Caffe, Keras, and TensorFlow.
Deep LearningCollaborative platformsMachine Learning

Interviewbit

Software Engineering Intern

Jan 2018Jun 2018 · 5 mos · Pune Area, India

  • Worked with InterviewBit for designing a platform to help users learn skills needed for technology jobs by adding end to end support for new features in InterviewBit website.
Feature designUser supportSoftware Development

Hackerearth

Problem Setter and Tester

Oct 2017Mar 2018 · 5 mos · Pune Area, India

  • Worked on designing algorithmic problems for competitive coding contests.

Nvidia

System Engineering Intern

Jul 2017May 2018 · 10 mos · Pune Area, India

  • Worked with NVIDIA's multimedia team for developing a test suite for NVIDIA's video encoder, providing highly efficient test suite execution environment which will allow developers to test the video encoder. And increasing the coverage of the tool which automates the tests by adding support for multiple category of tests for multimedia encoders and decoders.
Test suite developmentMultimedia encodingSoftware Development

Education

Georgia Institute of Technology

Doctor of Philosophy - PhD — Computer Science

Aug 2023May 2026

Georgia Institute of Technology

Master's degree — Computer Science

Aug 2021May 2023

Pune Institute of Computer Technology

Bachelor’s Degree — Information Technology

Jan 2015Jan 2018

Stackforce found 100+ more professionals with Machine Learning & Deep Learning

Explore similar profiles based on matching skills and experience

Ram Ramrakhya - AI Researcher | Stackforce