Rahul Sharma

Co-Founder

Seattle, Washington, United States10 yrs experience
Highly Stable

Key Highlights

  • PhD in Electrical and Computer Engineering from USC
  • Expert in video generation and understanding models
  • Experience with deep learning frameworks for computer vision
Stackforce AI infers this person is a Machine Learning and Computer Vision specialist in the tech industry.

Contact

Skills

Core Skills

Image ProcessingComputer VisionMachine Learning

Other Skills

AlgorithmsCC++Convolutional Neural NetworksCrowd Flow SegmentationData StructuresDeep LearningHTMLJavaLaTeXMatlabMicrosoft OfficePythonSignal Processing

About

I am an Applied Scientist at Amazon AGI, specializing in developing cutting-edge video generation models. I hold a PhD from the University of Southern California, where my research focused on advancing long-form video understanding models with a particular emphasis on entertainment media content.

Experience

10 yrs
Total Experience
2 yrs 2 mos
Average Tenure
1 yr 2 mos
Current Experience

Building in stealth mode

Founder

Mar 2025Present · 1 yr 2 mos · Seattle, Washington, United States · On-site

Amazon

2 roles

Applied Scientist

Promoted

Feb 2023Apr 2025 · 2 yrs 2 mos · Sunnyvale, California, United States · On-site

Applied Scientist II

May 2021Aug 2021 · 3 mos · California, United States

  • Worked with Trustworthy Alexa AI to explore the federated learning for NLP

Usc viterbi school of engineering

PHD Fellow

Aug 2017Jan 2023 · 5 yrs 5 mos · Los Angeles Metropolitan Area

Indian institute of technology, kanpur

Masters Student

Jul 2016May 2017 · 10 mos · IIT Kanpur

  • Worked on various projects related to image processing and computer vision.
  • Current working on the project named: Towards Multimodal Assessment of Speaker Performance in Public Speaking. We propose a computational framework for quantifying speaker performance in the context of public speaking. We do not consider the content of the talk, rather analyze the speech and the visual content to capture the verbal and non-verbal behavior of the speaker. Inspired by the celebrated acceptance of deep learning frameworks over conventional methods, we are trying to model the speaker's behaviour using various deep neural architectures.
  • In another of my recent works, I have developed a novel framework for segmenting crowd flow patterns from high density crowd videos. In this work, I have proposed an automated and completely unsupervised method to segregate crowd flows present in a video. The work has been accepted to appear in ICIP 2016.
  • Apart from this, I have also developed a framework for calculating the Pixel-wise saliency values in an image based on color contrast and structure contrast of the image.
Image ProcessingComputer VisionDeep LearningCrowd Flow Segmentation

Siemens technology india

Software Developer Internship

May 2015Jul 2015 · 2 mos · Greater Bengaluru Area

  • Worked with the Imaging and Computer Vision group.
  • Explored the implementation of Machine Learning Algorithms.
  • Developed a generic library for 'Random Forest Regression' using C++. Also worked on the implementation of Discriminative Deep Metric Learning using Convolutional Neural Networks.
Machine LearningC++Convolutional Neural Networks

Systemantics india pvt. ltd.

Research Internship

Dec 2014Dec 2014 · 0 mo · Greater Bengaluru Area

  • Worked on the project named 'Cloud access for robot monitoring and support'. Explored solutions for providing both local and remote access to the robot controller using embedded wireless modules. Investigated use of Spark and Edison wireless modules and tested cloud access with both modules.

Gt silicon pvt ltd

Research Internship

Jan 2014Jul 2014 · 6 mos · Greater Lucknow Area

  • Worked on the software implementation of a Foot Mounted Inertial Navigation System, also named ‘OPENSHOE’ which is an open source software created by Signal Processing Lab, KTH Royal Institute of Technology, Sweden.
  • Also created an Android Application which can communicate with the hardware of the OPENSHOE over Bluetooth socket and thus can collect the readings from the device and is further able to plot the walking paths of the user in real time.

Avanti fellows

Content Development Internship

May 2013Jul 2013 · 2 mos · Greater Delhi Area

  • Designed their curriculum based on the special peer learning pedagogy introduced by Dr. Eric Mazur (Prof. with the Physics department at Harvard University).
  • Created high quality conceptual tests for the aspirants of Engineering Entrance Exams such as JEE and AIEEE.

The learning point

Virtual Internship

May 2013Jul 2013 · 2 mos

  • The Learning Point is an initiative in open education, which aims to create a large scale and open repository of tutorials, quizzes and visualizations for students in high school and college.
  • I created high quality tutorials relating College level Mathematics and Chemistry. These tutorials were included in the Learning Point’s repository of tutorials.

Education

University of Southern California

Doctor of Philosophy - PhD — Electrical and Computer Engineering

Jan 2017Jan 2022

Indian Institute of Technology, Kanpur

BTech - MTech Dual Degree — Electrical Engineering

Jan 2012Jan 2017

Stackforce found 100+ more professionals with Image Processing & Computer Vision

Explore similar profiles based on matching skills and experience