Joel Moniz — AI Researcher

I am currently a Staff Machine Learning Engineer at DoorDash. Previously, I was in the Natural Language Modeling and Conversational Intelligence teams within Siri at Apple. I completed my master's degree (MS in Computer Science) at CMU, and graduated in December 2018. I completed an internship with Siri's Natural Language Modeling Team at Apple in the summer of 2018. Before my MS, I spent a year working as a Software Engineer in the Cybersource team at Visa. Prior to this, I did my bachelor's degree in Computer Science from Birla Institute of Technology and Science, Pilani, Pilani Campus, and completed my undergraduate thesis at Montreal Institute of Learning Algorithms, Université de Montréal on Deep Learning techniques in Computer Vision (face verification, image classification). I have interned twice with The Processing Foundation (as part of Google Summer of Code 2014 and 2015), with Webmobi, and with the Computer Division at Indira Gandhi Centre for Atomic Research. My interests span a broad range of sub-fields in Computer Science, including Deep Learning, Machine Learning, Computer Vision and Natural Language Processing. I am comfortable with Python, C++, C, Java and Git. I have extensive experience with various deep learning and machine learning frameworks, Tensorflow and Pytorch in particular. I am familiar with parallel frameworks such as OpenMP-MPI and Apache Spark. I am also extremely good at understanding and modifying existing code bases, and am quick to pick up technical concepts.

Stackforce AI infers this person is a Machine Learning Engineer with expertise in AI-driven applications across technology and fintech sectors.

Location: San Francisco, California, United States

Experience: 9 yrs 1 mo

Skills

Deep Learning
Machine Learning
Natural Language Processing
Software Engineering

Career Highlights

Led advanced projects in conversational AI at Apple.
Expert in deep learning and natural language processing.
Proven track record in improving machine learning systems.

Work Experience

DoorDash

Staff Machine Learning Engineer (1 yr 8 mos)

Apple

Senior Machine Learning Engineer (3 yrs 4 mos)

Machine Learning Engineer (2 yrs 3 mos)

Machine Learning Intern (3 mos)

Visa

Software Engineer (11 mos)

Mila - Quebec Artificial Intelligence Institute

Deep Learning Research Intern (5 mos)

The Processing Foundation

GSoC Participant (3 mos)

WebMobi

Big Data Intern (1 mo)

The Processing Foundation

GSoC Participant (3 mos)

Indira Gandhi Centre for Atomic Research

Research Intern (2 mos)

Education

Master of Science at Carnegie Mellon University

Bachelor of Engineering (Hons.) at Birla Institute of Technology and Science, Pilani

at Rajagiri Public School

Joel Moniz

AI Researcher

San Francisco, California, United States9 yrs 1 mo experience

Highly Stable

Key Highlights

Led advanced projects in conversational AI at Apple.
Expert in deep learning and natural language processing.
Proven track record in improving machine learning systems.

Stackforce AI infers this person is a Machine Learning Engineer with expertise in AI-driven applications across technology and fintech sectors.

Contact

Skills

Core Skills

Deep LearningMachine LearningNatural Language ProcessingSoftware Engineering

Other Skills

Foundation ModelsImage ProcessingText UnderstandingVideo Content AnalysisVideo AnalysisLarge Language ModelsConversational IntelligenceCoreference ResolutionData CollectionModelingData ModelingNamed Entity RecognitionOn-device SearchContact ResolutionSequence-to-Sequence Models

About

Experience

9 yrs 1 mo

Total Experience

1 yr 10 mos

Average Tenure

1 yr 8 mos

Current Experience

Doordash

Staff Machine Learning Engineer

Aug 2024 – Present · 1 yr 8 mos · San Francisco Bay Area · On-site

Technical/Team Lead, Menu Media Pod: Leading a team that uses Deep Learning and Foundation Models to understand and improve images, text, and videos. Projects include:
automatically understanding and moderating images to cut down the time for an image to be approved from days to minutes
inferring descriptions of menu items using various retrieval augmented generation, multimodal signals and common-sense reasoning
improving and re-plating dishes in photographs to improve presentability and to bring them up to menu quality without misrepresenting the dish, its content, or its ingredients
performing content understanding on videos to identify when a video, much like my life, centers around food, as well as using multimodal reasoning and retrieval augmented generation to identify what dishes were featured in a video

Deep LearningFoundation ModelsImage ProcessingText UnderstandingVideo Content AnalysisMachine Learning

Apple

3 roles

Senior Machine Learning Engineer

Promoted

Apr 2021 – Aug 2024 · 3 yrs 4 mos · San Francisco Bay Area

Senior Machine Learning Engineer working in Siri Conversational Intelligence Team
Orchestrated a team of skilled engineers to leverage contextual signals within Large Language Models to enhance Conversational Fluency and Disambiguation-related experiences in Siri and Apple Intelligence. Spearheaded highly collaborative efforts across cross-organizational teams. Role subsumed responsibilities below.
Project Lead: Led a team to improve how Siri handles conversational follow-ups, using on-device coreference resolution to resolve ordinal, deictic and descriptive references [which comprises a part of the system published at CRAC 2023]. Responsible for the end-to-end project lifecycle, including architecture design, data collection, modeling, platform and internationalization. Successfully delivered user-facing features, fostering cross-organizational collaboration. Current owner of Siri’s entire ML-powered reference resolution stack.
Modeling and Data Lead: Improved how Siri handles corrections and disfluencies using query rewriting.

Large Language ModelsConversational IntelligenceCoreference ResolutionData CollectionModelingNatural Language Processing+1

Machine Learning Engineer

Jan 2019 – Apr 2021 · 2 yrs 3 mos · San Francisco Bay Area

Machine Learning Engineer working in Natural Language Modeling sub-team within Siri's Natural Language Understanding division
Used on-device search and personal signals to move Siri’s parsing and contact resolution on-device in 13 locales, reducing errors by up to 20% (relative) over the server while simultaneously improving latency and privacy
Applied state-of-the-art Named Entity Recognition and Normalization techniques to improve how Siri processes named entities such as movies and songs [published at NAACL 2021 Industry Track and Interspeech 2021], including via the novel use of speech/multimodal signals [published at 3rd NLP4ConvAI Workshop]

Natural Language ProcessingNamed Entity RecognitionOn-device SearchContact Resolution

Machine Learning Intern

May 2018 – Aug 2018 · 3 mos · San Francisco Bay Area

Machine Learning Intern with the Siri Natural Language Modeling Team
Improved Named Entity Normalization by using Deep Generative Sequence-to-Sequence Models to learn commonly observed error patterns

Deep LearningSequence-to-Sequence Models

Visa

Software Engineer

Aug 2016 – Jul 2017 · 11 mos

Added point-of-sale support for the FDI Global and Amex Direct gateways; Implemented the auto-auth reversal after void service for Amex Direct; Improved the CTP Sale API
Designed and implemented a novel, near-zero technical debt run-time feature toggling framework
National Winner, Visa Global Hackathon: Implemented an online infrastructure for seamless, one-click microtransactions using VISA Checkout APIs

Software DevelopmentAPI DevelopmentSoftware Engineering

Mila - quebec artificial intelligence institute

Deep Learning Research Intern

Jan 2016 – Jun 2016 · 5 mos · Montreal, Canada Area

Undergraduate thesis on Deep Learning in Computer Vision:
Designed and implemented a novel Relative Face Pose Normalization and keypoint Depth Prediction with Recombinator Networks and a Spatial Transformer-like architecture without any ground-truth depths during either the training or testing stages, implemented using Theano and Lasagne
Devised Convolutional Residual Memory Networks, a novel Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) Network based deep architecture inspired by Highway Networks and Residual Networks, both for Structured Face Keypoint Prediction with drastically reduced computational complexity (by a factor of 23x) and for Image Classification achieving near state-of-the-art accuracies
Proposed Nested LSTMs, which outperform Stacked LSTMs on Character-level Sequence Prediction

Deep LearningComputer Vision

The processing foundation

GSoC Participant

May 2015 – Aug 2015 · 3 mos · Google Summer of Code

Created a new Read-Evaluate-Print-Loop (REPL) Mode for Processing
Worked on replacing the JEditTextArea with the more feature-packed RSyntaxTextArea, and on adding improved versions of several features (such as syntax and error highlighting and auto-complete support) into the new IDE

Software DevelopmentPredictive Modeling

Webmobi

Big Data Intern

Dec 2014 – Jan 2015 · 1 mo · Bengaluru Area, India

Worked on predictive customer scoring using Twitter feeds and other data signals such as information pertaining to firmographic data, company performance and so on, both with and without SAP HANA, and on integrating this predictive customer scoring system with Salesforce
Proposed and coded up a basic predictive model, including natural language processing tasks such as tweet pre-processing, named entity recognition and sentiment analysis, as well as interfaces to Twitter, Salesforce and SAP HANA APIs using Java

The processing foundation

GSoC Participant

May 2014 – Aug 2014 · 3 mos · Google Summer of Code

Worked on the improvement of the Contributions Manager
Added in new features such as a new type of contribution (examples-packages); enabled removal, install and update of contributions without restarting Processing; improved the UI of the Manager; improved how it handles the lack of an internet connection

Indira gandhi centre for atomic research

Research Intern

May 2014 – Jul 2014 · 2 mos · Kalpakkam, Tamil Nadu

Worked on the Application Oriented Benchmarking of a GPU Cluster using the Nanoscale Molecular Dynamics program (NAMD):
Analyzed what combination of CPUs and GPUs provides optimum performance, and how the performance varies for different CPU-GPU combinations
Analyzed how efficiently resources are used for a particular task and when adding an extra resource provides a speedup sufficient to warrant its addition
Analyzed how different network architectures and protocols affect performance using the network monitoring tool Ganglia