Joel Moniz

AI Researcher

San Francisco, California, United States9 yrs 1 mo experience
Highly Stable

Key Highlights

  • Led advanced projects in conversational AI at Apple.
  • Expert in deep learning and natural language processing.
  • Proven track record in improving machine learning systems.
Stackforce AI infers this person is a Machine Learning Engineer with expertise in AI-driven applications across technology and fintech sectors.

Contact

Skills

Core Skills

Deep LearningMachine LearningNatural Language ProcessingSoftware Engineering

Other Skills

Foundation ModelsImage ProcessingText UnderstandingVideo Content AnalysisVideo AnalysisLarge Language ModelsConversational IntelligenceCoreference ResolutionData CollectionModelingData ModelingNamed Entity RecognitionOn-device SearchContact ResolutionSequence-to-Sequence Models

About

I am currently a Staff Machine Learning Engineer at DoorDash. Previously, I was in the Natural Language Modeling and Conversational Intelligence teams within Siri at Apple. I completed my master's degree (MS in Computer Science) at CMU, and graduated in December 2018. I completed an internship with Siri's Natural Language Modeling Team at Apple in the summer of 2018. Before my MS, I spent a year working as a Software Engineer in the Cybersource team at Visa. Prior to this, I did my bachelor's degree in Computer Science from Birla Institute of Technology and Science, Pilani, Pilani Campus, and completed my undergraduate thesis at Montreal Institute of Learning Algorithms, Université de Montréal on Deep Learning techniques in Computer Vision (face verification, image classification). I have interned twice with The Processing Foundation (as part of Google Summer of Code 2014 and 2015), with Webmobi, and with the Computer Division at Indira Gandhi Centre for Atomic Research. My interests span a broad range of sub-fields in Computer Science, including Deep Learning, Machine Learning, Computer Vision and Natural Language Processing. I am comfortable with Python, C++, C, Java and Git. I have extensive experience with various deep learning and machine learning frameworks, Tensorflow and Pytorch in particular. I am familiar with parallel frameworks such as OpenMP-MPI and Apache Spark. I am also extremely good at understanding and modifying existing code bases, and am quick to pick up technical concepts.

Experience

9 yrs 1 mo
Total Experience
1 yr 10 mos
Average Tenure
1 yr 8 mos
Current Experience

Doordash

Staff Machine Learning Engineer

Aug 2024Present · 1 yr 8 mos · San Francisco Bay Area · On-site

  • Technical/Team Lead, Menu Media Pod: Leading a team that uses Deep Learning and Foundation Models to understand and improve images, text, and videos. Projects include:
  • automatically understanding and moderating images to cut down the time for an image to be approved from days to minutes
  • inferring descriptions of menu items using various retrieval augmented generation, multimodal signals and common-sense reasoning
  • improving and re-plating dishes in photographs to improve presentability and to bring them up to menu quality without misrepresenting the dish, its content, or its ingredients
  • performing content understanding on videos to identify when a video, much like my life, centers around food, as well as using multimodal reasoning and retrieval augmented generation to identify what dishes were featured in a video
Deep LearningFoundation ModelsImage ProcessingText UnderstandingVideo Content AnalysisMachine Learning

Apple

3 roles

Senior Machine Learning Engineer

Promoted

Apr 2021Aug 2024 · 3 yrs 4 mos · San Francisco Bay Area

  • Senior Machine Learning Engineer working in Siri Conversational Intelligence Team
  • Orchestrated a team of skilled engineers to leverage contextual signals within Large Language Models to enhance Conversational Fluency and Disambiguation-related experiences in Siri and Apple Intelligence. Spearheaded highly collaborative efforts across cross-organizational teams. Role subsumed responsibilities below.
  • Project Lead: Led a team to improve how Siri handles conversational follow-ups, using on-device coreference resolution to resolve ordinal, deictic and descriptive references [which comprises a part of the system published at CRAC 2023]. Responsible for the end-to-end project lifecycle, including architecture design, data collection, modeling, platform and internationalization. Successfully delivered user-facing features, fostering cross-organizational collaboration. Current owner of Siri’s entire ML-powered reference resolution stack.
  • Modeling and Data Lead: Improved how Siri handles corrections and disfluencies using query rewriting.
Large Language ModelsConversational IntelligenceCoreference ResolutionData CollectionModelingNatural Language Processing+1

Machine Learning Engineer

Jan 2019Apr 2021 · 2 yrs 3 mos · San Francisco Bay Area

  • Machine Learning Engineer working in Natural Language Modeling sub-team within Siri's Natural Language Understanding division
  • Used on-device search and personal signals to move Siri’s parsing and contact resolution on-device in 13 locales, reducing errors by up to 20% (relative) over the server while simultaneously improving latency and privacy
  • Applied state-of-the-art Named Entity Recognition and Normalization techniques to improve how Siri processes named entities such as movies and songs [published at NAACL 2021 Industry Track and Interspeech 2021], including via the novel use of speech/multimodal signals [published at 3rd NLP4ConvAI Workshop]
Natural Language ProcessingNamed Entity RecognitionOn-device SearchContact Resolution

Machine Learning Intern

May 2018Aug 2018 · 3 mos · San Francisco Bay Area

  • Machine Learning Intern with the Siri Natural Language Modeling Team
  • Improved Named Entity Normalization by using Deep Generative Sequence-to-Sequence Models to learn commonly observed error patterns
Deep LearningSequence-to-Sequence Models

Visa

Software Engineer

Aug 2016Jul 2017 · 11 mos

  • Added point-of-sale support for the FDI Global and Amex Direct gateways; Implemented the auto-auth reversal after void service for Amex Direct; Improved the CTP Sale API
  • Designed and implemented a novel, near-zero technical debt run-time feature toggling framework
  • National Winner, Visa Global Hackathon: Implemented an online infrastructure for seamless, one-click microtransactions using VISA Checkout APIs
Software DevelopmentAPI DevelopmentSoftware Engineering

Mila - quebec artificial intelligence institute

Deep Learning Research Intern

Jan 2016Jun 2016 · 5 mos · Montreal, Canada Area

  • Undergraduate thesis on Deep Learning in Computer Vision:
  • Designed and implemented a novel Relative Face Pose Normalization and keypoint Depth Prediction with Recombinator Networks and a Spatial Transformer-like architecture without any ground-truth depths during either the training or testing stages, implemented using Theano and Lasagne
  • Devised Convolutional Residual Memory Networks, a novel Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) Network based deep architecture inspired by Highway Networks and Residual Networks, both for Structured Face Keypoint Prediction with drastically reduced computational complexity (by a factor of 23x) and for Image Classification achieving near state-of-the-art accuracies
  • Proposed Nested LSTMs, which outperform Stacked LSTMs on Character-level Sequence Prediction
Deep LearningComputer Vision

The processing foundation

GSoC Participant

May 2015Aug 2015 · 3 mos · Google Summer of Code

  • Created a new Read-Evaluate-Print-Loop (REPL) Mode for Processing
  • Worked on replacing the JEditTextArea with the more feature-packed RSyntaxTextArea, and on adding improved versions of several features (such as syntax and error highlighting and auto-complete support) into the new IDE
Software DevelopmentPredictive Modeling

Webmobi

Big Data Intern

Dec 2014Jan 2015 · 1 mo · Bengaluru Area, India

  • Worked on predictive customer scoring using Twitter feeds and other data signals such as information pertaining to firmographic data, company performance and so on, both with and without SAP HANA, and on integrating this predictive customer scoring system with Salesforce
  • Proposed and coded up a basic predictive model, including natural language processing tasks such as tweet pre-processing, named entity recognition and sentiment analysis, as well as interfaces to Twitter, Salesforce and SAP HANA APIs using Java

The processing foundation

GSoC Participant

May 2014Aug 2014 · 3 mos · Google Summer of Code

  • Worked on the improvement of the Contributions Manager
  • Added in new features such as a new type of contribution (examples-packages); enabled removal, install and update of contributions without restarting Processing; improved the UI of the Manager; improved how it handles the lack of an internet connection

Indira gandhi centre for atomic research

Research Intern

May 2014Jul 2014 · 2 mos · Kalpakkam, Tamil Nadu

  • Worked on the Application Oriented Benchmarking of a GPU Cluster using the Nanoscale Molecular Dynamics program (NAMD):
  • Analyzed what combination of CPUs and GPUs provides optimum performance, and how the performance varies for different CPU-GPU combinations
  • Analyzed how efficiently resources are used for a particular task and when adding an extra resource provides a speedup sufficient to warrant its addition
  • Analyzed how different network architectures and protocols affect performance using the network monitoring tool Ganglia

Education

Carnegie Mellon University

Master of Science — Computer Science

Jan 2017Jan 2018

Birla Institute of Technology and Science, Pilani

Bachelor of Engineering (Hons.) — Computer Science

Jan 2012Jan 2016

Rajagiri Public School

Jan 1998Jan 2012

Stackforce found 100+ more professionals with Deep Learning & Machine Learning

Explore similar profiles based on matching skills and experience