Jiun-Hao (Aaron) Jhan

AI Researcher

Mountain View, California, United States2 yrs 10 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Led a team to improve AI model accuracy significantly.
  • Published research in top-tier conferences.
  • Developed a multilingual classifier for Samsung Bixby.
Stackforce AI infers this person is a Machine Learning Engineer with a focus on Natural Language Processing and AI solutions.

Contact

Skills

Core Skills

Natural Language Processing (nlp)Machine Learning

Other Skills

TransformerPyTorchSoftware DevelopmentObject-Oriented Programming (OOP)PythonKerasCC++C#UnityMySQLApplied Machine LearningReinforcement LearningImage ProcessingData Structures

About

I am a master's student who graduated from software engineering at Carnegie Mellon University. I am looking for SDE, Applied ML, or Data Science 2024 full-time opportunities. I have professional skills in programming languages like Python, C/C++, Java, Javascript, HTML, etc. I'm also familiar with frameworks or tools, such as Pytorch, Keras, Scikit-learn, MongoDB, MySQL, Node.js, Express.js, Git, etc. I worked as an AI data scientist intern at Samsung Research America, leading 3 AI data scientists in a project to build multilingual intents and slots classifier using Pytorch and HuggingFace transformer framework based on Amazon’s Massive dataset and Samsung Bixby data. During the internship, I Improved intents and slots accuracy by 15% and 25% and published the work in EMNLP 2022 workshop. Furthermore, I finetuned Large Languages Models (LLMs), like Dolly, GPT-J, to increase the performance in intents classification and slot filling. For Samsung, I created a prototype product of multilingual intents and slots classifier on Samsung Bixby, allowing Bixby to understand and reply with 50 additional languages. I have data science experience as a Research Assistant in Prof. Hung-Yi Lee’s labs at National Taiwan University (NTU), focusing on Natural Language Processing, Chatbot, and Reinforcement Learning. During graduate school at NTU, I published three papers on Deep Learning-related top conferences (NAACL, NIPS, etc..). Moreover, I participated in an industry-academia collaboration project with Intel Labs, cooperating with Dr. Sahay Saurav. Additionally, I was a software engineering intern at MediaTek where I worked in Python and gained leadership experience. I led a Central Control Unit (CCU) project combining the team's expertise in modem to conduct software verification without human supervision by using Android Debug Bridge (ADB). CCU helped save 60% of human resources and 20% of the testing time period. I am excited to discuss machine learning and software engineering. If you have any questions, feel free to contact me.

Experience

2 yrs 10 mos
Total Experience
1 yr 5 mos
Average Tenure
2 yrs 1 mo
Current Experience

Apple

Machine Learning Engineer

Apr 2024Present · 2 yrs 1 mo · Cupertino, California, United States · On-site

Icpc - international collegiate programming contest

Volunteer

Jul 2023Apr 2024 · 9 mos · Remote

Samsung research america (sra)

AI Data Scientist Intern

May 2022Apr 2023 · 11 mos · Mountain View, California, United States

  • Led a team of 3 AI data scientists in developing a multilingual intents and slots classifier using PyTorch and XLM-Roberta, achieving a 15% and 25% improvement in accuracy.
  • Presented a zero-shot learning method that incorporated generated translated data and delexicalization for intent and slots classifiers.
  • Achieved 3rd place in the MMNLU Competition and published the work in EMNLP 2022.
  • Established an experimental environment on AWS m5.xlarge machine for model evaluation, exploring architectures such as XLM-Roberta and mT5 for the intents and slots classifier project.
  • Developed a prototype product for a multilingual intents and slots classifier on Samsung Bixby, expanding its language capabilities from processing and responding in a single language to 51 languages.
  • Optimized Large Language Models (LLMs) such as Dolly and GPT-J for improved intent and slot classification, implementing the LoRA algorithm to reduce trainable parameters by 10,000 times and GPU memory requirements by 3 times while maintaining accuracy.
  • Executed prompt engineering in inference with ChatGPT and Dolly for automated data annotation, achieving an 80% reduction in time and cost for data annotation.
TransformerNatural Language Processing (NLP)PyTorchMachine Learning

National taiwan university

Research Assistant

Jun 2020Dec 2020 · 6 mos · Taipei City, Taiwan

  • Built a framework to train Empathetic Chatbots (CheerBots), based on BERT and OpenAI-GPT models, to make interlocutors feel more sympathy while chatting using Pytorch.
  • Utilized BERT-based emotion predictor to examine empathy valence between interlocutors’ emotional state as rewards and optimized rewards to arouse sympathy using Reinforcement Learning.
  • Developed a framework whereby several CheerBots are based on understanding users' implied feelings and replying empathetically for multiple dialogues turns.
  • Published "CheerBots: Chatbots toward Empathy and Emotion using Reinforcement Learning" in NIPS 2020 workshop.
  • Received the Best Paper Award and achieved SOTA performance on the EmpatheticDialogues dataset.

Intel labs

Industry-academia collaboration

Jun 2020Dec 2020 · 6 mos · Taipei City, Taiwan

  • Concentrated on a controllable chatbot to guide human beings combining expertise in Natural Language Processing and Meta-reinforcement Learning.
  • Proposed a framework to train Guiding Chatbots based on DialoGPT using Reinforcement Learning on Pytorch to induce the interlocutors to reply with intentions, such as long responses, joyful responses, and responses with specific words.
  • Established three different chatbots, GoogleBot, fixed-weight DialoGPT chatbot, and BERT-based retrieval chatbot, to simulate human beings’ behaviors.
  • Validated the effectiveness of chatbots to improve rewards by an average of 20 times over 3 intentions.
  • Published the work “Put Chatbot into its Interlocutor's Shoes: New Framework to learn Chatbot responding with Intention” on NAACL 2021.

Mediatek

Software Engineering Intern

Jul 2018Sep 2018 · 2 mos · Hsinchu City, Taiwan

  • Led 7 software engineers in a project, called Central Control Unit (CCU), to automate software verification pipelines on modem testing by using Android Debug Bridge and Python.
  • Developed an internal web tool to visualize modem testing results, resulting in a 50% time savings for testers and analysts during result checks and analysis.
  • Saved 60% of human resources (16 people to 5 people) and 20% of the testing time (18hrs to 14hrs).

Academia sinica, taiwan

Intern

Jul 2017Sep 2017 · 2 mos · Taipei City, Taiwan

  • Researched DNA sequence assembly in a group that combines bioinformatics expertise
  • Saved 250% of the time to perform the DNA sequences assembly using Canu against using PBcR Algorithm.
  • Accelerated the execution time of the read classification in the Sam file by 100% in a larger file (70G) and 20~50% in a smaller file (20G)

Education

Carnegie Mellon University

Master of Science - MS — Software Engineering

Jan 2022May 2023

National Taiwan University

Master of Science - MS

Feb 2018Jun 2020

National Sun Yat-sen University

Bachelor of Science - BS — Computer Science and Engineering

Sep 2013Jan 2018

Stackforce found 100+ more professionals with Natural Language Processing (nlp) & Machine Learning

Explore similar profiles based on matching skills and experience

Jiun-Hao (Aaron) Jhan - AI Researcher | Stackforce