Sameer Jain

Machine Learning Engineer

Mountain View, California, United States4 yrs 3 mos experience
Most Likely To Switch

Key Highlights

  • Expert in NLP and LLMs with practical experience.
  • Achieved state-of-the-art performance in multilingual classification.
  • Developed innovative frameworks for text evaluation.
Stackforce AI infers this person is a Machine Learning Engineer specializing in Natural Language Processing and AI-driven solutions.

Contact

Skills

Core Skills

Natural Language ProcessingMachine Learning

Other Skills

Large Language Models (LLM)CPythonC++JavaData StructuresAlgorithmsInformation RetrievalDeep LearningNeural NetworksDeep Neural Networks (DNN)

About

I am a recent graduate from Carnegie Mellon University's Language Technologies Institute, with a Master's degree specializing in Machine Learning and Natural Language Processing (NLP). My research involved using Large Language Models (LLMs) to evaluate the quality of artificially-generated text. I have also worked on utilizing LLMs for limited-resource languages to aid non-profit organizations such as the World Wildlife Fund in identifying conservation-related news in regional languages efficiently. I have 2 years of industry experience applying NLP to product at Samsung Research and Supernormal (a generative AI startup), along with an internship at Meta.

Experience

4 yrs 3 mos
Total Experience
1 yr 5 mos
Average Tenure
2 yrs 1 mo
Current Experience

Pinterest

Machine Learning Engineer II

Mar 2024Present · 2 yrs 1 mo · Palo Alto, California, United States

Supernormal

Machine Learning Engineer

Apr 2023Jan 2024 · 9 mos · Remote

  • Supernormal captures meetings and uses transcriptions to automatically generates notes, summaries, and action items using large language models (LLMs)
  • I engineer prompts and fine-tune LLMs for note-quality enhancement and develop metrics for quality evaluation
Large Language Models (LLM)Natural Language ProcessingMachine Learning

Meta

Software Engineer Intern

Jun 2022Aug 2022 · 2 mos · Menlo Park, California, United States

  • Worked with the Creative Delivery Team on ad enhancement through augmented reality effects
  • Built pipelines to identify impactful AR effects, and made transformation infra guardrails configurable

Carnegie mellon university

Graduate Research Assistant

Jan 2021Jul 2023 · 2 yrs 6 mos · Pittsburgh, Pennsylvania, United States

  • 1. Multilingual Classification of Environment-related News using LLMs
  • Collaborated with the World Wide Fund for Nature (WWF) to identify multilingual news content related to environmental conservation
  • Designed a multi-step prompt pipeline to perform multilingual classification for low-resource languages, using in-context learning, chain-of-thought reasoning, and self-reflection on LLMs
  • Achieved state-of-the-art performance using <6% of the training data required by existing models
  • 2. Evaluation of Generated Text using In-context Learning (ACL Findings '23)
  • Designed a framework to use large language models as evaluators of artificially generated text using in-context learning
  • Engineered a prompt that achieves state-of-the-art evaluation of text summarization without requiring the evaluation model to be fine-tuned on large datasets.
  • Link: https://aclanthology.org/2023.findings-acl.537.pdf
  • 3. A Mixed-method Reflexive Analysis of the Fairness, Accountability, and Transparency Conference (FAccT '22)
  • Identified areas of focus in the fair machine learning domain through community detection in
  • fair-ML citation networks and a metastudy of the FAccT conference.
  • Uncovered the moral underpinnings of FAccT against the framework provided by the Moral
  • Foundations Theory.
  • Link: https://dl.acm.org/doi/pdf/10.1145/3531146.3533107
Large Language Models (LLM)Natural Language ProcessingMachine Learning

Samsung r&d institute india

Software Engineer

Jul 2019Dec 2020 · 1 yr 5 mos · Bangalore Urban, Karnataka, India

  • Worked with the Natural Language Understanding division of the Voice Intelligence team on the Development of Bixby–Samsung Electronics' personal assistant.
  • Implemented an NLU engine to run under on-device resource constraints, thereby reducing server costs and improving offline performance.

University of zurich

Research Intern

Jan 2019May 2019 · 4 mos · Zurich, Switzerland

  • Worked at the Language and Space Lab on undergraduate thesis titled "Copy mechanisms for Upstream Text Processing".
  • Designed an encoder-decoder model with soft attention and a character copying mechanism to
  • improve low-resource performance.
  • Applied the architecture for normalization of the Swiss-German language and for morphological
  • segmentation of the English, German, and Indonesian languages.

Laboratoire parole et langage

Research Intern

May 2018Jul 2018 · 2 mos · Aix-En-Provence Area, France

  • Worked within the framework of the ACORFORMED project, which aims to develop an embodied conversational agent (ECA) to train doctors to break bad news.
  • Developed a model to estimate the levels of presence and co-presence experienced by a participant during his/her interaction with an embodied conversational agent in a virtual reality environment on the basis of verbal and non-verbal features.
  • Built random forest and support vector machine models to estimate presence and co-presence from the features obtained.

Mapmyindia

Summer Intern

May 2017Jul 2017 · 2 mos · Greater Delhi Area

  • Built a prototype image classifier aimed at identifying frequently encountered objects on roads as a component of an autonomous car development project.
  • Developed a CNN based model by performing transfer learning using pre-trained weights from the VGG-16 Network.

Education

Carnegie Mellon University School of Computer Science

Master of Science - MS — Intelligent Information Systems (Language Technologies Institute)

Jan 2021Jan 2022

Birla Institute of Technology and Science, Pilani

Bachelor of Engineering — Computer Science

Jan 2015Jan 2019

Amity International School, Noida

High School — Central Board of Secondary Education

Stackforce found 100+ more professionals with Natural Language Processing & Machine Learning

Explore similar profiles based on matching skills and experience