Vaibhav Vats

AI Researcher

San Francisco, California, United States4 yrs 2 mos experience
Most Likely To Switch

Key Highlights

  • Reduced training data for LLMs by 90%
  • Improved performance of LLMs by 18%
  • Developed scalable frameworks for NLP applications
Stackforce AI infers this person is a Machine Learning Engineer with a focus on Natural Language Processing and Web Development.

Contact

Skills

Core Skills

Natural Language ProcessingMachine LearningData AnalysisWeb DevelopmentAndroid Development

Other Skills

AWS(EC2, S3, Lambda)AirFlowAngularBERTBash/Shell ScriptingC/C++CSSCassandraDockerETL pipelinesFAISSFirebaseFlaskGCPHTML

About

MS CS graduate from the University of Southern California, Los Angeles. Skills: Areas worked in - Natural Language Processing, Computer Vision, Machine Learning, Android Application Development, Web Scraping, Web Development Languages: Python, Java, C/C++, JavaScript, Swift, Bash/Shell Scripting, HTML, CSS, SQL, NoSQL Libraries: Pandas, NumPy, PySpark, PyTorch, TensorFlow, Scrapy, Plotly, Transformers, NLTK, OpenCV Tools: Spark, Hadoop, Docker, AirFlow, MLFlow, SwiftUI, Angular, NodeJS, jQuery, REST, JSON, Android Data & Cloud: Kubernetes, ETL pipelines, Oracle, Cassandra, MySQL, MongoDB, hdfs, AWS(EC2, S3, Lambda), GCP, Solr Github: @fazevaib

Experience

4 yrs 2 mos
Total Experience
1 yr 4 mos
Average Tenure
2 yrs 6 mos
Current Experience

Salesforce

Data Scientist - Interactive AI

Dec 2023Present · 2 yrs 6 mos · Palo Alto, California, United States · On-site

University of southern california

NLP Research Assistant

Jan 2023Nov 2023 · 10 mos · Los Angeles Metropolitan Area · Remote

  • Research Assistant at Signal Analysis and Interpretation Laboratory (SAIL), USC
  • Developed and tested ML pipeline to support Generative Language Models for Zero-shot Social Media Intelligence on distributed data systems using Pandas, AirFlow, PySpark & Cassandra.
  • Reduced training data required for similar performance by LLMs by 90% and Improved the performance of Large Language Models like GPT3, GPT3.5, and FLAN-T5 for multi-label classification by 18%.
PandasAirFlowPySparkCassandraNatural Language ProcessingMachine Learning

Apple

Machine Learning Intern

May 2022Aug 2022 · 3 mos · Seattle, Washington, United States

  • From-scratch developed and tested a Python-based tool to display and analyze complex data states from ML architecture using PySpark, PyArrow & Plotly; resulting in improved performance on accuracy by 52%.
  • Created and maintained ETL pipeline for tool to support multiple Large Language Models and faster visualization by 40%
  • Built Python & bash scripts and worked on system pipeline performing feature extraction, data preparation, data wrangling, model selection, training, testing, and deployment with large-scale distributed data systems in high-computation environments using PySpark, Hadoop, hdfs, Docker, and Kubernetes.
PySparkPyArrowPlotlyDockerKubernetesMachine Learning+1

Information sciences institute

Research Assistant

Feb 2022Apr 2022 · 2 mos · Los Angeles Metropolitan Area

  • Constructed Scalable Zero-shot Entity Linking Framework in Python on WikiData with Dense Mapping at the Centre of Knowledge Graphs for Knowledge Graph Toolkit from multiple Data sources; parallelized the data ingestion process for faster processing.
  • Built and embedded APIs to Toolkit to map unseen entities in the text to WikiData nodes; improved performance of mapping by 23% using BERT-based Bi-encoders, Cross-encoders & FAISS.
PythonBERTFAISSNatural Language ProcessingMachine Learning

Logicquad technologies inc

Research Assistant

Aug 2019Jun 2020 · 10 mos · New Delhi, Delhi, India

  • Created web application for AQI Prediction and real-time Image Captioning using Angular, JavaScript, and Python; designed and tested APIs using Flask; built scripts to streamline the training and testing process.
  • Engineered android Application in Java for text messaging; integrated chat-bots trained using seq2seq model, built authentication, user profile page, and custom bots based on user emotion using Firebase as a data store.
AngularJavaScriptFlaskFirebaseWeb DevelopmentAndroid Development

Arbunize

Machine Learning Intern

Jul 2018Dec 2018 · 5 mos · New Delhi Area, India

  • Created Recommendation System for jobs in a team of 8; implemented Resume Parsing, Stable Matching, Collaborative Filtering, and custom NER to improve job matching accuracy from 88.12% to 94% using Python, NLTK, Scrapy & TensorFlow.
  • Deployed, tested & maintained websites on GCP; created custom ontology similarity matching algorithm for improved recommendations on jobs using Protégé, OWL, and NLTK.
PythonNLTKScrapyTensorFlowMachine LearningData Analysis

Education

University of Southern California

Master of Science - MS — Computer Science

Aug 2020Dec 2022

Guru Gobind Singh Indraprastha University

Bachelor's degree — Computer Science

Jan 2015Jan 2019

SHANTI GYAN NIKETAN SCHOOL

SSSC — Mathematics and Computer Science

Stackforce found 100+ more professionals with Natural Language Processing & Machine Learning

Explore similar profiles based on matching skills and experience