Kush Jain

Data Scientist

Mumbai, Maharashtra, India4 yrs 8 mos experience
AI ML PractitionerAI Enabled

Key Highlights

  • Expert in developing LLM-based applications.
  • Proven track record in NLP and data science projects.
  • Strong experience in e-commerce and telecom sectors.
Stackforce AI infers this person is a Data Scientist specializing in NLP and AI solutions for e-commerce and telecom industries.

Contact

Skills

Core Skills

Large Language Models (llm)Natural Language Processing (nlp)Machine Learning

Other Skills

Applied Machine LearningCalculusConvolutional Neural Networks (CNN)Data AnalysisData ClassificationData ScrapingDeep LearningDeep Neural Networks (DNN)FastAPIFlaskGenerative AILinear AlgebraLinear RegressionLogistic RegressionMathematical Modeling

About

Data Scientist with 4 years of experience in data modelling, NLP, traditional ML algorithms and other parts of the data pipeline, such as data scraping, data visualization, feature engineering, etc. I also have experience in new-age technologies like Generative AI, LLMs and RAG based applications and have developed solutions to use them in industry use cases.

Experience

Fractal

Data Scientist

Sep 2024Present · 1 yr 6 mos · Mumbai, Maharashtra, India · Hybrid

Jio

Data Scientist

Jul 2021Sep 2024 · 3 yrs 2 mos

  • LLM-based external knowledge chatbots: Worked on creating an e-commerce retail consumer-facing LLM-based chatbot, automating the complete product ordering journey.
  • QnA RAG chatbots: Created a dashboard API where users can do QnA with their documents via LLMs and vector databases like ChromaDB and FAISS.
  • TM Forum Jio Async API Hackathon Challenge: Was a part of Jio’s Hackathon team of 5 people, where we developed TMF AIVA, a multilingual LLM-based QA chatbot for Telecom use cases. Came in 2nd place in the same.
  • LLM Dashboard: Developed a complete end-to-end dashboard for users to finetune their custom Q&A dataset on any LLM and interact with the finetuned LLM directly in our chatbot interface.
  • IndicLens Dashboard: Developed a complete end-to-end dashboard for users to use our novel IndicLens pipeline and power their product catalogue with our novel multimodal and multilingual search capabilities.
  • Multilingual Hate Speech Classification: Ran multiple experiments for generating rich embeddings for hate speech tweets data and optimal semantic understanding of the text. Work is currently submitted for a patent.
  • Domain Prediction: Developed an ML-based end-to-end pipeline for classifying a query into multiple domains, using multiple data processing, cleaning and feature engineering techniques. Currently deployed in MyJio app and increased search CTR by 15.2%.
  • Indic Transliteration and Translation: Developed an end-to-end transliteration and translation pipeline and REST APIs for major Indian regional languages for all domains of Jio’s e-commerce platforms, increasing media and e-commerce search CTR by 12.4%.
  • Query Category Prediction: Developed an end-to-end pipeline and REST APIs for recognising categories to which the query belongs at every category level using NLP feature engineering and modelling techniques. Increased product search page CTR by 24%.
Python (Programming Language)Data AnalysisPyTorchLarge Language Models (LLM)Data ScrapingNatural Language Processing (NLP)+7

Crayon data

Data Science Intern

Jan 2021Apr 2021 · 3 mos

  • Developed a PoC for NLP-based extraction of product attributes from product descriptions for fashion products and tested the same with different algorithmic methods.
Data ScrapingNatural Language Processing (NLP)Problem Solving

Couture.ai

Data Science Intern

Aug 2020Dec 2020 · 4 mos

  • Worked on various modules towards the development of search engine pipeline for a fashion e-commerce platform along with other data science analysis, research and experimentation tasks related to the pipeline.
Data ScrapingNatural Language Processing (NLP)Problem Solving

Jio

AI Intern

May 2020Aug 2020 · 3 mos

  • Worked on search query optimization for AJIO Fashion and developed a knowledge graph API scalable to multiple e-commerce domains, titled J.A.N.K.I (Just a New Knowledge Graph).
Data ScrapingNatural Language Processing (NLP)Problem Solving

Artificial intelligence institute at university of south carolina

Research Intern

Apr 2020Aug 2020 · 4 mos

  • Worked under Prof. Amit Sheth on the topic of multimodal analysis for toxic tweet classification, using natural language processing, knowledge graphs and basic image processing techniques along with literature review and experimentation for the same.
Natural Language Processing (NLP)Mathematical Modeling

Ayeai

Summer Intern

Jul 2019Jul 2019 · 0 mo

  • A work from home internship based on an introduction to Image Recognition models and traffic light detection system for the project: AyeAI Autonomous Ambulance.

Indian red cross society -ircs (national headquarters, new delhi india)

Summer Intern

May 2019Jul 2019 · 2 mos · Mumbai, Maharashtra, India

Jio

Jio Digital Champions Student Learning Program

May 2018Jun 2018 · 1 mo · Mumbai, Maharashtra, India

Education

Birla Institute of Technology and Science, Pilani

Bachelor of Engineering - BE — Electrical and Electronics Engineering

Jan 2017Jan 2021

KC College

HSC (Maharashtra Board) — Science

Jan 2015Jan 2017

Bright Start Fellowship International School

IGCSE

Jan 2006Jan 2015

Stackforce found 100+ more professionals with Large Language Models (llm) & Natural Language Processing (nlp)

Explore similar profiles based on matching skills and experience