Yugal Jain — AI Researcher
- Experienced in building , Data Engineering (ETL) pipelines, end-to-end NLP pipelines and deep learning architectures. - Proficient in Python, PyTorch, Sklearn, Pandas, Transformers, ElasticSearch, Flask, Docker, AWS, Spacy, NLTK. - Have worked on supervised NLP problems such as Sequence Classification tasks(Aspect classification), Multi-Label Text Classification , Document Classification , Sentiment Analysis and Unsupervised Tasks such as Hierarchic Topic Modeling and Top2Vec. * Projects - Umeed : Advanced Language Analytics Tool(Delhi Police) • Experimented with NLP models to combat fake news, hate speech, and abusive content. Cleaned pre-existing data and trained Deep Learning model to detect abuse, traumatic and disturbing content in images and videos shared on socialmedia. • This is being undertaken to shed some light on the practical prospects of stopping the cycle of online crime, harassment and abuse. • Worked with multiple stakeholders to reduce the legal liabilities of models below the human level bias. - Auto Content Moderation System • Developed an auto content moderation system to provide family friendly web shows and integrated with speech,text,and vision system which has ability to detect and mute abusive videos. Used video classifcation model to classify NSFW videos and CMU sphinx for Text to Speech(TTS) to detect abusive words in audio and replace it with beep. • Proposed a mode named SAFE MODE/FAMILY MODE . This mode uses our ACS(Auto Censoring System) to minimize explicit visuals and abusive language from the content user wants to watch. Designed it in a way that it’s easily integrable with OTT platforms. - Social Media Bias Free Bot • This bot can reply unbiasedly on comments posted by users on social media platforms like reddit,twitter or discord and will try to stop spreading racism, religious bias on these platforms through positive sentiment comments. • Trained GPT-2 based model on Jigsaw Toxic Comment Classifcation Dataset to detect toxicity in comments and AG-News Dataset to reply according to topic of comments posted on subreddits like politics, sports , technology etc and deployed on discord and reddit
Stackforce AI infers this person is a Machine Learning Engineer specializing in Natural Language Processing and AI solutions.
Location: Rohtak, Haryana, India
Experience: 1 yr 4 mos
Skills
- Data Science
- Natural Language Processing (nlp)
- Large Language Models (llm)
- Machine Learning
- Optical Character Recognition (ocr)
- Computer Vision
- Deep Learning
- Machine Translation
Career Highlights
- Developed AI-based translation system for 100+ languages.
- Engineered real-time speech-to-text capabilities for sales meetings.
- Implemented OCR solutions for automated text extraction.
Work Experience
H10 AI
Machine Learning Engineer (4 mos)
Valona Intelligence
Data Science Consultant (3 yrs 3 mos)
Expedite Commerce
Machine Learning Engineer (7 mos)
Trantor
Machine Learning Engineer (10 mos)
Optima Ideas, s.r.o.
Data Science Consultant (3 mos)
Ritsumeikan University
Research Collaborator (2 mos)
BlackNet
Machine Learning Engineer (1 yr 1 mo)
Education
Bachelor of Technology at Guru Gobind Singh Indraprastha University