Kajal Yadav

Data Scientist

Delhi, India4 yrs 7 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Expert in NLP and Generative AI solutions.
  • Proven track record in building impactful data-driven applications.
  • Strong community engagement through teaching and content creation.
Stackforce AI infers this person is a Data Scientist specializing in AI and NLP across various industries.

Contact

Skills

Core Skills

Data ScienceNlpMachine LearningGenerative AiContent CreationDeep LearningBusiness Consulting

Other Skills

AI PromptingAWSAnalysisAnalytic Problem SolvingAnalytical SkillsBig DataBig Data AnalyticsBusiness Decision MakingC++ChatGPTCollaborative Problem SolvingCompetitive AnalysisContextual AnalysisContextual ResearchCreative Problem Solving

About

I'm a Data Scientist by profession, an AI enthusiast by passion, and at times night owl by habit — constantly fighting to rise early, & constantly driven by the pursuit of meaningful, scalable, and impactful solutions. With hands-on experience in Machine Learning, Deep Learning, NLP, Generative AI, and Computer Vision, I specialise in designing and deploying intelligent systems across domains like Healthcare, Retail, Entertainment, Finance, and Education, E-Commerce. My primary toolkit includes Python (go-to language), R, Java, along with SQL, AWS, and Big Data tools—paired with a deep understanding of statistics, probability, optimisation, and system thinking. Professionally, I've been building real-world NLP and GenAI solutions at Fractal, leveraging LLMs in enterprise-grade use cases and contributing to the evolution of scalable architectures. Earlier at Hexo AI, I worked hands-on with diffusion models, AWS cloud services, and end-to-end deployment of GenAI pipelines, including optimisation for performance, latency, and cost. These experiences shaped my ability to move fast, think in systems, and contribute meaningfully across both product and research-driven settings. Beyond work, I enjoy teaching, mentoring, and engaging with the community. I’ve led as a Data Science Consultant, guided project teams, and mentored peers in practical ML/NLP applications. My outreach includes 500+ YouTube subscribers, 12K+ LinkedIn followers, and 2K+ Medium readers, where I simplify AI concepts and share learnings—often blending storytelling with technical precision. I hold a Master’s in Data Science & AI from the Central University of Rajasthan and a Computer Science undergraduate degree from the University of Delhi, where I honed not just my technical skills, but the ability to critically analyse, iterate fast, and build with purpose. What defines me professionally is a strong bias for action, ownership mindset, and adaptability across structured and ambiguous setups—whether it's building recommendation models, deploying NLP pipelines, optimising systems, or translating research into working prototypes. "I move pieces with agency" as often quoted by my colleagues. 5 years down the line? I envision building AI-powered solutions that drive humanitarian impact at scale. Until then, I’m here to learn, lead, and build responsibly. If you’re seeking a curious mind, a collaborative teammate, and a problem-solver who turns coffee into intelligent systems and blogs—let’s connect.

Experience

Fractal

Data Scientist

Feb 2024Present · 2 yrs 1 mo · Gurugram, Haryana, India · Hybrid

  • ➢ Led 100% development of an end-to-end NLP & LLM-based Bot Detector, analyzing features such as repetition, overlap, & scripted responses to identify bot-like behavior in human chat support, enhancing support quality and human friendliness.
  • ➢ Worked in a team of 50 and assessed them 70% of time with different ongoing projects
Natural Language Processing (NLP)Large Language Models (LLM)Python (Programming Language)Data AnalyticsData ScienceNLP

Hexo

Machine Learning Engineer

Feb 2023Oct 2023 · 8 mos · Bengaluru, Karnataka, India · Remote

  • Building Hexo to outstand in generative AI space.
  • ➢ Reduced human involvement by 70%, retaining clients through loophole investigation and feature launches. Improved image generation tool by leveraging on prompt engineering.
  • ➢ Customized Diffusion models for clients by fine-tuning on specific data.
  • ➢ Enhanced existing generative AI models using deep learning frameworks like Pytorch, Transformers, Xformers, Segment_Anything.
  • ➢ Added features like Segmentation, Background Removal, Inpainting, Out-painting, Photorealism, Color Control, Upscale, Adapts for an end-to-end pipeline.
  • ➢ Worked with AWS services like EC2 instances to run big models on GPUs, parallel processing CPUs, worked with S3 bucket storages and used Sagemaker like services to deploy the machine learning models further.
Generative AIDeep LearningPrompt EngineeringAWSMachine Learning

News quick

Data Science Consultant

Feb 2022Feb 2023 · 1 yr · India · Remote

  • ➢ Built a news aggregator app prototype from scratch with 60% less human involvement.
  • ➢ This MVP evolved around 4 vital features article summary in a vernacular format categorized in different genres with a audio button to listen the summary of the news article.
  • ➢ Scraped Data of news articles from big publications like Times India, Times Magazine, TheWire, google News using BeautifulSoup, Scrapy.
  • ➢ Managed a complete database using MySQL to fetch stored articles summary, their respective genre, translated text,respective audio as per user's choice.
  • ➢ NLP based summarization using TextRank, BERT, GPT-3like algorithms; Cosine matrices; Libraries used: NLTK, SpaCy.
  • ➢Used ML models: Naive Bayes, SVM, neural networks to classify articles in respective genres as per the context.
  • ➢ To ensure vernacular format: collected user preferences data to support regional languages translation; Implemented language translation models; Fameworks: Snowball stemmer, Translate, tkinter, googletrans, google translate API (experimented between different libraries and APIs to select the best).
  • ➢ Converted text to audio using GTTS API; deployed features/ ML models using APIs on AWS.
Natural Language Processing (NLP)Data ScrapingMachine LearningData ScienceNLP

Omdena

Data Science consultant

Aug 2021Jan 2022 · 5 mos · New York, United States · Remote

  • Worked on two of Omdena's vital projects:
  • FIRST PROJECT:
  • "Improving Natural Disaster and Water Resource Management using Natural Language Processing"
  • ➢ Led a 50-member team at Omdena, tackling a 2-month challenge to enhance Water Resource Management (WRM) with NLP and space data for startup partner WEO's sustainability mission.
  • ➢ Developed a robust tool in 8 weeks, empowering WEO to swiftly extract critical information on floods and droughts.
  • ➢ Applied advanced NLP, machine learning, and deep learning for insightful analysis.
  • ➢ Defined and executed query topics (flood and drought evaluations, landcover mapping, urban climate monitoring) with NLP, NLTK, and word embeddings.
  • ➢ Achieved comprehensive data gathering from diverse sources.
  • ➢ Contributed significantly to an efficient data storage system, enabling streamlined data evaluation.
  • ➢ Implemented metrics for statistical insights, emphasizing accuracy, precision, and recall.
  • ➢ Demonstrated project impact through tangible outcomes: retrieval of a significant number of images and data points. Aligned goals with broader challenges of population growth, urbanization, and climatic uncertainties in WRM.
  • SECOND PROJECT:
  • "Anomaly Detection on Mars Using Deep Learning"
  • ➢ Worked in a team of 50 and assessed them 70% of time.
  • ➢ Detected anomalies on Mars Surface (86% confidence)
  • ➢ Preprocessed Image: annotated 200 images with anomalies using the
  • VGG image annotator & invented models for the project.
  • ➢ Diagnosed all anomalies using YOLOv4; achieved 86% confidence
  • ➢ Led 100% of the task & deployed model successfully on Stream-lit
Natural Language Processing (NLP)Deep LearningData AnalyticsData ScienceNLP

Initor global uk

2 roles

Data Science Consultant

Jan 2021Jun 2021 · 5 mos · Greater London, England, United Kingdom

  • Built end-to-end NLP pipeline from scratch, the major work includes :
  • Researched the domain and narrowed down the top potential sources to scrape data from
  • Data Scraping using Python Scripts and Octoparse software
  • Data Cleaning/ Normalizing /Pre-processing
  • Exploratory Data Analysis
  • Generating n-Grams/ Collocations
  • Text clustering
  • Topic Modeling
  • Grammar Patterns
  • Keywords Extraction for Data Labeling purpose.
  • Text summarization to understand the context of the data.
  • Sentiment analysis of text to build insights over the future services needed to be given.
Data ScrapingData AnalyticsNatural Language Processing (NLP)Data ScienceNLP

Data Science Intern

Sep 2020Dec 2020 · 3 mos · Greater London, England, United Kingdom

  • Worked as a Data Science Consultant to provide them with insights that will eventually help them to scale as a business.
  • I have worked on Researching & Scraping Data.
  • Built AI driven solution for marketing analytics in order to do better consultation.
Data ScrapingData AnalyticsMachine LearningData ScienceBusiness Consulting

Octoparse - octopus data inc.

Technical Content Writer

Sep 2020Dec 2020 · 3 mos · Tokyo, Japan

  • Octoparse is the SaaS platform that provides Scraping services.
  • I have written articles around web-scraping for them.
  • I have scraped all types of complex website structures such as e-commerce sites (Amazon), blogging sites, edtech-sites, OTT platforms (Netflix, Hotstar), Youtube Comments and many more not so popular websites.
Technical WritingData ScrapingContent Creation

Youtube

Youtube Content Creator

Jun 2020Mar 2023 · 2 yrs 9 mos · India

  • Sharing my learnings and knowledge by creating Data Science, ML, DL, AI related videos.
  • I always wanted education to be freely available to every ends of the world and what would be better than YT to host educational videos. So, Here I am not knowing everything but here to share my learnings on the way of my journey to become world's best data scientist.
  • I will keep posting every now and then about data science field, the fundamentals, the advance techs, the job roles, the kinda work we do, the new advancements or research happening worldwide and so on.
  • Check out my channel- https://www.youtube.com/channel/UCdwAaZMWiRmvIBIT96ApVjw
Data AnalyticsPython (Programming Language)Data ScrapingData ScienceContent Creation

Medium

Technical Writer

Jun 2020Mar 2023 · 2 yrs 9 mos · San Francisco, California, United States

  • To share my data science learnings and knowledge, I started contributing my work related to Deep learning, Machine learning, Artificial intelligence through blogs on Medium.
  • So far, I have covered the basic/fundamental topics on Scraping data, Pre-processing data, Exploratory data analysis, Modelling data, Data labelling methods, Topic modelling, End-to-end machine learning projects, The stand out data science project ideas, Data Science resources, building NLP pipeline, Text clustering using K-Means, Important ML Algorithms.
  • If you have any question or want to give feedback, feel free to DM me.
Technical WritingData ScienceMachine LearningContent Creation

Education

Central University Of Rajasthan

Masters — M.sc. Big Data Analytics

Jan 2019Jan 2021

Delhi University

Bachelors — Computer Science

Jan 2016Jan 2019

Kendriya Vidyalaya

Higher Secondary Certificate — Mathematics and Computer Science

Jan 2004Jan 2016

Stackforce found 100+ more professionals with Data Science & Nlp

Explore similar profiles based on matching skills and experience