Jitendra Jangid — CEO

Thank you for taking out time to know more about me:With ~8 years of experience as a Machine Learning Scientist, I have cultivated expertise in solving challenging problems in Recommender Systems, LLMs and Classical NLP, particularly within the dynamic landscapes of E-commerce, Healthcare, and Finance. I thrive on collaborative work, finding joy in growing alongside my teammates. Over the years, I've been entrusted by mentors and teams, reciprocating that trust with those I've had the privilege to work with. My passion for transforming data into impactful insights aligns seamlessly with my commitment to fostering strong, trust-based collaborations.My main areas of expertise are: - ML & Analytics: Regression, Classification, Random Forest, XGBoost, LightGBM, CatBoost, Stacking, Support Vector Machine, Clustering Techniques, Deep Neural Networks, Transformers, BERT, A/B Testing- LLM Engineering: Langchain, LlamaIndex, TxtAI, RAG, LLM As Judge, Agentic Frameworks, LLM-Finetuning (PEFT, QLoRA), Prompt Engineering (Chain-of-Thought), OpenAI, Anthropic, LiteLLM, Ollama, Embeddings- Libraries & Frameworks: Scikit-learn, Numpy, Pandas, PyTorch, TensorFlow, Keras, Hugging Face, FastAPI, Seaborn, Plotly, Gensim, NLTK, SpaCy, FAISS, Pinecone, Weaviate, StanfordCoreNLP, MLflow, Streamlit, Gradio- Languages & Big Data: Python, PySpark, R, SQL, CML Infrastructure: AWS (SageMaker, S3, Bedrock), Databricks, Airflow, Google Cloud Platform (Vertex AI),Microsoft Azure, Vector Databases- Management/Productivity Tools: Visual Studio Code, Jupyter, GitHub/BitBucket, Jira, Confluence, Tableau———————————————————————————————————————————————————Stay humble when you're doing well and hungry when you're not. Nothing last forever.- Unknown———————————————————————————————————————————————————

Stackforce AI infers this person is a Machine Learning expert with a focus on E-commerce and Healthcare applications.

Location: Gurgaon, Haryana, India

Experience: 7 yrs 2 mos

Skills

Generative Ai
Machine Learning
Recommendation Systems
Data Analytics

Career Highlights

Expert in Machine Learning and Generative AI.
Led successful data science teams in high-impact projects.
Proven track record in E-commerce and Healthcare domains.

Work Experience

IDFC FIRST Bank

Data Science Lead (Generative AI) (1 yr 6 mos)

Tokopedia

Senior Data Scientist (Home, Recommendations & Personalization) (1 yr 11 mos)

Zomato

Data Scientist (1 yr)

ZS

Data Science Associate Consultant (10 mos)

Data Science Associate (1 yr 11 mos)

Meru Cab Company Pvt. Ltd.

Data Scientist Intern (2 mos)

Education

Bachelor’s Degree at Indian Institute of Technology, Kharagpur

12th at New Public Senior Secondary School, Kota (Rajasthan)

10th at Tagore Senior Secondary School, Kuchaman City (Rajasthan)

Jitendra Jangid

CEO

Gurgaon, Haryana, India7 yrs 2 mos experience

AI EnabledAI ML Practitioner

Key Highlights

Expert in Machine Learning and Generative AI.
Led successful data science teams in high-impact projects.
Proven track record in E-commerce and Healthcare domains.

Stackforce AI infers this person is a Machine Learning expert with a focus on E-commerce and Healthcare applications.

Contact

Skills

Core Skills

Generative AiMachine LearningRecommendation SystemsData Analytics

Other Skills

A/B TestingAWS SageMakerArtificial Intelligence (AI)Azure DatabricksBERTBioBERTCCatBoostCollaborative FilteringComputer VisionCustomer AnalyticsDeep LearningDeepgramDistilRoBERTaEnglish

About

Experience

7 yrs 2 mos

Total Experience

1 yr 9 mos

Average Tenure

Current Experience

Idfc first bank

Data Science Lead (Generative AI)

Mar 2024 – Sep 2025 · 1 yr 6 mos · Remote

Leadership & Team Management (Generative AI Initiatives):
Spearheaded a team of 5 Data Scientists within the Interaction Squad (AI Labs), fostering cross-functional partnerships between Product Management, Engineering teams, and key Business Stakeholders.
Email Automation - Customer Support (Individual Contributor):
Architected and deployed an end-to-end automated email response system
Engineered a multi-component AI pipeline comprising: (1) LightGBM based junk filter (100% precision), (2) PII extraction using Presidio and GPT-4o for customer identification, (3) Intent Classification with fine-tuned DistilRoBERTa & deployed on AWS SageMaker (∼20ms latency) with 95% precision and 85% recall across 71 intents, (4) PySpark Data Pipeline for contextual enrichment.
Implemented a scalable multi-agent framework (Query Analyzer, SOP Executor, Response Generator) for complex queries using LLM-powered SOPs, enabling handling of sophisticated customer queries with potential to increase automation volume to >80%.
[Agent Scorecard Automation System]: Led development of an automated call evaluation system scaling from <1% to comprehensive coverage (MD Mention). Built a pipeline with Whisper for language detection, Deepgram for transcription, and finetuned Llama-3.1-8b for feature scoring, achieving >80% F1 score across all features (greetings, sentiment, agent pitch etc).
[Agent Training System]: Pioneered an innovative training system using AWS Polly for customer interaction simulation. Designed an LLM-based evaluation pipeline that compressed agent training time from 6 months to 1 month with real-time performance feedback.
[Voice Analytics Platform]: Orchestrated development of voice analytics with an interactive query system empowering business teams to extract actionable insights, sentiment analysis from call recordings, driving data-informed customer experience improvements.

Generative AITeam ManagementLightGBMAWS SageMakerDistilRoBERTaPySpark+1

Tokopedia

Senior Data Scientist (Home, Recommendations & Personalization)

Apr 2022 – Mar 2024 · 1 yr 11 mos · Gurugram, Haryana, India

[Product Recommendation Engine (PRE)]: Introduced personalization at a buyer‑product level to improve the browsing experience for more than 50 million Tokopedia buyers. Executed two‑stage development process to ensure the system’s robustness and scalability: [Stage 1] Generate most relevant products (out of ∼600 million products) for buyers at buyer segment x category ‑ level (CatBoost Ranking Model using buyer segment x product past orders/clicks and raw product features) and [Stage 2] Rank the generated relevant product as per buyers preferences at that point in time (Two Tower Model using buyer past order/click history, demographics & products price, title, category features). In offline evaluation, proposed RecSys surpassed existing solution (Best Seller algorithm) by ∼5% lift in Personalization Score@5, and in A/B test, PRE boosted Homepage Inspirational Widget CTR (+15%) and CVR (+10%).
[POC ‑ Next Category Prediction (NCP)]: Designed a recommendation engine that can remember sequential patterns of user’s actions to recommend more relevant and new categories. Implemented Next‑item Recommendation with Sequential Hypergraphs research paper using user’s past interactions (Click, Order, ATC, Wishlist, Discussion & Micro Actions) & recency as features, and last purchased category as target label. In offline evaluation achieved HR@1 ∼20% on out‑of‑time validation set which out‑performed baseline solution (user x category affinity‑based) by ∼45% lift.

CatBoostRecommendation SystemsA/B TestingData AnalyticsMachine Learning

Zomato

Data Scientist

Apr 2021 – Apr 2022 · 1 yr · Gurugram, Haryana, India

Homepage Personalization:
[Quick Cart Checkout]: Introduced a feature to simplify online food ordering with a single click checkout option on the homepage (readymade carts). Created user tags to identify whether a user has an affinity towards restaurant*dish or dish or restaurant using the user’s past order history. Based on these tags, the top 5 carts (restaurants + dishes) were identified for a user. Deployed a daily cron spark job on Airflow to update cart recommendations for incremental users.
[People Like You ‑ User Personalized Dish Recommendations]: Implemented a collaborative filtering based algorithm (LightFM ‑ Implicit feedback, matrix factorization based model) for user personalized dish recommendations using user’s past ordered history and user*dish interaction, user average order value.
Customer Analytics and Segmentation:
[Customer Intent Classification]: Created customer personas (explorers, repeaters, low/medium intent user) of those users who opened the app using various app interaction signals which customer leave behind in terms of search and clicks activities. Applied clustering techniques to identify these customer personas using session length, total # clicks, and property interaction etc. features

Collaborative FilteringLightFMCustomer AnalyticsMachine LearningData Analytics

Zs

2 roles

Data Science Associate Consultant

Jun 2020 – Apr 2021 · 10 mos

Led the team of 4 Junior Data Scientists in developing a robust information extraction and retrieval system to be used by various pharma clients

Data Science Associate

Jul 2018 – Jun 2020 · 1 yr 11 mos

Text Analytics Products:
Study Design Optimizer: Built a fully automated & end to end Natural Language Processing (NLP) pipeline that leverages state of the art models (BERT) & transfer learning to classify text and extract explicit/implicit information using syntactic and semantic relationships from protocol documents (PDFs) to make recommendations on optimizing clinical trial design. Reduced manual effort by 50-60% for protocol designing process and accelerated clinical programs through more efficient trial designs.
Design Intelligence: Developed an end to end product that combines entity recognition & linking (BioBERT/BioFlair), and weak supervised learning with deep neural networks to classify unstructured biomedical text from open source data (ClinicalTrials.Gov, PubMed) to expedite the clinical program planning and trial design process. Reduced time, effort & cost for clinical trial designing (weeks to hours) and automated insights to better inform future clinical trial investment.
Study Design Optimizer and Design Intelligence Products featured on Cision PR Newswire
Predictive modeling and Explainable AI Solutions:
Enrollment Rate Analysis: Analyzed the impact of study design attributes and KPIs on Enrollment Rate (Patient per Site per Month) using Generalized Linear Mixed-Effects Modeling (GLMM) to account for both fixed and random effects at protocol level
Enrollment Rate Prediction: Built a stack of machine learning models to predict the Enrollment Rate using various study design attributes and pre-trained biomedical embeddings

NLPBERTPredictive ModelingMachine LearningData Analytics

Meru cab company pvt. ltd.

Data Scientist Intern

May 2017 – Jul 2017 · 2 mos · Mumbai Area, India

Developed a Machine Learning model to predict driver refusal after job is awarded in order to take further action
Implemented LightGBM method (0.94 AUC) over imbalanced data, having 90% accuracy using booking details and driver’s past trend
Designed a Restful web API using the Flask microframework in Python and deployed it using WSGI server (Apache)
Company replaced existing model with proposed one in order to avoid refusal and driver profiling owing to its feasibility and accuracy

LightGBMFlaskMachine Learning