Ankit Singh — AI Researcher

👨‍💻 About Me I’m an applied AI Scientist with end-to-end expertise across Generative AI, NLP, Computer Vision, Graph Neural Networks, and E-commerce Re-ranking Systems. I build production-grade AI platforms that connect research to scalable deployment — from RAG / GraphRAG and multi-agent pipelines to feature-driven recommendation engines, observability, and model evaluation frameworks. -- 🧠 Core Expertise LLMs | RAG / GraphRAG | NLP | CV | GNNs | LangGraph | Multi-Agent AI | Retrieval & Re-ranking | MLOps | Model Serving | Observability | Recommendation Systems -- 🚀 Experience Highlights ✨ Generative AI & LLM Systems Fine-tuned LLaMA-3, Mistral, Phi-3, & Qwen (Full Precision / LoRA / QLoRA) for domain adaptation. Trained LLM-based rerankers via GRPO, optimizing NDCG /MRR and token-level ranking. Fine-tuned Cross Encoder/Colbert Reranker models using ModernBERT/BGE-v2-M3/BGE-M3 for boosting semantic search. Architected RAG + GraphRAG pipelines for multimodal retrieval across text, tables & visuals. Built dual-retriever chatbot (Vector DB + FHIR) for compliant clinical Q&A. Created semantic search & template recommendation engine indexing 5 M+ presentation constructs using multimodal embeddings + intent-aware ranking. Deployed Ray Serve + vLLM for distributed inference and low-latency serving. Designed multi-tenant evaluation & telemetry with Langfuse, OpenTelemetry, Grafana, LiteLLM for unified cost, latency & quality tracking (RAGAS, DeepEval). Built vendor-agnostic LLM API gateway (OpenAI, Claude, Cohere) with routing, analytics & compliance. Developed LangGraph multi-agent pipeline (Planner → Coder → Evaluator) for automated code generation to do data analysis for various downstream tasks like report generation, large excel understanding, slide generation via Claude + Aspose / Python-PPTX. --- 🎯 Ecommerce Recommendation & Re-Ranking Trained and deployed XGBoost & LightGBM models to improve template and content recommendations. Designed user-level feature store and reranking system at Walmart scale, leveraging session signals and contextual features for personalized search and reranking. --- 📷 Computer Vision & Document AI Fine-tuned YOLO, DINO, Faster RCNN for layout segmentation and multimodal understanding. Trained LayoutLM, LILT, GNNs for structure-aware reasoning & document relation extraction. --- ⚙️ Core Stack Python · LangGraph · LangChain · Ray Serve · vLLM · Bedrock Claude · Airflow · Kafka · Spark · Flink · FastAPI · Grafana · Langfuse · MLflow · Neo4j · Milvus · S3 · QLoRA · LLaMA-3 · Mistral · Phi-3

Stackforce AI infers this person is a highly skilled AI/ML engineer with expertise in Generative AI and E-commerce solutions.

Experience: 8 yrs 9 mos

Skills

Generative Ai
Llm Fine-tuning
Recommendation Systems
Mlops
System Architecture
Multi-agent Ai
Computer Vision
E-commerce Recommendation
Applied Machine Learning

Career Highlights

Expert in Generative AI and LLM fine-tuning.
Proven track record in building scalable AI platforms.
Strong background in E-commerce recommendation systems.

Work Experience

Prezent

Principal data Scientist (2 yrs 11 mos)

Walmart Global Tech India

Senior Data Scientist (2 yrs 6 mos)

Qualcomm

Data Scientist (1 yr 3 mos)

Wipro AI LAB

Data scientist (ML/AI Reasearch) (2 yrs 1 mo)

Education

Bachelor of Technology - BTech at National Institute of Technology Jamshedpur

Doctor of Philosophy - PhD at Indian Institute of Technology, Patna

Ankit Singh

AI Researcher

India8 yrs 9 mos experience

Most Likely To SwitchAI ML Practitioner

Key Highlights

Expert in Generative AI and LLM fine-tuning.
Proven track record in building scalable AI platforms.
Strong background in E-commerce recommendation systems.

Stackforce AI infers this person is a highly skilled AI/ML engineer with expertise in Generative AI and E-commerce solutions.

Contact

ankitnitjsr13@gmail.com LinkedIn

Skills

Core Skills

Generative AiLlm Fine-tuningRecommendation SystemsMlopsSystem ArchitectureMulti-agent AiComputer VisionE-commerce RecommendationApplied Machine Learning

Other Skills

RAGGraphRAGLangfuseGrafanaVector DBFHIR GraphQLMultimodal embeddingsIntent-aware rankingLLaMA-3MistralPhi-3Ray ServevLLMOpenAIClaude

About

Experience

8 yrs 9 mos

Total Experience

2 yrs 2 mos

Average Tenure

2 yrs 11 mos

Current Experience

Prezent

Principal data Scientist

Jul 2023 – Present · 2 yrs 11 mos · Remote

🔍 RAG & Clinical Chatbot System (Pharma – Trials, Compliance, Legal)
Built RAG + GraphRAG stack using hybrid dense–sparse multimodal retrieval.
Enabled retreival over text, tables, and visuals with adaptive query reformulation and grounded generation.
Developed dual-retriever chatbot (Vector DB + FHIR GraphQL) for traceable, compliant answers.
Added Langfuse + Grafana observability for recall drift, latency, and cost.
🧠 Semantic Search & Template Recommendation (5 M + Assets)
Created semantic retrieval + recommendation engine over 5 M presentation templates.
Used multimodal embeddings intent-aware ranking.
Delivered personalized suggestions by prompt, team, and visual preference.
🧪 Model Training & Serving (LLMs + Embedders + Rerankers)
Fine-tuned LLaMA-3, Mistral, Phi-3, Qwen with LORA/QLoRA; trained BGE - M3, ModernBERT for domain embeddings.
Finetuned Cross Encoder rerankers like Bge-v2-m3 and Qwen LLm based rerankers via GRPO; deployed it using Ray Serve + vLLM for distributed, low-latency inference.
Set up MLflow + Langfuse for experiment tracking, evaluation, and model observability.
🧩 LLM Vendor-Agnostic API Platform
Designed unified LLM API gateway (OpenAI, Claude, Cohere, Bedrock, Mistral).
Added policy routing, cost / latency analytics, security, PII redaction, and compliance controls.
Enabled multi-team access with centralized governance and performance monitoring.
🤖 Multi-Agentic Slide Generation (LangGraph + Claude + Aspose / Python-PPTX)
Built LangGraph pipeline (Planner → Coding Agent → Design Evaluator) for autonomous slide code generation.
Integrated web data retrieval + aesthetic evaluation for layout optimization.
Matched or surpassed Copilot-level design fidelity and brand consistency.
📚 Document AI & Layout Understanding
Trained YOLO / DINO / Grounding DINO for layout segmentation;
used LayoutLM / LILT / GNNs for visual-text relation extraction and structured summarization.

RAGGraphRAGLangfuseGrafanaVector DBFHIR GraphQL+2

Walmart global tech india

Senior Data Scientist

Jan 2021 – Jul 2023 · 2 yrs 6 mos · Bengaluru, Karnataka, India

1. Legal data classification
2. Design & Development of Feature Store
3. Design & Development of Data Monitoring tool.
4. Summarization of large pdf
5. Pdf data representation for helping various downstream task.

XGBoostLightGBMFeature StoreE-commerce Recommendation

Qualcomm

Data Scientist

Sep 2019 – Dec 2020 · 1 yr 3 mos · Greater Hyderabad Area

1. Quantization of machine learning model to work on low computing devices like Mobile, IOT etc.
2. Anomoly detection
3. Designing of dashboard to populate the intermediate result.

QuantizationAnomaly DetectionDashboard DesignApplied Machine Learning