Ankit Singh

AI Researcher

India8 yrs 9 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Expert in Generative AI and LLM fine-tuning.
  • Proven track record in building scalable AI platforms.
  • Strong background in E-commerce recommendation systems.
Stackforce AI infers this person is a highly skilled AI/ML engineer with expertise in Generative AI and E-commerce solutions.

Contact

Skills

Core Skills

Generative AiLlm Fine-tuningRecommendation SystemsMlopsSystem ArchitectureMulti-agent AiComputer VisionE-commerce RecommendationApplied Machine Learning

Other Skills

RAGGraphRAGLangfuseGrafanaVector DBFHIR GraphQLMultimodal embeddingsIntent-aware rankingLLaMA-3MistralPhi-3Ray ServevLLMOpenAIClaude

About

👨‍💻 About Me I’m an applied AI Scientist with end-to-end expertise across Generative AI, NLP, Computer Vision, Graph Neural Networks, and E-commerce Re-ranking Systems. I build production-grade AI platforms that connect research to scalable deployment — from RAG / GraphRAG and multi-agent pipelines to feature-driven recommendation engines, observability, and model evaluation frameworks. -- 🧠 Core Expertise LLMs | RAG / GraphRAG | NLP | CV | GNNs | LangGraph | Multi-Agent AI | Retrieval & Re-ranking | MLOps | Model Serving | Observability | Recommendation Systems -- 🚀 Experience Highlights ✨ Generative AI & LLM Systems Fine-tuned LLaMA-3, Mistral, Phi-3, & Qwen (Full Precision / LoRA / QLoRA) for domain adaptation. Trained LLM-based rerankers via GRPO, optimizing NDCG /MRR and token-level ranking. Fine-tuned Cross Encoder/Colbert Reranker models using ModernBERT/BGE-v2-M3/BGE-M3 for boosting semantic search. Architected RAG + GraphRAG pipelines for multimodal retrieval across text, tables & visuals. Built dual-retriever chatbot (Vector DB + FHIR) for compliant clinical Q&A. Created semantic search & template recommendation engine indexing 5 M+ presentation constructs using multimodal embeddings + intent-aware ranking. Deployed Ray Serve + vLLM for distributed inference and low-latency serving. Designed multi-tenant evaluation & telemetry with Langfuse, OpenTelemetry, Grafana, LiteLLM for unified cost, latency & quality tracking (RAGAS, DeepEval). Built vendor-agnostic LLM API gateway (OpenAI, Claude, Cohere) with routing, analytics & compliance. Developed LangGraph multi-agent pipeline (Planner → Coder → Evaluator) for automated code generation to do data analysis for various downstream tasks like report generation, large excel understanding, slide generation via Claude + Aspose / Python-PPTX. --- 🎯 Ecommerce Recommendation & Re-Ranking Trained and deployed XGBoost & LightGBM models to improve template and content recommendations. Designed user-level feature store and reranking system at Walmart scale, leveraging session signals and contextual features for personalized search and reranking. --- 📷 Computer Vision & Document AI Fine-tuned YOLO, DINO, Faster RCNN for layout segmentation and multimodal understanding. Trained LayoutLM, LILT, GNNs for structure-aware reasoning & document relation extraction. --- ⚙️ Core Stack Python · LangGraph · LangChain · Ray Serve · vLLM · Bedrock Claude · Airflow · Kafka · Spark · Flink · FastAPI · Grafana · Langfuse · MLflow · Neo4j · Milvus · S3 · QLoRA · LLaMA-3 · Mistral · Phi-3

Experience

8 yrs 9 mos
Total Experience
2 yrs 2 mos
Average Tenure
2 yrs 11 mos
Current Experience

Prezent

Principal data Scientist

Jul 2023Present · 2 yrs 11 mos · Remote

  • 🔍 RAG & Clinical Chatbot System (Pharma – Trials, Compliance, Legal)
  • Built RAG + GraphRAG stack using hybrid dense–sparse multimodal retrieval.
  • Enabled retreival over text, tables, and visuals with adaptive query reformulation and grounded generation.
  • Developed dual-retriever chatbot (Vector DB + FHIR GraphQL) for traceable, compliant answers.
  • Added Langfuse + Grafana observability for recall drift, latency, and cost.
  • 🧠 Semantic Search & Template Recommendation (5 M + Assets)
  • Created semantic retrieval + recommendation engine over 5 M presentation templates.
  • Used multimodal embeddings intent-aware ranking.
  • Delivered personalized suggestions by prompt, team, and visual preference.
  • 🧪 Model Training & Serving (LLMs + Embedders + Rerankers)
  • Fine-tuned LLaMA-3, Mistral, Phi-3, Qwen with LORA/QLoRA; trained BGE - M3, ModernBERT for domain embeddings.
  • Finetuned Cross Encoder rerankers like Bge-v2-m3 and Qwen LLm based rerankers via GRPO; deployed it using Ray Serve + vLLM for distributed, low-latency inference.
  • Set up MLflow + Langfuse for experiment tracking, evaluation, and model observability.
  • 🧩 LLM Vendor-Agnostic API Platform
  • Designed unified LLM API gateway (OpenAI, Claude, Cohere, Bedrock, Mistral).
  • Added policy routing, cost / latency analytics, security, PII redaction, and compliance controls.
  • Enabled multi-team access with centralized governance and performance monitoring.
  • 🤖 Multi-Agentic Slide Generation (LangGraph + Claude + Aspose / Python-PPTX)
  • Built LangGraph pipeline (Planner → Coding Agent → Design Evaluator) for autonomous slide code generation.
  • Integrated web data retrieval + aesthetic evaluation for layout optimization.
  • Matched or surpassed Copilot-level design fidelity and brand consistency.
  • 📚 Document AI & Layout Understanding
  • Trained YOLO / DINO / Grounding DINO for layout segmentation;
  • used LayoutLM / LILT / GNNs for visual-text relation extraction and structured summarization.
RAGGraphRAGLangfuseGrafanaVector DBFHIR GraphQL+2

Walmart global tech india

Senior Data Scientist

Jan 2021Jul 2023 · 2 yrs 6 mos · Bengaluru, Karnataka, India

  • 1. Legal data classification
  • 2. Design & Development of Feature Store
  • 3. Design & Development of Data Monitoring tool.
  • 4. Summarization of large pdf
  • 5. Pdf data representation for helping various downstream task.
XGBoostLightGBMFeature StoreE-commerce Recommendation

Qualcomm

Data Scientist

Sep 2019Dec 2020 · 1 yr 3 mos · Greater Hyderabad Area

  • 1. Quantization of machine learning model to work on low computing devices like Mobile, IOT etc.
  • 2. Anomoly detection
  • 3. Designing of dashboard to populate the intermediate result.
QuantizationAnomaly DetectionDashboard DesignApplied Machine Learning

Wipro ai lab

Data scientist (ML/AI Reasearch)

Aug 2017Sep 2019 · 2 yrs 1 mo · Bangalore

  • 1. Sentiment analysis on Indian region language datasets
  • 2. Abstractive summarisation
  • 3. Neural network based relation extraction
  • 4. Image quality improvement using GAN, Auto-encoders
  • 5. Few shot learning
  • 6. Object classification
  • 7. Haze removal from single image
Sentiment AnalysisAbstractive SummarizationGANsAuto-encodersApplied Machine Learning

Education

National Institute of Technology Jamshedpur

Bachelor of Technology - BTech — Electronics and Communications Engineering

Jan 2013Jan 2017

Indian Institute of Technology, Patna

Doctor of Philosophy - PhD — Computer Software Engineering

Stackforce found 100+ more professionals with Generative Ai & Llm Fine-tuning

Explore similar profiles based on matching skills and experience