Yash Singhal

AI Researcher

Delhi, India5 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Engineered production-grade AI infrastructure for patent analysis.
  • Delivered 10+ full-stack AI applications for international clients.
  • Authored 20+ technical articles read by 5,000+ developers.
Stackforce AI infers this person is a SaaS-focused AI Engineer with strong full-stack and MLOps expertise.

Contact

Skills

Core Skills

RagMlopsFull-stack DevelopmentCloud InfrastructureTechnical WritingDevops

Other Skills

PythonLangChainLangGraphvLLMPyTorchQdrantDockerKubernetesPrompt EngineeringLLM Fine-tuningFastAPINext.jsReact.jsAWSGCP

About

AI Engineer and final-year CS student. I build production LLM systems that ship. Currently interning at Researchwire Knowledge Solutions. I work on RAG pipelines for large-scale patent analysis, processing 1M+ documents with sub-second semantic search. I improved retrieval precision by 40% and reduced system latency by 30-40%. I cut unnecessary LLM calls by 50% and achieved 2-3x inference throughput improvement by deploying vLLM with hybrid ML strategies. Other highlights: IEEE-published researcher. Developed a Random Forest model (R2 = 0.54) for rainfall prediction, presented at IC3-2025. Delivered 10+ freelance AI applications for clients on AWS, Azure, and GCP. Authored 20+ technical articles read by 5,000+ developers. Built a multilingual RAG chatbot for 10+ languages with sub-500ms transcription latency. Deployed full MLOps pipelines on Kubernetes using Jenkins and ArgoCD, cutting release time to under 8 minutes. I take full ownership of every project. From system design and prompt engineering to Docker, Kubernetes, monitoring, and production optimization. I thrive in fast-moving environments where I can build and ship real things. Open to AI Engineer, Generative AI Engineer, and Full-Stack Engineer roles from mid-2026. Reach me at: yashsinghal9886@gmail.com Skills: RAG, LangChain, LangGraph, Multi-Agent Systems, vLLM, LLM Fine-tuning, PyTorch, Prompt Engineering, Pydantic, OpenAI API, Qdrant, Docker, Kubernetes, Jenkins, ArgoCD, CI/CD, Terraform, AWS, Azure, GCP, FastAPI, Next.js, React.js, Python, TypeScript, System Design, Data Pipelines, Technical Writing

Experience

5 mos
Total Experience
5 mos
Average Tenure
5 mos
Current Experience

Researchwire knowledge solutions pvt. ltd.

Artificial Intelligence Engineer Intern | RAG Pipelines, LLM Optimization, MLOps, Python, vLLM

Jan 2026Present · 5 mos · Hybrid

  • Building production-grade AI infrastructure for large-scale patent analysis.
  • Key achievements:
  • Engineered a production RAG pipeline using Docker-based Qdrant, processing 1M+ documents with sub-second semantic search and 40% improvement in retrieval precision.
  • Deployed a vLLM inference server achieving ~2-3x throughput improvement and 35% reduction in cost-per-request compared to baseline.
  • Introduced a hybrid ML + LLM strategy using lightweight classifiers for pre-filtering, cutting unnecessary LLM calls by 50% and lowering monthly compute costs.
  • Optimized LLM pipelines through prompt restructuring and request batching, reducing system latency by 30-40% and improving P95 response times from 3s to 1.8s.
  • Fine-tuned open-source LLMs on domain-specific patent corpora, improving information retrieval accuracy by ~20% over baseline.
  • Skills: Python, RAG, LangChain, LangGraph, vLLM, PyTorch, Qdrant, Docker, Kubernetes, Prompt Engineering, LLM Fine-tuning, FastAPI, MLOps
PythonRAGLangChainLangGraphvLLMPyTorch+7

Fiverr

Freelance AI and Full-Stack Engineer | Python, FastAPI, Next.js, AWS, Docker, Kubernetes

Jan 2025Present · 1 yr 5 mos

  • Building and deploying full-stack AI applications for international clients end-to-end.
  • Key achievements:
  • Delivered 10+ full-stack AI applications covering React/Next.js front-end, FastAPI back-end, and production cloud deployment on AWS, Azure, and GCP.
  • Expanded service offerings from web development to full MLOps and cloud infrastructure, growing average project value by 3x and achieving 60% repeat client engagement.
  • Architected and deployed containerized applications using Docker and Kubernetes, handling complete ownership from system design through production monitoring.
  • Maintained a 5-star rating across all client engagements by taking end-to-end ownership from architecture through post-launch support.
  • Skills: Python, FastAPI, Next.js, React.js, Docker, Kubernetes, AWS, GCP, Azure, Node.js, REST APIs, CI/CD, System Design
PythonFastAPINext.jsReact.jsDockerKubernetes+9

Hashnode

Technical Content Writer | DevOps, Kubernetes, AWS, Cloud Native, AI Engineering, LangChain

Jul 2024May 2025 · 10 mos

  • Writing in-depth technical content on DevOps, Cloud Native, and AI engineering for a global developer audience.
  • Key achievements:
  • Authored 20+ technical articles on Kubernetes, Docker, AWS, and AI engineering topics, reaching 5,000+ readers and building authority in the cloud-native community.
  • Translated complex infrastructure and MLOps concepts into clear, actionable guides — ranked among top-read posts on Hashnode by reader engagement.
  • Covered topics including EKS deployments, Terraform automation, ArgoCD GitOps workflows, and LLM application development.
  • Skills: Technical Writing, Kubernetes, Docker, AWS, Terraform, ArgoCD, LangChain, DevOps, Cloud Native, AI Engineering
Technical WritingKubernetesDockerAWSTerraformArgoCD+4

Education

Jaypee Institute Of Information Technology

Bachelor of Technology - BTech — Computer Science

Jan 2022Jan 2026

Amity International School, Vasundhara (Sector-6)

Apr 2022Present

Stackforce found 100+ more professionals with Rag & Mlops

Explore similar profiles based on matching skills and experience