Deepak Kumar โ Co-Founder
Building Production AI Systems | LLM Training & Evaluation โข RAG โข Applied AI โข MCP I have 7+ years of engineering experience, starting as a founding engineer building products from scratch to Lead engineer builing scaling real-time platform infrastructure to 100K+ concurrent users at Airmeet (a Zoom alternative). Along the way I've worked with researchers at Meta, OpenAI, and Google through Turing, contributing to LLaMA 4 training and model evaluation. I'm building production AI systems at Wrike - RAG pipelines, MCP servers, and agentic developer workflows, while writing about AI agents and automation workflows at tooljunction.io. ๐ช๐ต๐ฎ๐ ๐ ๐๐ฝ๐ฒ๐ฐ๐ถ๐ฎ๐น๐ถ๐๐ฒ ๐ถ๐ป: - AI & LLMs: LLM Fine-Tuning, RLHF, LLM Evaluation (BLEU, pass@k, RAGAS, CodeBLEU), Prompt Engineering, Instruction Tuning, Agentic Workflow Orchestration, MCP Servers - RAG & Search: RAG System Design, Vector Search, LlamaIndex, LangChain, LangGraph, Semantic Search, Embeddings - Backend & Infrastructure: Python, Node.js, TypeScript, Java, REST APIs, Microservices, Distributed Systems, Apache Kafka, AWS (Lambda, S3, EC2), Vertex AI, Redis, Elasticsearch, Docker, CI/CD - Real-Time Systems: High-concurrency event infrastructure, WebSockets, Firebase, low-latency service design at scale. ๐ช๐ต๐ฎ๐ ๐'๐๐ฒ ๐ฏ๐๐ถ๐น๐: - Trained LLaMA 4 using RLHF with AI researchers at Meta and OpenAI, optimised 500+ prompts for LLaMA 4 and Gemini, ran evaluation pipelines (BLEU, pass@k, RAGAS, CodeBLEU) across 1,000+ training examples, improving code generation accuracy by 23% and reducing hallucination rates by 15% - Built a Vertex AI-powered production RAG system handling 10K+ daily queries and an MCP server adopted organisation-wide at Wrike - enabling agentic developer workflows across engineering teams - Integrated GPT-4, Whisper API, and semantic matching into live event infrastructure running 100K+ concurrent users at <100ms latency (Kafka, AWS Lambda, AWS Bedrock) at Airmeet - Maintaining A to Z Resources for Developers (22K+ GitHub stars, 5K+ forks, 10K+ weekly visitors) ๐ช๐ต๐ฎ๐ ๐ ๐๐ผ๐ฟ๐ธ ๐๐ถ๐๐ต: LLMs | RLHF | LLM Fine-Tuning | LLM Evaluation | RAG | LlamaIndex | LangChain | LangGraph | MCP Server | Agentic AI | Prompt Engineering | RAGAS | Vertex AI | OpenAI API | Claude API | Whisper API | Distributed Systems | Apache Kafka | AWS Lambda | Node.js | Python | TypeScript | Docker | CI/CD | System Design | Microservices Contact: Email: dipakkr.co@gmaill.com Github: https://github.com/dipakkr Medium: https://medium.com/@dipakkr
Stackforce AI infers this person is a SaaS-focused AI Engineer with expertise in LLMs and real-time systems.
Location: Bengaluru, Karnataka, India
Experience: 7 yrs 2 mos
Skills
- Ai & Llms
- Rag & Search
- Backend & Infrastructure
- Frontend
- Product Management
Career Highlights
- 7+ years of engineering experience in AI systems.
- Built production AI systems handling 10K+ daily queries.
- Trained LLaMA 4, improving code generation accuracy by 35%.
Work Experience
Stealth
AI Consultant (7 mos)
Wrike
Senior Software Engineer - II (1 yr 2 mos)
Turing
LLM Engineer (10 mos)
Airmeet
Senior Software Engineer - II (9 mos)
Software Engineer - II (1 yr 11 mos)
Software Engineer (8 mos)
Flux Auto
Software Engineer (8 mos)
91Wheels
Founding Engineer (1 yr 4 mos)
FrontBench
Co-Founder (8 mos)
Malaviya National Institute of Technology Jaipur
Machine Learning Intern (3 mos)
Microsoft
Senior Microsoft Student Partner (1 yr 10 mos)
Internity Foundation
Software Engineering Intern (1 mo)