Robin S. — Senior Software Engineer
I’m a Senior LLM & AI Platform Engineer with 8+ years, with last 6 years focusing on machine learning, LLMs/NLP, and GenAI. I like taking LLM ideas from “cool demo” to reliable, observable, cost-aware production services. Initial 2 years, I worked on designing and building scalable, highly available backend & distributed systems with AWS. Recently I’ve been: • Designing RAG pipelines end-to-end: ingestion, chunking, embeddings (OpenAI/Hugging Face), vector DBs (Pinecone, Weaviate, FAISS), hybrid BM25 + dense search, reranking, and prompt registries. • Building LLM services on Kubernetes (EKS) with Docker, FastAPI, GitHub Actions CI/CD, and full telemetry (OpenTelemetry, Prometheus/Grafana, token/latency/cost metrics). • Working on LLM serving & LLMOps: vLLM/TGI, quantization, KV cache, batching, routing between managed APIs (OpenAI/Anthropic) and OSS models, plus profiling (torch.profiler, py-spy, basic CUDA) to tune performance and cost. • Prototyping agentic systems with LangChain/LangGraph: planner + tool-using agents (RAG search, productivity tools), structured tool-calls (Pydantic/JSON), traces for debugging, guardrails, and caching. I enjoy roles where I can: • Own LLM platforms end-to-end – RAG, agents, serving, evaluation, and observability • Improve retrieval & RAG quality (hybrid search, semantic search, evaluation harnesses: MRR, Recall@K, NDCG, LLM-as-judge) • Collaborate with product/infra/ML teams to build AI platforms and agentic workflows that actually move KPIs Open to roles focused on: LLM platforms, RAG, multi-agent systems, and AI infrastructure (serving, eval, observability). Core stack: • RAG & IR: OpenAI/HF models, Pinecone/Weaviate/FAISS, BM25 + hybrid search, reranking • LLMOps / MLOps: Docker, K8s (EKS/ECS), GitHub Actions, Terraform, MLflow, observability, SLOs • Cloud/backend: AWS (S3/ECS/EKS/Lambda/VPC/IAM, SageMaker), Kafka/Kinesis, OpenSearch/Elasticsearch, Python/Java/Node-TS
Stackforce AI infers this person is a SaaS-focused LLM and AI infrastructure engineer.
Location: Seattle, Washington, United States
Experience: 8 yrs 4 mos
Skills
- Llmops
- Rag
- Mlops
- Distributed Systems
- Microservices
Career Highlights
- Expert in building scalable LLM platforms.
- Proficient in RAG and LLMOps methodologies.
- Strong background in cloud-based distributed systems.
Work Experience
Shell
Senior Software Engineer (LLM/LLMops) (3 yrs 1 mo)
Thomson Reuters
Senior Software Engineer (ML/LLM Platform) (3 yrs 1 mo)
TradeRev
Software Developer (11 mos)
AT&T
Software Engineer (1 yr 3 mos)
Education
Bachelor of Technology (B.Tech.) at Guru Nanak Dev University