Shoury Sharma

Software Engineer

Bengaluru, Karnataka, India3 yrs 4 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in building scalable Generative AI systems.
  • Proven track record in improving AI infrastructure performance.
  • Strong collaboration with ML teams for production deployment.
Stackforce AI infers this person is a SaaS-focused Software Engineer specializing in Generative AI and infrastructure development.

Contact

Skills

Core Skills

Generative AiAi InfrastructureMlopsBackend ArchitectureFrontend DevelopmentReact

Other Skills

Unit TestingProject ManagementGitRedisReact.jsJavaScriptLangChainModel Context Protocol (MCP)Agentic AI DevelopmentKubernetesProgramming LanguagesSystems DesignGreenSock Animation Platform (GSAP)Fullstack EngineerWeb Pages

About

I’m a Software Engineer focused on building production-grade Generative AI systems and LLM infrastructure at scale. Currently, I work as Member of Technical Staff 2 at Nutanix, where I help design and deploy AI-driven infrastructure features for Nutanix AI (GPT-in-a-Box)—with a strong emphasis on scalable LLM serving, secure access (OAuth2 + OIDC), and high-performance containerized microservices. My work has directly improved inference latency and system reliability in enterprise AI environments. Before this, I spent over two years at Katonic.ai, where I built and shipped multiple enterprise GenAI applications, including RAG systems, multi-agent workflows, and analytics platforms for tracking model usage and cost. I’ve worked extensively with LangChain, vector databases (Pinecone, FAISS), Kubernetes, and cloud-native deployments on GCP, collaborating closely with ML teams to take ideas from prototype to production. I enjoy working at the intersection of LLMs, backend systems, and cloud infrastructure—turning complex AI workflows into reliable, scalable products. I’m particularly interested in LLMOps, RAG architectures, AI agents, and platform engineering for GenAI. If you’re building or scaling AI systems and want to exchange ideas, feel free to connect.

Experience

3 yrs 4 mos
Total Experience
2 yrs
Average Tenure
1 yr 4 mos
Current Experience

Nutanix

Member of Technical Staff-2

Jan 2025Present · 1 yr 4 mos · Bengaluru, Karnataka, India · Hybrid

  • Contributed towards feature development of Nutanix AI related to inferencing, model importing, MCP Server Deployment etc
  • Worked on SSO integration and Fine grain permissions architecture with On-Prem Solution
  • Worked on integrating internal tools with NAI AI Org
Unit TestingGenerative AIAI Infrastructure

Katonic ai

3 roles

Senior Full Stack Engineer

Jan 2024Jan 2025 · 1 yr

  • Comprehensively explored Generative AI libraries to integrate into the platform.
  • Worked on improving user journey aspects of MLOps and GenerativeAI.
Project ManagementGitGenerative AIMLOps

Full Stack Engineer

Promoted

Jan 2023Jan 2024 · 1 yr

  • Worked on improving platform performance by implementing server state management aswell as Backend architectural patterns and Frontend architectural pattern.
  • Implemented Kubernetes APIs to handle deployments and its logs.
  • Integrated license management for entire application and user based role managament for frontend using global state management system Redux and in backend using middlewares.
  • Worked on improvising User Interface and improving reliability and performance.
  • Worked with Auth management Keycloak by Redhat to cover entire deployments with authentication system.
  • Worked on Generative AI implementation for enterprise platform.
Unit TestingRedisGenerative AIBackend Architecture

Full Stack Engineer

Jun 2021Dec 2022 · 1 yr 6 mos

  • Migrated entire platform code from template EJS to SPA application with React.js as a framework.
  • Implemented socket programming to dynamically fetch logs from kubernetes API to UI.
  • Worked on UI implementation of installer for cloud platforms
React.jsJavaScriptFrontend DevelopmentReact

Education

Maulana Azad National Institute of Technology

Bachelor of Technology - BTech — Electrical and Electronics Engineering

Jan 2019Jan 2023

Stackforce found 100+ more professionals with Generative Ai & Ai Infrastructure

Explore similar profiles based on matching skills and experience