N

NAVEEN GOYAL

Software Engineer

Bengaluru, Karnataka, India10 yrs 10 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Architected large-scale AI systems with LLMs.
  • Led migration to agent-based architecture for AI workflows.
  • Proven track record in optimizing cloud-native infrastructures.
Stackforce AI infers this person is a SaaS-focused software engineer with expertise in AI-driven platforms and cloud-native architectures.

Contact

Skills

Core Skills

Large Language Model Operations (llmops)Cloud ApplicationsDistributed Systems

Other Skills

Agent DevelopmentKubernetesMLOpsSystems DesignOptimizationGo (Programming Language)JavaPythonObservabilityPerformance OptimizationMicroservicesMonitoringDynamoDBGraph-based Access PatternsDatabase Partitioning

About

Staff/Principal-level Software Engineer with over a decade of experience building and scaling AI-driven platforms and distributed systems. Expertise spans Python, Go, and Java, with a strong focus on cloud-native architectures, microservices, and platform engineering. Currently leading the development of large-scale AI systems leveraging LLMs, RAG, and agent frameworks, with deep involvement in Kubernetes-based infrastructure and production-grade MLOps. Proven ability to design high-impact systems that improve performance, enhance reliability, and optimize infrastructure efficiency. Strong background in conversational AI, search, and workflow automation, combined with hands-on experience in system design and reliability engineering. Known for driving platform modernization, simplifying complex architectures, and enabling scalable, resilient solutions. A strategic technical leader who mentors engineers, influences architecture decisions, and partners closely with product teams to deliver systems that power meaningful business outcomes.

Experience

10 yrs 10 mos
Total Experience
2 yrs 2 mos
Average Tenure
3 yrs 1 mo
Current Experience

Allen digital

2 roles

Staff Engineer

Promoted

Apr 2024Present · 2 yrs 2 mos · Bengaluru · On-site

  • Architected and scaled an LLM-based AI platform handling millions of daily requests, significantly improving compute efficiency through asynchronous processing, connection pooling, and end-to-end observability.
  • Led the migration to an agent-based architecture using the Google Agent Development Kit (ADK), enabling reliable and scalable AI workflows with a seamless production rollout.
  • Built a unified RAG platform with intelligent caching and resilience mechanisms, eliminating redundant inference and substantially improving response latency.
  • Optimized model hosting on Kubernetes through rightsizing, autoscaling, and workload tuning, driving meaningful infrastructure cost efficiencies.
  • Consolidated fragmented search infrastructure into a unified architecture, improving query performance while simplifying data pipelines and operational complexity.
  • Introduced evaluation-gated deployment pipelines to ensure production quality, reducing the risk of faulty releases and strengthening system reliability.
  • Led backend modernization efforts, improving maintainability, reducing service sprawl, and enhancing CI/CD robustness.
  • Designed and built a conversational agent platform with an end-to-end orchestration layer, including configurable RAG pipelines, multimodal OCR integration, and safety systems—enabling scalable, low-latency, and high-quality automated query resolution.
Agent DevelopmentDistributed SystemsLarge Language Model Operations (LLMOps)Cloud ApplicationsKubernetesMLOps+2

Senior Software Engineer

Apr 2023Mar 2024 · 11 mos · Bengaluru · On-site

  • Query & Doubt Management Platform: Owned the core service powering the end-to-end query lifecycle, including creation, intelligent routing, and expert workflows. Improved platform availability, responsiveness, and operational reliability through scalable architecture and performance optimizations.
  • Leadership & Engineering Excellence: Led a team of engineers while driving key architectural decisions and platform direction. Established an observability-first, metrics-driven engineering culture leveraging OpenTelemetry, SLOs, experimentation frameworks, and standardized best practices across Go, Java, and Python services.
Cloud ApplicationsGo (Programming Language)JavaPythonObservabilityPerformance Optimization+1

Amazon

Software Development Engineer -2

Mar 2020Apr 2023 · 3 yrs 1 mo · Bengaluru · Remote

  • Owned event-driven microservices at massive scale, ensuring system reliability through effective use of throttling, retries, and autoscaling strategies.
  • Led operational readiness for high-traffic scenarios by implementing robust monitoring, detailed runbooks, and deployment safeguards to maintain system stability.
  • Migrated key workloads from relational models to graph-based access patterns, improving query efficiency and reducing latency for complex data relationships.
  • Designed and built scalable internal APIs on Amazon DynamoDB, enabling efficient and reliable data access for critical services.
  • Redesigned database partitioning strategies to eliminate hot spots and sustain high performance under heavy load.
MicroservicesMonitoringDynamoDBGraph-based Access PatternsDatabase PartitioningDistributed Systems+1

Sca technologies

3 roles

Tech Lead

Jul 2019Feb 2020 · 7 mos · Gurgaon, India

  • Architected & optimized freight cost and delivery APIs for CRM systems with dynamic pricing
  • Worked on Re-engineering Excel-based workflows into secure, modular web applications
  • Built robust SQL Server pipelines enhancing ETL and analytics
  • Supported cross-functional teams for project delivery
API DevelopmentSQL ServerETLAnalyticsCloud Applications

Senior Software Engineer

Promoted

Jul 2018Jul 2019 · 1 yr · Gurgaon, India

Associate Software Engineer

Jun 2015Dec 2017 · 2 yrs 6 mos · Gurugram

  • Built RESTful APIs to replace manual integrations, enabling real-time data exchange and reducing operational overhead.
  • Contributed to ERP system rollouts, improving release quality, deployment processes, and defect tracking practices.
  • Developed and optimized ETL pipelines and database queries, enhancing reporting performance and data reliability.
  • Replaced manual spreadsheet-driven workflows with web-based internal tools, improving efficiency, traceability, and auditability.
  • Enhanced ETL pipelines with monitoring, alerting, and recovery mechanisms, reducing downtime during critical business operations.
  • Technologies: Java, SQL Server, Oracle, ERP Integrations
RESTful APIsETLDatabase QueriesCloud Applications

Power2sme

Senior Software Engineer

Dec 2017Jul 2018 · 7 mos · Gurgaon, India

  • Optimized payment & reconciliation services using Spring Boot & SQL, improving transaction throughput
  • Initiated codebase modularization for service isolation and faster feature delivery
  • Set up SonarQube & Nexus, enabling CI/CD foundations
  • Collaborated across teams to improve financial workflow automation
Spring BootSQLCI/CDCloud Applications

Education

Indian Institute of Technology, Roorkee

Bachelor of Technology (B.Tech.)

Jan 2011Jan 2015

Adarsh Vidya Mandir

INTER — Science

Jan 2009Jan 2011

Stackforce found 100+ more professionals with Large Language Model Operations (llmops) & Cloud Applications

Explore similar profiles based on matching skills and experience