Avi Nagpal — SRE (Site Reliability Engineer)
As a Site Reliability Engineer, I specialize in building scalable, resilient systems and automating infrastructure operations. I’ve worked extensively with both AWS and Apple’s private cloud, deploying and managing Kubernetes clusters to support high-throughput applications handling 10K+ TPS. I bring hands-on experience across the SRE toolchain—automating with Shell and Python, managing deployments with Jenkins, Spinnaker (including Canary and Red/Black strategies), and configuring infrastructure with Ansible. I’ve also built end-to-end observability stacks using Prometheus, Grafana, and OpenTelemetry for tracing, with a strong focus on real-time alerting and diagnostics. From reverse proxies like NGINX to fine-tuning dashboards and system health metrics, I’ve worked across all layers to ensure availability, performance, and continuous improvement in production environments. Additionally, I’ve implemented intelligent auto-scaling strategies to dynamically scale services up or down based on real-time traffic patterns, optimizing both performance and cost.
Stackforce AI infers this person is a Cloud Infrastructure Engineer with strong DevOps capabilities.
Location: Hyderabad, Telangana, India
Experience: 13 yrs 4 mos
Skills
- Site Reliability Engineering
- Kubernetes
- Devops
- Aws
- Monitoring
- Network Management
- Automation
- Reporting
- Linux Administration
- Web Development
Career Highlights
- Expert in building scalable, resilient systems.
- Extensive experience with AWS and Kubernetes.
- Proficient in automation using Shell and Python.
Work Experience
Apple
Site Reliability Engineer (5 yrs 2 mos)
Paytm
Senior Devops Engineer (2 yrs)
Ericsson
Senior Automation Engineer (3 yrs 1 mo)
HCL Technologies
Senior Linux Administrator (2 yrs 4 mos)
NIIT Technologies Limited
Web Developer (9 mos)
AlfaIT ltd.
J2EE Developer (2 mos)
netmax technologies
CCNA Project Trainee (2 mos)
Education
Bachelor of Engineering (B.E.) at chitkara university
at RSV