Akarshi Kapoor — SRE (Site Reliability Engineer)
I’m a Lead Site Reliability/Software Engineer with 12+ years of experience building and operating large-scale distributed systems, observability platforms, and backend infrastructure. My work spans Cisco, Netflix, NTT Ltd., Bank of America, Accenture, and TCS, where I’ve led high-impact initiatives across Kubernetes, Kafka/MSK, Terraform, cloud infrastructure, observability, and automation. I specialize in designing reliable, scalable systems and helping teams execute complex roadmaps with high engineering standards. At Cisco, I architected an event-driven AI pipeline on Kafka/MSK and Kubernetes that processes 2B+ of streaming events with sub-second end-to-end latency. I also built a multi-tenant event streaming platform for a US Government project in just 30 days, owning infrastructure, networking, and security end to end. I’ve supported distributed systems operating at P99 latency below 200ms while also leading hiring and team development initiatives. At Netflix, I improved deployment performance by 30%, helped maintain 99.99% uptime during high-traffic conditions, and built AI-assisted reliability tooling that reduced MTTD by 60% and MTTR by 50%. Across earlier roles, I’ve delivered ~50% AWS cost savings, 35% performance improvements, and major reductions in operational toil through automation, cloud modernization, and CI/CD improvements. I’m passionate about building resilient platforms, leading strong engineering teams, and solving hard problems in distributed systems, observability, event-driven architecture, and AI-enabled operations.
Stackforce AI infers this person is a highly skilled Site Reliability Engineer specializing in large-scale distributed systems and cloud infrastructure.
Location: Bengaluru, Karnataka, India
Experience: 11 yrs 10 mos
Skills
- Site Reliability Engineering
- Kubernetes
- Apache Kafka
- Infrastructure Management
- Artificial Intelligence (ai)
- Devops
- Cloud Computing
Career Highlights
- Architected a billion-event processing pipeline.
- Achieved 99.99% uptime during high-traffic conditions.
- Delivered significant AWS cost savings through optimization.
Work Experience
Cisco
Lead Site Reliability Engineer (1 yr 8 mos)
Netflix
Senior Site Reliability Engineer (1 yr 10 mos)
NTT
Senior Software Engineer (1 yr 7 mos)
Bank of America
Senior Software Engineer (8 mos)
Accenture
Senior Software Engineer (2 yrs)
Tata Consultancy Services
Software Engineer (4 yrs 1 mo)
Education
B.Sc. Honors at Dayalbagh Educational Institute
MBA at Dayalbagh Educational Institute
Higher Secondary at St. Conrad’s Inter College