N

Neha Kundra

SRE (Site Reliability Engineer)

Noida, Uttar Pradesh, India11 yrs 10 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Reduced incident response time by 78% through proactive measures.
  • Architected zero-downtime infrastructure for peak traffic.
  • Implemented AI-powered observability frameworks for faster issue detection.
Stackforce AI infers this person is a Senior Site Reliability Engineer specializing in cloud-native systems within the Fintech and Enterprise sectors.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud ComputingAmazon Web Services (aws)Cloud ApplicationsMicroservicesLinux System Administration

Other Skills

Distributed SystemsGrafanaPrometheus.ioGo (Programming Language)DevOpsAutomation ToolsPerformance TuningprometheousNetworkingInfrastructureElasticsearchApache KafkaConsulPartitioningElastic Stack (ELK)

About

I am a Senior Site Reliability Engineering (SRE) Architect with 12+ years of experience specializing in the design and operation of hyper-scalable, resilient cloud-native systems that handle billions of daily requests. My career has spanned high-growth Fintech (Paytm) and large-scale Enterprise/Cloud (Oracle), culminating in leading critical infrastructure at Adobe. My focus is driving measurable operational excellence: -> Incident Reduction & MTTR: Led initiatives that reduced mean incident response time by 78% and decreased Mean Time to Recovery (MTTR) by 65% through proactive Chaos Engineering. -> Scalability & Performance: Architected infrastructure that guarantees zero downtime while auto-scaling to absorb 300% traffic spikes during peak loads. -> AI/ML for Observability: Designed and implemented distributed monitoring and anomaly detection frameworks (AI-Powered Observability) that identify issues 15x faster than traditional methods, shifting our organization to a proactive posture. -> Systems Design & Automation: Deep expertise in systems design, infrastructure as code (IaC), and developing Python-based automation/remediation tools that significantly boost cross-team efficiency and reduce manual intervention. I am passionate about applying software engineering principles to solve complex reliability and scaling challenges for the world’s largest applications.

Experience

11 yrs 10 mos
Total Experience
1 yr 11 mos
Average Tenure
4 yrs 4 mos
Current Experience

Adobe

Computer Scientist II

Feb 2022Present · 4 yrs 4 mos · Noida, Uttar Pradesh, India

Cloud ApplicationsDistributed SystemsCloud ComputingMicroservicesGrafanaPrometheus.io+4

Oracle

Senior Site Reliability Developer at OCI

Dec 2020Feb 2022 · 1 yr 2 mos · Bengaluru, Karnataka, India

Amazon Web Services (AWS)Performance TuningprometheousNetworkingInfrastructureSite Reliability Engineering

Paytm

Sr. Devops Engineer

Jul 2018Dec 2020 · 2 yrs 5 mos · Noida

ElasticsearchApache KafkaCloud ApplicationsConsulPerformance TuningMicroservices+4

Acquia

DevOps Engineer

Jan 2017May 2018 · 1 yr 4 mos · New Delhi Area, India

Amazon Web Services (AWS)Cloud ApplicationsPerformance TuningNetworkingRed Hat LinuxSite Reliability Engineering

Cisco systems, inc. / aricent technology holdings

Linux System Administrator

Jun 2015Dec 2016 · 1 yr 6 mos · Gurgaon

NetworkingLinux System AdministrationRed Hat Linux

Tech mahindra (formerly mahindra satyam)

System Engineer

Jan 2014Feb 2015 · 1 yr 1 mo · Noida

Web Application FirewallContent Distribution NetworksRed Hat Certified Engineer (RHCE)NetworkingLinux System AdministrationRhcsa+2

Education

Kurukshetra University

Bachelor’s Degree — Engineering

Jan 2009Jan 2013

SMJPS

High School — Non Medical

Jan 2007Jan 2009

Stackforce found 100+ more professionals with Site Reliability Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience