Tarun Sharma — SRE (Site Reliability Engineer)
I am a Site Reliability and Cloud Engineer with 5+ years of experience operating and improving reliability for production-grade cloud systems at scale. Currently working at Amazon (AWS Managed Services), I focus on maintaining highly available, scalable infrastructure while reducing operational toil through automation and standardization. My work revolves around: • Incident management, root cause analysis, and postmortems • Improving system reliability through monitoring, alerting, and observability • Designing and operating distributed systems on AWS • Infrastructure as Code (Terraform, CloudFormation) • Performance optimization and cost efficiency I have hands-on experience managing multi-tenant cloud environments, handling critical production incidents, and collaborating across teams to ensure high availability and operational excellence. I hold 8 industry certifications across AWS, Azure, and GCP, and I am actively focused on advancing deeper into Site Reliability Engineering, particularly for large-scale distributed systems. I am open to global opportunities in SRE and production engineering roles.
Stackforce AI infers this person is a Cloud Infrastructure Engineer with a focus on Site Reliability Engineering.
Location: Bengaluru, Karnataka, India
Experience: 5 yrs 2 mos
Skills
- Site Reliability Engineering
- Distributed Systems
- Infrastructure As Code
- Devops
Career Highlights
- 5+ years of experience in Site Reliability Engineering.
- 8 industry certifications across AWS, Azure, and GCP.
- Expertise in managing multi-tenant cloud environments.
Work Experience
Amazon Web Services (AWS)
AWS Cloud Operations Engineer (AWS Managed Services) (1 yr 3 mos)
Rackspace Technology
AWS Cloud Administrator II (7 mos)
DXC Technology
AWS Cloud Engineer (3 yrs 4 mos)
Trainee (2 mos)
CDAC Mohali
Network Administrator (1 mo)
Education
Bachelor of Technology - BTech at Chandigarh Engineering College