Rahul Reddy

SRE (Site Reliability Engineer)

Hyderabad, Telangana, India4 yrs 4 mos experience

Most Likely To SwitchHighly Stable

Key Highlights

Expert in multi-cloud infrastructure with AWS, Azure, and GCP.
Proficient in Infrastructure as Code using Terraform and Ansible.
Strong background in Site Reliability Engineering and DevOps practices.

Stackforce AI infers this person is a Cloud Infrastructure Engineer specializing in Site Reliability Engineering within the SaaS industry.

Contact

Skills

Core Skills

Site Reliability EngineeringInfrastructure As Code (iac)Devops

Other Skills

Amazon Web Services (AWS)Python (Programming Language)TerraformGitLab CIGrafanaPrometheusCloudWatchKubernetesDocker.NET CoreDesign PatternsBashPowerShellPythonJenkins

About

Cloud & Site Reliability Engineer with 4 years of experience designing, automating, and operating distributed infrastructure across AWS, Azure, and GCP. Skilled in Infrastructure as Code (Terraform · Ansible), Kubernetes · Docker · Helm, CI/CD (Jenkins · GitLab CI), and observability (Prometheus · Grafana · CloudWatch · Azure Monitor). Experienced implementing SLIs/SLOs (Service Level Indicators / Objectives), incident response and root cause analysis (RCA), and building auto-healing systems that reduce MTTR and improve availability. Comfortable across DevOps / SRE / Platform / Infrastructure roles—focusing on reliability, cost optimization, and security in multi-cloud environments.

Experience

4 yrs 4 mos

Total Experience

2 yrs 2 mos

Average Tenure

3 yrs 1 mo

Current Experience

Teleperformance

Site Reliability Engineer

May 2023 – Present · 3 yrs 1 mo · Gurugram, Haryana, India

Engineered and maintained multi-cloud infrastructure (AWS & Azure) for high-availability and cost optimization.
Automated infrastructure provisioning via Terraform and GitLab CI, enabling repeatable deployments across environments.
Implemented observability stack (Grafana + Prometheus + CloudWatch) tracking SLIs / SLOs to improve uptime metrics by 30%.
Containerized legacy workloads using Docker and Kubernetes to standardize releases and reduce deployment errors.
Wrote Python / PowerShell automation scripts for monitoring and incident response, cutting manual operations by 35%.

Amazon Web Services (AWS)Python (Programming Language)TerraformGitLab CIGrafanaPrometheus+5

Curl

Engineer

Jan 2022 – Apr 2023 · 1 yr 3 mos · Bengaluru, Karnataka, India

Supported production C#/.NET microservices hosted on Linux and AWS/Azure hybrid clouds, ensuring >99.9% service uptime.
Automated recurring operational tasks (health checks, log rotation, restarts) using Bash/PowerShell/Python, reducing MTTR by 40%.
Collaborated with DevOps team to implement CI/CD pipelines in GitLab CI/Jenkins, enabling safe rollbacks and versioned deployments.
Monitored performance and error rates via Prometheus + Grafana dashboards, introducing new SLIs/SLOs that improved incident detection by 25%.
Executed infrastructure changes with Terraform and Ansible, standardizing provisioning for development and QA environments.
Managed containerized services on Docker and supported early Kubernetes adoption for staging workloads.
Conducted root cause analysis (RCA) for production incidents and documented corrective actions in ServiceNow and Confluence.
Partnered with developers to optimize application logs and metrics, improving troubleshooting speed by 30%.