S

Sri Sindhu Nunna

SRE (Site Reliability Engineer)

India1 yr 11 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Implemented automation for OS patching at Amazon.
  • Resolved over 6000 code quality issues.
  • Achieved cost reduction through infrastructure optimization.
Stackforce AI infers this person is a DevOps Engineer with strong expertise in cloud infrastructure and automation.

Contact

Skills

Core Skills

KubernetesSite Reliability EngineeringDevopsAws

Other Skills

Amazon EKSRubyAmazon Route 53ArgoArgo VaultKubectlMockitoSonar issuesAutomationLokiTerraformGrafanaSystem MonitoringMonitoringElastic search

About

As a Computer Science and Engineering student at Gayatri Vidya Parishad, I am eager to apply my skills and knowledge in the field of DevOps. I have a strong passion for learning new technologies and improving the quality and security of software delivery. I have recently completed a six-month stint as a DevOps Engineer II at Amazon, where I contributed to various projects and tasks using AWS resources and tools. I implemented Quilt pipeline with LPT to automate OS patching for 60 orphan hosts, ensuring continuous integration and delivery and mitigating EC2 risks. I also migrated CDK from V1 to V2 and KeyMaster from V1 to C2, tested application code changes, and resolved 70+ customer cut tickets. Additionally, I effectively mitigated SAS and Shepherd risks, and created cases for more than 20,000 records. I value collaboration, innovation, and customer satisfaction, and I can bring diverse perspectives and experiences to the DevOps team.

Experience

1 yr 11 mos
Total Experience
1 yr 11 mos
Average Tenure
--
Current Experience

Phenom

Site Reliability Engineer I

Oct 2023Sep 2025 · 1 yr 11 mos · Hyderabad, Telangana, India · On-site

  • Scheduled weekly automation reports using Kestra to identify underutilized Kubernetes nodes from Prometheus and managed services (Redis, RDS, Loki logs); integrated metrics into Lightdash dashboards with data stored in Snowflake for comprehensive Node usage monitoring.
  • Resolved over 80 monitoring tickets, addressing issues promptly and maintaining system health.
  • Conducted Root Cause Analysis (RCA) for a critical pre-production downtime issue using Terraform.
  • Facilitated the onboarding process of DevLake for our team, enhancing collaboration and productivity.
  • Standardized Helm charts and automation scripts for extracting MongoDB and Redis configurations,
  • streamlining environment setup and deployment.
  • Integrated automated REST API testing using the Robot Framework, improving QA coverage
  • Increased unit test coverage by 60% through the implementation of Cursor AI agents, ensuring higher code reliability.
  • Resolved over 6000 code quality issues, significantly improving code maintainibility and reducing tech debt
  • Retrieved and analyzed data from ArgoCD APIs to monitor release pipelines; automated hourly Slack reports
  • for enhanced visibility in release management.
  • Automated detection and cleanup of unused Persistent Volume Claims (PVCs) and stale Git branches, optimizing
  • resource usage.
  • Strengthened security posture by implementing the Argo Vault plugin for secure secrets management in CI/CD
  • pipelines.
  • Achieved a 5% cost reduction through infrastructure optimization and resource tuning, reduced alert fatigue by 3%
  • improved query performance by automating index creation through Liquibase
KubernetesAmazon EKSSite Reliability Engineering

Amazon

DevOps Engineer II

Jan 2023Jun 2023 · 5 mos · Hyderabad · On-site

  • Implemented Quilt pipeline with LPT to automate OS patching for 60 orphan hosts, ensuring CI/CD and mitigating EC2 risks
  • Resolved 70+ customer cut tickets, providing efficient and effective solutions.
  • Effectively mitigated SAS and shepherd risks in a timely manner, ensuring the continuous stability of the pipeline.
  • Implemented case creation for more than 20,000 records using AWS resources
  • resolved EC2 ,docker,SAS,Shepherd risks, Merge From live risks
  • Migrated CDK from V1 to V2 and KeyMaster from V1 to C2
  • Tested various application code changes, Got 8 Cr's approved
RubyAmazon Route 53DevOpsAWS

Education

Gayatri Vidya Parishad College of Engineering (Autonomous)

under graduate — Computer Science and Engineering

Jan 2019Jan 2023

Stackforce found 100+ more professionals with Kubernetes & Site Reliability Engineering

Explore similar profiles based on matching skills and experience