S

Santhosh Deepu Patrayuni

SRE (Site Reliability Engineer)

Bengaluru, Karnataka, India19 yrs 9 mos experience
AI EnabledHighly Stable

Key Highlights

  • Over 13 years of IT experience with strong cloud expertise.
  • Led significant projects optimizing cloud infrastructures.
  • Certified in AWS and Kubernetes with a focus on reliability.
Stackforce AI infers this person is a Cloud Infrastructure and DevOps expert specializing in SaaS solutions.

Contact

Skills

Core Skills

KubernetesPythonDevopsAws

Other Skills

Agile MethodologiesAmazon EKSAmazon Web Services (AWS)AnsibleAppDynamicsApplication VirtualizationArgoCDAzure Kubernetes Service (AKS)BashChefContinuous IntegrationDesktop VirtualizationDockerGenerative AIGit

About

Site Reliability Engineer | DevOps Engineer | Cloud Engineer | AWS | Kubernetes | Docker | Ansible | Python | Linux | Terraform ** AWS and Kubernetes Certified Engineer having more than 13+ years of overall professional IT experience and 4+ years of experience in dealing with Cloud service providers like AWS, DevOps Implementation, Build & Release engineering and Strong Virtualization background. ** Technology skills used in my career include - AWS | Kubernetes | Docker | Jenkins | Chef | Ansible | Linux | Git | Python | Terraform | Splunk | AppDynamics | Wavefront | Nginx | Server Virtualization| Desktop Virtualization| Application Virtualization | Storage | Networking | Bash and many more ** My Skills also include - Design, Architect and Implement Highly Available/Fault Tolerant Scalable services which are running in Private, Public and Hybrid environments using AWS(Public), VMWare, vCloud and OpenStack(Private) - Design and Implemententation experience on Monolithic, SOA, Micro-Services and Serverless Architectures - Good understanding on CICD and Release Engineering - Strong experience on Virtualization & Storage

Experience

Nvidia

Staff Site Reliability Engineer

Jan 2024Present · 2 yrs 2 mos · Hybrid

Generative AIKubernetesGoogle Kubernetes Engine (GKE)Azure Kubernetes Service (AKS)Amazon EKSPython

Broadcom

Reliability Engineer 5

Oct 2023Jan 2024 · 3 mos · Bengaluru, Karnataka, India · On-site

Tanzu observability by wavefront

Staff Site Reliability Engineer

Aug 2020Oct 2023 · 3 yrs 2 mos · Bengaluru, Karnataka, India · Hybrid

  • Proven Leadership in Infrastructure and DevOps:
  • Guided as Lead Engineer in numerous projects, driving the architecture and implementation of changes to optimize Tanzu Observability (Wavefront) clusters.
  • Oversaw the management of expansive AWS and Google Cloud infrastructures, comprising over 2000+ cloud-computing platform resources.
  • Spearheaded updates to versions, scripts, and documentation of Ansible playbooks and Terraform modules, ensuring seamless deployment and maintenance.
  • Managed a portfolio of 500+ Kubernetes workloads , implementing standardization and configuration through ArgoCD, Kustomize, and Helm, resulting in increased operational efficiency.
  • Enhanced the usability of Kubernetes proxies, significantly reducing failure rates and improving overall system reliability.
  • Developed custom tools in Go and Python to expedite troubleshooting and implemented bots to support on-call responsibilities, enhancing incident response times.
  • Engineered a mutli sharded system for FoundationDB, substantially increasing speed and capacity in super-large clusters.
  • Optimized PagerDuty alerts, leading to a reduction in overall alerts and increased efficiency in alert management.
  • Contributed as an integral part of the on-call rotation, taking responsibility for PagerDuty alerts and successfully recovering failed systems and applications.
DevOpsPythonAgile MethodologiesContinuous IntegrationAmazon Web Services (AWS)Wavefront+8

Intuit india

Senior Site Reliability Engineer

Oct 2012Aug 2020 · 7 yrs 10 mos · India

  • Experience in orchestrating the migration of services between Private Data Centers and AWS, showcasing versatility in adapting to evolving infrastructure requirements.
  • Experience in Infrastructure as Code (IaC) like Terraform & AWS CloudFormation, ensuring seamless management and scalability.
  • Employing a comprehensive suite of monitoring tools including Splunk, AppDynamics as APM, Wavefront, Telegraf with InfluxDB, AWS CloudWatch etc., to uphold optimal system performance and reliability.
  • Wrote many scripts and contributed in writing tools and services aimed at enhancing operational efficiency.
  • Experience in working with SQL databases like MySQL, Oracle and Non-SQL databases like AWS DynamoDB
  • Leveraging advanced troubleshooting techniques for Linux systems using tools like top, htop, vmstat, iostat, tcpdump, etc., ensuring prompt identification and resolution of performance issues.
  • Experience in automating mandane and manual tasks using Python
DevOpsPythonAmazon Web Services (AWS)JenkinsSite Reliability EngineeringAWS

Tata consultancy services

Senior IT Analyst

Jan 2011Oct 2012 · 1 yr 9 mos · Bangalore, India · On-site

Dxc technology

3 roles

VMware & VDI Administrator

Promoted

Apr 2010Jan 2011 · 9 mos

VMware Administrator

Promoted

Sep 2009Apr 2010 · 7 mos

System Administrator

Jul 2006Sep 2009 · 3 yrs 2 mos

Education

Andhra University

Masters in Computer Applications — Computer Science

Jan 2003Jan 2006

Andhra University

Bachelor of Science - BS — Computer Science

Apr 2000Apr 2003

Stackforce found 100+ more professionals with Kubernetes & Python

Explore similar profiles based on matching skills and experience