Sumit Prasad

SRE (Site Reliability Engineer)

Bengaluru, Karnataka, India9 yrs 11 mos experience
AI ML PractitionerAI Enabled

Key Highlights

  • Streamlined canary deployment process for production stability.
  • Achieved PCI compliance certification through thorough audits.
  • Designed CI/CD platform enhancing deployment efficiency.
Stackforce AI infers this person is a SaaS Infrastructure Engineer with expertise in Site Reliability Engineering and DevOps.

Contact

Skills

Core Skills

Site Reliability EngineeringTechnical LeadershipDevopsKubernetesInfrastructure Capacity PlanningContinuous Integration And Continuous Delivery (ci/cd)

Other Skills

AWSAmazon Web Services (AWS)AnsibleArgoArgoCDArtificial Intelligence for DesignAzure Kubernetes Service (AKS)Change ManagementCommunicationCustomer ServiceDockerElastic Stack (ELK)GrafanaHigh Availability (HA)Infra compute

About

As an Staff SRE, I oversee the deployment, scaling, and reliability of large-scale distributed software applications using Kubernetes and other cutting-edge technologies. I have streamlined the canary deployment process, achieved PCI compliance certification, and conducted proof-of-concept evaluations on various tools and platforms. I have also acquired multiple certifications in DevOps, site reliability engineering, and Kubernetes. I am passionate about process automation, efficiency optimization, and customer satisfaction. I collaborate with cross-functional teams and deliver innovative solutions that enhance the performance and stability of the systems.

Experience

9 yrs 11 mos
Total Experience
1 yr 10 mos
Average Tenure
5 mos
Current Experience

Groww

Site Reliability Engineering Lead

Dec 2025Present · 5 mos · Bengaluru, Karnataka, India · On-site

Site Reliability EngineeringInfra securityInfra computeInfra observabilityNetwork Infrastructure ArchitectureTechnical Leadership

Angel one

Staff Site Reliability Engineer

Aug 2024Dec 2025 · 1 yr 4 mos · Bengaluru, Karnataka, India · On-site

Amazon Web Services (AWS)DevOpsKubernetesPython (Programming Language)TerraformArgoCD

Myntra

2 roles

Technical Lead SRE

Promoted

Apr 2022Aug 2024 · 2 yrs 4 mos

  • ° Streamlined the canary deployment process using Argo Rollout, reducing risk and minimizing the impact of potential issues on production systems.
  • ° Leveraged Argo CD's declarative configuration and GitOps principles to achieve consistent and auditable deployments for platform components across multiple environments.
  • ° Offer exceptional operational support and engineering expertise for multiple large-scale distributed software applications.
  • ° Developed and executed effective strategies for cluster upgrades, ensuring minimal impact on production workloads and seamless transitions.
  • ° Successfully achieved PCI compliance certification by completing a thorough audit.
Linux KernelArgoLeadershipSystem ArchitectureAzure Kubernetes Service (AKS)Kubernetes+13

Senior Site Reliability Engineer

Jan 2021Apr 2022 · 1 yr 3 mos

  • ° Orchestrated the deployment and scaling of containerized applications across the Kubernetes clusters, optimizing resource utilization and minimizing downtime.
  • ° Conducted proof-of-concept (PoC) evaluations on Linkerd, Thanos, Argo CD, Argo Rollouts, and Haproxy Ingress Controller.
  • ° Implemented robust monitoring and alerting systems to proactively detect and address issues in the Kubernetes clusters, ensuring high reliability and performance.
  • ° Actively participated in capacity planning, scaling, and capacity management to meet evolving business needs.
  • ° Oversaw the configuration, deployment, and maintenance of self managed HAProxy load balancers to ensure optimal performance and scalability for large-scale production applications.
Linux KernelAzure Kubernetes Service (AKS)Problem SolvingChange ManagementInfrastructure Capacity PlanningUnix Administration+5

Bounce

Senior Devops Engineer

Nov 2019Jan 2021 · 1 yr 2 mos · Bangalore

  • ° Designed a CI/CD platform using AWS ECS, ECR, Jenkins, and Bitbucket.
  • ° Containerized numerous applications using Docker.
  • ° Created an observability platform utilizing Prometheus and Grafana to monitor infrastructure.
  • ° Implemented centralized logs management using open-source components: Kibana, Fluentd, and
  • Elasticsearch.
Linux KernelJenkinsElastic Stack (ELK)DockerPrometheusAmazon Web Services (AWS)+8

Ola (ani technologies pvt. ltd)

2 roles

Senior Devops Engineer

Promoted

Apr 2019Nov 2019 · 7 mos · Bengaluru, Karnataka, India

  • ° Collaborated on the implementation of AWS and Azure services to support the development of diverse web applications' infrastructure.
  • ° Orchestrated the seamless migration of the entire cloud infrastructure from AWS to Azure, ensuring a successful transition.
  • ° Took proactive measures in executing cost optimization initiatives using scripting languages like Python and Bash.
Linux KernelChange ManagementInfrastructure Capacity PlanningUnix AdministrationHigh Availability (HA)Interpersonal Skills

Devops Engineer (OlaCabs and Foodpanda)

Apr 2017Mar 2019 · 1 yr 11 mos · Bengaluru, Karnataka, India

  • ° Actively participated in 24/7 on-call schedules, which encompassed monitoring application health, incident management, and server maintenance.
  • ° Orchestrated the onboarding of applications, load balancing, dynamic configuration, and monitoring.
Change ManagementUnix AdministrationHigh Availability (HA)

Bobcares

2 roles

Linux System Administrator

Promoted

Jan 2017Mar 2017 · 2 mos

  • ° Proficiently troubleshooted Linux servers and various CMS platforms including WordPress, Joomla, and Magento, resolving issues promptly and effectively.
  • ° Conducted CMS installations and performed Tomcat installations, ensuring smooth deployment and configuration.
  • ° Proactively implemented server patching and server hardening techniques to enhance security and prevent unauthorized access.
  • ° Actively monitored server load and employed measures to mitigate brute force attacks on CMS sites, maintaining a secure environment.
  • ° Ensured compliance with PCI Data Security Standards (PCI DSS) by identifying and resolving security vulnerabilities on servers.
  • ° Utilized Nagios Monitoring System to actively monitor processes and services on VPS and Dedicated servers, ensuring optimal performance and availability.
Unix Administration

Junior linux system administrator

Jan 2016Dec 2016 · 11 mos

  • Basic Troubleshooting of the Linux servers and the CMS like WordPress, Joomla, Magento etc., CMS installation, Tomcat installation, Server patching, Server Hardening and Prevention of hacking by continuous monitoring the server load and by stopping the brute force attacks in the CMS sites.
Unix Administration

Education

Sikkim Manipal Institute Of Technology

Btech — Electronics and Communications Engineering

Sir Tashi Namgyal Sr. Sec School

+2 — Science

West Point Sr. Sec School

Class 10 — Science

Stackforce found 100+ more professionals with Site Reliability Engineering & Technical Leadership

Explore similar profiles based on matching skills and experience