Sagar Utekar

SRE (Site Reliability Engineer)

Pune, Maharashtra, India7 yrs 7 mos experience
Highly Stable

Key Highlights

  • 8+ years in DevOps and SRE roles
  • Expert in Kubernetes and cloud technologies
  • Proven track record in automation and efficiency
Stackforce AI infers this person is a SaaS Infrastructure Engineer with strong expertise in DevOps and cloud technologies.

Contact

Skills

Core Skills

KubernetesDevopsObservabilityCloud Computing

Other Skills

PrometheusGrafanaELK StackPagerDutyTerraformArgoCDJenkinsAWSHelmKOPSAzurePythonGoBashKyverno

About

DevOps+SRE+Cloud Mentor | GSoC22 | SRE@CrowdStrike | CKS | CKA | CKAD | KCNA | KCSA | Terraform & Prometheus Certified | AWS | Azure | Ansible | Jenkins | Python | Go | ELK | Cloud 8+ years of experience managing client-facing projects, troubleshooting technical issues, and working with engineering and customers. Experience maintaining internet facing production-grade applications in Virtualized environments. Experience writing software in Java, Python, Go, Node.js Experience with cluster deployment and orchestration technologies using Chef, Salt, Ansible, Docker, Kubernetes, Helm, OpenStack, Jenkins, KOPS. Knowledge of managing Kubernetes in large production environments. Experience with monitoring and alerting infrastructure using ELK, Prometheus, Grafana, Pagerduty, Slack, Datadog Experience with scalable networking technologies (e.g., Load Balancers, Firewalls) and web standards (e.g., REST APIs, web security mechanisms). Experience in system administration tasks in Linux, Unix, or Windows and familiarity with standard IT security practices (e.g., encryption, certificates, key management). Demonstrated understanding of open source server software (e.g., NGINX, RabbitMQ, Redis, Elasticsearch, etc).

Experience

Kubestellar

CNCF KubeStellar Sandbox Project Maintainer

Dec 2025Present · 3 mos

Crowdstrike

Site Reliability Engineer

Aug 2025Present · 7 mos · India · Remote

Vmware

2 roles

Site Reliability Engineer MTS3

Promoted

Mar 2023Jul 2025 · 2 yrs 4 mos · India · Remote

  • Promoted from MTS2 to MTS3 for exceptional contributions in Kubernetes migration, security automation, and cost
  • optimization.
  • Managed and scaled Kubernetes clusters using Helm, KOPS, and AWS, ensuring high availability, scalability, and
  • performance for critical workloads.
  • Designed and implemented monitoring and alerting systems using Prometheus, Grafana, ELK Stack, and PagerDuty,
  • reducing incident response time by 30%.
  • Automated infrastructure provisioning and deployment processes using Terraform, ArgoCD and Jenkins to standardize
  • declarative infrastructure management and automated deployments, reducing manual intervention and improving
  • deployment efficiency.
  • Collaborated with cross-functional teams to improve system reliability, scalability, and security, ensuring 99.9% uptime for
  • production environments.
  • Conducted root cause analysis (RCA) for critical incidents and implemented preventive measures to reduce recurrence.
  • Led on-call rotations, swiftly resolving critical incidents and minimizing business impact through proactive monitoring and
  • automation. Developed runbooks and incident management playbooks, streamlining resolution processes and reducing
  • response times. Automated log analysis and remediation tasks using Python and shell scripting, cutting manual toil by
  • 35%.
KubernetesPrometheusGrafanaELK StackPagerDutyTerraform+3

Site Reliability Engineer MTS2

Mar 2021Feb 2023 · 1 yr 11 mos · India · Remote

  • Migrated from self-managed Kubernetes clusters to Amazon EKS and Azure AKS, enhancing scalability, security, and
  • operational efficiency.
  • Strengthened security across infrastructure, containers, and cloud environments by enforcing Kyverno/OPA policies, RBAC,
  • and CIS Kubernetes benchmarks.
  • Optimized cloud costs by 20% through autoscaling, right-sizing resources, and leveraging AWS Spot Instances for
  • non-production workloads.
  • Developed automation scripts in Python, Go, and Bash for infrastructure provisioning, log analysis, metrics collection and
  • remediation, reducing manual toil.
KubernetesAWSAzurePythonGoBash+2

Youtube

YouTuber (SRE + DevOps + Cloud)

Mar 2023Present · 3 yrs · Remote

  • Sagar.Utekar will help you to learn DevOps, Cloud, SRE, Programming, Open Source and prepare you for interviews.
  • Also, this will give you an easy explanation of the above topics in the easiest language with hands-on tutorials.

Avaya

Senior Technical Associate

Aug 2019Mar 2021 · 1 yr 7 mos · Pune Area, India

  • Automated repetitive operational tasks using Docker, Kubernetes, Terraform, and Jenkins, improving operational
  • efficiency by 25%.
  • Implemented observability solutions using Datadog and Azure Kubernetes Service (AKS) for real-time monitoring, logging,
  • and troubleshooting of infrastructure.
  • Worked on infrastructure optimization, reducing cloud costs by 15% through resource utilization analysis and scaling
  • strategies.
  • Provided technical guidance and mentorship to junior team members, fostering a culture of continuous learning and
  • improvement.
  • Implemented Infrastructure as Code (IaC) best practices, migrating legacy infrastructure to Terraform-managed
  • environments, improving deployment speed and consistency, significantly improving system reliability.
  • Worked on high-priority production issues, reducing MTTR (Mean Time to Resolution) by 30% through improved incident
  • handling and automation.
  • Collaborated with security teams to enforce IT security policies around certificates, encryption, and access controls.
DockerKubernetesTerraformJenkinsDatadogDevOps+1

Gs lab

Software Engineer

Jul 2018Aug 2019 · 1 yr 1 mo · Pune, Maharashtra, India

  • Led a team of 5 members to design and implement CI/CD pipelines using Jenkins, Docker, Kubernetes, and SonarQube,
  • reducing deployment time by 20.
  • Played a key role in developing an IBM product, delivering training sessions on Docker, Kubernetes, Helm, and Python to
  • internal teams.
  • Automated testing and deployment processes, improving release cycle efficiency and reducing manual errors.
  • Worked closely with Dev & QA teams, implementing automated test pipelines and improving test coverage and deployment
  • success rates.
  • Maintained high-traffic, client-facing production applications in virtualized and containerized environments, ensuring
  • reliability and performance.
JenkinsDockerKubernetesPythonDevOpsCloud Computing

Atos

Intern

Feb 2018May 2018 · 3 mos · Pune, Maharashtra, India

  • Led a team of 4 members to optimize Unix server performance, creating a proof-of-concept (POC) and developing shell
  • scripts for performance monitoring and optimization.
  • Conducted performance analysis and implemented improvements, resulting in a 15% increase in server efficiency.
  • Gained hands-on experience with Linux/Unix systems, shell scripting, and server optimization techniques.

Education

Pune Institute of Computer Technology

Bachelor of Engineering - BE — Computer Science

Jan 2015Jan 2018

Dr. Babasaheb Ambedkar Technological University

Diploma — Information Technology

Jan 2012Jan 2015

Stackforce found 100+ more professionals with Kubernetes & Devops

Explore similar profiles based on matching skills and experience