Sujit Patel

SRE (Site Reliability Engineer)

Bengaluru, Karnataka, India10 yrs 2 mos experience
Most Likely To Switch

Key Highlights

  • Expert in designing scalable Kubernetes platforms.
  • Proven track record in cloud cost optimization.
  • Strong background in automation and CI/CD practices.
Stackforce AI infers this person is a SaaS Infrastructure Engineer with a focus on reliability and automation.

Contact

Skills

Core Skills

KubernetesAwsMonitoringCloud OptimizationDevops

Other Skills

TerraformPythonGitOpsAlertingPrometheus.ioDockerJenkinsGitLab CIAnsibleBashGoogle Cloud Platform (GCP)ManagementTroubleshootingElastic Stack (ELK)Grafana

About

I’m a Lead Site Reliability Engineer with 9+ years of experience building, scaling, and securing cloud infrastructure for fast-moving engineering teams. I enjoy solving deep technical problems, simplifying infrastructure, and creating systems that are reliable, observable, and cost-efficient. Over the years, I’ve worked across AWS, Kubernetes, Terraform, and large-scale production systems, leading migrations, EKS upgrades, Karpenter adoption, observability improvements, and automation initiatives that remove operational pain. My strengths include: • Designing scalable and reliable Kubernetes platforms • Cloud optimization (compute, storage, RDS) • Automation using Terraform, Python, and Bash • Improving monitoring, alerting, and SRE processes • Building CI/CD pipelines with GitHub Actions, GitLab CI, Jenkins & ArgoCD • Reducing toil and improving developer experience • Incident management, on-call ownership & RCA culture I love working on infrastructure that actually makes engineers’ lives easier. Always open to discussing SRE strategy, platform engineering, and modern cloud solutions. If you're exploring something exciting in this space, feel free to connect!

Experience

10 yrs 2 mos
Total Experience
2 yrs
Average Tenure
2 yrs 8 mos
Current Experience

Freshworks

Lead Site Reliability Engineer

Sep 2023Present · 2 yrs 8 mos · Bengaluru, Karnataka, India · Hybrid

  • Leading reliability, scalability, and platform initiatives across large-scale Kubernetes workloads.
  • Designed and executed EKS upgrade strategies, Karpenter migration, and cluster hardening.
  • Improved observability and alerting systems to reduce MTTR and improve signal quality.
  • Driving automation across infra workflows using Terraform, Python, and GitOps practices.
  • Collaborating with product, security, and engineering teams to ensure platform stability.
KubernetesAWSTerraformPythonGitOps

Nurture.farm

Senior Production Engineer

Apr 2022Sep 2023 · 1 yr 5 mos · Bengaluru, Karnataka, India

  • Reduced cloud infrastructure costs by 30% through compute optimization, database tuning, and improved provisioning.
  • Streamlined alerting/monitoring systems, reducing noise and improving issue detection.
  • Automated infra management using Terraform for more consistent and scalable deployments.
  • Improved RDS performance & reduced instance footprint through right-sizing.
  • Mentored junior engineers and participated in on-call + incident response rotations.
AWSTerraformPrometheus.ioCloud Optimization

Oye rickshaw: the first & last mile ride co.

Site Reliability Engineer

Sep 2020Apr 2022 · 1 yr 7 mos · New Delhi, Delhi, India

  • Led the implementation of a container-based microservices architecture using Docker and Kubernetes
  • Designed and implemented a Kubernetes cluster, using tools such as Kong, Nginx-Ingress, Fluentd, and Prometheus
  • Worked closely with development teams to implement automated CI/CD production/stage/dev pipelines using Jenkins, GitLab CI and ArgoCD, resulting in faster delivery of new features
  • Implemented logging, monitoring, and alerting using open-source tools such as Node Exporter, Promtail, Loki, Prometheus, and Grafana, which enables the team to quickly identify and resolve issues
  • Implemented automation using Jenkins, bash and python scripts resulting in 50% reduction in time spent on manual tasks
  • Proficient in managing databases such as Postgresql, MongoDB, and Amazon Redshift
  • Implemented a data warehouse using DMS and Redshift for the analytics team, to enable them to easily access and analyze large sets of data
  • Worked with IoT Applications such as Mosquitto MQTT
  • Led the migration of systems and infrastructure to Microsoft Azure, ensuring a smooth transition and minimal disruption to operations
DockerKubernetesJenkinsGitLab CIDevOps

Unify technologies

Sr. Devops Engineer

Sep 2019Aug 2020 · 11 mos · Gurgaon, India

  • Experience with CI/CD practices, pipelines, and workflows
  • Proficient in using automation software such as Jenkins, and Ansible
  • Experience in managing and supporting enterprise Logging, Alerting, and Monitoring technologies
  • Handled incident management and troubleshot complex application problems
  • Have a demonstrable ability to work effectively in a team-oriented environment, managing numerous priorities and projects simultaneously.
  • Languages worked on – Java, Python, Bash.
JenkinsAnsibleBashDevOps

Neostencil, inc.

2 roles

DevOps Engineer

Promoted

Mar 2019Aug 2019 · 5 mos

  • Maintain infrastructure using Amazon Web Services (EC2, RDS, Route53, IAM, SNS) and Google Cloud Platform (Compute Engine, Storage, VPC Network), and Microsoft Azure (Virtual Machines, Storage Accounts)
  • Utilize Nginx for load balancing, reverse proxy, and web server functions
  • Monitor and troubleshoot production systems to ensure optimal performance and availability
  • Adept in Linux Administration and Scripting (Bash, Python)
  • Familiar with CI/CD using GitLab
AWSGoogle Cloud Platform (GCP)Bash

Tech Support

Jan 2016Mar 2019 · 3 yrs 2 mos

  • Implemented and installed new system configurations at client sites
  • Performed regular maintenance on network infrastructure
  • Developed and uploaded various lecture units to course pages on website
  • Monitored classroom videos and resolved any technical issues that arose
  • Ensured complete data management, maintenance, and backups
  • Managed and supported on-field operations team
  • Installed and configured computer hardware, operating systems, and applications
  • Assisted management with scheduling, service protocols improvements, and quality assurance
  • Provided support, including procedural documentation and relevant reports
  • Troubleshoot system and network problems, diagnosing and solving hardware or software faults
ManagementTroubleshooting

Education

Veer Kunwar Singh University, Arrah

B.Sc. — Mathematics

Jan 2014Jan 2017

Govt. Sarvodaya Bal Vidyalaya, CBSE, Delhi

10+2 CBSE — Science

Stackforce found 100+ more professionals with Kubernetes & Aws

Explore similar profiles based on matching skills and experience