Suyash Singhai

SRE (Site Reliability Engineer)

Mumbai, Maharashtra, India10 yrs experience

Key Highlights

  • Led successful migration of Kubernetes workloads.
  • Achieved significant cost savings through automation.
  • Enhanced system performance and reliability across projects.
Stackforce AI infers this person is a Site Reliability Engineer with expertise in cloud infrastructure and automation in SaaS environments.

Contact

Skills

Core Skills

Site Reliability EngineeringKubernetesObservabilityCloud MigrationInfrastructure AutomationFull Stack Development

Other Skills

AWSAWS EKSAerospikeAmazon EKSAmazon Relational Database Service (RDS)Amazon Web Services (AWS)Apache AirflowApache KafkaArgoCDAtlantisBackbone.jsBootstrapCC++CI/CD

About

Passionate and results-driven Site Reliability Engineer with extensive experience in automating and optimizing complex infrastructure systems. Proficient in Kubernetes, Terraform, CI/CD pipelines, and cloud platforms including GCP and AWS. Adept at migrating large-scale workloads, enhancing system performance, and driving cost efficiencies. My journey in tech began with a strong foundation in full stack web development, where I honed my skills in creating robust, scalable applications. Transitioning into site reliability engineering, I have led critical infrastructure projects, automated deployment processes, and implemented cutting-edge monitoring and logging solutions. Key Highlights: Successfully migrated Kubernetes workloads and MSSQL/Redis clusters, achieving significant cost savings. Automated multi-region infrastructure setup using Terraform and ArgoCD, ensuring seamless replication and management. Developed and deployed secure and scalable web applications, leveraging modern tech stacks and CI/CD pipelines. Played a crucial role in hiring and mentoring new talent, fostering a collaborative and high-performance team environment. Committed to continuous learning and staying at the forefront of technology, I am always eager to tackle new challenges and contribute to impactful projects. Let’s connect and explore opportunities to innovate and drive success together.

Experience

10 yrs
Total Experience
2 yrs 9 mos
Average Tenure
1 yr 9 mos
Current Experience

Pocket fm

Senior Site Reliability Engineer

Sep 2024Present · 1 yr 9 mos · Bengaluru, Karnataka, India

  • Leading the Resiliency & Security SRE pod at Pocket FM, enhancing reliability and security across production systems.
  • Re-architected the observability stack, decreasing alert latency from 12 minutes to under 2 minutes.
  • Spearheaded the migration from EC2 to Kubernetes, implementing multi-environment GitOps deployment pipelines.
  • Improved application uptime from 98.5% to 99.95% through proactive reliability engineering initiatives.
KubernetesTerraformGitOpsObservabilitySecuritySite Reliability Engineering

Media.net

Site Reliability Engineer

Jan 2023Aug 2024 · 1 yr 7 mos · Mumbai, Maharashtra, India · On-site

  • Led the operation of Kubernetes workloads in GKE, automating infrastructure with Terraform and ArgoCD. Optimized CI/CD pipelines with Jenkins and Atlantis, and migrated MSSQL and Redis clusters to GCP, achieving significant cost savings and enhanced system reliability.
KubernetesTerraformArgoCDCI/CDGCPSite Reliability Engineering

Confidential

2 roles

SRE Consultant

Dec 2021Dec 2022 · 1 yr · United States · Remote

  • Migrated workloads from Azure Kubernetes to AWS EKS, enhancing performance and reliability
  • with a multi-region setup. Led the migration of MongoDB and Memcached to MongoDB Atlas,
  • reducing costs and improving efficiency.
  • Implemented ArgoCD and AWS CodeBuild/CodePipeline for robust CI/CD pipelines, and
  • configured Prometheus and Grafana for real-time monitoring. Deployed RabbitMQ for efficient
  • messaging and optimized AWS EKS for scalability.
  • Acted as a critical part of a multi-team effort to deliver, manage, and maintain configuration
  • automation to meet business needs, and created and maintained configuration standards for
  • software and infrastructure.
AWS EKSMongoDBArgoCDPrometheusGrafanaSite Reliability Engineering+1

DevOps Consultant

Jun 2020Nov 2021 · 1 yr 5 mos · United States · Remote

  • Consulted on system reliability and scalability, migrating workloads from Azure Kubernetes to
  • AWS EKS, enhancing performance and efficiency in a multi-regional setup. Automated
  • infrastructure setup and management with Terraform and ArgoCD across multiple clusters.
  • Implemented cost-saving solutions by moving Elasticsearch to Kubernetes on spot instances,
  • reducing costs by 80%, and established CI/CD pipelines with ArgoCD and AWS
  • CodeBuild/CodePipeline. Configured Prometheus and Grafana for real-time monitoring.
  • Set up mixed-node MongoDB clusters on Kubernetes with failover mechanisms, ensuring primary
  • nodes on-demand and secondary nodes on spot instances with PVCs.
  • Recommended, developed, and implemented system enhancements that improved the performance
  • and reliability of the system, including installing, upgrading/patching, monitoring, problem
  • resolution, configuration management, and security.
TerraformArgoCDPrometheusGrafanaMongoDBSite Reliability Engineering+1

Iconia studios

Full Stack Web Developer

Mar 2014Jun 2018 · 4 yrs 3 mos · Bhopal, Madhya Pradesh · On-site

  • Developed MERN stack web applications with responsive design and strong security measures. Deployed and managed applications on AWS, using Jenkins for CI/CD pipelines. Set up and maintained infrastructure for WordPress-based websites, and managed DNS through Cloudflare. Demonstrated expertise in installing, operating, and troubleshooting a variety of open-source technologies.
MERN StackAWSJenkinsWordPressFull Stack Development

Education

Indraprastha Institute of Information Technology, Delhi

Master of Technology - MTech — Computer Science

Jan 2020Jan 2022

AKS University Satna

Master of Computer Applications - MCA — Computer Science

Jan 2017Jan 2019

Sikkim Manipal University - Distance Education

BCA — Computer Science

Jan 2014Jan 2017

Stackforce found 100+ more professionals with Site Reliability Engineering & Kubernetes

Explore similar profiles based on matching skills and experience