Ishu Gupta

DevOps Engineer

Mumbai, Maharashtra, India3 yrs experience
Highly Stable

Key Highlights

  • Expert in scaling high-availability production systems.
  • Proficient in GCP, Terraform, and Kubernetes.
  • Strong focus on observability and automation in SRE.
Stackforce AI infers this person is a Cloud Infrastructure Engineer with a strong focus on Site Reliability Engineering.

Contact

Skills

Core Skills

Site Reliability Engineering (sre)DevopsCloud InfrastructureSoftware Development

Other Skills

TerraformGoogle Cloud Platform (GCP)JenkinsGrafanaELKInfluxDBKubernetesLinuxAnsibleASP.NETReactJSMySQLJestOperating SystemsTechnical Skillset

About

DevOps / Site Reliability Engineer with ~3 years of experience building and operating high-availability, low-latency production systems at scale. Currently working at Jio Platforms, supporting large-scale SSAI platforms with millions of concurrent users. I specialize in production reliability, observability, and automation โ€” defining SLIs/SLOs, reducing alert fatigue, improving MTTR, and eliminating toil using Infrastructure as Code (Terraform) and Python automation. Iโ€™ve worked extensively on GCP-based architectures, predictive autoscaling, Kubernetes, CI/CD pipelines, and cost optimization โ€” ensuring systems remain stable during extreme traffic spikes (e.g., IPL events). ๐ŸŽ“ IIT (BHU) Varanasi graduate ๐Ÿ”ง Interests: SRE, Production Engineering, Distributed Systems, Cloud Infrastructure ๐Ÿ“ฉ Open to DevOps / SRE opportunities

Experience

3 yrs
Total Experience
3 yrs
Average Tenure
3 yrs
Current Experience

Jio platforms limited (jpl)

2 roles

DevOps / Site Reliability Engineer โ€“ Jio Platforms

May 2024 โ€“ Present ยท 2 yrs 1 mo ยท On-site

  • ๐’๐œ๐š๐ฅ๐ž๐ ๐ก๐ข๐ ๐ก-๐š๐ฏ๐š๐ข๐ฅ๐š๐›๐ข๐ฅ๐ข๐ญ๐ฒ ๐ฉ๐ซ๐จ๐๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐ฌ๐ฒ๐ฌ๐ญ๐ž๐ฆ๐ฌ to support ๐Ÿ“๐Œ+ ๐œ๐จ๐ง๐œ๐ฎ๐ซ๐ซ๐ž๐ง๐ญ ๐ฌ๐ž๐ฌ๐ฌ๐ข๐จ๐ง๐ฌ, maintaining ๐Ÿ—๐Ÿ—.๐Ÿ—๐Ÿ—% ๐ฎ๐ฉ๐ญ๐ข๐ฆ๐ž during peak traffic events.
  • ๐„๐ง๐ ๐ข๐ง๐ž๐ž๐ซ๐ž๐ ๐ฉ๐ซ๐ž๐๐ข๐œ๐ญ๐ข๐ฏ๐ž ๐š๐ฎ๐ญ๐จ๐ฌ๐œ๐š๐ฅ๐ข๐ง๐  for GCP Managed Instance Groups using custom metrics to eliminate scaling lag.
  • ๐‘๐ž๐ฏ๐ข๐ญ๐š๐ฅ๐ข๐ณ๐ž๐ ๐จ๐›๐ฌ๐ž๐ซ๐ฏ๐š๐›๐ข๐ฅ๐ข๐ญ๐ฒ ๐ฌ๐ญ๐š๐œ๐ค๐ฌ (Grafana, ELK, InfluxDB), reducing alert fatigue by ๐Ÿ’๐ŸŽ% through user-centric SLIs/SLOs.
  • ๐ˆ๐ฆ๐ฉ๐ฅ๐ž๐ฆ๐ž๐ง๐ญ๐ž๐ ๐‚๐š๐ง๐š๐ซ๐ฒ ๐๐ž๐ฉ๐ฅ๐จ๐ฒ๐ฆ๐ž๐ง๐ญ๐ฌ and automated rollbacks via Jenkins to safeguard error budgets.
  • ๐€๐ซ๐œ๐ก๐ข๐ญ๐ž๐œ๐ญ๐ž๐ ๐ฆ๐ฎ๐ฅ๐ญ๐ข-๐ซ๐ž๐ ๐ข๐จ๐ง ๐ƒ๐‘ using modularized ๐“๐ž๐ซ๐ซ๐š๐Ÿ๐จ๐ซ๐ฆ (๐ˆ๐š๐‚) for 300+ servers.
  • ๐Ž๐ฉ๐ญ๐ข๐ฆ๐ข๐ณ๐ž๐ ๐ฅ๐š๐ญ๐ž๐ง๐œ๐ฒ-๐œ๐ซ๐ข๐ญ๐ข๐œ๐š๐ฅ ๐ฉ๐ข๐ฉ๐ž๐ฅ๐ข๐ง๐ž๐ฌ (<50ms budget) through TCP tuning and connection pooling.
  • ๐ƒ๐ซ๐จ๐ฏ๐ž ๐š ๐Ÿ‘๐ŸŽ% ๐œ๐ฅ๐จ๐ฎ๐ ๐œ๐จ๐ฌ๐ญ ๐ซ๐ž๐๐ฎ๐œ๐ญ๐ข๐จ๐ง via capacity planning and preemptible instance strategies.
  • ๐‡๐š๐ซ๐๐ž๐ง๐ž๐ ๐๐š๐ญ๐š ๐ฉ๐ข๐ฉ๐ž๐ฅ๐ข๐ง๐ž๐ฌ using Kafka and KeyDB, maintaining a ๐Ÿ—๐Ÿ—% ๐œ๐š๐œ๐ก๐ž ๐ก๐ข๐ญ ๐ซ๐š๐ญ๐ข๐จ.
TerraformGoogle Cloud Platform (GCP)JenkinsGrafanaELKInfluxDB+3

Graduate Engineering Trainee

Jun 2023 โ€“ May 2024 ยท 11 mos ยท On-site

  • > Gained hands-on experience with Linux administration and cloud fundamentals using AWS and GCP, focusing on server setup, networking, and monitoring.
  • > Learned to automate infrastructure using Terraform and Ansible, creating basic deployment scripts and managing configurations in cloud environments.
LinuxTerraformAnsibleCloud Infrastructure

Amazon

Amazon ML Summer School

Jul 2022 โ€“ Jul 2022 ยท 0 mo

  • Selected for the program to learn ML topics & interact with Amazon Scientists
  • Inculcated knowledge about some key ML topics including Supervised Learning, Deep Neural Networks,
  • Dimensionality Reduction, Unsupervised Learning, Probabilistic Graphical Models, and Sequential Learning

Kritikal solutions

Software Engineer Intern

May 2022 โ€“ Jul 2022 ยท 2 mos ยท Remote

  • Developed a web application for the staff portal using MVC architecture with ASP.NET, ReactJS, and MySQL.
  • Implemented CRUD functionalities including search, create, view, edit, and delete operations to manage employee data.
  • Implemented unit and integration tests for the ReactJS application using Jest and React Testing Library.
ASP.NETReactJSMySQLJestSoftware Development

Education

Indian Institute of Technology (Banaras Hindu University), Varanasi

Bachelor of Technology - BTech โ€” Electronics Engineering

Jul 2019 โ€“ May 2023

Jawahar Navodaya Vidyalaya - JNV

Bundi โ€” Rajasthan

Jul 2017 โ€“ Apr 2019

Jawahar Navodaya Vidyalaya - JNV

Morena โ€” Madhya Pradesh

Jun 2012 โ€“ Jun 2017

Indian Institute Of Technology(BHU),Varanasi

Bachelor of Technology

Stackforce found 100+ more professionals with Site Reliability Engineering (sre) & Devops

Explore similar profiles based on matching skills and experience