Gaurav G S

SRE (Site Reliability Engineer)

Bengaluru, Karnataka, India3 yrs 10 mos experience

Key Highlights

  • 4+ years in Site Reliability Engineering for fintech.
  • Expert in high-availability payment systems.
  • Proficient in AI infrastructure and cloud platforms.
Stackforce AI infers this person is a Fintech Infrastructure Specialist with strong expertise in Site Reliability Engineering and Cloud Automation.

Contact

Skills

Core Skills

Site Reliability EngineeringAutomationInfrastructure Migration

Other Skills

SQLDebuggingApache ZooKeeperGlusterFSTerraformAmazon Web Services (AWS)Amazon DynamoDBLarge Language Model Operations (LLMOps)Incident HandlingAIOpsGo (Programming Language)Distributed SystemsSoftware ObservabilityWorkflow orchestrationKubernetes

About

- Site Reliability and Platform Engineer with 4+ years of experience operating mission-critical fintech infrastructure across Linux, observability, CI/CD automation, incident response, production migrations, and compliance-sensitive environments. - At PhonePe and Mastercard, I have worked on high-availability payment and banking platforms involving Linux services, Nginx, ZooKeeper, MariaDB, Splunk/Grafana-based troubleshooting, infrastructure migrations, data-center validation, release automation, RCA, DR drills, and operational readiness. - I am currently focused on building AI infrastructure and cloud platform depth across LLM serving, Kubernetes, Terraform, observability, MCP runtime controls, AIOps remediation, and production-grade reliability automation. My project work includes AI-assisted RCA/remediation systems and distributed workflow orchestration with durable state, retries, leases, worker heartbeats, backpressure, audit trails, Prometheus/Grafana observability, Kubernetes/Kustomize, Argo CD, and GitHub Actions. - Target roles: AI Infrastructure Engineer, GenAI Platform Engineer, SRE, Platform Engineer, Cloud Infrastructure Engineer, LLMOps Engineer, and Production Engineering roles.

Experience

3 yrs 10 mos
Total Experience
2 yrs 8 mos
Average Tenure
1 yr 2 mos
Current Experience

Mastercard

Site Reliability Engineer 2

Apr 2025Present · 1 yr 2 mos · Pune District

  • Part of Franchise and Legal Solutions handling applications generating 77 million dollars annually.
  • Taking care of app deployments, enhancing automation using XLR and Jenkins, debugging app level issues for clients such as Standard Chattered Bank, Bank of Brazil, etc.
  • Lead PCI certification audits for Global Rule Investigation Program which is crucial for Site Data Protection(SDP) and compliant to Financial Organisations.
  • Handling oncalls, lead data migrations, recovered corrupted financial data and kept systems running smooth.
SQLDebuggingSite Reliability EngineeringAutomation

Phonepe

2 roles

Site Reliability Engineer 1

Promoted

Aug 2022Apr 2025 · 2 yrs 8 mos

  • Maintaining UPI infrastructure involving bank PSP infra involving 49% market share for online UPI transactions in India.
  • Migrated Yes Bank PSP container orchestrator infra from Mesos to Drove resulting in efficient resource utilisation seamlessly without downtime.
  • Migrated Axis and Yes Bank infrastructure from one DC to another.
  • Optimised the DB backup verification process through a script that parallelises processes, achieving a 33% reduction in execution time and automated using Salt stack (Infrastructure as Code).
  • Created scripts for infra components like Nginx and Zookeeper for granular metric observation and monitoring allowing transparency over transactional flow, quicker identification of issues and integrating with observation tools(Grafana).
  • Managed compliance audit resolutions based on RBI mandates, conducted feature testing across UPI infrastructure components, actively participated in DR(Disaster Recovery) Drills, on calls (24x7).
SQLApache ZooKeeperSite Reliability EngineeringInfrastructure Migration

Site Reliability Engineer

Mar 2022Jul 2022 · 4 mos

Apache ZooKeeperGlusterFS

Google india

Google Cloud CR Program

Dec 2020Mar 2021 · 3 mos

  • Successfully completed Architecting with Google Compute Engine Specialization from Google Cloud EDU.
  • Attained GCP Qwiklabs badges on Cloud Engineering, Architecture, Data Science & ML.
  • View all of my 31 badges at:
  • https://google.qwiklabs.com/public_profiles/781d58c2-edd4-43e6-9eb1-7a8053343dfe

Yebilo

Digital Marketing Intern

May 2020Aug 2020 · 3 mos

  • Worked as a Team Lead for a group of 5 members of Digital Marketing Interns.
  • Marketing Analysis on Social Media sites, Marketing Strategies, SEO, SEM, metadata, graphic designing were all executed during the tenure.
  • Received a Certificate of Appreciation.

Indian institute of technology, roorkee

Machine Learning Summer Intern

Mar 2020May 2020 · 2 mos

  • Part of IIT-R Cognizance 2020 Internship Fest.
  • Successfully completed two projects on ML models, DS visualizations, NLP & Sentiment Analysis.
  • Team Lead of major project.

Education

CMR Institute Of Technology

BE - Bachelor of Engineering — Information Science & Engineering

Jan 2018Jan 2022

Stackforce found 100+ more professionals with Site Reliability Engineering & Automation

Explore similar profiles based on matching skills and experience