Saurav S.

Software Engineer

Bengaluru, Karnataka, India8 yrs 8 mos experience
Most Likely To Switch

Key Highlights

  • Over seven years in Software and Site Reliability Engineering.
  • Expert in building scalable and reliable infrastructure.
  • Proven track record in cost reduction and performance optimization.
Stackforce AI infers this person is a SaaS and Cloud Infrastructure expert with a strong focus on Site Reliability Engineering.

Contact

Skills

Core Skills

Distributed SystemsSite Reliability EngineeringNetworkingCloud Computing

Other Skills

AWSAlgorithmsAmazon Web Services (AWS)Android DevelopmentApache SparkCC (Programming Language)Chaos EngineeringCost ManagementData Structures And AlgorithmsDatadogDevOpsDisaster RecoveryDockerDomain Name System (DNS)

About

I am a Staff Software Engineer, where I work with a team of engineers to design, build, and maintain scalable, reliable, and secure infrastructure and services. I have over seven years of experience in Software and Site Reliability Engineering. My core competencies include Software Development, Linux, Python, GoLang, Kubernetes, Distributed Systems and Cloud Platform .I am passionate about solving complex problems, learning new technologies, and contributing to the mission of Twilio to fuel the future of communications by enabling developers and businesses to build powerful and innovative communication solutions.

Experience

8 yrs 8 mos
Total Experience
1 yr 5 mos
Average Tenure
2 yrs 2 mos
Current Experience

Twilio

Staff Software Engineer

Apr 2024Present · 2 yrs 2 mos · India · Remote

Amazon Web Services (AWS)Distributed SystemsTerraformKubernetesGo (Programming Language)Team Leadership+2

Media.net

Senior Engineer

May 2023Apr 2024 · 11 mos · Bengaluru, Karnataka, India · Hybrid

  • Implemented Chaos Engineering using Chaos Mesh , Identified the issues with the circuit breaker Java library which potentially can cause revenue loss.
  • Worked on identifying/fixing scaling issues of in-house Prometheus setup.
  • Migrated the proxies from nginx to envoy, with zero downtime for infrastructure serving peak traffic of 640K rps, re-wrote the lua scripts for processing request headers with minimal latency.
  • Live migration of redis from self-managed redis to Google MemoryStore .
  • Helped the development team in identifying the Kubernetes features they can use for full filing the Business Needs and onboarded few services with complete monitoring.
  • Reduced oncall toil , started actively tracking the issues and bringing the culture for Handover and Discussing oncall issues every day during standup and remediation measures.
  • Worked on cross cluster communication via cilium cluster mesh feature, to avoid GLB Cost and Data Egress Cost.
  • Mentored Intern and drove the implementation of Atlantis.
PythonDistributed SystemsChaos EngineeringKubernetesGoogle Cloud Platform (GCP)Site Reliability Engineering

Coinbase

Software Engineer

Apr 2022Feb 2023 · 10 mos · Remote

  • Implemented and enhanced Service Level Objectives (SLOs) to optimize critical endpoints, ensuring the achievement of SLOs and SLA targets.
  • Spearheaded AWS Infrastructure Cost Reduction efforts, successfully slashing annual costs by $850K.
  • Led production readiness as the Directly Responsible Individual (DRI) for several critical XXM $ products.
  • Introduced End-to-End (E2E) testing for key user workflows, elevating the overall customer experience.
  • Orchestrated and crafted a comprehensive Kubernetes Migration Plan
RubyDistributed SystemsDatadogSite Reliability EngineeringDockerGo (Programming Language)

Linkedin

SRE

Sep 2021Apr 2022 · 7 mos · Bengaluru, Karnataka, India

  • Working on Data Science and Artificial Intelligence Workbench .
HadoopApache SparkDistributed SystemsGrafanaPython (Programming Language)Kubernetes+1

Oracle

Software Engineer II , Virtual Networking OCI

Feb 2020Sep 2021 · 1 yr 7 mos · Bengaluru, Karnataka, India

  • I was part of Virtual Cloud Network Team, And handled responsibilities from Control Plane, Data Plane and IP Management Team. I was also part of organisational critical project like Automated Région Build and Launching of New Regions , Improving Canary and Building Automated Deployment Pipeline for deploying the services across all the host and all the DCs. I was also part of oncall roster for handling any production outages and bugs which impacted user.
Distributed SystemsTerraformGrafanaIP managementPython (Programming Language)Java+2

Amazon web services (aws)

Cloud Network Engineer

Jul 2017Feb 2020 · 2 yrs 7 mos · Bengaluru Area, India

  • Helped the companies with troubleshooting Network and Operating syatem related technical issues
  • Designing Cloud Network Solutions, troubleshooting complex networking issues providing unique costumer solutions.
  • Developed AWS solutions with Load Balancers, VPN, Direct Connect, VPC, Route 53, CloudWatch, AutoScaling, CloudFront, Lambda, S3 and WAF firewall to meet companies cloud architectural requirements and perform system administration
  • Trained a team of 30 engineers on Networking AWS Services and selected as primary lead to handle escalations
  • Conducted network performance analysis with standard networking tools.
  • Developed Tools to ease investigation and troubleshooting, Also collaborated with AWS Config Team to develop config rules.
  • Assisted TAM and SA by developing tools for security audit for India Top 100 AWS customer.
Amazon Web Services (AWS)Internet Protocol Suite (TCP/IP)Distributed SystemsTcpdumpLinuxProgramming+4

Education

Cochin University of Science and Technology

Bachelor of Technology (B.Tech.) — Computer Science

Stackforce found 100+ more professionals with Distributed Systems & Site Reliability Engineering

Explore similar profiles based on matching skills and experience