Sonu Giri

Director of Engineering

Bengaluru, Karnataka, India13 yrs 8 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Built a team from scratch with zero attrition.
  • Saved PayPal millions by replacing vendor software.
  • Designed innovative solutions for high availability.
Stackforce AI infers this person is a Fintech Infrastructure Engineer with strong DevOps and software development expertise.

Contact

Skills

Core Skills

DevopsSoftware Development

Other Skills

C++GolangPythonKubernetesDockerGitTerraformGrafanaDatadogServiceNowLogicMonitorCloud SQLWebAssemblySlackInfluxDb

About

Over 12 years of experience in Software Design and Development. Currently serving as a Software Engineering Manager at PayPal's Infra Software Services Team, with expertise in mentoring engineers, driving innovation, and aligning technology strategies with business objectives. Managing a team of 11 FTE and 8 contingent workers. Built team from scratch with zero attrition over last 5+ years. Few Projects: * Network Observability and Monitoring: Architected and Developed an experience-focused network connectivity and latency monitoring tool featuring capabilities such as ping mesh, HTTP/TCP endpoint monitoring, ping monitoring, and LDAP monitoring. This solution successfully helped PayPal save millions of dollars in licensing costs by replacing vendor software. * Reduced MTTD by integrating Datadog, LogicMonitor, and ServiceNow with LLM via MCP and Slack, streamlining issue detection and diagnosis for Network Operations engineers * Enhanced infrastructure resilience by deploying high availability and disaster recovery through hybrid-cloud Kubernetes clusters, migrating to DBA-managed Cloud SQL, and enabling an active-active service model. * Designed the Compute at Edge Platform to optimize app deployment on Fastly infrastructure, automating builds, packaging services as WebAssembly binaries, enabling controlled rollouts, instant rollbacks, and improving reliability, downtime, and audit traceability. * Built CDNs error breach observability pipelines with autonomous 5xx anomaly detection and Slack alerting, eliminating manual dashboard monitoring and accelerating issue detection. * Proactively drives incident response for infrastructure software services by implementing alerting pipelines, enabling faster detection and resolution. * Architected solution to move from static storage allocation to dynamic allocation with built in self-service saving PayPal $400k annually in storage cost. Skills: Programming Language: C++, Golang, Python Database: MySQL, Redis, InfluxDb Operating System: Linux DevOps: Kubernetes, Docker, ArgoCD, GitOps, Jenkins, Terraform, Git, Grafana, Ansible Public Cloud: GCP, Azure, AWS.

Experience

13 yrs 8 mos
Total Experience
4 yrs 6 mos
Average Tenure
5 yrs 7 mos
Current Experience

Paypal

3 roles

Software Engineering Manager - 2

Promoted

Jan 2023Present · 3 yrs 4 mos

  • Established and scaled a team from the ground up, growing from 0 to 12 members with zero attrition over a 5-year period. Provided mentorship to interns, junior engineers, and senior engineers, fostering growth and development across all levels.
  • Utilized MCP and LLM to significantly reduce MTTD: Led the team in integrating tools such as Datadog, LogicMonitor, and ServiceNow with LLM through MCP and a Slack interface, enhancing the efficiency and convenience for Network Operations engineers in detecting and diagnosing issues or failures.
  • High Availability and Disaster Readiness for Infra: Implemented high availability and disaster recovery (DR) capabilities within the infrastructure by deploying multiple Kubernetes instances across a hybrid-cloud setup, transitioning from self-managed to DBA-managed Cloud SQL, and enabling an active-active service model by relocating key components outside the cluster.
  • Compute @ Edge Deployment Platform: Designed and architected the CEP Platform to streamline application deployment on Fastly's infrastructure by automating builds, packaging services as WebAssembly binaries, and enabling controlled rollouts with instant rollbacks. It improved reliability, minimized downtime, and enhanced deployment traceability for troubleshooting and audits.
  • Built CDNs error breach observability pipelines with autonomous 5xx anomaly detection and Slack alerting, eliminating manual dashboard monitoring and accelerating issue detection.
  • Grafana and Influx Migration to Datadog: I spearheaded the creation of dashboards for all services migrated to Datadog, leveraging its advanced alerting and analytics capabilities to simplify management and enhance operational efficiency.
  • Proactively drives incident response for infrastructure software services by implementing alerting pipelines, enabling faster detection and resolution.
C++GolangPythonKubernetesDockerGit+6

Member Of Technical Staff 2

Promoted

Apr 2022Jan 2023 · 9 mos

  • Summary: Leading multiple projects at PayPal Infra Software team, providing mentoring and guidance, hiring and growing team, handling scrum meetings, cross collaboration with other teams, providing design and architecture expertise.
  • Worked on below Projects:
  • Vendor Maintenance Tracking: Designed and architected a Vendor Maintenance Service to automate the processing of vendor emails, extracting maintenance window details and seamlessly integrating them into the alerting pipeline. This solution eliminated the need for the NetOps team to manually track hundreds of emails from numerous vendors on a regular basis.
  • Vulnerability Management: An automated vulnerability patching application designed to efficiently address security vulnerabilities across computing resources, significantly reducing the time and effort required for manual intervention.
GolangPythonTerraformKubernetesSoftware Development

Member Of Technical Staff 1

Sep 2020Mar 2022 · 1 yr 6 mos

  • Worked on following projects/features from end to end.
  • Built Network Health Monitoring software to monitor network connectivity and latency across devices in Data Center. This solution is similar to ThousandEyes, pingmesh used at Microsoft, Argos used at LinkedIn.
  • Software to monitor Endpoints (services hosted inside/outside datacenter). Similar to ThousandEyes ,kentik endpoint monitoring solution.
  • Setup CD pipeline using argo on kubernetes.
  • API Security: Secure the APIs exposed by services using mTLS.
  • A Framework to automate UI/backend code generation to enhance software development cycle.
  • Tech Used: Golang, Python, InfluxDb, Redis, Grafana, Kubernetes, docker.
GolangPythonInfluxDbRedisGrafanaKubernetes+2

School of ai,mlblr

ML Intern

May 2019Jun 2019 · 1 mo · Bangalore

  • Involved understanding foundations of Deep Learning, Convolution Neural Networks along with hands on assignment.
  • https://github.com/sonugiri1043/machine_learning

Arista networks, inc.

Software Engineer

Jul 2013Sep 2020 · 7 yrs 2 mos · Bengaluru Area, India

  • Worked on several broad areas of Arista's EOS, which is the Operating System which runs on Arista switches.
  • Developed Terraform provider for CloudEOS ( OS which runs on VM ).
  • Feature Integration on VM (vEOS) ( Jan 2017 – July 2017 )
  • Integrated physical switch features on VM( vEOS ). Some of the use cases of vEOS are
  • lightweight branch office router, cloud edge router, NFV.
  • Path Tracer for OAM of Switches ( Aug 2016 – Jan 2017 )
  • Implemented a Path Tracer for Operation and Management ( OAM ). OAM functions are important
  • for fault management and performance monitoring. https://tools.ietf.org/html/draft-pang-nvo3-
  • vxlan-path-detection-00
  • Support for MAC Auth Bypass (MAB) ( Feb 2016 – Jul 2016 )
  • There are a lot of devices that do not support 802.1X authentication, and yet need to be used
  • in a protected network environment. Eg. network printers, wireless phones etc. With MAC
  • Authentication Bypass (MAB), the switch tries to authenticate with the AAA server on behalf of
  • the connected device.
  • License Management ( Nov 2014 – Jun 2016 )
  • Helped with designing and implementation of license management system to replace the earlier
  • Honour system with no license enforcement.
  • High Availability for License Management ( Oct 2015 – Jan 2016 )
  • Designed mechanisms to ensure high-availability for the license management system. Used
  • clustering for HA, rsync and inotify for synchronisation of state.
  • Audio Video Bridging(AVB) ( Jul 2013 – Nov 2014 )
  • Implemented an AVB endpoint simulation based on Open-AVB.
  • https://github.com/AVnu/OpenAvnu

Aston university

Research Internship and B.Tech Project

May 2012Apr 2013 · 11 mos · Birmingham, United Kingdom

  • Constructed open-source tool to generate 2D Natural and Urban Landscapes for testing ecological theories. It involved using machine learning techniques to learn from real data and use it to generate artificial landscape.

Education

Indian Institute of Technology, Ropar

B.Tech — Computer Science

Jan 2009Jan 2013

Army School Kota

Senior Secondary +2 — PCM

Jan 2006Jan 2008

Army School Roorkee Cantt

High School

Jan 2001Jan 2006

Stackforce found 100+ more professionals with Devops & Software Development

Explore similar profiles based on matching skills and experience