Nishant Garg

Software Engineer

United States9 yrs 10 mos experience
Highly Stable

Key Highlights

  • Led critical infrastructure development for Azure Storage.
  • Improved tracking ID generation time from 300 ms to 3–7 ms.
  • Designed automated airline refund systems enhancing operational efficiency.
Stackforce AI infers this person is a Backend-heavy Software Engineer specializing in Cloud Computing and Distributed Systems.

Contact

Skills

Core Skills

Cloud InfrastructureDistributed SystemsSoftware DevelopmentApi Development

Other Skills

API IntegrationAlgorithmsAspect-Oriented Programming (AOP)Azure DevOps ServicesAzure StorageCC#C++CSSCassandraCore JavaData ModelingData ProcessingData StructuresDatabase Selection

About

Senior Software Engineer with 9+ years of backend development expertise across Microsoft, Amazon, and MakeMyTrip. I specialize in designing and delivering large-scale distributed systems with a strong focus on reliability, scalability, and resilience. At Microsoft, I’ve led the development of critical infrastructure for Azure Storage, including self-reconfiguring services, service-aware reconfiguration, and production-scale virtualized test environments. Improving failover reliability, reduced deployment risks, and enabled elastic scaling across regional clusters. Previously at Amazon, I owned end-to-end delivery of scalable systems for global tracking ID generation and carrier configurations, significantly improving latency and operational efficiency across fulfillment pipelines.

Experience

9 yrs 10 mos
Total Experience
3 yrs
Average Tenure
7 mos
Current Experience

Meta

Software Engineer

Nov 2025Present · 7 mos · Bellevue, Washington, United States

  • Serverless Compute

Microsoft

2 roles

Senior Software Engineer (L62->64)

Promoted

Jun 2022Oct 2025 · 3 yrs 4 mos

  • Azure Storage
  • Led Azure Storage Impact Manager framework, introducing coordinated approval gates and safe workflows for high-risk operations (e.g., node reboots, cluster upgrades), significantly reducing unplanned outages and enhancing service reliability.
  • Led service-aware reconfiguration initiatives for Azure Storage using the Self-Reconfiguring Service model in Service Fabric, enabling autonomous lifecycle transitions, elastic scaling, and robust failover capabilities.
  • Developed and managed production-scale simulation clusters on Azure VMs for Azure Storage Testing, supporting safe, isolated
  • validation of new architectural patterns and deployment flows.
  • Engineered resilient solutions for account migration cancellation and rollback across distributed storage clusters.
  • Functional interviewer for SDE I & II, primary mentor for interns, SDE-1 and SDE-2s.
Azure StorageService FabricSelf-Reconfiguring ServicesProduction-scale TestingService-aware ReconfigurationCloud Infrastructure+1

SDE-2 (L62)

Dec 2020Jun 2022 · 1 yr 6 mos

  • Azure Storage
  • Worked on a globally distributed, large-scale data processing and orchestration system that provisions users and tenants in Microsoft Teams. The system reduced customer SLA from 24 hours to 15 minutes, with P99 latencies under 5 seconds, and processes over 5 million updates per hour in each of the three geo-regions where it is deployed.
  • As a Senior Software Engineer on the team, I contribute to feature development, long-term architecture design, and cross-team collaboration to ensure alignment with Microsoft Teams’ product vision. I actively mentor junior engineers and help drive the team toward shared goals. Responsibilities span the entire product lifecycle, including requirements analysis, development, deployment, DevOps, and maintenance — with a strong focus on scalability, fault-tolerance, and performance.
Microsoft TeamsData ProcessingOrchestration SystemDistributed SystemsCloud Infrastructure

Amazon

Software Developer-2 (L5)

Sep 2018Dec 2020 · 2 yrs 3 mos · Hyderabad, Telangana, India · On-site

  • Global Transportation System
  • Led the end-to-end design and implementation of Amazon’s Tracking ID system, including the Tracking-ID Configuration, Generation, Sequence Monitoring, and Validator Service. My responsibilities spanned high- and low-level design, carrier configuration analysis and migration, data modeling, database selection and setup, as well as the development of validation systems. I architected and delivered highly available, scalable solutions for tracking ID generation and configuration, integrated machine learning models for predictive sequence monitoring, and implemented automated, multi-channel stakeholder notifications. Rigorous testing including shadow and reverse shadow modes ensured reliability and minimized issues.
  • IMPACT: Reduced tracking ID generation time from 300 ms to 3–7 ms, cut carrier configuration update time from 7 days to 15 minutes, improved operational efficiency, proactively prevented sequence exhaustion, and significantly reduced the number of operational issues enocuntered.
  • Held end-to-end ownership and maintenance of a Tier-1 system essential to Amazon Retail operations, with direct impact on global shipment delivery and risk of outage.
  • Served as a functional interviewer for SDE I & II positions, primary mentor for interns and SDE-1s, and secondary mentor for SDE-2s.
Tracking ID SystemData ModelingDatabase SelectionDistributed SystemsCloud Infrastructure

Makemytrip.com

Software Developer

Jun 2016Aug 2018 · 2 yrs 2 mos · Gurugram, Haryana, India · On-site

  • Flights
  • Led the integration and automation of critical airlines and GDS APIs, including the full suite of TravelPort post-sale flight APIs (Retrieval, Document Access, Cancellation, Provider Split PNR, Refund, and Fee Details). Designed and delivered an automated refund system for leading airlines such as Indigo, GoAir, SpiceJet, Jet Airways, Air India, AirAsia, and FlyDubai, utilizing generic request translators and Java connectors for robust, scalable solutions. Pioneered India’s first automated special claim system for extraordinary events, leveraging Drools rules for dynamic validation and refund calculation. Developed full and partial PNR cancellation and retrieval processes with unified data handling across multiple airline APIs, significantly enhancing operational efficiency and reliability.
  • Technologies: Java 8, MyStric, MySQL, Spring Framework, Metrics, Kafka, Aspect-Oriented Programming, Drools, RxJav
API IntegrationJavaMySQLSoftware DevelopmentAPI Development

Education

National Institute of Technology Kurukshetra

Bachelor’s Degree — Information Technology

Jan 2012Jan 2016

S.D Public School

High School — Mathematics

Jan 1999Jan 2012

Stackforce found 100+ more professionals with Cloud Infrastructure & Distributed Systems

Explore similar profiles based on matching skills and experience