Amith Reddy

Engineering Manager

Bengaluru, Karnataka, India10 yrs 6 mos experience

Key Highlights

  • Proven track record in building scalable consumer products.
  • Expert in leading cross-functional engineering teams.
  • Strong focus on developer experience and operational excellence.
Stackforce AI infers this person is a SaaS and AI/ML expert with a strong focus on Site Reliability Engineering.

Contact

Skills

Core Skills

Cloud ComputingDistributed SystemsSite Reliability EngineeringData EngineeringSoftware Development

Other Skills

AJAXAWSAmazon Web Services (AWS)AnsibleApache KafkaAsynchronous ComponentsCCI/CDCSSCascading Style Sheets (CSS)CassandraCloud Database Lifecycle ManagementCloud-Native ArchitectureCommunicationConfiguration Management

About

Engineering leader with experience building consumer products and running mission-critical systems at scale. I focus on bringing people together by leading with empathy, setting clear expectations, and giving honest feedback. I’ve worked through all stages of growth (0 → 1 → N) with teams from different backgrounds, and know the full software lifecycle—from design and development to deployment and operations. I enjoy building strong teams, mentoring, designing reliable systems, and improving developer experience. Lately, I’m interested in how Generative AI can be applied to create useful, real-world products.

Experience

Booking holdings (nasdaq: bkng)

Engineering Manager, Site Reliability

Dec 2025Present · 3 mos · Bengaluru, Karnataka, India · Hybrid

Booking.com

Engineering Manager, Site Reliability

Jul 2024Nov 2025 · 1 yr 4 mos · Amsterdam Area

  • Responsible for Kafka & Flink platform at booking.com

Glovo

3 roles

Engineering Manager, Storage, Platform

Promoted

Aug 2022Jul 2024 · 1 yr 11 mos

  • Led data storage & streaming platform team of 6 engineers, drove two promotions, represented Glovo at tech conferences, led company wide data contracts OKR for OLTP sources
  • Led the team to build self service cloud database lifecycle management platform using k8s operator enabling product teams to own their databases. Owing to it's success, it was eventually adopted by Delivery Hero's (parent org) central platform team
  • Led the team through a huge undertaking of migrating all streaming workloads from Kinesis to Confluent Cloud improving end-to-end latency for events from ~s to ~ms successfully finishing 2 year initiative all the way to the long tail
Data StorageStreaming PlatformCloud Database Lifecycle ManagementKubernetesKafkaConfluent Cloud+2

Software Engineer IV - Site Reliability

Apr 2022Jul 2022 · 3 mos

  • Global Incident Commander for Sev 0/ Sev 1 incidents, lead post incident reviews, service
  • health reviews and production readiness scores to reduce MTTR by ~20%.

Software Engineer III - Site Reliability

Mar 2021Mar 2022 · 1 yr

  • Championed bringing SRE culture across the org by setting up weekly operational reviews, RFC reviews, production readiness scores and supporting product teams along their journey from monolith to microservices, responsible for maintaining core infra libraries
  • Led task force to stabilise critical monolith mysql database enabling business to expand user base by 30% during high growth phase. Key contributor to modernising the entire infrastructure on AWS (from EC2 to EKS, Kinesis to Kafka).

Rapido - india's largest bike taxi

Senior Site Reliability Engineer

Jun 2019Oct 2020 · 1 yr 4 mos

  • Core member of the team which built platform on GCP with 100s of microservices for a distributed system serving ~1M orders per day moving from almost no automation to a near fully automated system within an year
  • Re-architected the dispatch system to meet business needs for efficient and controllable order matching with asynchronous components, fault reliance and seamless integration of ML components.
  • Designed and implemented a secure private cloud native custom network for the entire infrastructure with custom subnets according to the workload. Key contributions to setting up platform components including Kubernetes, CI / CD, Istio, Infra automation and observability, self hosted databases
  • Helped migrate ~100 services running in production from manual deployment to fully automated deployment across cloud providers seamlessly with minimal downtime
  • Implemented sdks for location cache, logging and developer tools for using platform.
Infrastructure ModernizationAWSMySQLMicroservicesCloud ComputingSite Reliability Engineering

Gamooga

Senior Software Engineer

Jan 2019May 2019 · 4 mos · Hyderabad, Telangana, India

  • Contributed to building a custom OLAP engine over leveldb which records over 1 billion events per day. Developed algorithm for event flow analytics queries (to identify the different paths users take in a consumer application) on top of this engine
  • Tech Stack: Java, Leveldb, Apache Kafka
  • Instrumented real time event processing system (serving upto 12K RPS) exposing metrics efficiently , automated central data store backups , improved overall monitoring and observability of the system.
  • Tech Stack: Node.js, Prometheus, Aerospike, Greenplum, Apache Kafka
MicroservicesGCPKubernetesCI/CDCloud ComputingSite Reliability Engineering

Nicheai private limited

Software Engineer

Apr 2018Nov 2018 · 7 mos · Hyderabad, Telangana, India

  • Designed and developed a deployment product that sets up an optimized serving stack for a deep learning/machine learning model pipeline providing observability out of the box
  • Tech Stack: Python, Docker, Flask, Tensorflow Serving, gRPC
OLAP EngineReal-time Event ProcessingMonitoringData EngineeringSite Reliability Engineering

Nowfloats

Software Engineer

Oct 2016Mar 2018 · 1 yr 5 mos · Hyderabad Area, India

  • 1. Core member of design team of a hybrid human agent/bot chat platform https://ana.chat and major contributor to bot service which served peak traffic of up to 90RPS per single replica
  • Tech Stack: Python, Flask, Redis, SQS, Apache Thrift, SQL, MongoDB
  • 2. Designed and developed a distributed system of microservices (from scratch) which reads/writes text/image data across multiple social/discovery platforms. This system processes around 100K events per day.
  • Tech Stack: Node.js, Python, SQS
  • 3. Implemented continuous deployment pipelines, containerization of a system with 20 microservices, implemented monitoring, centralized logging systems
  • Tech Stack: Shell Script, Python, Grafana, ELK, Nginx, AWS Services
Deployment ProductDeep LearningMachine LearningSoftware Development

Ekincare

Full Stack Web Developer

Oct 2015Sep 2016 · 11 mos · Greater Hyderabad Area

  • Built data integration pipelines for consumer health data from partner organisations ensuring HIPAA compliance
  • Responsible for monolith backend of the app which serves end consumers, enterprises and customer support agents
  • Tech Stack: Ruby on Rails, SQL, Redis, Sidekiq, AWS
Hybrid Chat PlatformMicroservicesContinuous DeploymentSoftware Development

Freelancer

Web Developer

Jun 2014Oct 2015 · 1 yr 4 mos · Greater Hyderabad Area

  • 1. Worked on a bookmarking and collaboration app from scratch
  • Tech Stack: Node.js, ReactJs
  • 2. Worked with a publishing enterprise on a couple of products (successstory.com and template.net) to develop the front end components
  • Tech Stack: HTML, JavaScript, CSS
Data Integration PipelinesHIPAA ComplianceSoftware Development

Education

Indian Institute of Technology, Madras

Dual Degree

Jan 2009Jan 2014

Stackforce found 100+ more professionals with Cloud Computing & Distributed Systems

Explore similar profiles based on matching skills and experience