H

Hemanth kumar Mangalore

CEO

Bengaluru, Karnataka, India21 yrs 11 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Achieved 99.99% uptime across platforms.
  • Led teams to reduce incidents by 80%.
  • Mentored next-generation reliability leaders.
Stackforce AI infers this person is a Fintech Infrastructure and Reliability Engineering expert.

Contact

Skills

Core Skills

Site Reliability EngineeringPlatform InfrastructureCrisis ManagementCloud ComputingOperations ManagementSystems EngineeringNetwork Engineering

Other Skills

SREObservabilityEnterprise ITRegulatory ComplianceReliability EngineeringIncident ResolutionIncident ManagementPlatform TransformationTeam BuildingCloud StrategyPerformance EngineeringAutomationCloud InfrastructureChange ManagementAzure Services

About

Technology leader with 20+ years of experience leading SRE, platform infrastructure, observability, and enterprise IT across global financial and technology companies including Yahoo, Freecharge, Intuit, Microsoft, and Walmart. My expertise spans building, scaling, and operating large-scale distributed systems on AWS and Azure, ensuring high availability, resilience, security, and business-centric reliability for mission-critical platforms. I thrive at the intersection of technology, operations, and business, translating complex engineering challenges into measurable business outcomes. Key Highlights & Achievements: Reliability & Resilience: Architected and led platforms achieving 99.99% uptime, built geo-distributed, self-healing infrastructures, and drove initiatives that reduced incidents by 80%. Crisis Leadership: Calm and decisive under market-impacting incidents; trusted by boards, regulators, and auditors for transparent, rapid incident resolution and root cause analysis. Strategic Platform Leadership: Led enterprise-wide IT transformations, integrating observability, automation, AI/ML-driven monitoring, and cloud modernization for scalable operations. Team & Culture Building: Hired, mentored, and scaled high-performing global SRE and DevOps teams, fostering ownership, innovation, and a reliability-first culture. Regulatory & Board Engagement: Influenced regulatory frameworks and policies on technical outages, glitches, and systemic risk; aligned platform reliability with compliance and governance objectives. Business-Centric Mindset: Operationalized engineering excellence into tangible business outcomes, ensuring uptime, speed, and customer trust directly contribute to organizational success. I am passionate about bridging engineering, operations, and business strategy, mentoring the next generation of leaders, and building systems that are resilient, scalable, and trusted by customers and regulators alike.

Experience

21 yrs 11 mos
Total Experience
2 yrs 8 mos
Average Tenure
4 yrs 7 mos
Current Experience

Angel one

2 roles

Senior Vice President - Platform, SRE and Cloud

Promoted

Apr 2023Present · 3 yrs 1 mo

  • Lead a cross-functional portfolio covering SRE, Observability, Platform Infra, Operations, and Enterprise IT, ensuring resiliency and uptime for mission-critical financial markets infrastructure.
  • Built and institutionalized a Business-Centric Reliability framework, linking platform reliability directly to customer trust, regulatory compliance, and financial performance.
  • Serve as executive crisis commander, leading through high-stakes, market-impacting incidents, ensuring rapid recovery and transparent communication with boards and regulators.
  • Influence policy-making with regulators on defining and managing “technical glitches” and outage frameworks, advocating proportionality, transparency, and systemic resilience.
  • Scaled systems to handle tens of millions of users and trades per day, modernizing infrastructure with cloud adoption, automation, and AI/ML-driven observability.
  • Drive enterprise-wide IT transformation, ensuring security, compliance, and velocity across business functions.
  • Mentor and grow next-generation reliability leaders, building a culture of ownership, calmness under pressure, and operational excellence.
SREObservabilityPlatform InfrastructureEnterprise ITCloud ComputingCrisis Management+2

VP Engineering

Sep 2021Mar 2023 · 1 yr 6 mos

  • Platform Transformation: Led initiatives that reduced incidents by 80%, achieved 99.99% uptime, and built geo-distributed, self-healing architectures.
  • Crisis Leadership: Calm under fire during market-impacting incidents, providing decisive guidance and transparent communication to boards, auditors, and regulators.
  • Team Building: Hired and mentored high-performing global SRE and DevOps teams, fostering a culture of ownership, innovation, and operational excellence.
Incident ManagementPlatform TransformationTeam BuildingSite Reliability Engineering

Walmart

Senior Engineering Manager - Site Reliability & Performance Engineering @ Walmart

Aug 2018Sep 2021 · 3 yrs 1 mo · Bengaluru, Karnataka, India

  • As SRE leader, directly responsible for SRE & Operations functions across 4 Markets to enable company's Online Grocery offerings across the world at a large scale
  • Defined and executed Technology strategy by leveraging Cloud First
  • Strategic driver for Walmart fulfilment Infra and SRE org, evaluating and building upon everything from tooling adoption, roadmap planning, setting strong operating principles, and identifying and advocating for operational efficiencies across a scaling business.
  • Oversee large, often cross-functional organization-wide initiatives in alignment with Platform Strategy and Operations and collaborating with other cross-pillars
SREOperations ManagementCloud StrategySite Reliability Engineering

Intuit

Senior Engineering Manager - Site Reliability Engineering @Intuit

Jul 2017Aug 2018 · 1 yr 1 mo

  • Leader of Site Reliability Engineering Team for TurboTax, an Intuit Company that generates over $2.1B in revenue per year.
  • Matured the SRE organization from pure Live-site oriented on-call team to a team that's tightly aligned with Product Engineering team & sprint cycles, performs reliability/performance code fixes, builds telemetry, end-to-end automation for greater efficiency and reliability of the product
  • Managing people, budgets, infrastructure, and reliability/availability/scalability of business critical customer facing applications. Responsibilities also include hiring, as well as technical leadership of initiatives, change management oversight, incident management, risk management, and problem management.
  • Tax Season Readiness
SREPerformance EngineeringAutomationSite Reliability Engineering

Freecharge

Director- SRE & Cloud Infra

Jun 2016Jun 2017 · 1 yr

  • Responsibilities include
  • Site Reliability Engineering
  • 24x7 Operation Center
  • Security
  • Database Engineering
  • Release Engineering
  • Change Management
  • Corp IT
SRECloud InfrastructureChange ManagementSite Reliability Engineering

Microsoft

3 roles

Microsoft Azure

Promoted

Aug 2015Jun 2016 · 10 mos

  • Lead, build and maintain Azure Services worldwide.
  • Manage incidents, communications and other incident related activities for Microsoft Azure infrastructure, platforms and online services
  • Effectively partner with the Microsoft Engineering & Business Groups on the operational support experience and drive improvements and automation .
  • Attract, develop and retain a global team of high-performing engineers.
  • Drive the culture of learning, best practice sharing, quality mindset and customer obsession across delivery communities.
Azure ServicesIncident ManagementAutomationCloud Computing

Lead - Edge & Network Operations

Jan 2014Aug 2015 · 1 yr 7 mos

  • IT Strategy and roadmap planning, Problem solving and decision-making and Customer satisfaction and engagement focused
IT StrategyCustomer EngagementOperations Management

Operations Manager

Feb 2013Dec 2013 · 10 mos

  • Lead the development, management and efficient execution of Crisis Response for Major Incidents, Service impacting high priority events and cross-Microsoft events, including major business launches. Established strategies and plans to drive efficiencies across the operations and its partner Business Groups through analytics and associated process and tool improvements. collaboration with other global teams, and leadership of the OC including planning, Metrics and KPI development, SLA management and day to day incident and crisis management specific to Service Operations and Crisis Management.
  • Lead the 24x7 Service Operations for the OC that will focus on driving site up and cost down as it relates to incident and crisis management for online services and service components including Outlook.com, Bing.com, Ads and OneDrive, as well as Shared Services like AD and DNS and MOC Tooling and Monitoring services.
Crisis ManagementService OperationsOperations Management

Yahoo

3 roles

Tech Lead, Service Engineering

Apr 2010Jan 2013 · 2 yrs 9 mos

  • Systems Engineering for Linux/FreeBSD based applications in Perl/PHP/Java having database/replication capabilities that assume both high availability and scalability requirements. The whole nine yards of SE - architect, build, deploy, monitor, automate and run 24x7. Walking with a small team to deliver on these goals while enhancing efficiency in all directions possible.
  • Handling Yahoo!'s Ad Stack and cloud computing infrastructure.
Systems EngineeringHigh Availability

Sr Systems Engineer

Promoted

Nov 2007Mar 2010 · 2 yrs 4 mos

  • Systems Engineering for FreeBSD based applications in Perl/PHP having database/replication capabilities that assume both high availability and scalability requirements. The whole nine yards of SE - architect, build, deploy, monitor, automate and run 24x7.
  • Handled systems engineering for Personalization and Open Content Platform (RSS Database) during these years
Systems EngineeringHigh Availability

NOC Engineer

Jan 2006Nov 2007 · 1 yr 10 mos

  • Managed Yahoo! NOC at Bangalore that supports systems, storage and network infrastructure of Yahoo! production environment. 24x7 operations executed by engineers in multiple shifts. Responsibilities included process improvement & automation, ensuring of SLAs, people & technology management, analyzing and implementing various monitoring tools
Network ManagementProcess ImprovementNetwork Engineering

Vinciti aq

Network eng

Oct 2004Jan 2006 · 1 yr 3 mos

  • o Deploying and Managing Vinciti Proprietary software, Vinprobe (Remote Monitoring tool)
  • o Managing the IT infrastructure of a client company including LAN and WAN. This includes managing Cisco 2600 routers, switches and Windows servers.
  • Capacity Planning: Analysing current utilization of Links, Routers/ Switches and recommending Capacity enhancement
Network ManagementCapacity PlanningNetwork Engineering

E4e

Network engineer

Oct 2004Jan 2006 · 1 yr 3 mos

Vinciti networks

Network Engineer

Jan 2004Jan 2006 · 2 yrs

  • o Deploying and Managing Vinciti Proprietary software, Vinprobe (Remote Monitoring tool)
  • o Managing the IT infrastructure of a client company including LAN and WAN. This includes managing Cisco 2600 routers, switches and Windows servers.
  • Capacity Planning: Analysing current utilization of Links, Routers/ Switches and recommending Capacity enhancement

Education

Indian Institute of Management, Calcutta

Jan 2016Jan 2017

Visvesvaraya Technological University

BE — Computers

Jan 2000Jan 2004

Bethany High School

Indian Institute of Management, Calcutta

Stackforce found 100+ more professionals with Site Reliability Engineering & Platform Infrastructure

Explore similar profiles based on matching skills and experience