santosh vangur

CTO

Hyderabad, Telangana, India25 yrs 11 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Led large-scale transformations in enterprise platforms.
  • Achieved $100M+ in cost savings through optimizations.
  • Championed D&I initiatives to enhance employee experience.
Stackforce AI infers this person is a SaaS and Cloud Services expert with a focus on engineering leadership and operational excellence.

Contact

Skills

Core Skills

Software EngineeringCloud ComputingSite Reliability EngineeringService EngineeringDatabase Management

Other Skills

Agile MethodologiesAgile Project ManagementC#CommunicationCultural CompetencyCultural DiversityData CenterData ManagementDatabasesDistributed SystemsEnterprise ArchitectureEnterprise SoftwareMarket KnowledgeMicrosoft AzureMicrosoft SQL Server

About

As a Global Capability Center leader and Engineering site leader at Microsoft, I bring 25 years of experience building and scaling high-impact engineering hubs across the U.S., China, and India. I have grown organizations from incubation to 200+ FTEs and 300+ vendor resources, managing multimillion-dollar Capex/Opex budgets. By setting up multi-discipline teams across software engineering, product management, IT, SRE, and NOC, and partnering with stakeholders worldwide, I have significantly reduced the cost of running the business across people, process, and product while enabling GCCs to become strategic innovation engines and driving the vision, roadmap and delivering results with deep partnerships across stakeholders. I am passionate about technology innovation and have led large-scale transformations that shaped the future of enterprise platforms. My experience spans developing end-to-end product features, driving system stabilization, and modernizing technology stacks for performance and scalability. I have led the transition from on-premises to cloud, built agile DevOps teams, integrated acquisitions, and delivered $100M+ in cost savings through service and operational optimizations. With expertise across Azure, Dynamics 365, Power Platform, Bing, and Commerce, I thrive at balancing deep technical execution with strategic business outcomes. Equally important is my focus on people and culture. As the D&I lead for India Dynamics 365, I have championed inclusion and driven initiatives to strengthen employee experience at Microsoft IDC. I believe in building transparent, collaborative, and customer-first organizations where people grow and thrive. By mentoring engineers, managers, and leaders, I cultivate high-performing teams that embrace accountability, innovation, and excellence. I have also been part of improving engineering excellence, employee development programs at IDC Microsoft.

Experience

Microsoft corporation

5 roles

Principal GEM/ Sr.Director of Engineering - Dynamics 365 & Power Platform

Promoted

May 2017Present · 8 yrs 10 mos

  • Head of Engineering – Connectors & Agent Data Platform
  • Lead Microsoft’s 1400+ Connectors Platform, powering integration with ISVs and external data sources.
  • Drive knowledge assets for Microsoft Copilot Studio, ensuring secure, scalable integration.
  • Own the Connectors Certification & Governance Platforms (design & runtime), enabling compliance and trust.
  • Spearhead the AI-first transition, defining strategy for the Model Context Protocol (MCP) to enable copilots and agents.
  • Head of Engineering – Microsoft ERP (F&O Data Platform)
  • Owned end-to-end strategy & execution across runtime, lifecycle, batch, integration, file store, auditing, authorization, and data engine.
  • Transformed ERP delivery to true SaaS by reducing supported versions to N–2 via continuous updates.
  • Improved stability—cut incidents and customer calls by 50% with AI-driven self-healing and copilots.
  • Unified workstreams across platforms, driving $10M annual COGS savings (projected $50M in 3 years).
  • Scaled the engineering org to 100+ engineers and 200+ vendor resources, enabling execution at global scale.
  • Group Engineering Manager – D365 Observability, Infra & Data Management
  • Delivered next-gen Data Retention Services, cutting storage costs while boosting performance.
  • Improved incident auto-detection from 55% → 90%+ across an 8,000-developer org via observability platforms (telemetry, anomaly detection, visualization, flighting).
  • Led RFPs for monitoring solutions (e.g., New Relic) to gain external “outside-in” signals.
  • Migrated on-premises D365 → cloud, consolidating multiple versions into a unified SaaS platform.
  • Strengthened platform reliability with data-driven OKRs, insights, and reporting, increasing uptime and customer trust.
  • Drove security & compliance programs meeting stringent requirements across Europe and India.
Strategic InitiativesPresentationsTeam IntegrationMarket KnowledgeCultural DiversityStrategic Planning+4

Director/Principal Engineering Manager - Azure

Promoted

Jan 2015Feb 2017 · 2 yrs 1 mo

  • Driving strategy, vision, and transformation to instill an engineering mindset, agile practices, and a data-driven, metrics-oriented culture with a growth mindset.
  • Part of the core SRE team for Microsoft Azure, defining strategy and running the command center for crisis management, handling large-scale outages across multiple geographies, data centers, and infrastructure.
  • Accountable for Azure availability and reliability (24x7) from India—focused on improving incident detection, reducing mitigation time, and strengthening SRE capabilities. Conduct regular executive reviews of live site issues and drive a data-driven RCA/RI culture.
  • Lead Azure capacity management, building and scaling clusters globally with deep automation and process optimization.
  • Deliver key software engineering projects, including diagnostic tools, end-to-end incident automation, and an NLP-driven bot for workflow automation, dashboards, and task orchestration.
  • Responsible for problem management, identifying recurring issues, closing gaps, and implementing long-term fixes.
  • Build and lead the India engineering organization—hiring, performance management, attrition management, and college recruitment (conducted 100+ interviews, hired 80+ employees over 5 years).
  • As site lead, manage multiple functions and partner with Staffing, HR, Finance, Legal, and Tax to ensure alignment and operational excellence.
Strategic InitiativesPresentationsTeam IntegrationMarket KnowledgeCultural DiversityStrategic Planning+4

Principal Engineering Manager

Promoted

Apr 2012Dec 2014 · 2 yrs 8 mos

  • Led the Cloud Reliability Hub in India, responsible for 24x7 health monitoring, incident correlation, response, and engagement across Microsoft’s infrastructure, platforms, services, and third-party providers to ensure maximum site availability for online services.
  • Managed and scaled teams handling Incident Management, Network Operations, Edge SRE, Crisis Management, and Multi-Service Global Outages, including executive communications during critical events.
  • Drove operational optimizations across the organization—reducing ticket volumes through auto-triaging, alert elimination, self-healing, and automation, achieving $100M+ in TCO reduction over 3 years.
  • Designed and deployed systems, tools, and processes to reduce detection and mitigation times, improving overall reliability and resilience of Microsoft’s online services.
  • Strengthened the SRE culture, embedding data-driven practices, faster RCA/RI cycles, and enhanced automation, while mentoring and developing high-performing teams.
Strategic InitiativesPresentationsTeam IntegrationMarket KnowledgeCultural DiversityStrategic Planning+4

Engineering Manager

Promoted

Aug 2006Mar 2012 · 5 yrs 7 mos

  • Leading the service engineering team for Commerce platforms which included subscription, payment gateway, Risk/fraud, payouts) which supports global payments, global markets, generates revenues in few Billion$, has 35 internal partners (Xbox, Windows Phone, Azure, AdCenter, Office 365, Windows live etc.), external partners and dependencies.
  • Built systems that will move towards 0-minute down time deployments. Automated high availability setup for back end SQL server for high OLTP and reduced down time to seconds.
  • Delivered large scale releases, holiday seasons, product launches – prepared readiness of the services, infra to ensure successful events.
  • Integrated the acquisitions of external companies Jellyfish, Massive into Microsoft and was part of Swat team to integrate this into Live search.
  • Designed data center build out for BCP levels, data center migrations.
  • Ensured the billing platform is secure and complaint (PCI, SOX)
  • Developed playbooks, instrumentation, monitoring, standard deviation, detection, hardware virtualization, scale outs, dashboards, metrics.
  • Implemented self-healing, risk assessment of services, failures, security – avoid from DDOS and other attacks.
  • Designed and Managed the infrastructure around 2000 + servers which includes Front end, SQL servers, middle tiers, 100’s of Terabytes of data and other infra includes load balancers, firewalls, IPS, TORS, routers, switches, arbor etc. (The network infra is being managed by shared services) and storage area networks (SAN - EMC, Hitachi)
PresentationsStrategic PlanningCommunicationService EngineeringCloud Computing

Database Engineer, Senior Software Engineer

Feb 2001Jul 2006 · 5 yrs 5 mos

  • Managed thousands of SQL servers for critical services in Microsoft – worked extensively on High availability – Log shipping, clustering, database mirroring, performance management, capacity management, consolidation by virtualization, scripting repetitive alerts, fixing root cause, developing scripts, improving SQL performance, replication, data center setup, data center migrations, installation and upgrade of environments. Developed frameworks to automate the rollouts, upgrades, performance testing.
PresentationsDatabase Management

Volt

Database Engineer

Nov 1999Feb 2001 · 1 yr 3 mos · Greater Seattle Area

  • Managed thousands of SQL servers for critical services in Microsoft – worked extensively on High availability – Log shipping, clustering, database mirroring, performance management, capacity management, consolidation by virtualization, scripting repetitive alerts, fixing root cause, developing scripts, improving SQL performance, replication, data center setup, data center migrations, installation and upgrade of environments

Education

University of Washington

1 year Course in Technology Management — Technlogy management

Jan 2008Jan 2009

Gulbarga University

Bachelors in Engineering — Mechanical

Jan 1993Jan 1997

Stackforce found 100+ more professionals with Software Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience