T

Tushar Sappal

DevOps Engineer

Greater Vancouver, Canada11 yrs 1 mo experience
Highly Stable

Key Highlights

  • Expert in building cost-efficient cloud platforms.
  • Proven track record of achieving 99.9% uptime SLAs.
  • Skilled in driving compliance initiatives across multiple standards.
Stackforce AI infers this person is a Cloud Infrastructure and Site Reliability Engineering expert in the SaaS industry.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud InfrastructureSoftware Engineering ManagementWebopsCloud Infrastructure ManagementInfrastructure ArchitectureTechnical Operations ManagementSre Culture Development

Other Skills

Agile MethodologiesAmazon Web Services (AWS)AutomationCloud ApplicationsCloud ComputingCommunicationCybersecurityDevOpsDistributed SystemsDockerIncident ManagementInformation TechnologyInfrastructure-as-CodeJavaKubernetes

About

Engineering Leader with 12+ years building and scaling Site Reliability and Cloud Infrastructure teams across hyper-growth startups and Fortune 500 companies. Expert in leading distributed teams to deliver highly available, cost-efficient cloud platforms (AWS, Azure, GCP, Kubernetes) serving millions of users. Proven track record managing cloud budgets, achieving 99.9% uptime SLAs, and driving compliance initiatives (SOC2, FedRAMP, CMMC). Skilled in building SRE cultures focused on automation, observability, and continuous improvement.

Experience

Blackpoint cyber

Senior Software Engineering Manager

May 2025Dec 2025 · 7 mos · Canada · Remote

  • Lead cross-functional SRE team supporting MDR SOC platform serving enterprise customers with 99.9% uptime SLA commitments.
  • Reduced critical incidents by 80% MoM through systematic service maturity program, implementing SLO-based monitoring, automated remediation, and comprehensive postmortem processes across distributed Kubernetes infrastructure
  • Improved MDR SOC availability from 90% to 99.9% through capacity planning, chaos engineering, and ML-powered anomaly detection that reduced false alerts by 60%
  • Drove annual cost optimization through right-sizing initiatives, reserved instance strategies, and automated resource lifecycle management
  • Led CMMC Type 2 compliance initiatives, establishing security controls, audit processes, and documentation standards for platform certification.
Solution ArchitectureCloud ApplicationsRisk AssessmentRisk ManagementCybersecurityCloud Computing+3

Pantheon platform

Software Engineering Manager

Mar 2024May 2025 · 1 yr 2 mos · Canada · Remote

  • Built and managed distributed Engineering team of 8 engineers across North America supporting WordPress/Drupal WebOps SaaS platform serving 300K+ sites.
  • Led company wide initiatives for SOC2 Type 1 and Type 2 compliance accreditations for the platform and
  • supported services for Pantheon.
  • Led the development of new services within the infrastructure portfolio resulting in improvement of
  • ownership, reliability, and incident management, thereby resulting in a 60% reduction till date in detection and mitigation time.
  • Led initiatives for service maturity and in general system design consulting, platform management, and
  • capacity planning for overall Webops platform.
  • Led support of agile delivery of a growing portfolio of SaaS applications, product releases, infrastructure and optimizations in concert with Professional Services and QE team.
  • Led the maturity of processes , tooling and automation production incidents, performing root cause analysis, identification cation and resolution of underlying problem patterns - driving development of self-healing solutions.
  • Led and drove continuous improvement for supported applications, in areas such as monitoring,
  • operational tasks, automation, continuous integration, deployments and performance tuning.
Cloud ApplicationsProgram ManagementSoftware DevelopmentDevOpsAgile MethodologiesSoftware Engineering Management+1

Microsoft

Senior Software Engineering Manager

Sep 2020Mar 2024 · 3 yrs 6 mos · Vancouver, BC

  • Senior manager overseeing team of SREs and SWEs supporting Azure-based PaaS platforms (Kubernetes, Service Fabric) for Microsoft Defender product portfolio, managing $50M+ annual cloud infrastructure budget.
  • Delivered multi million dollar annual cost savings through data-driven capacity planning, multi-tenancy optimizations, and automated resource governance policies across Azure infrastructure
  • Achieved 80% reduction in incident MTTD/MTTR by establishing centralized observability platform, automated remediation playbooks, and cross-team incident command protocols
  • Drove SLO-based reliability culture, implementing error budgets, blameless postmortems, and quarterly reliability reviews with product leadership
  • Built self-service infrastructure platform enabling 200+ engineering teams to provision compliant environments in <15 minutes vs. previous multi-day manual processes
  • Led multi-quarter compliance initiatives delivering SOC2 Type 1/2, FedRAMP High, DISA IL5, and PCI DSS certifications, collaborating with security, legal, and audit teams
  • Recruited, mentored, and retained high-performing team in competitive market; 90%+ retention rate with 3 engineers promoted to senior/lead roles
Cloud ComputingSoftware ManagementDevOpsSite Reliability EngineeringKubernetesCloud Infrastructure Management

Vmware

Technical Team Lead

Dec 2018Sep 2020 · 1 yr 9 mos

  • Architected and delivered production-grade Kubernetes PaaS serving 100+ internal/external microservices across multiple geographic regions with 99.95% uptime
  • Established foundational SRE practices: comprehensive observability stack (Prometheus, Grafana, ELK), automated deployment pipelines, incident management processes, and on-call rotation framework
  • Accelerated SOC2 Type 1 and FedRAMP compliance certification, enabling VMware to pursue federal and enterprise market segments
  • Implemented one-click infrastructure deployment and auto-remediation capabilities, reducing operational toil by 70%
Cloud ApplicationsSoftware DevelopmentDevOpsSite Reliability EngineeringInfrastructure Architecture

Adobe

Senior Member of Technical Staff

Jun 2012Apr 2018 · 5 yrs 10 mos · India

  • Transformed 10-person Technical Operations team from reactive support model to proactive SRE organization aligned with product engineering, supporting multi-cloud (AWS, Azure, private datacenter) SaaS/PaaS platforms.
  • Cultural Transformation:
  • Restructured team using SRE principles and Agile methodologies, establishing embedded DevOps model with regional product teams across 3 global offices
  • Recruited and mentored 20 SREs/SWEs; developed career progression framework and training programs resulting in 5 internal promotions to lead roles
  • Infrastructure Modernization:
  • Reduced infrastructure provisioning from weeks to minutes through Infrastructure-as-Code adoption (Terraform, Ansible, CloudFormation) across hybrid cloud environment
  • Built centralized database administration team, modernizing database operations and driving NoSQL adoption strategy
  • Delivered 50+ product releases through improved deployment pipelines, blue-green deployments, and automated rollback mechanisms
  • Operational Excellence:
  • Improved MTTD/MTTR by 40% through comprehensive monitoring strategy, runbook automation, and chaos engineering practices
  • Designed and executed disaster recovery and business continuity plans, achieving <4-hour RTO for critical services
  • Drove cost optimization initiatives reducing cloud spend by 25% through reserved instances, rightsizing, and automated resource cleanup
Cloud ApplicationsSoftware DevelopmentDevOpsSite Reliability EngineeringTechnical Operations ManagementSRE Culture Development

Education

The University of British Columbia

Certifacte — Cloud Security and Digital Transformation

Birla Institute of Technology and Science, Pilani

Master of Technology - MTech — Software Systems

Jan 2013Jan 2015

DIT UNIVERSITY

Bachelor of Technology (BTech) — Computer Science and Engineering

Jan 2008Jan 2012

Stackforce found 100+ more professionals with Site Reliability Engineering & Cloud Infrastructure

Explore similar profiles based on matching skills and experience