Dzevad Trumic

CTO

San Francisco, California, United States22 yrs 6 mos experience

Key Highlights

  • Proven leader in Site Reliability Engineering.
  • Expertise in building scalable internet infrastructure.
  • Strong background in managing high-performance engineering teams.
Stackforce AI infers this person is a SaaS Infrastructure and Reliability Engineering expert.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud ComputingInfrastructure ManagementSoftware DevelopmentSystem Architecture

Other Skills

CDNDNSDDoSProtocolsProxies and CachingDeveloper ExcellenceKubernetesSecurity PlatformLoad BalancingHigh AvailabilitySecurityProduction InfrastructureDeveloper ProductivitySRECorp IT

About

Engineering leader who enjoys building strong teams to solve complex problems.

Experience

22 yrs 6 mos
Total Experience
4 yrs 11 mos
Average Tenure
2 yrs 7 mos
Current Experience

Cloudflare

VP of Engineering

Oct 2023Present · 2 yrs 7 mos · San Francisco Bay Area

  • Helping Build a Better Internet with an amazing crew.
  • I run Foundational Engineering, where my exceptionally talented teams cover many of the services and building blocks of a Better Internet:
  • CDN
  • DNS (including our amazing public Resolver 1.1.1.1)
  • DDoS
  • Protocols
  • Proxies and Caching
  • Developer Excellence
  • Kubernetes
  • Security Platform
  • Along with many low-level primitives upon which Cloudflare builds its many rich products
  • Unimog - https://blog.cloudflare.com/unimog-cloudflares-edge-load-balancer
  • Pingora - https://blog.cloudflare.com/pingora-open-source
  • 1.1.1.1 Resolver - https://www.cloudflare.com/learning/dns/what-is-1.1.1.1/
  • FL - https://blog.cloudflare.com/building-cloudflare-on-cloudflare/
  • And much more.
CDNDNSDDoSProtocolsProxies and CachingDeveloper Excellence+4

Observe, inc.

Senior Director of Engineering, Head of Infrastructure

Apr 2023Oct 2023 · 6 mos · San Mateo, California, United States · On-site

  • Super excited to do an ambitious startup again and to deliver state of the art modern infrastructure to underpin the best Observability product on the planet.
  • Overseeing a talented and amazing group across:
  • Production Infrastructure
  • Developer Productivity (CI/CD, SCM, GitOps/IaC)
  • SRE
  • Security
  • Corp IT
Production InfrastructureDeveloper ProductivitySRESecurityCorp ITInfrastructure Management+1

Robinhood

Senior Director, Head of Reliability

Mar 2022Oct 2022 · 7 mos · Menlo Park, California, United States · Hybrid

  • Reliability, evolved.
  • Leading engineering groups across:
  • SRE
  • Observability (Metrics, Logging, Tracing)
  • Capacity Engineering (Cloud spend management)
  • Load testing and Fault tolerance
  • Incident Management
SREObservabilityCapacity EngineeringLoad testingIncident ManagementSite Reliability Engineering+1

Goldman sachs

Managing Director, Global Head of SRE

Apr 2018Mar 2022 · 3 yrs 11 mos · London Area, United Kingdom · On-site

  • Founded SRE at Goldman Sachs.
  • As Global Head of SRE, I'm responsible for:
  • Embedded SRE teams for SecDb, AppleCard, Marquee, Observability
  • CRE (Customer Reliability Engineering)
  • Observability Platform for the firm (Metrics/Monitoring, Alerting, Logging, Tracing)
  • Firmwide Reliability strategy
  • Post-mortem / root-cause analysis standards
Embedded SRE teamsObservability PlatformReliability strategyPost-mortem analysisSite Reliability EngineeringSystem Architecture

Google

Senior Staff Site Reliability Engineer / Tech Lead / Manager

May 2003Apr 2018 · 14 yrs 11 mos · Mountain View, CA & London, UK · On-site

  • 2010 - 2018 | Traffic Team, Network Infrastructure SRE, Google Cloud
  • Managing a group of very talented folks within Traffic Team: the frontline between billions of users and the Google supercomputer.
  • Our job is high availability of, and universally fast access to, Google Cloud and all other Google services.
  • Led Edge deployment strategy for all frontend traffic.
  • Specialization in:
  • Frontend load balancing
  • Content Caching & CDN
  • Automation
  • Distributed systems
  • High availability
  • DNS
  • DDoS
  • Software Defined Networking (Espresso: https://research.google.com/pubs/pub46316.html)
  • Monitoring
  • 2006 - 2010 | Systems Software, Platforms
  • Tech Lead of Burn-in software that stress tests every machine at Google.
  • Focus on hard drive and DRAM testing.
  • 2003 - 2006 | Hardware Systems, HWOps
  • Data center repair software automation.
Traffic TeamNetwork Infrastructure SREHigh availabilityDNSDDoSMonitoring+2

Education

San José State University

Computer Science

University of Google searches and Wikipedia articles

Stackforce found 100+ more professionals with Site Reliability Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience