Dzevad Trumic

CTO

San Francisco, California, United States22 yrs 6 mos experience

Key Highlights

Proven leader in Site Reliability Engineering.
Expertise in building scalable internet infrastructure.
Strong background in managing high-performance engineering teams.

Stackforce AI infers this person is a SaaS Infrastructure and Reliability Engineering expert.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud ComputingInfrastructure ManagementSoftware DevelopmentSystem Architecture

Other Skills

CDNDNSDDoSProtocolsProxies and CachingDeveloper ExcellenceKubernetesSecurity PlatformLoad BalancingHigh AvailabilitySecurityProduction InfrastructureDeveloper ProductivitySRECorp IT

About

Engineering leader who enjoys building strong teams to solve complex problems.

Experience

22 yrs 6 mos

Total Experience

4 yrs 11 mos

Average Tenure

2 yrs 7 mos

Current Experience

Cloudflare

VP of Engineering

Oct 2023 – Present · 2 yrs 7 mos · San Francisco Bay Area

Helping Build a Better Internet with an amazing crew.
I run Foundational Engineering, where my exceptionally talented teams cover many of the services and building blocks of a Better Internet:
CDN
DNS (including our amazing public Resolver 1.1.1.1)
DDoS
Protocols
Proxies and Caching
Developer Excellence
Kubernetes
Security Platform
Along with many low-level primitives upon which Cloudflare builds its many rich products
Unimog - https://blog.cloudflare.com/unimog-cloudflares-edge-load-balancer
Pingora - https://blog.cloudflare.com/pingora-open-source
1.1.1.1 Resolver - https://www.cloudflare.com/learning/dns/what-is-1.1.1.1/
FL - https://blog.cloudflare.com/building-cloudflare-on-cloudflare/
And much more.

CDNDNSDDoSProtocolsProxies and CachingDeveloper Excellence+4

Observe, inc.

Senior Director of Engineering, Head of Infrastructure

Apr 2023 – Oct 2023 · 6 mos · San Mateo, California, United States · On-site

Super excited to do an ambitious startup again and to deliver state of the art modern infrastructure to underpin the best Observability product on the planet.
Overseeing a talented and amazing group across:
Production Infrastructure
Developer Productivity (CI/CD, SCM, GitOps/IaC)
SRE
Security
Corp IT

Production InfrastructureDeveloper ProductivitySRESecurityCorp ITInfrastructure Management+1

Robinhood

Senior Director, Head of Reliability

Mar 2022 – Oct 2022 · 7 mos · Menlo Park, California, United States · Hybrid

Reliability, evolved.
Leading engineering groups across:
SRE
Observability (Metrics, Logging, Tracing)
Capacity Engineering (Cloud spend management)
Load testing and Fault tolerance
Incident Management

SREObservabilityCapacity EngineeringLoad testingIncident ManagementSite Reliability Engineering+1

Goldman sachs

Managing Director, Global Head of SRE

Apr 2018 – Mar 2022 · 3 yrs 11 mos · London Area, United Kingdom · On-site

Founded SRE at Goldman Sachs.
As Global Head of SRE, I'm responsible for:
Embedded SRE teams for SecDb, AppleCard, Marquee, Observability
CRE (Customer Reliability Engineering)
Observability Platform for the firm (Metrics/Monitoring, Alerting, Logging, Tracing)
Firmwide Reliability strategy
Post-mortem / root-cause analysis standards

Embedded SRE teamsObservability PlatformReliability strategyPost-mortem analysisSite Reliability EngineeringSystem Architecture

Google

Senior Staff Site Reliability Engineer / Tech Lead / Manager

May 2003 – Apr 2018 · 14 yrs 11 mos · Mountain View, CA & London, UK · On-site

2010 - 2018 | Traffic Team, Network Infrastructure SRE, Google Cloud
Managing a group of very talented folks within Traffic Team: the frontline between billions of users and the Google supercomputer.
Our job is high availability of, and universally fast access to, Google Cloud and all other Google services.
Led Edge deployment strategy for all frontend traffic.
Specialization in:
Frontend load balancing
Content Caching & CDN
Automation
Distributed systems
High availability
DNS
DDoS
Software Defined Networking (Espresso: https://research.google.com/pubs/pub46316.html)
Monitoring
2006 - 2010 | Systems Software, Platforms
Tech Lead of Burn-in software that stress tests every machine at Google.
Focus on hard drive and DRAM testing.
2003 - 2006 | Hardware Systems, HWOps
Data center repair software automation.