David Nguyen

SRE (Site Reliability Engineer)

San Francisco, California, United States7 yrs 5 mos experience
Highly Stable

Key Highlights

  • Expert in site reliability and scalability.
  • Proficient in Linux kernel development.
  • Strong background in back-end development.
Stackforce AI infers this person is a Systems Reliability Engineer with a strong focus on Infrastructure and SaaS.

Contact

Skills

Core Skills

LinuxConfiguration ManagementPython

Other Skills

BashPostgresFlaskVue.jsSaltStackPrometheusGrafanaGoElasticsearchDockerMachine LearningC/C++MySQLX86x86 Assembly

About

I'm a system administrator who strives to improve site reliability and scalability, but I'm also a software engineer who has experience with back-end development and an interest in operating systems.

Experience

7 yrs 5 mos
Total Experience
1 yr 10 mos
Average Tenure
--
Current Experience

Cloudflare, inc.

Systems Reliability Engineer

Jul 2019Aug 2025 · 6 yrs 1 mo · San Francisco, CA

  • ● Managed the fleet of edge servers of the Cloudflare global network
  • Instrumented monitoring and alerting for server health metrics using Prometheus and Grafana ensuring SLOs/SLAs
  • ◦ Triaged and mitigated production incidents across various Cloudflare products. Examples:
  • ▪ Debugged BGP anycast route configuration
  • ▪ Ran packet captures in named network namespaces for Magic Transit
  • ▪ Coredump retrieval for Nginx-fl (load balancer)
  • ● Responsible for change management of SaltStack configuration management codebase
  • ● Architected server provisioning pipeline
  • ◦ Provisioning information in Postgres behind a Go REST endpoint that synchronizes with Netbox
  • ◦ Wrote Vue.js frontend and Flask backend web app to import data from manufacturer
  • ● Maintained tooling for software release management
  • ◦ Partitioned server fleet into different release environments like Canary or dogfooding
  • ◦ Wrote PL/pgsql stored procedures to keep partitions current with hardware lifecycle
  • ◦ Full stack for web app so engineers can make release plans using said environments
  • ● Linux kernel development. Backported BPF LSM for ARM from 6.4 to 6.1
  • ◦ Familiarity with virtme, qemu, strace, and GDB
LinuxConfiguration ManagementPythonBashPostgresFlask+4

Trackit

Software Engineering Intern

May 2016Sep 2016 · 4 mos · Los Angeles, California

  • ● Implemented a RESTful API in​ ​Flask (Python) that queries an Elasticsearch database of
  • Amazon AWS logs to forecast future costs using linear regression
  • ● Automated IBM GPFS deployment with SaltStack
  • ● Wrote Dockerfiles to provision production and development environments
PythonFlaskElasticsearchSaltStackDocker

Synopsys

IT Support Administrator

Feb 2014Aug 2014 · 6 mos · Mountain View, California

  • ● Tier 1 desktop support and software troubleshooting
  • ● General software, like VPN and VNC client, install and configuration
  • ● RSA SecureID and Active Directory administration

Simplenetwks corporation

Junior System Administrator

Jun 2013Dec 2013 · 6 mos · Worcester, Massachusetts

  • ● Configured Oracle Linux VMs on Oracle VM Hypervisor
  • ● Deployed a Git server and Puppet server running MCollective
  • ● Automated deployments and set configurations through shell scripting

Education

UCLA

Bachelor of Science (B.S.) — Mathematics of Computation

Jan 2014Jan 2018

De Anza College

Associate of Arts (A.A.) — Computer Information Systems: Systems Programming

Jan 2012Jan 2014

De Anza College

Associate of Arts (A.A.)

Jan 2012Jan 2014

Stackforce found 100+ more professionals with Linux & Configuration Management

Explore similar profiles based on matching skills and experience