Mattie Toia

CTO

New York, New York, United States20 yrs 9 mos experience

Highly Stable

Key Highlights

Led global teams in cloud engineering at Google.
Expert in site reliability and infrastructure management.
Significant contributions to air traffic management systems.

Stackforce AI infers this person is a Cloud Computing and Infrastructure expert with extensive experience in Site Reliability Engineering.

Contact

Skills

Core Skills

Cloud EngineeringSite Reliability EngineeringInfrastructure ManagementSoftware DevelopmentWeb DevelopmentReal-time Systems

Other Skills

Google Cloud PlatformObservabilityCustomer Reliability EngineeringDevOpsContinuous Integration/Continuous DeliveryDeveloper InfrastructureData StorageJavaPythonGoogle Web ToolkitOSGiSNMPCSSSun Certified Java ProgrammerHTML5

About

Experience working in development, integration, operations, management, and leadership roles on mission critical software services. Specialties: Programming Languages – Experience in numerous languages including Java (Sun Certified), C++, C, Python, Perl, Ruby, Javascript, PHP, tcl/tk, ADA, sed, awk Operating Systems – Extensive experience in Unix environments including Linux, Solaris, and Alpha (Tru64 Unix) systems.

Experience

20 yrs 9 mos

Total Experience

5 yrs 2 mos

Average Tenure

0 mo

Current Experience

Uber

Vice President Engineering, Infrastructure

Jun 2026 – Present · 0 mo · New York, NY · Hybrid

Shopify

3 roles

Vice President of Infrastructure

Promoted

Apr 2024 – May 2025 · 1 yr 1 mo

Director of Engineering, Infrastructure

Jan 2023 – Apr 2024 · 1 yr 3 mos

Director of Engineering, Production Platform

May 2021 – Jan 2023 · 1 yr 8 mos

Google

6 roles

Director Of Engineering

Oct 2019 – May 2021 · 1 yr 7 mos

Director of Engineering in Site Reliability Engineering at Google responsible for two major areas of Google Cloud Platform: Observability and Customer Reliability Engineering (CRE) - two global teams on the order of 100 engineers in size.
Observability services includes Google's global scale Time Series Database (TSDB), Alert Management, Debug Logging, Cloud Monitoring, Cloud Logging, and the Cloud APM suite.
Customer Reliability Engineering engages with GCP customers to help them build reliable software on the GCP platform as well as share insight in SRE best practices and techniques.

Engineering Director

Promoted

Nov 2018 – Oct 2019 · 11 mos

Product Area Lead for DevOps Infrastructure SRE
DevOps Infrastructure SRE are the teams responsible for the reliability, scalability, and efficiency of Google's source, build, continuous integration/continuous delivery (CI/CD), monitoring, and alerting services. These services include our global scale source control, distributed build/test environments, metric collection infrastructure, debug logs, and more. My teams are responsible for both internal systems as well as Google's Cloud developer products.
I also am the interim SRE Site Lead for NYC with all local teams across all SRE product areas in New York reporting through me.

Site Reliability Engineering Manager

Promoted

Apr 2016 – Nov 2018 · 2 yrs 7 mos

Product Area Lead for Developer Infrastructure SRE
Developer Infrastructure SRE are the teams responsible for the reliability, scalability, and efficiency of Google's source, build, continuous integration, and test services. These services include our global scale source control and distributed build/test environments. My teams are responsible for both internal systems as well as Google's Cloud developer products.
I also manage a number of Corp Eng SRE which include the teams responsible for Google's internal IT/Enterprise platforms and our virtualizaton infrastructure for these system.

Tech Lead / Manager, Site Reliability Engineering

Promoted

Jan 2014 – Apr 2016 · 2 yrs 3 mos

Manager of both an infrastructure storage SRE team as well as for the Persistent Disk SRE team in Google's New York office. Persistent Disk is the block storage device offered as part of Google Cloud Platform's (GCP) Compute Engine (GCE) product.
Day to day activities involve deployment and operational support of production services, resource allocation and planning, and development of automation and monitoring software for Google's global scale data storage systems. Work is in conjunction with other Core Storage and Cloud Storage teams worldwide.

Site Reliability Engineer - Software Engineer

Oct 2012 – Jan 2014 · 1 yr 3 mos

Responsible for the operations, reliability, and availability of Google's data storage infrastructure. Day to day activities involve deployment and operational support of production services, resource allocation and planning, and development of automation and monitoring software for Google's global scale data storage systems.

Software Engineer

Aug 2011 – Oct 2012 · 1 yr 2 mos

Software engineering working on web applications using Google's highly scalable infrastructure to meet the needs of customers. Day to day work includes using the following frameworks and languages:
Java
Python
Google Web Toolkit
Google Guice
Google AppEngine
JUnit
EasyMock

Raytheon, network centric systems

Senior Software Engineer

Jun 2006 – Aug 2011 · 5 yrs 2 mos

Senior Software Engineer II working in the Air Traffic Management (ATM) group. Responsibilities include leading a team of software developers, providing software size, cost, and effort estimation for bids, and reviewing design, code, and verification procedure. The software product must perform in realtime and conform to world air traffic safety standards.
Projects, in chronological order starting with the most recent:
Lead designer for a modular software based approach to support electronic flight strips and external system interfaces using Java and OSGi to bridge research in SoA
Software development manager of the Gardermoen Tower upgrade project. This project is bringing the system in use in Scandinavia’s second largest airport to Solaris 10 using the latest of Oracles’s Sun hardware.
Lead the design and development of a Java based Control and Monitoring display which used SNMP to provide enterprise status information to technical supervisors at air traffic control centers and towers.
Provided technical consultation for research study for the German aviation authority, the DFS, on porting their legacy Raytheon system from a DEC Alpha platform to a x86 Linux platform.
Managed the day to day software development of enhancements to Avinor’s Air Traffic Control system performed by a team of 10 engineers.
Upgraded and enhanced a database management system for adapting system and geographic information for Air Traffic Control on the Solaris 10 platform.
Development of an auto-generated web knowledge base of all hardware and software errors reported to users of the ATC system using Perl and XML.

Raytheon

Software Engineer

Jun 2006 – Jan 2006 · 7 mos

Software Engineer, International Air Traffic Control

Tufts university

Residential Computing Consultant

Aug 2004 – Jun 2006 · 1 yr 10 mos

Worked for Tufts Online, a computer repair service offered by Tufts University to students. Held office hours and responded to field tickets to fix broken computer hardware and clean virus infected computers. Also was responsible for network trouble shooting and bad network/phone/cablel jack inspection.