Mattie Toia

CTO

New York, New York, United States20 yrs 9 mos experience
Highly Stable

Key Highlights

  • Led global teams in cloud engineering at Google.
  • Expert in site reliability and infrastructure management.
  • Significant contributions to air traffic management systems.
Stackforce AI infers this person is a Cloud Computing and Infrastructure expert with extensive experience in Site Reliability Engineering.

Contact

Skills

Core Skills

Cloud EngineeringSite Reliability EngineeringInfrastructure ManagementSoftware DevelopmentWeb DevelopmentReal-time Systems

Other Skills

Google Cloud PlatformObservabilityCustomer Reliability EngineeringDevOpsContinuous Integration/Continuous DeliveryDeveloper InfrastructureData StorageJavaPythonGoogle Web ToolkitOSGiSNMPCSSSun Certified Java ProgrammerHTML5

About

Experience working in development, integration, operations, management, and leadership roles on mission critical software services. Specialties: Programming Languages – Experience in numerous languages including Java (Sun Certified), C++, C, Python, Perl, Ruby, Javascript, PHP, tcl/tk, ADA, sed, awk Operating Systems – Extensive experience in Unix environments including Linux, Solaris, and Alpha (Tru64 Unix) systems.

Experience

20 yrs 9 mos
Total Experience
5 yrs 2 mos
Average Tenure
0 mo
Current Experience

Uber

Vice President Engineering, Infrastructure

Jun 2026Present · 0 mo · New York, NY · Hybrid

Shopify

3 roles

Vice President of Infrastructure

Promoted

Apr 2024May 2025 · 1 yr 1 mo

Director of Engineering, Infrastructure

Jan 2023Apr 2024 · 1 yr 3 mos

Director of Engineering, Production Platform

May 2021Jan 2023 · 1 yr 8 mos

Google

6 roles

Director Of Engineering

Oct 2019May 2021 · 1 yr 7 mos

  • Director of Engineering in Site Reliability Engineering at Google responsible for two major areas of Google Cloud Platform: Observability and Customer Reliability Engineering (CRE) - two global teams on the order of 100 engineers in size.
  • Observability services includes Google's global scale Time Series Database (TSDB), Alert Management, Debug Logging, Cloud Monitoring, Cloud Logging, and the Cloud APM suite.
  • Customer Reliability Engineering engages with GCP customers to help them build reliable software on the GCP platform as well as share insight in SRE best practices and techniques.

Engineering Director

Promoted

Nov 2018Oct 2019 · 11 mos

  • Product Area Lead for DevOps Infrastructure SRE
  • DevOps Infrastructure SRE are the teams responsible for the reliability, scalability, and efficiency of Google's source, build, continuous integration/continuous delivery (CI/CD), monitoring, and alerting services. These services include our global scale source control, distributed build/test environments, metric collection infrastructure, debug logs, and more. My teams are responsible for both internal systems as well as Google's Cloud developer products.
  • I also am the interim SRE Site Lead for NYC with all local teams across all SRE product areas in New York reporting through me.

Site Reliability Engineering Manager

Promoted

Apr 2016Nov 2018 · 2 yrs 7 mos

  • Product Area Lead for Developer Infrastructure SRE
  • Developer Infrastructure SRE are the teams responsible for the reliability, scalability, and efficiency of Google's source, build, continuous integration, and test services. These services include our global scale source control and distributed build/test environments. My teams are responsible for both internal systems as well as Google's Cloud developer products.
  • I also manage a number of Corp Eng SRE which include the teams responsible for Google's internal IT/Enterprise platforms and our virtualizaton infrastructure for these system.

Tech Lead / Manager, Site Reliability Engineering

Promoted

Jan 2014Apr 2016 · 2 yrs 3 mos

  • Manager of both an infrastructure storage SRE team as well as for the Persistent Disk SRE team in Google's New York office. Persistent Disk is the block storage device offered as part of Google Cloud Platform's (GCP) Compute Engine (GCE) product.
  • Day to day activities involve deployment and operational support of production services, resource allocation and planning, and development of automation and monitoring software for Google's global scale data storage systems. Work is in conjunction with other Core Storage and Cloud Storage teams worldwide.

Site Reliability Engineer - Software Engineer

Oct 2012Jan 2014 · 1 yr 3 mos

  • Responsible for the operations, reliability, and availability of Google's data storage infrastructure. Day to day activities involve deployment and operational support of production services, resource allocation and planning, and development of automation and monitoring software for Google's global scale data storage systems.

Software Engineer

Aug 2011Oct 2012 · 1 yr 2 mos

  • Software engineering working on web applications using Google's highly scalable infrastructure to meet the needs of customers. Day to day work includes using the following frameworks and languages:
  • Java
  • Python
  • Google Web Toolkit
  • Google Guice
  • Google AppEngine
  • JUnit
  • EasyMock

Raytheon, network centric systems

Senior Software Engineer

Jun 2006Aug 2011 · 5 yrs 2 mos

  • Senior Software Engineer II working in the Air Traffic Management (ATM) group. Responsibilities include leading a team of software developers, providing software size, cost, and effort estimation for bids, and reviewing design, code, and verification procedure. The software product must perform in realtime and conform to world air traffic safety standards.
  • Projects, in chronological order starting with the most recent:
  • Lead designer for a modular software based approach to support electronic flight strips and external system interfaces using Java and OSGi to bridge research in SoA
  • Software development manager of the Gardermoen Tower upgrade project. This project is bringing the system in use in Scandinavia’s second largest airport to Solaris 10 using the latest of Oracles’s Sun hardware.
  • Lead the design and development of a Java based Control and Monitoring display which used SNMP to provide enterprise status information to technical supervisors at air traffic control centers and towers.
  • Provided technical consultation for research study for the German aviation authority, the DFS, on porting their legacy Raytheon system from a DEC Alpha platform to a x86 Linux platform.
  • Managed the day to day software development of enhancements to Avinor’s Air Traffic Control system performed by a team of 10 engineers.
  • Upgraded and enhanced a database management system for adapting system and geographic information for Air Traffic Control on the Solaris 10 platform.
  • Development of an auto-generated web knowledge base of all hardware and software errors reported to users of the ATC system using Perl and XML.

Raytheon

Software Engineer

Jun 2006Jan 2006 · 7 mos

  • Software Engineer, International Air Traffic Control

Tufts university

Residential Computing Consultant

Aug 2004Jun 2006 · 1 yr 10 mos

  • Worked for Tufts Online, a computer repair service offered by Tufts University to students. Held office hours and responded to field tickets to fix broken computer hardware and clean virus infected computers. Also was responsible for network trouble shooting and bad network/phone/cablel jack inspection.

Education

Tufts University School of Engineering

Bachelor of Science — Electrical Engineering

Jan 2002Jan 2006

Tufts University

Master of Science — Engineering Management

Jan 2009Jan 2011

Stackforce found 100+ more professionals with Cloud Engineering & Site Reliability Engineering

Explore similar profiles based on matching skills and experience