Joseph P.

DevOps Engineer

Los Angeles, California, United States12 yrs 11 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in designing human-agent systems for complex infrastructure.
  • Proven track record in AI orchestration and agentic workflows.
  • Strong leadership in DevOps and cloud infrastructure management.
Stackforce AI infers this person is a DevOps and Cloud Infrastructure expert in the Healthcare and SaaS industries.

Contact

Skills

Core Skills

Ai OrchestrationSoftware ObservabilitySite Reliability EngineeringDevopsKubernetesAwsGcpWeb DevelopmentPhp

Other Skills

Token EconomicsAgentic CodingAI Pair Programming / AI-Assisted ProgrammingAgentic WorkflowsHuman+AI Teams or (Human+AI Collaboration)Prompt EngineeringLLM EvaluationContext EngineeringGoogle Kubernetes Engine (GKE)Performance TestingTechnical LeadershipCross-functional CollaborationsGoogle Cloud Platform (GCP)Incident CommandTroubleshooting

About

I design systems where humans and AI agents collaborate to solve complex infrastructure problems. My foundation is in DevOps and platform engineering with deep expertisse in GCP, AWS, kuberrnetes, Terraform, observability, and regulation compliant cloud architecture across I’ve spent years building the infrastructure that keeps complex technology systems running reliably at scale. Outside of work, I perform on indie improv teams around Los Angeles. I am trained by Upright Citizen's Brigade and continually train at various independent schools and coaches.

Experience

12 yrs 11 mos
Total Experience
2 yrs 1 mo
Average Tenure
8 mos
Current Experience

Synthesis health

Senior DevOps Engineer

Sep 2025Present · 8 mos · Los Angeles Metropolitan Area · Remote

  • Design and implement agentic coding workflows using Claude Code for both greenfield development and legacy codebase modernization, creating detailed specifications that AI agents execute with human oversight at key decision points.
  • Develop custom orchestration skills, commands, and subagent workflow patterns. Track cost-per-session economics and evaluate model selection tradeoffs (model routing, quality/cost analysis) across project types.
  • Gave training on AI and AI agentic coding using chatgpt custom gpts, claude projects, claude code agents, skills, and commands, and beads.
  • Built production synthetic monitoring for medical imaging platform, providing first-ever proactive detection of service degradation before customer impact
  • Implemented stress testing that identified critical performance bottleneck, blocking a release that would have caused production outages
  • Created observability dashboards enabling data-driven capacity planning, replacing guesswork with real user-load metrics
  • Partnered with Viewer (a strategically important product) team to instrument image rendering pipeline, uncovering 1,000+ daily failures previously invisible to the business
  • Work recognized by CEO and executive leadership within first 60 days
AI OrchestrationToken EconomicsAgentic CodingAI Pair Programming / AI-Assisted ProgrammingAgentic WorkflowsHuman+AI Teams or (Human+AI Collaboration)+11

Convergenz

SRE contractor

Feb 2025Sep 2025 · 7 mos · Los Angeles Metropolitan Area · Remote

  • At an international social media company, supported the stability of the livestreaming feature including the moderation capability.
  • Adjusted capacity of bare metal kubernetes clusters to meet demand and reduce cost. Reduced one cluster by 33% and another two clusters by 50%. Documented plan, created change documents, and created change tickets.
  • Participated in 24/7 on call shifts two weeks every 4-5 weeks as secondary and then primary on call. Diagnosed and resolved issues with meeting our SLAs while on call. Typical number of alarms was about 75 - 100 alarms a week.
  • Participated in retros for P0 incidents.
  • Wrote 2 SOPs and updated 3 others.
  • Promoted the use of alarm statistics to reduce the number of alarms. We now have a process to propose and change alarms that are not warranted or need to change levels.
TroubleshootingSite Reliability EngineeringLinuxKubernetesBash

Teksystems

Software Engineer III at TEKsystems at Apple

Jun 2024Feb 2025 · 8 mos · Los Angeles, California, United States · Remote

  • Create proof of concepts for GitOps deployment tools Flux and ArgoCD and provide direction for migrating to a continuous delivery platform that will make it easier for our stakeholder teams to deploy their applications to kubernetes clusters.Created an ArgoCD configuration management plugin to detect Python Pulumi and deploy code developed with python Pulumi kubernetes sdk.
  • Refactor a Pulumi (Python) monolith stack to multiple stacks and migrate core python code to a packaged sdk to make our infrastructure code more scalable.
  • Added new environments for integration tests and for alerts for health checks. This built confidence in migrating to new EKS clusters managed by Apple’s implementation of Crossplane. This facilitated moving off of an EKS cluster that was managed by a team that no longer supported it.
  • Took lead in transitioning ownership of Weights and Biases instances to centralize management of Apple’s account with Weights and Biases. This helps Apple negotiate consistent pricing for licenses.
  • Automated audits of users of Weights and Biases to track which divisions and teams were using it which took the effort from one day to under an hour
  • Automated onboarding applications to one of our team’s services. This reduced an error prone process that took 15 minutes to one which was less than a minute.
  • Proved that we can use Apple’s secrets management service in Apple’s ci/cd instead of using the secrets manager for Apple’s ci/cd tool. This reduced the number of places secrets were managed from multiple secret stores to one secret store.
Interpersonal SkillsPulumiPythonAmazon Web Services (AWS)Amazon EKSGrafana+7

Freshpulse

Co-Founder

Aug 2023Jan 2024 · 5 mos · United States · Remote

  • Setup the "running the business"vendors such as lawyers, collaboration tools, business creation, employment contracts, banking accounts, etc. to ensure the company is compliant and able to pay the bills
  • Led customer discovery sessions to learn current pain points in the market
  • Recruited product designer to create the Figma prototypes that could be shown to investors and prospective clients
  • Pitched potential investors.

Thoughtworks

Senior Infrastructure Consultant

Nov 2021Jun 2024 · 2 yrs 7 mos · Remote

  • For a FAANG company, I migrated of Java Spring Boot microservice APIs from on premise to AWS, optimizing costs and improving infrastructure efficiency. The APIs calculated the breakdown of the costs of the company’s cloud infrastructure per project which was important to manage the company’s cloud costs. Developed documentation and demos so that the application team could be managed by the application team after I rolled off.
  • For the Department of Veteran Affairs worked as both a software engineer and infrastructure SME on a pilot project to stream FitBit data from patients to the VA’s system to assist health care workers work with patients. The tasks included updating the frontend react code and unit tests. Diiscovered that the react tests were not run in GitHub actions and that tests were failing. Fixed the failing tests and made sure they were run on CI. Worked on the Python application that ran batch jobs to process data. As part of improvements with infrastructure, I optimized a 2.75GB Docker image to half its size and sped up fresh download speeds from 20 minutes to 4 minutes.
  • For the Department of Veteran Affairs worked as both software engineer and infrastructure on a project that provides a platform for applications that help clinicians treat their patients. I was brought on to help the team grow its knowledge of best infrastructure as code and infrastructure management practices.
GrafanaTerraformGitHub ActionsPythonDevOpsAmazon Web Services (AWS)+23

Sada

Senior Cloud Infrastructure Engineer

Feb 2019Nov 2021 · 2 yrs 9 mos · North Hollywood, California · Remote

  • Worked as a Google Partner Engineer with Google PSO for Google’s GCP clients including AirBnB and a FAANG company
  • Created ci/cd jobs in GitLab for Packer image creation. Wrote Python script to use Google’s Python client library for Bulk VM API. Wrote Python scripts to detect empty nodes for GKE clusters and send a Slack notification using Python Kubernetes client and Slack’s SDK and Google’s PubSub.. Wrote a python Cloud Function to send notifications to Slack when an auto-upgrade for GKE node pools happens. This work was for an established logging and observability company as part of Google PSO.
  • Wrote production ready Terraform modules for a healthcare company that needed to import their existing infrastructure into Terraform. The project was part of Google PSO.
  • Wrote a Flask app that was used to essentially provide the functionality of Terraform Enterprise for project and network creation.My contribution was the use of the Python client library for Cloud Build to deploy Terraform code. The work was for a FAANG company through Google PSO.
  • Created Terraform modules and infrastructure code for Google Cloud IAP connector (Identity Aware Proxy connector) for AirBnB on behalf of Google PSO. Modules included shared VPC, IAM, and Kubernetes cluster. Code was included in Google's Cloud Platform, which are open source terraform modules that Google maintains.
  • Created Terraform modules and infrastructure code for customers migrating from AWS to Google Cloud.
  • Defined and implemented how Cloud Start workshops were done. These were design and discovery workshops done with clients. Wrote a delivery guide so we could scale how SADA did these types of engagements.

Omniex holdings, inc.

DevOps Engineer

Feb 2018Feb 2019 · 1 yr · Greater Los Angeles Area

  • Purchased by Gemini in 2022
  • Made architectural decisions regarding techniques and tools, such as Okta, Artifactory, Travis, LucidChart, ElasticStack, Bitwarden, Terraform etc. Implemented practice of creating post mortems, runbooks, and agile.
  • Mentored software engineers to get our OMS (a front end and middleware application for trading crypto) into Terraform. Originally this was manually created in AWS using RDS, Elastic Beanstalk, and aws-cli.
  • Created rpm packages as needed for C++ engineers which reduced toil for the software engineers
  • Created CI for EMS, a C++ project, with Docker and Travis CI
  • Automated build of bare metal server with Bash scripts.
  • Instituted monitoring and alerts with Elasticsearch, Kibana, Fluentd, Watch and Metricbeat using Ansible for the IaC.
  • Setup a simple internal dns server using Dnsmasq
  • Updated Palo Alto (a firewall) configuration through the gui to do things such as add vpn users and update IP white listings.
TerraformPythonDevOpsAmazon Web Services (AWS)Google Cloud Platform (GCP)Continuous Integration and Continuous Delivery (CI/CD)+10

Beachbody

2 roles

DevOps Engineer

Jun 2016Feb 2018 · 1 yr 8 mos · Santa Monica, CA

  • Increased team’s velocity by mentoring team to adopt code standards, automated tests, isolated development environments, code reviews, trunk-based development, continuous integration, code reuse instead of boilerplate code, documentation first, and other clean code and pragmatic programming practices
  • Used Bash, Python, and Puppet to automate creation of Oracle’s IDMs stack which took environment creation from one month to two hours.
  • Used Terraform to migrate Atlassian applications (Jira and Confluence) to AWS which decreased maintenance costs.
TerraformDevOpsAmazon Web Services (AWS)Continuous Integration and Continuous Delivery (CI/CD)ElasticsearchKibana+9

Software Engineer

Mar 2014Jun 2016 · 2 yrs 3 mos · Santa Monica, CA

  • Created WordPress themes and plugins for the company’s blog and worked with business stakeholders on requirements.. This enabled the company to provide content to 11 million monthly users. This turned the project into a prestige project within the company.
  • Created WordPress themes and plugins to enable the business stakeholders that ran Beachbody Summit, the annual event that attracted about 20,000 Beachbody MLM participants,This enabled the business to make their own changes to the site when needed. This also reduced the amount of time to create a site each year from months to a few weeks.
  • Migrated two LAMP stacks from on premise to AWS ECS and RDS which made the applications more reliable and decreased weekly incidents to no incidents.
  • Migrated legacy project to use OOP PHP, CI, composer package manager, autoloading, build scripts, Symfony components,, unit testing, code linting, functional testing, Vagrant and then docker-compose for LAMP and LEMP stacks. This improved code quality and reduced time to create new features,
  • Delivery lead for migration from multiple platforms to a single platform for Beachbody LIVE which improved the ability of the business stakeholders to grow the business. I worked with multiple teams across the organization to coordinate the effort. I was the lead for hiring new members from me to 5 engineers and 3 QA. This allowed Beachbody LIVE, an important strategic business unit, to expand the business due to the ability to update the site themselves.
TerraformDevOpsAmazon Web Services (AWS)Continuous Integration and Continuous Delivery (CI/CD)Interpersonal SkillsPython+6

Self-employed

Freelancer

Jan 2012Jan 2014 · 2 yrs · Greater Los Angeles Area

  • Web developer/programmer
  • Customize themes for WordPress, Drupal, and Magento
Continuous Integration and Continuous Delivery (CI/CD)Interpersonal SkillsPHPLAMPNginxJavaScript+7

Education

University of Pittsburgh

Bachelor of Science (BS) — Mathematics

Jan 2004Jan 2005

Stackforce found 100+ more professionals with Ai Orchestration & Software Observability

Explore similar profiles based on matching skills and experience