Seth Black

SRE (Site Reliability Engineer)

Milton, Florida, United States9 yrs experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Built innovative automation solutions for incident management.
  • Recognized as Mentor of the Year for internship program.
  • Expert in transforming operational pain into efficient systems.
Stackforce AI infers this person is a Site Reliability Engineer with expertise in SaaS and FinOps.

Contact

Skills

Core Skills

Site Reliability EngineeringAutomationIncident ManagementFinopsObservabilityApplication Performance MonitoringClinical Laboratory ManagementMedical Laboratory Science

Other Skills

ScriptingNew RelicAWSPagerDutyCMDBLambdaEventBridgeGraphQLNew Relic ProgrammabilityJavaScriptLaboratory MedicineEmployee TrainingClinical MicrobiologyHematologyWeb Development

About

👋 Hey, I’m Seth, a Site Reliability Engineer who likes solving the problems everyone else quietly assumes are impossible. Most of my time is spent at the intersection of incidents, observability, automation, and FinOps: taking noisy, manual operational pain and turning it into event-driven systems that quietly do the right thing in the background. Recent work I’m proud of: Built a PagerDuty compliance automation that continuously audits teams, escalation policies, and schedules against our standards, flags gaps, and keeps on-call data clean so pages actually reach the right person. Reverse-engineered New Relic billing and turned opaque, month-behind invoices into near real-time usage and cost visibility for finance, with breakdowns by ingest type, account, and application. Designed a New Relic ↔ CMDB tagging pipeline that pulls entities from New Relic, checks CMDB for the source-of-truth tags, and automatically updates tags in New Relic to kill drift and make dashboards/alerts reflect reality. Implemented multiple serverless workflows on AWS (Lambda + EventBridge) to handle recurring SRE tasks, audits, reports, config checks, data sync, cutting manual toil and making our operational hygiene repeatable. Every year I mentor interns, pairing them with real SRE/automation projects and coaching them on incident response, observability, and shipping production-grade work. My toolbox includes New Relic, PagerDuty, ThousandEyes, AWS (Lambda, EventBridge, EC2, etc.), Bash, PowerShell, Python, and React/TypeScript. I care about: Clear, calm incident response and ownership on the bridge Strong observability and tagging hygiene so we’re not flying blind Pragmatic FinOps, where engineers and finance see the same reality Coaching and collaboration, because reliability is a team sport Off-call, I’m usually hanging out with my dogs, yelling at the TV during Chiefs games, and unapologetically being a Swiftie. If you’re working on tough reliability, observability, or cost-visibility problems, especially ones people keep calling “impossible”, I’m always up for comparing notes.

Experience

9 yrs
Total Experience
3 yrs 6 mos
Average Tenure
5 yrs 1 mo
Current Experience

Upwork

Freelance Site Reliability Engineer (Upwork)

Nov 2025 – Present · 6 mos · Florida, United States · Remote

  • I take on freelance work through Upwork, focusing on incidents, on-call design, automation, AI services, and FinOps. Typical engagements include cleaning up PagerDuty setups, designing observability and tagging strategies in New Relic, and building small AWS Lambda + EventBridge automations to handle recurring SRE tasks like audits, configuration checks, usage snapshots, and data sync. I also help teams make sense of opaque vendor billing (like New Relic) by turning it into clear, near-real-time internal cost visibility per app or team.

Cdk global

2 roles

Site Reliability Engineer

Promoted

Nov 2022 – Present · 3 yrs 6 mos · Remote

  • As a Site Reliability Engineer at CDK Global, I focus on incidents, observability, automation, and FinOps, with a bias toward solving the problems everyone else quietly assumes are impossible. I led a month-long server rebuild war room during a critical cyber incident, coordinating cross-team efforts to rebuild and secure a large fleet of servers while restoring key services safely and with minimal downtime.
  • I spend a lot of time turning manual operational pain into event-driven automation. I built a PagerDuty compliance system that continuously audits teams, escalation policies, and schedules against our on-call standards, flags gaps, and keeps contact data and routing clean so pages actually reach the right person. I also reverse-engineered New Relic’s billing model and built automations to turn opaque, month-behind vendor invoices into near real-time usage and cost visibility for finance, with detailed breakdowns by ingest type, account, and application.
  • To keep observability accurate, I designed a New Relic ↔ CMDB tagging pipeline that pulls entities from New Relic, compares them to CMDB records, and automatically updates tags in New Relic to match the CMDB source of truth. Alongside that, I maintain a portfolio of AWS Lambda + EventBridge workflows and scripting (Bash, PowerShell, Python) that handle recurring SRE tasks such as audits, configuration checks, data sync, backup management, and configuration backup/restore, significantly reducing manual toil and standardizing our operational hygiene. I also drove a collaborative application discovery and knowledge base effort so critical application information, ownership, and dependencies are documented and discoverable during incidents instead of being tribal knowledge. Every year I mentor interns on real SRE and automation projects, and in 2023 I was recognized as Mentor of the Year for the internship program.
ScriptingNew RelicAWSIncident ManagementAutomationSite Reliability Engineering

Application Service Engineer

May 2022 – Dec 2022 · 7 mos · Remote

  • During my tenure as an Application Service Engineer within the Site Reliability Engineering (SRE) organization at CDK Global, I've had the privilege of spearheading initiatives that have significantly contributed to the stability and efficiency of our critical applications. My role has been diverse, requiring a blend of technical proficiency, inventive problem-solving, and a collaborative mindset.
  • One notable accomplishment was the creation of a customized NewRelic JavaScript application tailored specifically to our organization's unique needs. This tool has proven instrumental in our ability to monitor, analyze, and optimize application performance in real-time, with a focus on customer satisfaction.
  • Additionally, I took a lead role in developing a comprehensive process for collaborative application discovery. This process encompasses the creation and maintenance of a knowledge base, serving as a central repository for vital application-related information. By fostering seamless collaboration across cross-functional teams, this approach has considerably improved our ability to identify, address, and enhance application performance and reliability.
  • Integral to my responsibilities has been the successful operationalization of these processes. By seamlessly integrating them into our daily workflows, I've ensured that they become an intrinsic part of our application management strategy. This operationalization has not only enhanced our team's efficiency but also significantly reduced downtime and potential disruptions.
  • Throughout my journey as an Application Service Engineer, I've demonstrated a knack for effective communication and problem-solving. Working closely with colleagues from diverse backgrounds, I've consistently conveyed technical concepts in a clear and understandable manner, fostering collaboration and synergy.
GraphQLNew Relic ProgrammabilityApplication Performance Monitoring

Sre project

SRE | Co-Founder

Apr 2021 – Present · 5 yrs 1 mo · Remote

Santa rosa medical center

2 roles

Clinical Laboratory Manager

Promoted

Mar 2020 – Apr 2022 · 2 yrs 1 mo · Milton, Florida, United States

  • Managed all aspects of a hospital clinical laboratory during the COVID-19 pandemic, leading a multidisciplinary team through high-volume, high-stakes operations. I was responsible for workflow design, staffing, quality control, regulatory compliance, and budget management. I focused on improving turnaround times, maintaining accreditation during inspections, and expanding testing capacity while keeping quality and patient safety front and center. The experience cemented my bias toward clear processes, calm under pressure, and data-driven decision making, skills I now apply to SRE and incident management.
Laboratory MedicineEmployee TrainingClinical Laboratory Management

Medical Laboratory Scientist

May 2017 – Mar 2020 · 2 yrs 10 mos · Milton, Florida, United States

  • Worked as a Medical Laboratory Scientist across multiple hospitals, performing a wide range of diagnostic testing in high-stakes environments. I owned specimen processing, analysis, instrument maintenance, and documentation while adhering to CLIA, HIPAA, OSHA, and hospital policies. The role required precision, reliability, and collaboration with physicians and nurses to support patient care, giving me a strong foundation in operational discipline and working calmly under pressure.
Clinical MicrobiologyHematologyMedical Laboratory Science

Education

Missouri State University

Bachelor of Science - BS — Clinical Laboratory Science/Medical Technology/Technologist

Jan 2015 – Present

Stackforce found 100+ more professionals with Site Reliability Engineering & Automation

Explore similar profiles based on matching skills and experience