Kenny Sheridan — CEO
After eight years as a Meteorologist in the U.S. Marine Corps, I moved into building reliable infrastructure systems for high-performance compute environments. My work centers on designing and implementing software systems that manage GPU-backed workloads across their full lifecycle, not on manual operations or ad-hoc administration.I design and build vendor-agnostic platforms spanning bare metal, virtual machines, and Kubernetes, with an emphasis on predictable performance and clear system behavior. This includes system onboarding, design, implementation, and validation for NVIDIA Hopper/Blackwell & AMD Instinct environments. A large part of my work involves co-designing management planes and host-level agents for asset discovery, inventorying, topology awareness, and cluster profiling. These systems encode infrastructure knowledge directly into software, allowing behavior to be reasoned about, tested, and automated rather than manually managed. I also build orchestration, deployment, and validation frameworks for GPU fleets, alongside distributed performance testing systems that measure networking, storage throughput and latency, and collective GPU operations (NCCL/RCCL/MPI). Then correlate infrastructure behavior directly to model training and inference performance, making regressions visible at the workload level. In addition, I build cloud-native developer tooling like CLIs and libraries that allow engineers to provision, inspect, validate, and extend infrastructure programmatically. The tools are designed to integrate with cloud-native workflows, emphasizing explicit state, repeatability, and testability. Alongside internal systems, I build and maintain open-source Rust libraries (crates) that my company relies on for vendor-agnostic infrastructure automation. These crates serve as core building blocks across provisioning, validation, performance testing, and developer tooling.I also spend a fair amount of time on-call, troubleshooting real production issues in code, debugging distributed behavior, performance regressions, and failure modes across compute, networking, databases, and storage. That feedback loop directly informs system designs and hardeningMost infrastructure software is written in rust paired with Nix/NixOS for reproducible builds and controlled rollout. Related work includes RDMA (RoCE), TCP/IP, gRPC, and hardware-aware system design.The goal is to treat infrastructure and software as explicit, measurable, and maintainable systems that scale cleanly across baremetal, VMs, and Kubernetes
Stackforce AI infers this person is a high-performance computing and AI infrastructure expert with a strong background in Rust and cloud technologies.
Location: Seattle, WA, United States
Experience: 18 yrs
Skills
- Infrastructure Engineering
- Ai Infrastructure
- Supercomputing Engineering
- Performance Optimization
- Ai/ml Infrastructure Design
- Vendor-agnostic Solutions
- Performance Automation
- Cloud Infrastructure
- Hardware Testing
- Infrastructure Management
- System Administration
- It Support
- Meteorology
- Instruction
Career Highlights
- Expert in building high-performance AI infrastructure.
- Proficient in Rust for scalable compute solutions.
- Former Marine with strong leadership and training skills.
Work Experience
Andromeda
Member of Technical Staff - Infrastructure Product (0 mo)
San Francisco Compute Company
Supercomputing Engineer (1 yr 7 mos)
TensorWave
Senior AI & HPC Infrastructure Engineer (2 mos)
ServiceNow
Senior Hardware & Software Performance Automation Engineer (1 yr 9 mos)
Senior Hardware Test Engineer (2 yrs 1 mo)
Hardware Test Engineer (2 yrs 9 mos)
NexLevel Information Technology
System Administrator (1 yr 9 mos)
United States Marine Corps
Staff Meteorologist (3 yrs 6 mos)
Senior Meteorologist (8 yrs 1 mo)
Education
Atmospheric Sciences and Meteorology at Community College of the Air Force