Kolton Andrus

CEO

Denver, United States20 yrs 11 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Pioneered the world's first Reliability Management product.
  • Expert in fault tolerance and resilience engineering.
  • Proven track record in leading successful tech startups.
Stackforce AI infers this person is a SaaS and E-commerce expert with a focus on reliability and performance engineering.

Contact

Skills

Other Skills

Distributed SystemsJavaAgile MethodologiesHibernateJavaScriptTomcatLinuxXMLSoftware DevelopmentC++EclipseSpringSQLPerlAJAX

About

CEO and Founder of Gremlin, the world's first Reliability Management product helping companies avoid outages and build more resilient systems.

Experience

20 yrs 11 mos
Total Experience
4 yrs 2 mos
Average Tenure
10 yrs 4 mos
Current Experience

Gremlin

3 roles

Chief Executive Officer

Promoted

Jun 2025Present · 11 mos

CTO and Founder

Feb 2022Jun 2025 · 3 yrs 4 mos

CEO and Founder

Jan 2016Feb 2022 · 6 yrs 1 mo

  • Built the first iterations of the product by hand
  • Led the sales operation and closed initial deals
  • Developed the branding concepts and initial product-market fit
  • Raised funding from Amplify, Index, and Redpoint

Netflix

Senior Software Engineer

Sep 2013Jan 2016 · 2 yrs 4 mos · San Francisco Bay Area

  • Hyper-focused on fault tolerance, resilience, and performance
  • Managed the resolution of large scale production incidents
  • Maintained the existing Netflix API and built the next generation API
  • Designed 'FIT' -- Netflix's failure injection service testing

Amazon

2 roles

Software Development Manager

May 2012Aug 2013 · 1 yr 3 mos · Greater Seattle Area

  • Optimized the Retail Website and built tools to identify regressions
  • Managed a team of engineers and analyzed potential improvements
  • Delivered several optimizations which reduced wait time for millions of customers

Software Development Engineer

Jul 2009May 2012 · 2 yrs 10 mos · Greater Seattle Area

  • Senior engineer on the Retail Website Availability and Latency teams
  • Responsible for the reliability and performance of all Amazon websites
  • Designed and implemented Amazon's Failure Injection Service
  • Managed the resolution of Retail Website incidents
  • Investigated and engineered solutions for complex failure modes
  • Worked across teams and systems to improve the customer experience

Mindshare technologies

Software Engineer

May 2007Jul 2009 · 2 yrs 2 mos · Greater Salt Lake City Area

  • Responsible for hundreds of thousands of reports nightly
  • Wrote a system to analyze a customer’s experience in real time
  • Implemented custom reporting for the McDonald’s pilot
  • Provided a consistent user experience across web and PDF views

Meridias capital

Software Engineer

Jun 2005May 2007 · 1 yr 11 mos

  • Created a compliance rules engine for validating processed loans
  • Migrated the mortgage processing system from Java Swing to a web interface
  • Automated flood certification and real-time interest rate collection
  • Designed and implemented commission calculation logic

Education

University of Utah

MS — Computer Science

Jan 2005Jan 2007

University of Utah

BS — Computer Science

Jan 2001Jan 2005

Stackforce found 100+ more professionals with Distributed Systems & Java

Explore similar profiles based on matching skills and experience