Zhenzhong Xu

CTO

Mountain View, California, United States20 yrs 7 mos experience

Key Highlights

  • Expert in MLOps and DataOps systems.
  • Led teams to build scalable data infrastructure.
  • Proven track record in AI-driven business acceleration.
Stackforce AI infers this person is a SaaS and Data Infrastructure expert with a strong focus on MLOps and real-time data processing.

Contact

Skills

Core Skills

Mlops

Other Skills

Distributed SystemsSoftware DesignCloud ComputingData StreamingJavaScalaMultithreadingSoftware EngineeringApache FlinkBig DataScalabilityBusiness IntelligenceDatabasesAmazon Web Services (AWS)Windows Azure

About

Hi, I am a software builder, coach, and motorcycle racer. I am currently working on bridging ML and Data infrastructure. Expertise & Interests: - MLOps and DataOps systems - Scale software & human systems - Real-time Data Infrastructure, Stream Processing - Large scale/fault-tolerant distributed systems - Concurrent transactional systems - Use AI to raise human intelligence

Experience

20 yrs 7 mos
Total Experience
2 yrs 11 mos
Average Tenure
1 yr 2 mos
Current Experience

Meta

Engineering Leader

Mar 2025Present · 1 yr 2 mos · San Francisco Bay Area · On-site

  • Business Acceleration Foundation: AI assisted business workflow to accelerate Ads revenue generation.
  • ML Governance: reliability governance and revenue acceleration in Monetization org by developing governance frameworks, risk scoring models, and cross-functional collaboration strategies that optimize changes flow, driving revenue growth while ensuring system stability.

Kite ai

Consulting Advisor

Jan 2025Jan 2026 · 1 yr · San Francisco Bay Area

  • Accelerating an AI ecosystem for everyone. Help KiteAI with Management and AI/Data infrastructure advisory.

Voltron data

VP of Engineering

Jan 2024Oct 2024 · 9 mos · Remote

  • Bridging Languages, Hardware, and People.
  • I led Ibis OSS Engineering team and Streaming Data Movement Engine team for a more modular and composable data AI/Analytics ecosystem.
MLOps

Sylphai

Advisor

Jun 2023Nov 2023 · 5 mos · San Francisco Bay Area · Hybrid

  • Startup business advisor for execution, team building, PMF, etc.
MLOps

Claypot ai

Co-Founder & CTO

Sep 2021Jan 2024 · 2 yrs 4 mos · San Francisco Bay Area · Hybrid

  • (Acquired by Voltron Data in Jan 2024)
  • At Claypot, we built MLOps products, including real-time model performance monitoring and a real-time feature engineering platform.
  • My role as CTO of the early-stage startup includes product strategy, product engineering architecture and execution, product market validation, solution engineering, team leadership and partnership strategy.
  • https://www.forbes.com/sites/adrianbridgwater/2024/01/25/voltron-claypot-deal-cooks-up-faster-ai-engine-power/
MLOps

Netflix

3 roles

Engineering Manager - Stream Processing Platform

Jan 2021Nov 2021 · 10 mos

  • I led Stream Processing Platform Team. We manage a fully managed Apache Flink Analytical Stream Processing platform and Mantis (Netflix OSS) Operational Stream processing platform to provide high leverage real-time data solutions for all organizations in Netflix.
  • Grew an exceptionally strong 10-person team.
  • Expanded use case coverage to analytical, operational, and ML across all Eng organizations in Netflix.
  • Stabilized operations and re-focused the team with a cohesive vision between Flink and Mantis products.
  • Netflix is a data-driven company, handling trillions of events per day to answer many application and business related questions. At the center of providing scalable solutions to these challenges is the Netflix Real Time Data Infrastructure team.
MLOps

Engineering Manager - Flink Platform

Promoted

Oct 2019Jan 2021 · 1 yr 3 mos

  • I led Flink Stream Processing Team (within Real-time Data Infrastructure).
  • Scaled Flink platform to more than 20T processed events per day.
  • Scaled to thousands of stateless use cases and dozens of high-value stateful use cases across all engineering organizations in Netflix, covering analytical and ML workloads.
  • Grew an exceptionally strong 6-person team.

Senior Software Engineer

Apr 2015Sep 2019 · 4 yrs 5 mos

  • Built Netflix Real-time Data Pipeline Infrastructure that processes ~1+ trillion events a day, and 10 million per second during peak.
  • Built Stream Processing as a Service platform to unlock data value in real-time.

Microsoft (windows azure fabric controller)

2 roles

Software Development Engineer

Oct 2012Apr 2015 · 2 yrs 6 mos · Redmond, WA

  • Worked on Windows Azure Fabric Controller - kernel of Microsoft cloud platform.
  • Tech lead for below major areas:
  • Responsible for hyper scale cloud infrastructure management service.
  • Fault tolerating system design. Ensure tenant availability SLA over 99.95%.
  • Tenant service recovery/healing systems.
  • Scale up/out large system design and planning.
  • Layered microservice architectural redesign for new generation of Fabric Controller system.
  • Managed over 500,000 servers and growing in production.
  • etc.

Software Development Engineer

Dec 2007Oct 2012 · 4 yrs 10 mos · Redmond, WA

  • From the ground up, designed, lead, implemented, shipped and maintained two generations of highly scalable, reliable, distributed Video Workflow processing system etc. The WF system is one of the most vital parts of the entire MSN Video backend infrastructure, allowed our team to move quickly on implementing new features and move onto different devices; Managed to maintain 97%+ SLA for all partners;
  • Built and maintained backend content delivery infrastructure;
  • Built a flighting (experimentation) solution, built mathematical models on data driven feature and was proven to be successful.
  • Designed, implemented numerous data warehouse and analytic cube systems to support both BI and production troubleshooting.

Kmg software, inc

AI Architect

Jan 2006Dec 2007 · 1 yr 11 mos · Madison, Wisconsin Area

Nrg mobile

Software Engineer

May 2005Dec 2005 · 7 mos · Portland, Oregon Area

  • Custom VoIP technology with Microsoft technology ecosystem.

Education

Stanford University Graduate School of Business

LEAD Certificate: Personal Leadership

Jan 2019Jan 2020

Portland State University

BS — Computer Science

Jan 2001Jan 2005

Stackforce found 100+ more professionals with Mlops

Explore similar profiles based on matching skills and experience