Andrew (Zhaoxing) Li

Software Engineer

San Francisco, California, United States10 yrs 2 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in Cloud Computing and Big Data solutions.
  • Led development of innovative data governance features.
  • Finalist in Cisco's Global Innovation Challenge.
Stackforce AI infers this person is a SaaS and Cybersecurity expert with strong skills in Cloud Computing and Big Data.

Contact

Skills

Core Skills

Cloud ComputingSoftware DevelopmentBig DataData GovernanceBlockchain

Other Skills

JavaPythonData MiningSDNOpenFlowMachine LearningBig Data AnalyticsCLinuxRESTful WebServicesDockerC++ScalaDatabasesNetwork Function Virtualization

About

Working on Unity Catalog (https://www.unitycatalog.io/) - we bring access control, data discovery, lineage and sharing into the data lake world and beyond. My demo of Lakehouse Federation at Data + AI summit 2024: https://www.databricks.com/dataaisummit/session/lakehouse-federation-discover-query-and-govern-any-data-unity-catalog

Experience

10 yrs 2 mos
Total Experience
3 yrs 4 mos
Average Tenure
5 yrs 11 mos
Current Experience

Databricks

5 roles

Staff Software Engineer (Unity Catalog)

Promoted

Feb 2025Present · 1 yr 2 mos

  • Make Unity Catalog feature rich and simple to use.
JavaPythonCloud ComputingSoftware Development

Senior Software Engineer (Unity Catalog)

Aug 2023Feb 2025 · 1 yr 6 mos

  • Unity Catalog (https://www.unitycatalog.io/).
  • Hive Metastore federation (Dec 2024 launch!) https://www.databricks.com/blog/announcing-public-preview-hive-metastore-and-aws-glue-federation-unity-catalog
  • Lakehouse Federation Blogs:
  • https://www.databricks.com/blog/announcing-general-availability-lakehouse-federation
  • https://www.databricks.com/blog/introducing-lakehouse-federation-capabilities-unity-catalog
  • Data + AI summit demo: https://www.databricks.com/dataaisummit/session/lakehouse-federation-discover-query-and-govern-any-data-unity-catalog
  • Lakeguard: Making shared access to Spark environment efficient, secure, and feature rich: https://www.databricks.com/blog/whats-new-shared-clusters-unity-catalog
JavaPythonCloud ComputingBig DataData Mining

Software Engineer (Unity Catalog)

Jun 2022Jul 2023 · 1 yr 1 mo

  • Working on Data Governance - we strive to bring permission control, table/view/ML dataset management, data discovery, lineage and sharing into the unstructured data lake world (S3/Azure Blob/Google Cloud Storage) and beyond (Data Warehouses and Relational Databases). Checkout Matei's announcement on Unity Catalog and Delta Sharing: https://www.youtube.com/watch?v=aRMfxPZxnfc.
  • For the last 6 months, I have been working on this amazing new feature called Lakehouse Federation (checkout our blog post): https://www.databricks.com/blog/introducing-lakehouse-federation-capabilities-unity-catalog
  • Before that, I had been working on managed/external file system support for Unity Catalog GA! Simon has a good video summarizing what I worked on: https://www.youtube.com/watch?v=aAswROA1bM8
JavaPythonCloud ComputingData Governance

Software Engineer (Databricks Compute)

Mar 2020May 2022 · 2 yrs 2 mos

  • Databricks Compute Fabrics: We work tirelessly so that more than 10 million of Virtual Machines are happily crunching exabytes of data every day. TL;DR: we host spark, and make sure it works out of the box :). Checkout Matei's keynote on our large scale cloud service:
  • https://www.slideshare.net/matei/scaling-databricks-to-run-data-and-ml-workloads-on-millions-of-vms?next_slideshow=239227656
  • Over the two years I've mainly worked on three projects:
  • 1. Secure Cluster Connectivity https://docs.databricks.com/security/secure-cluster-connectivity.html that allows customers to trust connecting their cloud compute to Databricks servers so we can process data in their account. This is the project that taught me how to build a reliable and secure network service.
  • 2. Cluster scoped library management https://docs.databricks.com/libraries/cluster-libraries.html. This is the first service I've owned end to end - we built a new microservice from scratch and make sure it's robust.
  • 3. Cluster management https://docs.databricks.com/clusters/index.html
  • I've mainly worked on instance acquirement and container environment setup, but also many other components in the general cluster creation & acquirement workflow. This experience taught me enormous system/OS knowledge and how to build robust cloud compute services.
  • Other projects I've served:
  • Enhanced Security Monitoring that provides realtime audit trail of activities on your Spark VMs to detect potential security anomalies.
  • https://docs.databricks.com/administration-guide/cloud-configurations/aws/enhanced-security-mode.html
  • Databricks Fedramp Moderate Certification: https://marketplace.fedramp.gov/#!/product/databricks?sort=productName&productNameSearch=databricks
  • Design of the dataplane -> controlplane mTLS framework that allows API interfaces private to Databricks compute to become possible.
JavaPythonCloud ComputingBig Data

Software Engineer Intern (Databricks Billing and Usage Infra)

May 2019Aug 2019 · 3 mos · San Francisco, California, United States

  • Working on the Growth/Billing and Usage Infra Team as an SDE intern, building Spark pipelines and platform services for Databricks' Billing and Usage Infrastructure, which enables customers with better visibility & analysis towards their Databricks subscription.
JavaPython

University of illinois at urbana-champaign

Teaching Assistant

Oct 2018Mar 2020 · 1 yr 5 mos · Urbana-Champaign Area

  • Spring & Fall 2019
  • CS498 Internet of Things
  • Prof Matthew Caesar
  • Course developer and lead TA for the first run of the class.
  • http://iot.cs.illinois.edu/
  • Independently developed the Car IoT lab, which allows students to build a toy car with object detection capabilities and a CAN & Ethernet based sensor network leveraging Arduino and Raspberry Pi
  • Fall2018
  • CS436 Computer Network Lab
  • Prof Matthew Caesar
  • http://caesar.cs.illinois.edu/courses/cs436.f18/index.html
JavaPythonCloud Computing

Cisco

2 roles

Software Engineer (Chief Technology and Architecture Office)

Sep 2016Aug 2018 · 1 yr 11 mos · Shanghai, China

  • Distributed Digital Forensics:
  • Designed a distributed digital forensics system capable of aggregating, correlating and visualizing digital network artifacts collected from multiple sources. Combined BlockChain hashing and Zero-Knowledge Proofs to form the Chain of Custody required by Digital Forensics. Pitched in front of Cisco SVPs during the live final of Cisco’s Global Innovation Everywhere Challenge as a finalist (6/556).
  • Distributed Service Mesh Audition:
  • Participated in developing a BlockChain based Service Mesh audition service, leveraging Envoy proxy to track and store application flow traces into a distributed ledger. Submitted patent draft based on the design which has been approved as a US patent.
  • Flow based Congestion Detection:
  • Combined application flows sampled from Cisco’s Vector Packet ProcessingEngine with BGP analytics provided by the open-source project SNAS.io, achieving real-time congestion detection in WAN network based on flow triangulation, using 75% fewer data points compared to NetFlow based approach.
  • Secured DNS: Developed Secured DNS and participated in developing the In-Vehicle Network (IVN) controller for Cisco Connected Cars solution. Work was demoed at BMW Group IT Fair 2017 and earned noticeable attention
  • Patent (Granted 2021-10-05):
  • Layer 7 Proxy for Immutable Application Audit Trails
  • Finalist in Cisco’s worldwide Innovation Everywhere Challenge (6/556)
JavaPythonCloud ComputingBlockchain

Cisco International Internship Program

Aug 2015Jul 2016 · 11 mos · San Jose, California, USA

  • Various SDN apps based on OpenDaylight platform:
  • VPP-ethernetVPN in fd.io, 2016-4 – 2016-7
  • Using vxlan tunnels to connect bridge domains of vpps, thus forming virtual bridge domains. After that, a bgp-evpn control plane is used to propagate MAC addresses between them.
  • PCE-OF controller, 2015-10 – 2016-4
  • PoC project for a famous US Service Provider's SDN-WAN workshop. It leverages the Opendaylight platform to create a PCE module based on OpenFlow Protocol. The project provides a powerful policy definition language which could be used to find and manage paths inside a pure OpenFlow network. I was responsible for the architecture and implementation of the module.
  • Open SDN Controller, 2015-8 – 2015.9
  • Built a full set of northbound(Rest) API for OpenDaylight application inside OSC (A Cisco Version of ODL), which is used to do CRUD operations on network elements and policies.
  • Callisto the Virtual Router, 2015-4 – 2015-7
  • This project leverages the OpenDaylight platform, enabling Cisco IOS-XRv to talk NETCONF with forwarding boxes. The Callisto controller acts like a virtual router on top of all the boxes, controlling and managing underlay interfaces.
SDNOpenFlow

Education

University of Illinois Urbana-Champaign

Master's degree — Computer Science

Jan 2018Jan 2019

Beijing University of Posts and Telecommunications

Bachelor's degree — Software Engineering

Jan 2012Jan 2016

Shenyang No.2 High School

High School Diploma

Jan 2009Jan 2012

Stackforce found 100+ more professionals with Cloud Computing & Software Development

Explore similar profiles based on matching skills and experience