Xuhui Zhan

AI Researcher

Nashville, Tennessee, United States3 yrs 5 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Founding member of a machine learning team.
  • Expertise in Large Language Models and Computer Vision.
  • Significant cost reductions through innovative AI solutions.
Stackforce AI infers this person is a skilled AI and Machine Learning engineer with a focus on Computer Vision and Large Language Models.

Contact

Skills

Core Skills

Machine LearningComputer VisionLarge Language Models (llm)Graph Neural NetworkData ScienceUniversity LecturingDeep Learning

Other Skills

Recommendation systemSoftware DevelopmentAWSGoTerraformKubernetesAirflowGitHub ActionsIndustrial CamerasMultimodalityBig Data ScalingOptical Character Recognition (OCR)PyGPyTorchFine-tuning

About

TL;DR: Engineer and researcher, specialized in Computer Vision, Large Language Model and Multimodal Model development and deployment.

Experience

3 yrs 5 mos
Total Experience
11 mos
Average Tenure
1 yr
Current Experience

Treverse llc.

Applied Scientist

Sep 2025Present · 8 mos · Nashville Metropolitan Area · Hybrid

  • TL;DR: Founding member of the machine learning team, building production ML systems and shared AI infrastructure from the ground up, with a focus on scalable and cost-efficient deployment of modern AI methods.
  • Key initiatives:
  • Multilingual translation service: Built a Go-based microservice deployed on AWS that performs LLM-powered translation and evaluation of Spanish/English item metadata used in production workflows.
  • Unified AI platform: Developing a centralized AI/ML platform on AWS using Terraform, Kubernetes, Airflow, Argo CD, and GitHub Actions to support standardized model training, CI/CD pipelines, and monitored deployments across ML projects, A/B testing.
  • Vision-based presort system: Building an edge-first computer vision pipeline for high-speed barcode detection and decoding of irregularly shaped items, integrated with industrial cameras and conveyor hardware.
  • Recommendation systems: Designing marketplace recommendation models that generate personalized item suggestions and improve engagement for hundreds of thousands of users.
Recommendation systemSoftware DevelopmentLarge Language Models (LLM)Computer VisionMachine Learning

Stealth ai startup

Founding Engineer and Researcher

Jun 2025Sep 2025 · 3 mos

Large Language Models (LLM)Optical Character Recognition (OCR)

Ieee transactions on big data

Reviewer

May 2025Present · 1 yr · Remote

Computer VisionMultimodalityLarge Language Models (LLM)Big Data Scaling

Vanderbilt university

3 roles

Graduate Research Assistant

Jun 2024May 2025 · 11 mos · Nashville, Tennessee, United States · Hybrid

  • Advisor: Prof. Tyler Derr (Network and Data Science Lab)
  • TL;DR: Worked on various research projects across multimodal models, foundation models, graph neural networks and recommender systems, aimed at developing an innovative unified fusion framework to potentially replace LLaVA (and other soft prompt/token based multimodal frameworks), achieving improved performance and efficiency when integrating multiple modalities.
  • Proposed a unified fusion framework that seamlessly integrates vision, graph, and language modalities.
  • Vision–language: adapted fusion framework on the LLaVA code‑base, eliminated the alignment pre‑training stage; relied on fine‑tuning alone cuts total training time ≈ 25 % and halves data needs while matching SOTA performance on nine public benchmarks. (See the project page for Inverse-LLaVA at https://inverse-llava.github.io)
  • Graph–text: built an advanced TGAT‑LLM pipeline leveraging proposed fusion framework to generate rich textual attributes for social‑network data; full experimental runs on Venmo and Amazon Reviews are in progress, with encouraging preliminary results.
Recommendation systemLarge Language Models (LLM)Graph Neural NetworkPyGPyTorch

Lead AI & Data Science Researcher

Promoted

Oct 2023May 2025 · 1 yr 7 mos · Nashville, Tennessee, United States · Hybrid

  • Advisor: Prof. Ray Friedman (AI Negotiation Lab) | Paid Position
  • TL;DR: Prompt engineering, Retrieval-Augmented Generation (RAG), Large Language Model (LLM) research and applications—including fine-tuning, quantization, and curated data pipelines—as well as interdisciplinary collaboration and web development.
  • Drive end‑to‑end LLM research for negotiation related applications—designing, experimenting, and optimising models.
  • Auto‑coding project: cut per‑transcript cost from $5 k to $3 (>99 % savings) through an LLM pipeline refined via active‑learning loops.
  • AI subject‑pool project: developed a novel algorithm that injects population‑level variation into LLM agents, creating realistic, diverse negotiator profiles.
  • Collaborate closely with negotiation faculty and data‑science peers; lead model ops and mentor junior coders.
  • Work presented at the 2025 AI Negotiation Summit (Harvard/MIT) and the IACMR annual meeting.
Python (Programming Language)Large Language Models (LLM)Machine LearningProject Management

Professional Research Assistant

Oct 2023Jan 2025 · 1 yr 3 mos · Nashville, Tennessee, United States · Hybrid

  • Advisor: Prof. Markus Eberl | Paid Position
  • TL;DR: Computer vision-related research and applications, including handling imbalanced datasets, designing evaluation metrics, and implementing MLOps practices.
  • SOTA classification pipeline: built an end‑to‑end Vision Transformer workflow that outperforms legacy methods.
  • Provenance by vision: devised a sample‑similarity algorithm that traces mortar movement routes by comparing visual features alone, offering a non‑destructive alternative to chemical assays.
Python (Programming Language)Machine LearningComputer VisionProject Management

Data science at vanderbilt university

Graduate Teaching Assistant

Aug 2023May 2025 · 1 yr 9 mos · Nashville, Tennessee, United States · Hybrid

  • Position funded by the Vanderbilt Graduate Teaching Fellowship
  • Led labs, designed quizzes, graded coursework, and held weekly office hours for:
  • DS 5620 Probability & Statistical Inference (Grad), DS 5690 Generative AI: Theory & Practice (Grad) and DS 3100 Fundamentals of Data Science (Undergrad)
  • Served as the dedicated statistics tutor for Master of Finance students in Econometrics I.
Data ScienceGenerative AIStatisticsUniversity LecturingPython (Programming Language)R (Programming Language)

Suzhou agv robot co. ltd.

Algorithm Engineer

Jun 2022Apr 2023 · 10 mos · Suzhou, Jiangsu, China · On-site

  • TL;DR: End-to-end, cost-sensitive, and scalable machine learning pipelines covering design, development, and deployment (OpenCV, YOLO, PyTorch → Django, ONNX, Async, Celery, Redis, CPython), as well as Robot Operating System (ROS) software development and deployment in C++.
  • Optimized and deployed high‑precision localization algorithms for Automated Guided Vehicle (AGV) robots, improving positional accuracy and enabling rollout to new product lines.
  • Built an end‑to‑end vision storage‑detection system (YOLO · OpenCV · PyTorch→ONNX · Django) that integrates with the central server for large‑scale, multi‑AGV coordination and visualization; standardized hardware and deployment playbooks for rapid installs in  more than  30 warehouses.
Python (Programming Language)Data ScienceDeep LearningComputer VisionC++Big Data Scaling+2

Education

Vanderbilt University

Master of Science - MS — Data Science

Aug 2023May 2025

Beijing Normal-Hong Kong Baptist University

Bachelor of Science - BS

Sep 2018Jun 2022

Hong Kong Baptist University

Bachelor of Science - BS — Data Science

Sep 2018Jun 2022

Stackforce found 100+ more professionals with Machine Learning & Computer Vision

Explore similar profiles based on matching skills and experience