Xuhui Zhan

AI Researcher

Nashville, Tennessee, United States3 yrs 5 mos experience

Most Likely To SwitchAI Enabled

Key Highlights

Founding member of a machine learning team.
Expertise in Large Language Models and Computer Vision.
Significant cost reductions through innovative AI solutions.

Stackforce AI infers this person is a skilled AI and Machine Learning engineer with a focus on Computer Vision and Large Language Models.

Contact

Skills

Core Skills

Machine LearningComputer VisionLarge Language Models (llm)Graph Neural NetworkData ScienceUniversity LecturingDeep Learning

Other Skills

Recommendation systemSoftware DevelopmentAWSGoTerraformKubernetesAirflowGitHub ActionsIndustrial CamerasMultimodalityBig Data ScalingOptical Character Recognition (OCR)PyGPyTorchFine-tuning

About

TL;DR: Engineer and researcher, specialized in Computer Vision, Large Language Model and Multimodal Model development and deployment.

Experience

3 yrs 5 mos

Total Experience

11 mos

Average Tenure

1 yr

Current Experience

Treverse llc.

Applied Scientist

Sep 2025 – Present · 8 mos · Nashville Metropolitan Area · Hybrid

TL;DR: Founding member of the machine learning team, building production ML systems and shared AI infrastructure from the ground up, with a focus on scalable and cost-efficient deployment of modern AI methods.
Key initiatives:
Multilingual translation service: Built a Go-based microservice deployed on AWS that performs LLM-powered translation and evaluation of Spanish/English item metadata used in production workflows.
Unified AI platform: Developing a centralized AI/ML platform on AWS using Terraform, Kubernetes, Airflow, Argo CD, and GitHub Actions to support standardized model training, CI/CD pipelines, and monitored deployments across ML projects, A/B testing.
Vision-based presort system: Building an edge-first computer vision pipeline for high-speed barcode detection and decoding of irregularly shaped items, integrated with industrial cameras and conveyor hardware.
Recommendation systems: Designing marketplace recommendation models that generate personalized item suggestions and improve engagement for hundreds of thousands of users.

Recommendation systemSoftware DevelopmentLarge Language Models (LLM)Computer VisionMachine Learning

Stealth ai startup

Founding Engineer and Researcher

Jun 2025 – Sep 2025 · 3 mos

Large Language Models (LLM)Optical Character Recognition (OCR)

Ieee transactions on big data

Reviewer

May 2025 – Present · 1 yr · Remote

Computer VisionMultimodalityLarge Language Models (LLM)Big Data Scaling

Vanderbilt university

3 roles

Graduate Research Assistant

Jun 2024 – May 2025 · 11 mos · Nashville, Tennessee, United States · Hybrid

Advisor: Prof. Tyler Derr (Network and Data Science Lab)
TL;DR: Worked on various research projects across multimodal models, foundation models, graph neural networks and recommender systems, aimed at developing an innovative unified fusion framework to potentially replace LLaVA (and other soft prompt/token based multimodal frameworks), achieving improved performance and efficiency when integrating multiple modalities.
Proposed a unified fusion framework that seamlessly integrates vision, graph, and language modalities.
Vision–language: adapted fusion framework on the LLaVA code‑base, eliminated the alignment pre‑training stage; relied on fine‑tuning alone cuts total training time ≈ 25 % and halves data needs while matching SOTA performance on nine public benchmarks. (See the project page for Inverse-LLaVA at https://inverse-llava.github.io)
Graph–text: built an advanced TGAT‑LLM pipeline leveraging proposed fusion framework to generate rich textual attributes for social‑network data; full experimental runs on Venmo and Amazon Reviews are in progress, with encouraging preliminary results.

Recommendation systemLarge Language Models (LLM)Graph Neural NetworkPyGPyTorch

Lead AI & Data Science Researcher

Promoted

Oct 2023 – May 2025 · 1 yr 7 mos · Nashville, Tennessee, United States · Hybrid

Advisor: Prof. Ray Friedman (AI Negotiation Lab) | Paid Position
TL;DR: Prompt engineering, Retrieval-Augmented Generation (RAG), Large Language Model (LLM) research and applications—including fine-tuning, quantization, and curated data pipelines—as well as interdisciplinary collaboration and web development.
Drive end‑to‑end LLM research for negotiation related applications—designing, experimenting, and optimising models.
Auto‑coding project: cut per‑transcript cost from $5 k to $3 (>99 % savings) through an LLM pipeline refined via active‑learning loops.
AI subject‑pool project: developed a novel algorithm that injects population‑level variation into LLM agents, creating realistic, diverse negotiator profiles.
Collaborate closely with negotiation faculty and data‑science peers; lead model ops and mentor junior coders.
Work presented at the 2025 AI Negotiation Summit (Harvard/MIT) and the IACMR annual meeting.

Python (Programming Language)Large Language Models (LLM)Machine LearningProject Management

Professional Research Assistant

Oct 2023 – Jan 2025 · 1 yr 3 mos · Nashville, Tennessee, United States · Hybrid

Advisor: Prof. Markus Eberl | Paid Position
TL;DR: Computer vision-related research and applications, including handling imbalanced datasets, designing evaluation metrics, and implementing MLOps practices.
SOTA classification pipeline: built an end‑to‑end Vision Transformer workflow that outperforms legacy methods.
Provenance by vision: devised a sample‑similarity algorithm that traces mortar movement routes by comparing visual features alone, offering a non‑destructive alternative to chemical assays.

Python (Programming Language)Machine LearningComputer VisionProject Management

Data science at vanderbilt university

Graduate Teaching Assistant

Aug 2023 – May 2025 · 1 yr 9 mos · Nashville, Tennessee, United States · Hybrid

Position funded by the Vanderbilt Graduate Teaching Fellowship
Led labs, designed quizzes, graded coursework, and held weekly office hours for:
DS 5620 Probability & Statistical Inference (Grad), DS 5690 Generative AI: Theory & Practice (Grad) and DS 3100 Fundamentals of Data Science (Undergrad)
Served as the dedicated statistics tutor for Master of Finance students in Econometrics I.

Data ScienceGenerative AIStatisticsUniversity LecturingPython (Programming Language)R (Programming Language)

Suzhou agv robot co. ltd.

Algorithm Engineer

Jun 2022 – Apr 2023 · 10 mos · Suzhou, Jiangsu, China · On-site

TL;DR: End-to-end, cost-sensitive, and scalable machine learning pipelines covering design, development, and deployment (OpenCV, YOLO, PyTorch → Django, ONNX, Async, Celery, Redis, CPython), as well as Robot Operating System (ROS) software development and deployment in C++.
Optimized and deployed high‑precision localization algorithms for Automated Guided Vehicle (AGV) robots, improving positional accuracy and enabling rollout to new product lines.
Built an end‑to‑end vision storage‑detection system (YOLO · OpenCV · PyTorch→ONNX · Django) that integrates with the central server for large‑scale, multi‑AGV coordination and visualization; standardized hardware and deployment playbooks for rapid installs in  more than  30 warehouses.

Python (Programming Language)Data ScienceDeep LearningComputer VisionC++Big Data Scaling+2