Cheng-Kang Chou

Product Engineer

Taiwan2 yrs 4 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Developed Taiwan's most downloaded ASR model.
  • Proposed innovative ASR training framework accepted at ASRU 2025.
  • Expertise in predictive analytics and explainable AI.
Stackforce AI infers this person is a skilled AI/ML researcher with a focus on ASR and data science.

Contact

Skills

Core Skills

Quantitative FinanceAsrData ScienceInformation RetrievalFinancial EngineeringAudio ProcessingSpeech RecognitionCloud ComputingBackend Development

Other Skills

Portfolio OptimizationDeep LearningLarge Language Models (LLM)Predictive analyticsexplanable AIResearch EngineeringData QualityActive LearningSpeech ProcessingRetrieval-Augmented Generation (RAG)Recommendation system財務分析Stochastic CalculusLinuxGAN

About

github: https://github.com/forbes110 check my ASR model: https://huggingface.co/MediaTek-Research/Breeze-ASR-25

Experience

2 yrs 4 mos
Total Experience
9 mos
Average Tenure
1 yr 4 mos
Current Experience

台大人工智慧應用社 ntu ai club

Technical Lead

Feb 2025Present · 1 yr 4 mos

  • Mentor of the research team collaborating with DeepMentor Inc. for Taiwan localized embedded LLM system.

聯發科技

Deep Learning Researcher

Feb 2025Jun 2025 · 4 mos

  • Focusing on Multimodal Fusion & LLM-enhanced ASR techniques
  • 1. Build most downloaded Taiwan open source model: Breeze-ASR-25(https://huggingface.co/MediaTek-Research/Breeze-ASR-25)
  • 2. Proposed self-refining framework for ASR training with TTS models, paper accepted by ASRU 2025

國立臺灣大學

2 roles

Teaching Assistant

Jan 2025May 2025 · 4 mos

  • Teaching Assistant of “Deep learning and its application”

Research Assistant

Sep 2023Dec 2024 · 1 yr 3 mos

  • Research assistant of productivity optimization lab,
  • research about data quality, predictability analysis, and explainable AI.
  • Built a large data science pipeline, which contains
  • 1. Data preprocessing with SOTA imputatition algorithm, data quality and property assessment
  • 2. Active learning interface to re-label and active prompt adjust the model weight.
  • 3. Deep learning model with knowledge distillation to reduce 25% model parameters with only 2% performance compensation.
  • 4. SHAP model feature explainable module and counterfactual explanation for robust model adjustment
  • Working on entropy bound estimation with nn and kernel density estimation methods.
Predictive analyticsexplanable AIResearch EngineeringData Science

國立臺灣大學電機資訊學院

3 roles

Undergraduate Researcher

Sep 2024May 2025 · 8 mos

  • Research Member of NTU EE SPML Lab supervised by Prof. Hung-Yi Lee, focusing on the open-source Taiwan Native Whisper and addressing streaming hallucination issues in ASR models.
ASRSpeech ProcessingLarge Language Models (LLM)

Undergraduate Researcher

May 2024Jan 2025 · 8 mos

  • Research Member of NTU CSIE IR Lab, Prof. Pu-Jen Cheng, working on research of cross-modality GEN-IR and cold start group recommendation systems.
Information RetrievalRetrieval-Augmented Generation (RAG)Recommendation system

Undergraduate Researcher

Feb 2024Jul 2024 · 5 mos

  • Research member of NTU CSIE Financial Algorithm Lab, Prof. Yuh-Dauh Lyuu.
財務分析Research EngineeringFinancial EngineeringDeep LearningStochastic Calculus

Google developer student club @ ntu

Technical Lead

Jul 2024Jun 2025 · 11 mos

  • Project support & course design about software development, data engineering, cloud resource management, statistical analytics and deep learning

United link co., ltd

AI Research Engineer

May 2024Sep 2024 · 4 mos

  • I process parallel audio data from different devices in Taiwanese and Hakka dialects. Due to the varying start and end times of recordings across devices, and with data from 8 recording devices, it's necessary to uniformly shift and remove silent segments from the beginning or end to avoid affecting the information content.
  • The traditional approach is to lock onto one device's data and use Dynamic Time Warping (DTW) as a standard, aligning other devices to it.
  • This method consumes enormous space for longer durations. I employed a correlation calculation method, reducing the space complexity to O(n), and used FFT to calculate correlation, achieving a time complexity of O(n log n).
  • Co-authored: Channel-aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition, submitted to ICASSP 2025
  • Using GAN and channel encoder for data augmentation to address the domain mismatch problem in speech recognition.
  • This research improves on the original framework for data augmentation targeting out-of-domain noise, used for training denoising systems. Here, an additional encoder is designed for the GAN to incorporate device information. The augmented data then needs to be validated through training on the ASR model. I was responsible for building the ASR model, as well as establishing baselines and toplines on various devices to verify model performance. I also adapted code from different frameworks, porting code from speechbrain and huggingface, to ensure consistency in the experimental setup.
Audio ProcessingSpeech ProcessingLarge Language Models (LLM)Linux

Worldquant

Quantitative Finance Research Consultant

Apr 2024Present · 2 yrs 2 mos

  • Produce predictive signals (alphas) to employ financial strategies focused on market inefficiencies.
Portfolio OptimizationQuantitative Finance

聚上雲 epic cloud

Cloud Architect Training Intern

Apr 2024Jun 2024 · 2 mos

  • Training about the Google Cloud Platform(GCP) Professional Cloud Architect
Google Cloud Platform (GCP)Cloud Computing

Cmoney

Backend/Data Engineer

Jul 2022Oct 2022 · 3 mos · On-site

  • 1. Improve the log delivery update system with ELK (Elasticsearch - Logstash - Kibana),
  • Standardize and automate important information previously stored in respective databases and special mailboxes.
  • 2. Develop text parser to automatically capture and parse information from websites, and update it into the format required by the financial analysis group.
  • 3. Adjust the large files developed by the company using the waterfall development method and optimize large crawlers → Parser → Database pipeline, updates the data and diversion mode from the early stage, including dynamic web crawlers, Regular Expression, SQL, mongodb and tortoisegit
SQLC#GitBackend Development

Education

National Taiwan University

Bachelor's degree — Information Management

Sep 2020Jun 2025

National Taiwan University

Master's degree — EECS,Graduate Institute of Communication Engineering

Stackforce found 100+ more professionals with Quantitative Finance & Asr

Explore similar profiles based on matching skills and experience