I am currently pursuing a master's degree in the Vision Learning Lab at National Taiwan University, advised by Prof. Yu-Chiang Frank Wang.
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction
arXiv 2025
Improving Speech Emotion Recognition in Under-Resourced Languages via Speech-to-Speech Translation with Bootstrapping Data Selection
ICASSP 2025
QuAVF: Quality-aware Audio-Visual Fusion for Ego4D Talking to Me Challenge
1st place winner of the CVPR 2023 Ego4D Workshop Challenge
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
ICLR 2025
ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos
NeurIPS 2024 Datasets and Benchmarks Track