Minghui Fang
~Minghui_Fang1
8
论文总数
4.0
年均投稿
平均评分
接收情况3/8
会议分布
ICLR
7
NeurIPS
1
发表论文 (8 篇)
20257 篇
4
AVSET-10M: An Open Large-Scale Audio-Visual Dataset with High Correspondence
ICLR 2025withdrawn
5
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control
ICLR 2025withdrawn
4
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
ICLR 2025Poster
4
OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
ICLR 2025withdrawn
4
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
ICLR 2025Poster
3
MindLoc: A Secure Brain-Based System for Object Localization
ICLR 2025withdrawn
6
Advancing Multimodal Unified Discrete Representations
ICLR 2025withdrawn