影响力指数

69.68/100

前 2.7%

全站排名 #1,743

发表论文25 篇

平均评分5.0

年均产出8.3 篇/年

Xize Cheng

PhD student@Zhejiang University·中国·OpenReview

研究方向

Spoken dialogue systems · Omni-modal understanding

SpatialHand: Generative Object Manipulation from 3D Prespective

ICLR 2026Poster

Vox-Infinity: Benchmarking the Limits of Long-Context Spoken Language Models

ICLR 2026Rejected

MARS-Sep: Multimodal-Aligned Reinforced Sound Separation

ICLR 2026Poster

AlignSep: Temporally-Aligned Video-Queried Sound Separation with Flow Matching

ICLR 2026Poster

WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

ICLR 2026Rejected

OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios

ICLR 2026Rejected

Character Beyond Speech: Leveraging Role-Playing Evaluation in Large Audio Language Models via Reinforcement Learning

ICLR 2026Withdrawn

VoxDialogue: Can Spoken Dialogue Systems Understand Information Beyond Words?

ICLR 2025Poster

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

ICLR 2025Poster

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

ICLR 2025Poster

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

ICLR 2025Poster

MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization

ICLR 2025Rejected

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control

ICLR 2025Withdrawn

OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios

ICLR 2025Withdrawn

T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback

ICLR 2025Withdrawn

AVSET-10M: An Open Large-Scale Audio-Visual Dataset with High Correspondence

ICLR 2025Withdrawn

Noise-Robust Audio-Visual Speech-Driven Body Language Synthesis

ICLR 2025Withdrawn

Dynamic Switching Teacher: How to Generalize Temporal Action Detection Models

ICLR 2025Withdrawn

MindLoc: A Secure Brain-Based System for Object Localization

ICLR 2025Withdrawn

合作者 (20)