PaperHub

Zehan Wang

~Zehan_Wang2

26
论文总数
13.0
年均投稿
5.4
平均评分
接收情况14/26
会议分布
ICLR
18
NeurIPS
7
ICML
1

发表论文 (26 篇)

202516

5.0
4

T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback

ICLR 2025withdrawn
8.7
4

Orient Anything V2: Unifying Orientation and Rotation Understanding

NeurIPS 2025Spotlight
5.5
4

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

ICML 2025Poster
6.3
4

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

ICLR 2025Poster
6.0
4

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

ICLR 2025Poster
4.4
5

Noise-Robust Audio-Visual Speech-Driven Body Language Synthesis

ICLR 2025withdrawn
5.8
4

Improving Long-Text Alignment for Text-to-Image Diffusion Models

ICLR 2025Poster
4.8
4

AVSET-10M: An Open Large-Scale Audio-Visual Dataset with High Correspondence

ICLR 2025withdrawn
6.6
5

VoxDialogue: Can Spoken Dialogue Systems Understand Information Beyond Words?

ICLR 2025Poster
4.0
4

Dynamic Switching Teacher: How to Generalize Temporal Action Detection Models

ICLR 2025withdrawn
5.8
5

Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision

ICLR 2025Poster
5.0
4

OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios

ICLR 2025withdrawn
5.2
5

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control

ICLR 2025withdrawn
6.5
4

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

ICLR 2025Poster
4.3
6

Advancing Multimodal Unified Discrete Representations

ICLR 2025withdrawn
2.3
3

MindLoc: A Secure Brain-Based System for Object Localization

ICLR 2025withdrawn

202410