Shengpeng Ji
~Shengpeng_Ji1
13
论文总数
6.5
年均投稿
平均评分
接收情况6/13
会议分布
ICLR
12
ICML
1
发表论文 (13 篇)
202512 篇
5
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control
ICLR 2025withdrawn
4
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
ICLR 2025Poster
6
Advancing Multimodal Unified Discrete Representations
ICLR 2025withdrawn
3
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization
ICLR 2025Rejected
4
Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
ICLR 2025withdrawn
4
IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models
ICML 2025Poster
4
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
ICLR 2025Poster
4
T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback
ICLR 2025withdrawn
5
VoxDialogue: Can Spoken Dialogue Systems Understand Information Beyond Words?
ICLR 2025Poster
4
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
ICLR 2025Poster
3
MindLoc: A Secure Brain-Based System for Object Localization
ICLR 2025withdrawn
4
OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
ICLR 2025withdrawn