影响力指数

45.5/100

前 10.7%

全站排名 #6,877

发表论文16 篇

平均评分5.2

年均产出5.3 篇/年

Shengpeng Ji

Researcher@HY LLM Team·中国·OpenReview

研究方向

Speech and LLM

5.5

Vox-Infinity: Benchmarking the Limits of Long-Context Spoken Language Models

ICLR 2026Rejected

4.5

WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

ICLR 2026Rejected

一作

3.5

OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios

ICLR 2026Rejected

6.6

VoxDialogue: Can Spoken Dialogue Systems Understand Information Beyond Words?

ICLR 2025Poster

6.5

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

ICLR 2025Poster

一作

6.3

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

ICLR 2025Poster

6.0

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

ICLR 2025Poster

5.8

Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

ICLR 2025Withdrawn

5.7

MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization

ICLR 2025Rejected

5.2

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control

ICLR 2025Withdrawn

一作

5.0

T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback

ICLR 2025Withdrawn

5.0

OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios

ICLR 2025Withdrawn

4.9

IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models

ICML 2025Poster

4.3

Advancing Multimodal Unified Discrete Representations

ICLR 2025Withdrawn

三作

2.3

MindLoc: A Secure Brain-Based System for Object Localization

合作者 (20)