sheng zhao
~sheng_zhao1
8
论文总数
4.0
年均投稿
平均评分
接收情况5/8
会议分布
ICLR
6
NeurIPS
2
发表论文 (8 篇)
20253 篇
4
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers
ICLR 2025Rejected
4
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching
NeurIPS 2025Poster
-
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis
ICLR 2025desk_rejected
20245 篇
4
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
ICLR 2024Rejected
4
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
ICLR 2024Spotlight
3
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
NeurIPS 2024Poster
4
GAIA: Zero-shot Talking Avatar Generation
ICLR 2024Poster
4
PromptTTS 2: Describing and Generating Voices with Text Prompt
ICLR 2024Poster