PaperHub

Yuki Mitsufuji

~Yuki_Mitsufuji1

25
论文总数
12.5
年均投稿
5.7
平均评分
接收情况15/25
会议分布
ICLR
19
ICML
3
NeurIPS
3

发表论文 (25 篇)

202519

4.8
4

Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space

ICLR 2025Rejected
7.2
4

Supervised Contrastive Learning from Weakly-Labeled Audio Segments for Musical Version Matching

ICML 2025Poster
4.8
4

A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation

ICLR 2025withdrawn
6.0
4

MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation

ICLR 2025Poster
4.8
3

VCT: Training Consistency Models with Variational Noise Coupling

ICML 2025Poster
6.3
3

Distillation of Discrete Diffusion through Dimensional Correlations

ICML 2025Poster
6.5
4

Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models

ICLR 2025Poster
7.3
3

Weighted Point Set Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric

ICLR 2025Spotlight
5.5
4

G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving

ICLR 2025Rejected
3.5
4

Mitigating Embedding Collapse in Diffusion Models for Categorical Data

ICLR 2025withdrawn
4.4
5

FLEXOUNDIT: VARIABLE-LENGTH DIFFUSION TRANSFORMER FOR TEXT-TO-AUDIO GENERATION

ICLR 2025withdrawn
7.0
4

SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation

ICLR 2025Poster
5.0
4

Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation

ICLR 2025Rejected
3.5
4

OpenMU: Your Swiss Army Knife for Music Understanding

ICLR 2025withdrawn
6.4
4

Enhancing 3D Reconstruction for Dynamic Scenes

NeurIPS 2025Poster
6.0
4

HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning

ICLR 2025Poster
5.4
5

Orator: LLM-Guided Multi-Shot Speech Video Generation

ICLR 2025Rejected
6.0
3

Mining your own secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models

ICLR 2025Poster
5.8
4

GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models

ICLR 2025Rejected