Yuki Mitsufuji
~Yuki_Mitsufuji1
25
论文总数
12.5
年均投稿
平均评分
接收情况15/25
会议分布
ICLR
19
ICML
3
NeurIPS
3
发表论文 (25 篇)
202519 篇
4
Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
ICLR 2025Rejected
4
Supervised Contrastive Learning from Weakly-Labeled Audio Segments for Musical Version Matching
ICML 2025Poster
4
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
ICLR 2025withdrawn
4
MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation
ICLR 2025Poster
3
VCT: Training Consistency Models with Variational Noise Coupling
ICML 2025Poster
3
Distillation of Discrete Diffusion through Dimensional Correlations
ICML 2025Poster
4
Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models
ICLR 2025Poster
3
Weighted Point Set Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric
ICLR 2025Spotlight
4
G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving
ICLR 2025Rejected
4
Mitigating Embedding Collapse in Diffusion Models for Categorical Data
ICLR 2025withdrawn
5
FLEXOUNDIT: VARIABLE-LENGTH DIFFUSION TRANSFORMER FOR TEXT-TO-AUDIO GENERATION
ICLR 2025withdrawn
4
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation
ICLR 2025Poster
4
Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation
ICLR 2025Rejected
4
OpenMU: Your Swiss Army Knife for Music Understanding
ICLR 2025withdrawn
4
Enhancing 3D Reconstruction for Dynamic Scenes
NeurIPS 2025Poster
4
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
ICLR 2025Poster
5
Orator: LLM-Guided Multi-Shot Speech Video Generation
ICLR 2025Rejected
3
Mining your own secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
ICLR 2025Poster
4
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
ICLR 2025Rejected
20246 篇
4
Towards reporting bias in visual-language datasets: bimodal augmentation by decoupling object-attribute association
ICLR 2024Rejected
2
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer
ICLR 2024Poster
3
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher
NeurIPS 2024Poster
4
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
ICLR 2024Poster
4
Manifold Preserving Guided Diffusion
ICLR 2024Poster
4
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping
NeurIPS 2024Poster