Zhengyuan Yang
~Zhengyuan_Yang1
19
论文总数
9.5
年均投稿
平均评分
接收情况15/19
会议分布
ICLR
9
NeurIPS
7
ICML
2
COLM
1
发表论文 (19 篇)
202514 篇
4
Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
ICLR 2025Poster
4
Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation
NeurIPS 2025Poster
4
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs
NeurIPS 2025Poster
4
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
NeurIPS 2025Spotlight
4
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
NeurIPS 2025Poster
4
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
ICML 2025Poster
4
OmniContrast: Vision-Language-Interleaved Contrast from Pixels All at once
ICLR 2025Rejected
4
MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models
ICLR 2025withdrawn
4
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
ICML 2025Oral
4
GenXD: Generating Any 3D and 4D Scenes
ICLR 2025Poster
3
EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
ICLR 2025Poster
4
VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents
NeurIPS 2025Poster
5
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
ICLR 2025Poster
4
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
ICLR 2025Spotlight
20245 篇
6
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
COLM 2024Poster
4
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
ICLR 2024withdrawn
3
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
NeurIPS 2024Poster
4
DisCo: Disentangled Control for Realistic Human Dance Generation
ICLR 2024withdrawn
4
Interfacing Foundation Models' Embeddings
NeurIPS 2024Poster