Yu-Xiong Wang
~Yu-Xiong_Wang1
30
论文总数
15.0
年均投稿
平均评分
接收情况19/30
会议分布
ICLR
19
NeurIPS
10
ICML
1
发表论文 (30 篇)
202516 篇
4
MR. Video: MapReduce as an Effective Principle for Long Video Understanding
NeurIPS 2025Poster
4
RTDiff: Reverse Trajectory Synthesis via Diffusion for Offline Reinforcement Learning
ICLR 2025Poster
4
3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing
ICLR 2025Poster
4
Video Diffusion Models Learn the Structure of the Dynamic World
ICLR 2025withdrawn
4
LayeredGS: Efficient Dynamic Scene Rendering and Point Tracking with Multi-Layer Deformable Gaussian Splatting
ICLR 2025withdrawn
5
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
ICLR 2025Poster
5
ReferEverything: Towards segmenting everything we can speak of in videos
ICLR 2025Rejected
4
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
ICLR 2025Poster
3
Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
ICLR 2025Rejected
4
Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image
NeurIPS 2025Poster
3
Latent Wasserstein Adversarial Imitation Learning
ICLR 2025Rejected
4
Self-Guided Hierarchical Exploration for Generalist Foundation Model Web Agents
NeurIPS 2025Poster
5
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models
ICLR 2025Poster
3
Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
ICML 2025Poster
3
One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding
NeurIPS 2025Poster
5
Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
ICLR 2025Rejected
202414 篇
4
AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation
ICLR 2024Rejected
5
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories
ICLR 2024Rejected
3
ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing
NeurIPS 2024Poster
4
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction
NeurIPS 2024Poster
4
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
NeurIPS 2024Spotlight
4
SceneCraft: Layout-Guided 3D Scene Generation
NeurIPS 2024Poster
4
Is Pre-training Truly Better Than Meta-Learning?
ICLR 2024Rejected
4
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs
NeurIPS 2024Poster
4
Robust Model-Based Optimization for Challenging Fitness Landscapes
ICLR 2024Poster
4
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
ICLR 2024Spotlight
4
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
ICLR 2024Rejected
4
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
NeurIPS 2024Poster
4
Aligning Large Multimodal Models with Factually Augmented RLHF
ICLR 2024Rejected
3
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
ICLR 2024Poster