Yu Wang

~Yu_Wang3

31

论文总数

15.5

年均投稿

5.8

平均评分

接收情况20/31

会议分布

ICLR

17

NeurIPS

9

ICML

3

COLM

2

发表论文 (31 篇)

202523 篇

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

ICLR 2025Poster

ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning

NeurIPS 2025Poster

FlightBench: Benchmarking Learning-based Methods for Ego-vision-based Quadrotors Navigation

ICLR 2025withdrawn

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization

ICML 2025Poster

Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network

ICML 2025Poster

Few-shot In-context Preference Learning using Large Language Models

ICLR 2025Rejected

Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding

ICLR 2025Poster

Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation

NeurIPS 2025Poster

VeSX: A Framework Featured by Verification, Self-Correction and In-context Learning for Web Automation Tasks

ICLR 2025Rejected

Reward-Robust RLHF in LLMs

ICLR 2025Rejected

Distilled Decoding 2: One-step Sampling of Image Auto-regressive Models with Conditional Score Distillation

NeurIPS 2025Poster

What Can RL Bring to VLA Generalization? An Empirical Study

NeurIPS 2025Poster

Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

ICLR 2025Rejected

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

NeurIPS 2025Poster

Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving

ICML 2025Poster

Towards Accurate and Efficient Sub-8-Bit Integer Training

ICLR 2025withdrawn

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

ICLR 2025Poster

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

NeurIPS 2025Poster

LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

ICLR 2025Rejected

LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

COLM 2025Poster

Mixture of Attention Spans: Optimizing LLM Inference Efficiency with Heterogeneous Sliding-Window Lengths

COLM 2025Poster

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

ICLR 2025Rejected

ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

ICLR 2025Poster

20248 篇

A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models

ICLR 2024Poster

Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game

ICLR 2024Rejected

Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

ICLR 2024Poster

GML-NeRF: Gate-guided Mutual Learning Framework for Neural Rendering

ICLR 2024Rejected

Rad-NeRF: Ray-decoupled Training of Neural Radiance Field

NeurIPS 2024Poster

SemSA: Semantic Sparse Attention is hidden in Large Language Models.

ICLR 2024withdrawn

DiTFastAttn: Attention Compression for Diffusion Transformer Models

NeurIPS 2024Poster

Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study

NeurIPS 2024Poster

合作者 (20)

Xuefei Ning18 篇

Guohao Dai13 篇

Huazhong Yang10 篇

Shengen Yan10 篇