PaperHub

Weinan Zhang

~Weinan_Zhang1

30
论文总数
15.0
年均投稿
5.8
平均评分
接收情况20/30
会议分布
ICLR
15
NeurIPS
13
ICML
2

发表论文 (30 篇)

202517

6.4
3

Flexible Realignment of Language Models

NeurIPS 2025Poster
6.1
4

Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport

ICML 2025Poster
6.8
4

Uni-RL: Unifying Online and Offline RL via Implicit Value Regularization

NeurIPS 2025Poster
7.3
4

Information-Theoretic Reward Decomposition for Generalizable RLHF

NeurIPS 2025Poster
4.0
3

Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games

ICLR 2025withdrawn
4.9
4

Large Language Models are Demonstration Pre-Selectors for Themselves

ICML 2025Poster
7.0
3

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

NeurIPS 2025Poster
5.3
4

Large Language Models are Demonstration Pre-Selectors for Themselves

ICLR 2025Rejected
5.7
3

ContraDiff: Planning Towards High Return States via Contrastive Learning

ICLR 2025Poster
6.3
4

Reconstruction-Guided Policy: Enhancing Decision-Making through Agent-Wise State Consistency

ICLR 2025Poster
5.3
4

DyDiff: Long-Horizon Rollout via Dynamics Diffusion for Offline Reinforcement Learning

ICLR 2025Rejected
7.3
4

KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills

NeurIPS 2025Poster
6.4
4

AgentNet: Decentralized Evolutionary Coordination for LLM-based Multi-Agent Systems

NeurIPS 2025Poster
4.5
4

RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation

ICLR 2025Rejected
6.4
4

ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning

NeurIPS 2025Poster
6.0
4

MobileUse: A Hierarchical Reflection-Driven GUI Agent for Autonomous Mobile Operation

NeurIPS 2025Poster
6.8
5

Robust Function-Calling for On-Device Language Model via Function Masking

ICLR 2025Spotlight

202413