Zhang-Wei Hong
~Zhang-Wei_Hong1
8
论文总数
4.0
年均投稿
平均评分
接收情况6/8
会议分布
ICLR
5
NeurIPS
2
ICML
1
发表论文 (8 篇)
20255 篇
6
ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization
ICLR 2025Poster
4
ImageNet-RIB Benchmark: Large Pre-Training Datasets Don't Guarantee Robustness after Fine-Tuning
ICLR 2025Rejected
4
ReGen: Generative Robot Simulation via Inverse Design
ICLR 2025Poster
4
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
ICML 2025Poster
4
RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning
NeurIPS 2025Poster