Paper
Hub
搜索
Toggle language
Hao Bai
~Hao_Bai1
5
论文总数
2.5
年均投稿
5.5
平均评分
接收情况
4
/
5
会议分布
NeurIPS
3
ICLR
2
发表论文 (5 篇)
2025
3 篇
4.3
3
Improving Neuron-level Interpretability with White-box Language Models
ICLR 2025
withdrawn
4.8
5
Digi-Q: Learning VLM Q-Value Functions for Training Device-Control Agents
ICLR 2025
Poster
6.4
4
Thinking vs. Doing: Improving Agent Reasoning by Scaling Test-Time Interaction
NeurIPS 2025
Poster
2024
2 篇
6.0
4
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
NeurIPS 2024
Poster
5.8
4
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
NeurIPS 2024
Poster
合作者 (20)
YZ
Yifei Zhou
4 篇
AK
Aviral Kumar
3 篇
SL
Sergey Levine
3 篇
AS
Alane Suhr
2 篇
JP
Jiayi Pan
2 篇
YM
Yi Ma
2 篇
ST
Shengbang Tong
2 篇
MC
Mert Cemri
1 篇
查看全部 20 位合作者