Baolin Peng
~Baolin_Peng2
11
论文总数
5.5
年均投稿
平均评分
接收情况9/11
会议分布
ICLR
6
NeurIPS
4
ICML
1
发表论文 (11 篇)
20258 篇
3
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
ICLR 2025withdrawn
4
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning
ICLR 2025Poster
4
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
ICLR 2025Oral
5
CollabLLM: From Passive Responders to Active Collaborators
ICML 2025Oral
6
Latent Action Pretraining from Videos
ICLR 2025Poster
4
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
NeurIPS 2025Poster
4
Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
NeurIPS 2025Poster
4
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
NeurIPS 2025Poster