Bei Li
~Bei_Li1
6
论文总数
3.0
年均投稿
平均评分
接收情况5/6
会议分布
ICLR
3
NeurIPS
2
ICML
1
发表论文 (6 篇)
20254 篇
4
Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective
ICLR 2025Poster
3
InteractiveCOT: Aligning Dynamic Chain-of-Thought Planning for Embodied Decision-Making
ICLR 2025Rejected
4
GRAM: A Generative Foundation Reward Model for Reward Generalization
ICML 2025Poster
5
MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
NeurIPS 2025Poster