Haoming Jiang
~Haoming_Jiang1
9
论文总数
4.5
年均投稿
平均评分
接收情况7/9
会议分布
NeurIPS
4
ICLR
2
COLM
2
ICML
1
发表论文 (9 篇)
20257 篇
4
Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
ICLR 2025Rejected
4
Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models
NeurIPS 2025Poster
4
RRO: LLM Agent Optimization Through Rising Reward Trajectories
COLM 2025Poster
4
Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions
ICLR 2025withdrawn
4
Discriminative Finetuning of Generative Large Language Models without Reward Models and Human Preference Data
ICML 2025Poster
3
Ask a Strong LLM Judge when Your Reward Model is Uncertain
NeurIPS 2025Poster
5
Self-Rewarding PPO: Aligning Large Language Models with Demonstrations Only
COLM 2025Poster