影响力指数

46.01/100

前 10.4%

全站排名 #6,677

发表论文10 篇

平均评分5.6

年均产出3.3 篇/年

Songyang Gao

Researcher@Shanghai Artificial Intelligence Laboratory·中国·OpenReview

研究方向

Reasoning · LLM Alignment · LLM Pretraining · Robustness

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

ICLR 2026Poster

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

ICLR 2026Rejected

LONG-HORIZON REASONING AGENT FOR OLYMPIAD- LEVEL MATHEMATICAL PROBLEM SOLVING

ICLR 2026Rejected

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

ICLR 2026Poster

Pre-Trained Policy Discriminators are General Reward Models

NeurIPS 2025Poster

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning

NeurIPS 2025Poster

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

COLM 2025Poster

AgentGym: Evaluating and Evolving Large Language Model-based Agents across Diverse Envronments

ICLR 2025Rejected

合作者 (20)