影响力指数

42.43/100

前 12.7%

全站排名 #8,194

发表论文9 篇

平均评分5.1

年均产出3.0 篇/年

Jiaxiang Li

Researcher@Facebook·美国·OpenReview

研究方向

LLM Alignment · Reinforcement Learning · Optimization

Memory-Efficient LLM Pretraining via Minimalist Optimizer Design

ICLR 2026Rejected

A Tale of Two Problems: Multi-Objective Bilevel Learning Meets Equality Constrained Multi-Objective Optimization

ICLR 2026Rejected

Muon Outperforms Adam in Tail-End Associative Memory Learning

ICLR 2026Poster

Aligning Frozen LLMs by Reinforcement Learning: An Iterative Reweight-then-Optimize Approach

ICLR 2026Rejected

ADARL: Adaptive Low-Rank Structures for Robust Policy Learning under Uncertainty

ICLR 2026Desk Rejected

Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment

ICLR 2025Spotlight

Policy optimization can be memory-efficient: LLM Alignment Through Successive Policy Re-weighting (SPR)

ICLR 2025Rejected

合作者 (20)

博后导师7 篇