Paper
Hub
搜索
Toggle language
Jiaxiang Li
~Jiaxiang_Li1
4
论文总数
2.0
年均投稿
5.5
平均评分
接收情况
3
/
4
会议分布
NeurIPS
2
ICLR
2
发表论文 (4 篇)
2025
2 篇
3.5
4
Policy optimization can be memory-efficient: LLM Alignment Through Successive Policy Re-weighting (SPR)
ICLR 2025
Rejected
7.3
4
Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment
ICLR 2025
Spotlight
2024
2 篇
5.8
4
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
NeurIPS 2024
Poster
5.3
3
SLTrain: a sparse plus low rank approach for parameter and memory efficient pretraining
NeurIPS 2024
Poster
合作者 (14)
MH
Mingyi Hong
4 篇
SZ
Siliang Zeng
3 篇
AG
Alfredo Garcia
2 篇
CL
Chenliang Li
2 篇
KL
Kaixiang Lin
1 篇
XZ
Xinnan Zhang
1 篇
AT
Akiko Takeda
1 篇
AH
Andi Han
1 篇
查看全部 14 位合作者