Paper
Hub
搜索
Toggle language
Shengyi Huang
~Shengyi_Huang1
6
论文总数
3.0
年均投稿
6.4
平均评分
接收情况
6
/
6
会议分布
COLM
4
ICLR
2
发表论文 (6 篇)
2025
3 篇
5.8
4
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
ICLR 2025
Poster
8.0
3
Tulu 3: Pushing Frontiers in Open Language Model Post-Training
COLM 2025
Poster
7.0
4
2 OLMo 2 Furious (COLM’s Version)
COLM 2025
Poster
2024
3 篇
5.3
4
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
ICLR 2024
Poster
6.4
5
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
COLM 2024
Poster
6.3
4
Zephyr: Direct Distillation of LM Alignment
COLM 2024
Poster
合作者 (20)
NL
Nathan Lambert
3 篇
CW
Christopher Wilhelm
2 篇
FB
Faeze Brahman
2 篇
HI
Hamish Ivison
2 篇
HH
Hannaneh Hajishirzi
2 篇
JM
Jacob Morrison
2 篇
LM
Lester James Validad Miranda
2 篇
LS
Luca Soldaini
2 篇
查看全部 20 位合作者