Paper
Hub
搜索
Toggle language
Akbir Khan
~Akbir_Khan1
4
论文总数
4.0
年均投稿
6.0
平均评分
接收情况
3
/
4
会议分布
ICLR
4
发表论文 (4 篇)
2025
4 篇
6.3
4
Language Models Learn to Mislead Humans via RLHF
ICLR 2025
Poster
4.5
4
Shell Games: Control Protocols for Adversarial AI Agents
ICLR 2025
withdrawn
6.3
4
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
ICLR 2025
Poster
7.0
3
Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats
ICLR 2025
Poster
合作者 (20)
AB
Aryan Bhatt
2 篇
BS
Buck Shlegeris
2 篇
EP
Ethan Perez
2 篇
HH
He He
2 篇
JW
Jiaxin Wen
2 篇
SF
Shi Feng
2 篇
BC
Bartłomiej Cupiał
1 篇
DP
Davide Paglieri
1 篇
查看全部 20 位合作者