Fabien Roger
~Fabien_Roger1
5
论文总数
2.5
年均投稿
平均评分
接收情况4/5
会议分布
NeurIPS
4
ICLR
1
发表论文 (5 篇)
20254 篇
4
Do Unlearning Methods Remove Information from Language Model Weights?
ICLR 2025Rejected
3
Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models
NeurIPS 2025Poster
3
Why Do Some Language Models Fake Alignment While Others Don't?
NeurIPS 2025Spotlight
4
Quantifying Elicitation of Latent Capabilities in Language Models
NeurIPS 2025Poster