Samuel Albanie
~Samuel_Albanie2
9
论文总数
4.5
年均投稿
平均评分
接收情况7/9
会议分布
ICLR
5
NeurIPS
3
COLM
1
发表论文 (9 篇)
20255 篇
4
Needle Threading: Can LLMs Follow Threads Through Near-Million-Scale Haystacks?
ICLR 2025Poster
4
Inverse Constitutional AI: Compressing Preferences into Principles
ICLR 2025Poster
3
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
COLM 2025Poster
4
GAMEBOT: Gaming Arena for Model Evaluation - Battle of Tactics
ICLR 2025withdrawn
4
Democratizing Evaluation with Infinity-Benchmarks: Sample-Level Heterogeneous Testing Over Arbitrary Capabilities
ICLR 2025withdrawn
20244 篇
3
Visual Data-Type Understanding does not emerge from scaling Vision-Language Models
ICLR 2024Poster
4
On scalable oversight with weak LLMs judging strong LLMs
NeurIPS 2024Poster
5
Efficient Lifelong Model Evaluation in an Era of Rapid Progress
NeurIPS 2024Poster
4
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
NeurIPS 2024Poster