Graham Neubig
~Graham_Neubig1
35
论文总数
17.5
年均投稿
平均评分
接收情况27/35
会议分布
ICLR
16
COLM
9
ICML
5
NeurIPS
5
发表论文 (35 篇)
202520 篇
4
Inducing Programmatic Skills for Agentic Tasks
COLM 2025Poster
3
Training Software Engineering Agents and Verifiers with SWE-Gym
ICML 2025Poster
4
RAGGED: Towards Informed Design of Scalable and Stable RAG Systems
ICML 2025Poster
4
RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems
ICLR 2025Rejected
4
Beyond Browsing: API-Based Web Agents
ICLR 2025withdrawn
4
Repetition Improves Language Model Embeddings
ICLR 2025Poster
5
Agent Workflow Memory
ICLR 2025Rejected
3
Demystifying Long Chain-of-Thought Reasoning
ICML 2025Poster
4
Agent Workflow Memory
ICML 2025Poster
3
Training Task Experts through Retrieval Based Distillation
ICLR 2025withdrawn
4
Checklists Are Better Than Reward Models For Aligning Language Models
NeurIPS 2025Spotlight
4
Do LLMs Understand Your Translations? Evaluating Paragraph-level MT with Question Answering
COLM 2025Poster
3
Better Instruction-Following Through Minimum Bayes Risk
ICLR 2025Spotlight
4
M-Prometheus: A Suite of Open Multilingual LLM Judges
COLM 2025Poster
4
Overtrained Language Models Are Harder to Fine-Tune
ICML 2025Poster
4
Harnessing Webpage UIs for Text-Rich Visual Understanding
ICLR 2025Poster
4
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
COLM 2025Poster
4
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
ICLR 2025Poster
5
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
ICLR 2025Rejected
4
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
ICLR 2025Poster
202415 篇
4
An Incomplete Loop: Instruction Inference, Instruction Following, and In-Context Learning in Language Models
COLM 2024Poster
4
Discovering Divergences between Language Models and Human Brains
ICLR 2024Rejected
4
Divergences between Language Models and Human Brains
NeurIPS 2024Poster
3
Alignment for Honesty
NeurIPS 2024Poster
4
Self-Guide: Better Task-Specific Instruction Following via Self-Synthetic Finetuning
COLM 2024Poster
4
What Are Tools Anyway? A Survey from the Language Model Perspective
COLM 2024Poster
4
Fine-grained Hallucination Detection and Editing for Language Models
COLM 2024Poster
4
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
COLM 2024Poster
5
Do LLMs exhibit human-like response biases? A case study in survey design
ICLR 2024Rejected
4
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures
NeurIPS 2024Poster
4
Learning Performance-Improving Code Edits
ICLR 2024Spotlight
3
FacTool: Factuality Detection in Generative AI - A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
ICLR 2024Rejected
4
Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale
NeurIPS 2024Poster
3
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
ICLR 2024Spotlight
3
WebArena: A Realistic Web Environment for Building Autonomous Agents
ICLR 2024Poster