影响力指数

93.07/100

前 0.4%

全站排名 #248

发表论文49 篇

平均评分5.3

年均产出16.3 篇/年

William Yang Wang

Full Professor@UC Santa Barbara·美国·OpenReview

研究方向

Vision and Language · Computational Social Science · Information Extraction

LogicReward: Incentivizing LLM Reasoning via Step-Wise Logical Supervision

ICLR 2026Poster

DevOps-Gym: Benchmarking AI Agents in Software DevOps Cycle

ICLR 2026Poster

Dynamic Speculative Agent Planning

ICLR 2026Poster

SOPBench: Evaluating Language Agents at Following Standard Operating Procedures and Constraints

ICLR 2026Rejected

Adversarial Training for Process Reward Models

ICLR 2026Desk Rejected

Do Larger Language Models Generalize Better? A Scaling Law for Implicit Reasoning at Pretraining Time

ICLR 2026Rejected

Self-Resource Allocation in Multi-Agent LLM Systems

ICLR 2026Rejected

PromptArmor: An Essential Baseline for Prompt Injection Defenses

ICLR 2026Rejected

Cost-effective Agent Test-time Scaling via Budget-Aware Thinking

ICLR 2026Rejected

HexMachina: Self-Evolving Multi-Agent System for Continual Learning of Catan

ICLR 2026Withdrawn

MuSLR: Multimodal Symbolic Logical Reasoning

NeurIPS 2025Poster

Weak-to-Strong Jailbreaking on Large Language Models

ICML 2025Poster

ThoughtTerminator: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

COLM 2025Poster

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

ICLR 2025Poster

T2V-Turbo-v2: Enhancing Video Model Post-Training through Data, Reward, and Conditional Guidance Design

ICLR 2025Poster

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

ICLR 2025Poster

MMSci: A Dataset for Graduate-Level Multi-Discipline Multimodal Scientific Understanding

ICLR 2025Rejected

COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement

ICLR 2025Rejected

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

COLM 2025Poster

Gödel Agent: A Self-Referential Framework Helps for Recursively Self-Improvement

ICLR 2025Rejected

Weak-to-Strong Jailbreaking on Large Language Models

ICLR 2025Rejected

Discovering Factor Level Preferences to Improve Human-Model Alignment

ICLR 2025Rejected

Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models

ICLR 2025Rejected

Generalization v.s. Memorization: Tracing Language Models’ Capabilities Back to Pretraining Data

ICLR 2025Poster

VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for MLLMs

ICLR 2025Withdrawn

TC-Bench: Benchmarking Temporal Compositionality in Conditional Video Generation

ICLR 2025Rejected

Can Editing LLMs Inject Harm?

ICLR 2025Rejected

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

ICML 2025Poster

Compact Multimodal Context Represenations Using Visual Tokens

ICLR 2025Rejected

Pixelated Instructions: Can Multimodal Large Language Models Follow Printed Instructions in Images?

ICLR 2025Rejected

SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement

ICLR 2025Poster

Detecting Training Data of Large Language Models via Expectation Maximization

ICLR 2025Rejected

DebUnc: Improving Large Language Model Agent Communication Via Uncertainty Metrics

ICLR 2025Withdrawn

合作者 (20)

Antonis Antoniades