Banghua Zhu

Assistant Professor@University of Washington·美国·OpenReview

研究方向

Reinforcement learning · computational economics · Foundation Models · Information Theory · Theoretical Statistics · Robust Statistics · Large Language Model

5.5

MMMG: A Comprehensive and Reliable Benchmark for Multitask Multimodal Generation

ICLR 2026Rejected

通讯

4.5

Unlocking Long-Horizon Agentic Search with Large-Scale End-to-End RL

ICLR 2026Poster

4.0

Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regression

ICLR 2026Poster

4.0

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

ICLR 2026Withdrawn

6.3

Taming Overconfidence in LLMs: Reward Calibration in RLHF

ICLR 2025Poster

三作

6.3

How to Evaluate Reward Models for RLHF

ICLR 2025Poster

6.1

From Crowdsourced Data to High-quality Benchmarks: Arena-Hard and Benchbuilder Pipeline

ICML 2025Poster

6.0

Watermarking using Semantic-aware Speculative Sampling: from Theory to Practice

ICLR 2025Rejected

6.0

Bench-O-Matic: Automating Benchmark Curation from Crowdsourced Data

合作者 (20)