Huazheng Wang
~Huazheng_Wang1
8
论文总数
4.0
年均投稿
平均评分
接收情况6/8
会议分布
ICLR
4
ICML
2
NeurIPS
2
发表论文 (8 篇)
20254 篇
4
Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition
ICML 2025Poster
4
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement
ICLR 2025Poster
4
Design-Based Bandits Under Network Interference: Trade-Off Between Regret and Statistical Inference
NeurIPS 2025Poster
3
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
ICML 2025Spotlight
20244 篇
3
On Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems
ICLR 2024Rejected
4
RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning
NeurIPS 2024Poster
4
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
ICLR 2024Poster
4
Adversarial Attacks on Combinatorial Multi-Armed Bandits
ICLR 2024Rejected