Tuo Zhao
~Tuo_Zhao2
19
论文总数
9.5
年均投稿
平均评分
接收情况16/19
会议分布
NeurIPS
8
COLM
5
ICLR
4
ICML
2
发表论文 (19 篇)
202511 篇
5
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
NeurIPS 2025Spotlight
4
Deep Reinforcement Learning from Hierarchical Preference Design
ICML 2025Poster
5
A Minimalist Example of Edge-of-Stability and Progressive Sharpening
NeurIPS 2025Poster
4
NoWag: A Unified Framework for Shape Preserving Com- pression of Large Language Models
COLM 2025Poster
5
Model Tells Itself Where to Attend: Steerable Prompting for Reliable Reading Comprehension of LLM
ICLR 2025withdrawn
3
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
COLM 2025Poster
4
Discriminative Finetuning of Generative Large Language Models without Reward Models and Human Preference Data
ICML 2025Poster
4
Adversarial Training of Reward Models
COLM 2025Poster
4
Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models
NeurIPS 2025Poster
3
Ask a Strong LLM Judge when Your Reward Model is Uncertain
NeurIPS 2025Poster
5
Self-Rewarding PPO: Aligning Large Language Models with Demonstrations Only
COLM 2025Poster
20248 篇
4
Provable Acceleration of Nesterov's Accelerated Gradient for Asymmetric Matrix Factorization and Linear Neural Networks
NeurIPS 2024Poster
3
HART: Efficient Adaptation via Regularized Autoregressive Parameter Generation
ICLR 2024Rejected
4
Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks
NeurIPS 2024Poster
4
Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks
ICLR 2024Rejected
4
Efficient Hybrid Long Sequence Modeling with State Space Augmented Transformers
COLM 2024Poster
4
Robust Reinforcement Learning from Corrupted Human Feedback
NeurIPS 2024Poster
4
Adaptive Preference Scaling for Reinforcement Learning with Human Feedback
NeurIPS 2024Poster
4
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
ICLR 2024Poster