Tuo Zhao
~Tuo_Zhao1
4
论文总数
4.0
年均投稿
平均评分
接收情况1/4
会议分布
ICLR
4
发表论文 (4 篇)
20244 篇
3
On Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems
ICLR 2024Rejected
4
Deep Reinforcement Learning from Weak Hierarchical Preference Feedback
ICLR 2024Rejected
5
Efficient Long Sequence Modeling via State Space Augmented Transformer
ICLR 2024Rejected
3
LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models
ICLR 2024Oral