Wei Shen
~Wei_Shen11
7
论文总数
7.0
年均投稿
平均评分
接收情况5/7
会议分布
NeurIPS
4
ICML
2
ICLR
1
发表论文 (7 篇)
20257 篇
4
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
NeurIPS 2025Poster
3
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
ICLR 2025Rejected
4
Policy Filtration for RLHF to Mitigate Noise in Reward Models
ICML 2025Poster
4
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning
NeurIPS 2025Rejected
5
HPSERec: A Hierarchical Partitioning and Stepwise Enhancement Framework for Long-tailed Sequential Recommendation
NeurIPS 2025Poster
4
What Do Latent Action Models Actually Learn?
NeurIPS 2025Poster
4
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
ICML 2025Poster