Hao Peng
~Hao_Peng4
21
论文总数
10.5
年均投稿
平均评分
接收情况14/21
会议分布
ICLR
16
NeurIPS
3
COLM
1
ICML
1
发表论文 (21 篇)
202514 篇
4
The Best Instruction-Tuning Data are Those That Fit
NeurIPS 2025Spotlight
4
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
NeurIPS 2025Poster
4
Retrieval Head Mechanistically Explains Long-Context Factuality
ICLR 2025Oral
4
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs
ICLR 2025withdrawn
4
$\textbf{PLUM}$: Improving Code LMs Using On-Policy Preference Learning Powered by Automatic Test Cases
ICLR 2025withdrawn
4
Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities
ICLR 2025withdrawn
5
A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts
ICLR 2025Poster
4
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
NeurIPS 2025Poster
4
S2-Attention: Hardware-Aware Context Sharding Among Attention Heads
ICLR 2025Rejected
5
Eliminating Position Bias of Language Models: A Mechanistic Approach
ICLR 2025Poster
5
Free Process Rewards without Process Labels
ICML 2025Poster
4
Scaling Diffusion Language Models via Adaptation from Autoregressive Models
ICLR 2025Poster
4
Advancing LLM Reasoning Generalists with Preference Trees
ICLR 2025Poster
4
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
ICLR 2025Poster
20247 篇
4
Efficiency Pentathlon: A Standardized Benchmark for Efficiency Evaluation
ICLR 2024Rejected
4
FiLM: Fill-in Language Models for Any-Order Generation
ICLR 2024Rejected
6
LETI: Learning to Generate from Textual Interactions
ICLR 2024withdrawn
4
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
ICLR 2024Spotlight
3
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
ICLR 2024Poster
4
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
ICLR 2024Poster
4
Source-Aware Training Enables Knowledge Attribution in Language Models
COLM 2024Poster