Yilun Zhao
~Yilun_Zhao1
9
论文总数
4.5
年均投稿
平均评分
接收情况5/9
会议分布
ICLR
6
COLM
2
NeurIPS
1
发表论文 (9 篇)
20255 篇
4
PuzzlePlex: A Benchmark to Evaluate the Reasoning and Planning of Large Language Models on Puzzles
ICLR 2025Rejected
4
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models
ICLR 2025Poster
4
MSRS: Evaluating Multi-Source Retrieval-Augmented Generation
COLM 2025Poster
4
ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning
ICLR 2025Poster
4
ML-Bench: Evaluating Large Language Models for Code Generation in Repository-Level Machine Learning Tasks
ICLR 2025Rejected
20244 篇
-
FinDA: A New Dataset for Query-focused and Trustworthy Document Analysis Generation
ICLR 2024withdrawn
3
AN ENTROPY PERSPECTIVE IN KNOWLEDGE DISTILLATION
ICLR 2024withdrawn
5
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in LLMs
NeurIPS 2024Poster
4
Evaluating LLMs at Detecting Errors in LLM Responses
COLM 2024Poster