Xiang Yue
~Xiang_Yue1
25
论文总数
12.5
年均投稿
平均评分
接收情况18/25
会议分布
ICLR
14
COLM
5
NeurIPS
3
ICML
3
发表论文 (25 篇)
202519 篇
4
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
ICLR 2025Poster
5
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
ICLR 2025Rejected
4
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
COLM 2025Poster
3
Visual Perception in Text Strings
ICLR 2025Rejected
3
Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
COLM 2025Poster
4
Teach Multimodal LLMs to Comprehend Electrocardiographic Images
ICLR 2025withdrawn
4
Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA
ICLR 2025Rejected
3
Demystifying Long Chain-of-Thought Reasoning
ICML 2025Poster
4
Overtrained Language Models Are Harder to Fine-Tune
ICML 2025Poster
3
Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning
ICML 2025Poster
4
MixEval-X: Any-to-any Evaluations from Real-world Data Mixture
ICLR 2025Spotlight
3
Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning
ICLR 2025Rejected
3
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
ICLR 2025Rejected
4
Harnessing Webpage UIs for Text-Rich Visual Understanding
ICLR 2025Poster
4
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations
COLM 2025Poster
4
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks
ICLR 2025Poster
4
LIME: LESS IS MORE FOR MLLM EVALUATION
ICLR 2025Rejected
4
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
ICLR 2025Poster
4
MuPT: A Generative Symbolic Music Pretrained Transformer
ICLR 2025Poster
20246 篇
5
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
ICLR 2024Spotlight
4
MAmmoTH2: Scaling Instructions from the Web
NeurIPS 2024Poster
5
Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization
NeurIPS 2024Poster
4
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures
NeurIPS 2024Poster
4
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
COLM 2024Poster
5
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
COLM 2024Poster