PaperHub

Ge Zhang

~Ge_Zhang5

31
论文总数
15.5
年均投稿
6.1
平均评分
接收情况19/31
会议分布
ICLR
23
NeurIPS
6
COLM
2

发表论文 (31 篇)

202520

5.8
4

OmniBench: Towards The Future of Universal Omni-Language Models

ICLR 2025Rejected
6.7
3

VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text

ICLR 2025Poster
6.4
5

General-Reasoner: Advancing LLM Reasoning Across All Domains

NeurIPS 2025Poster
6.8
4

FlexWorld: Progressively Expanding 3D Scenes for Flexible-View Exploration

NeurIPS 2025Poster
5.8
5

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

ICLR 2025Poster
4.6
5

KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation

ICLR 2025withdrawn
4.3
4

ING-VP: MLLMs Cannot Play Easy Vision-based Games Yet

ICLR 2025Rejected
6.5
4

McEval: Massively Multilingual Code Evaluation

ICLR 2025Poster
5.8
5

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

ICLR 2025Rejected
4.8
4

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

ICLR 2025withdrawn
5.5
4

M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation

ICLR 2025Rejected
5.8
4

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

ICLR 2025Poster
7.0
4

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks

ICLR 2025Poster
5.5
4

MIO: A Foundation Model on Multimodal Tokens

ICLR 2025Rejected
6.8
4

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models

ICLR 2025Poster
5.0
3

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

ICLR 2025Rejected
4.0
5

Can MLLMs Understand the Deep Implication Behind Chinese Images?

ICLR 2025withdrawn
6.0
4

LIME: LESS IS MORE FOR MLLM EVALUATION

ICLR 2025Rejected
6.5
4

MuPT: A Generative Symbolic Music Pretrained Transformer

ICLR 2025Poster
9.1
4

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

NeurIPS 2025Spotlight

202411