Ge Zhang
~Ge_Zhang5
31
论文总数
15.5
年均投稿
平均评分
接收情况19/31
会议分布
ICLR
23
NeurIPS
6
COLM
2
发表论文 (31 篇)
202520 篇
4
OmniBench: Towards The Future of Universal Omni-Language Models
ICLR 2025Rejected
3
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text
ICLR 2025Poster
5
General-Reasoner: Advancing LLM Reasoning Across All Domains
NeurIPS 2025Poster
4
FlexWorld: Progressively Expanding 3D Scenes for Flexible-View Exploration
NeurIPS 2025Poster
5
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
ICLR 2025Poster
5
KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation
ICLR 2025withdrawn
4
ING-VP: MLLMs Cannot Play Easy Vision-based Games Yet
ICLR 2025Rejected
4
McEval: Massively Multilingual Code Evaluation
ICLR 2025Poster
5
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
ICLR 2025Rejected
4
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
ICLR 2025withdrawn
4
M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
ICLR 2025Rejected
4
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
ICLR 2025Poster
4
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks
ICLR 2025Poster
4
MIO: A Foundation Model on Multimodal Tokens
ICLR 2025Rejected
4
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models
ICLR 2025Poster
3
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
ICLR 2025Rejected
5
Can MLLMs Understand the Deep Implication Behind Chinese Images?
ICLR 2025withdrawn
4
LIME: LESS IS MORE FOR MLLM EVALUATION
ICLR 2025Rejected
4
MuPT: A Generative Symbolic Music Pretrained Transformer
ICLR 2025Poster
4
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
NeurIPS 2025Spotlight
202411 篇
4
Massive Editing for Large Language Models via Meta Learning
ICLR 2024Poster
5
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
COLM 2024Poster
4
TIGERScore: Building Explainable Metric for All Text Generation Task
ICLR 2024Rejected
4
MAmmoTH2: Scaling Instructions from the Web
NeurIPS 2024Poster
3
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models
NeurIPS 2024Poster
5
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
ICLR 2024Spotlight
4
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
ICLR 2024Poster
5
Training Socially Aligned Language Models on Simulated Social Interactions
ICLR 2024Poster
4
AutoAgents: A Framework for Automatic Agent Generation
ICLR 2024Rejected
4
DDK: Distilling Domain Knowledge for Efficient Large Language Models
NeurIPS 2024Poster
4
Chinese Tiny LLM: Pretraining a Chinese-Centered Large Language Model
COLM 2024Poster