PaperHub

Caiming Xiong

~Caiming_Xiong1

43
论文总数
21.5
年均投稿
5.8
平均评分
接收情况24/43
会议分布
ICLR
34
ICML
4
NeurIPS
3
COLM
2

发表论文 (43 篇)

202531

6.8
4

DyMU: Dynamic Merging and Virtual Unmerging for Efficient Variable-Length VLMs

NeurIPS 2025Poster
5.3
4

JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking

ICLR 2025Rejected
5.0
4

Trust but Verify: Programmatic VLM Evaluation in the Wild

ICLR 2025Rejected
6.1
4

Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators

ICML 2025Poster
6.4
4

Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

NeurIPS 2025Poster
5.0
4

Direct Judgement Preference Optimization

ICLR 2025Rejected
3.7
3

Expanding the Web, Smaller Is Better: A Comprehensive Study in Post-training

ICLR 2025withdrawn
7.0
4

Automatic Curriculum Expert Iteration for Reliable LLM Reasoning

ICLR 2025Poster
7.5
4

ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement

ICLR 2025Oral
6.7
3

GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

ICLR 2025Poster
6.5
4

CodeXEmbed: A Generalist Embedding Model Family for Multilingual and Multi-task Code Retrieval

COLM 2025Poster
5.0
4

UniTST: Effectively Modeling Inter-Series and Intra-Series Dependencies for Multivariate Time Series Forecasting

ICLR 2025Rejected
5.8
5

FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"

ICLR 2025Poster
6.8
4

SiReRAG: Indexing Similar and Related Information for Multihop Reasoning

ICLR 2025Poster
5.3
4

GIFT-Eval: A Benchmark for General Time Series Forecasting Model Evaluation

ICLR 2025Rejected
7.3
3

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

ICLR 2025Spotlight
6.8
5

ThinK: Thinner Key Cache by Query-Driven Pruning

ICLR 2025Spotlight
4.2
6

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

ICLR 2025withdrawn
6.3
3

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

ICML 2025Poster
5.5
4

Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts

ICLR 2025Rejected
6.6
4

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

ICML 2025Poster
6.5
4

BingoGuard: LLM Content Moderation Tools with Risk Levels

ICLR 2025Poster
5.5
4

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

ICLR 2025Rejected
6.3
4

BLIP-3-Video: You Only Need 32 Tokens to Represent a Video Even in VLMs

ICLR 2025Rejected
3.8
5

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

ICLR 2025Rejected
4.9
4

Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts

ICML 2025Poster
8.0
4

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

ICLR 2025Oral
6.0
3

LAM Simulator: Advancing Large Action Model Training for Agent via Online Exploration and Feedback Simulation

ICLR 2025Rejected
6.3
4

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

ICLR 2025Poster
4.5
4

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

ICLR 2025Rejected
6.5
4

Bridging the Data Provenance Gap Across Text, Speech, and Video

ICLR 2025Poster

202412