Caiming Xiong
~Caiming_Xiong1
43
论文总数
21.5
年均投稿
平均评分
接收情况24/43
会议分布
ICLR
34
ICML
4
NeurIPS
3
COLM
2
发表论文 (43 篇)
202531 篇
4
DyMU: Dynamic Merging and Virtual Unmerging for Efficient Variable-Length VLMs
NeurIPS 2025Poster
4
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking
ICLR 2025Rejected
4
Trust but Verify: Programmatic VLM Evaluation in the Wild
ICLR 2025Rejected
4
Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators
ICML 2025Poster
4
Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
NeurIPS 2025Poster
4
Direct Judgement Preference Optimization
ICLR 2025Rejected
3
Expanding the Web, Smaller Is Better: A Comprehensive Study in Post-training
ICLR 2025withdrawn
4
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
ICLR 2025Poster
4
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
ICLR 2025Oral
3
GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
ICLR 2025Poster
4
CodeXEmbed: A Generalist Embedding Model Family for Multilingual and Multi-task Code Retrieval
COLM 2025Poster
4
UniTST: Effectively Modeling Inter-Series and Intra-Series Dependencies for Multivariate Time Series Forecasting
ICLR 2025Rejected
5
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
ICLR 2025Poster
4
SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
ICLR 2025Poster
4
GIFT-Eval: A Benchmark for General Time Series Forecasting Model Evaluation
ICLR 2025Rejected
3
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
ICLR 2025Spotlight
5
ThinK: Thinner Key Cache by Query-Driven Pruning
ICLR 2025Spotlight
6
MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs
ICLR 2025withdrawn
3
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
ICML 2025Poster
4
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
ICLR 2025Rejected
4
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
ICML 2025Poster
4
BingoGuard: LLM Content Moderation Tools with Risk Levels
ICLR 2025Poster
4
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
ICLR 2025Rejected
4
BLIP-3-Video: You Only Need 32 Tokens to Represent a Video Even in VLMs
ICLR 2025Rejected
5
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
ICLR 2025Rejected
4
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
ICML 2025Poster
4
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
ICLR 2025Oral
3
LAM Simulator: Advancing Large Action Model Training for Agent via Online Exploration and Feedback Simulation
ICLR 2025Rejected
4
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
ICLR 2025Poster
4
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases
ICLR 2025Rejected
4
Bridging the Data Provenance Gap Across Text, Speech, and Video
ICLR 2025Poster
202412 篇
4
Parameter-Efficient Detoxification with Contrastive Decoding
ICLR 2024Rejected
4
INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness
NeurIPS 2024Poster
4
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight
ICLR 2024Poster
4
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations
ICLR 2024Poster
4
Unlocking Anticipatory Text Generation: A Constrained Approach for Faithful Decoding with Large Language Models
ICLR 2024Rejected
4
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
ICLR 2024Rejected
4
Text2Data: Low-Resource Data Generation with Textual Control
ICLR 2024Rejected
5
X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
ICLR 2024withdrawn
3
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
ICLR 2024Spotlight
4
REX: Rapid Exploration and eXploitation for AI agents
ICLR 2024Rejected
3
OpenAgents: An Open Platform for Language Agents in the Wild
COLM 2024Poster
4
Lemur: Harmonizing Natural Language and Code for Language Agents
ICLR 2024Spotlight