PaperHub

Zhiyuan Liu

~Zhiyuan_Liu1

43
论文总数
21.5
年均投稿
6.1
平均评分
接收情况33/43
会议分布
ICLR
25
NeurIPS
9
COLM
7
ICML
2

发表论文 (43 篇)

202525

5.5
4

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System

ICLR 2025Rejected
3.6
5

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

ICLR 2025Rejected
5.8
5

Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads

ICLR 2025Rejected
6.4
4

Learning to Focus: Causal Attention Distillation via Gradient‐Guided Token Pruning

NeurIPS 2025Poster
6.0
5

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

ICLR 2025Poster
6.0
4

Stuffed Mamba: Oversized States Lead to the Inability to Forget

COLM 2025Poster
5.3
4

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

ICLR 2025Rejected
7.0
3

The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

NeurIPS 2025Poster
5.7
3

Rational Decision-Making Agent with Learning Internal Utility Judgment

ICLR 2025Poster
7.3
4

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

NeurIPS 2025Poster
6.8
4

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

NeurIPS 2025Poster
7.0
3

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

COLM 2025Poster
5.5
5

Free Process Rewards without Process Labels

ICML 2025Poster
4.8
5

AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems

ICLR 2025withdrawn
6.3
4

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models

ICLR 2025Poster
7.2
5

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

ICLR 2025Spotlight
7.8
3

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

ICML 2025Poster
6.0
5

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

ICLR 2025Poster
6.0
4

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

ICLR 2025Poster
7.0
4

Scaling Large Language Model-based Multi-Agent Collaboration

ICLR 2025Poster
4.6
5

Improving Zero-Shot Generalization of Instruction Tuning by Data Arrangement

ICLR 2025withdrawn
6.4
4

Multi-Agent Collaboration via Evolving Orchestration

NeurIPS 2025Poster
7.0
3

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

COLM 2025Poster
5.5
4

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

ICLR 2025Poster
6.5
4

Advancing LLM Reasoning Generalists with Preference Trees

ICLR 2025Poster

202418

4.5
4

LLM-Oriented Retrieval Tuner

ICLR 2024Rejected
6.0
3

CA-LoRA: Adapting Existing LoRA for Compressed LLMs to Enable Efficient Multi-Tasking on Personal Devices

COLM 2024Poster
7.0
4

Unified View of Grokking, Double Descent and Emergent Abilities: A Comprehensive Study on Algorithm Task

COLM 2024Poster
5.6
5

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

ICLR 2024Rejected
5.8
4

OneBit: Towards Extremely Low-bit Large Language Models

NeurIPS 2024Poster
5.8
4

ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

NeurIPS 2024Poster
5.0
4

InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory

NeurIPS 2024Poster
6.3
4

Rational Decision-Making Agent with Internalized Utility Judgment

ICLR 2024Rejected
5.6
5

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate

ICLR 2024Poster
6.3
4

UltraFeedback: Boosting Language Models with High-quality Feedback

ICLR 2024Rejected
6.0
4

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models

NeurIPS 2024Poster
6.0
4

Predicting Emergent Abilities with Infinite Resolution Evaluation

ICLR 2024Poster
6.0
4

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

ICLR 2024Poster
6.5
4

UniMem: Towards a Unified View of Long-Context Large Language Models

COLM 2024Poster
5.5
4

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

ICLR 2024Spotlight
7.0
4

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

ICLR 2024Spotlight
7.5
4

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

COLM 2024Poster
6.8
4

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

ICLR 2024Poster