PaperHub

Maosong Sun

~Maosong_Sun1

37
论文总数
18.5
年均投稿
6.1
平均评分
接收情况27/37
会议分布
ICLR
23
COLM
7
NeurIPS
6
ICML
1

发表论文 (37 篇)

202523

6.0
4

Stuffed Mamba: Oversized States Lead to the Inability to Forget

COLM 2025Poster
6.0
5

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

ICLR 2025Poster
5.5
4

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System

ICLR 2025Rejected
3.6
5

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

ICLR 2025Rejected
7.0
3

The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

NeurIPS 2025Poster
5.3
4

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

ICLR 2025Rejected
7.3
4

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

NeurIPS 2025Poster
5.7
3

Rational Decision-Making Agent with Learning Internal Utility Judgment

ICLR 2025Poster
7.0
3

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

COLM 2025Poster
5.8
4

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

ICLR 2025Rejected
6.8
4

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

NeurIPS 2025Poster
7.8
3

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

ICML 2025Poster
5.0
4

Selecting Influential Samples for Long Context Alignment via Homologous Models’ Guidance and Contextual Awareness Measurement

ICLR 2025withdrawn
6.3
4

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models

ICLR 2025Poster
7.2
5

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

ICLR 2025Spotlight
6.0
4

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

ICLR 2025Poster
6.0
5

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

ICLR 2025Poster
7.0
4

Scaling Large Language Model-based Multi-Agent Collaboration

ICLR 2025Poster
4.6
5

Improving Zero-Shot Generalization of Instruction Tuning by Data Arrangement

ICLR 2025withdrawn
6.4
4

Multi-Agent Collaboration via Evolving Orchestration

NeurIPS 2025Poster
7.0
3

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

COLM 2025Poster
5.5
4

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

ICLR 2025Poster
6.5
4

Advancing LLM Reasoning Generalists with Preference Trees

ICLR 2025Poster

202414