影响力指数

96.32/100

前 0.2%

全站排名 #122

发表论文55 篇

平均评分5.4

年均产出18.3 篇/年

Maosong Sun

Full Professor@Tsinghua University·中国·OpenReview

研究方向

natural language processing

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

ICLR 2026Poster

KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs

ICLR 2026Rejected

CPMöbius: Iterative Coach–Player Reasoning for Data-Free Reinforcement Learning

ICLR 2026Rejected

Test-Time Exploration in Unknown Environments

ICLR 2026Rejected

Process Reinforcement through Implicit Rewards

ICLR 2026Rejected

Exploration for Building Next-Generation Foundation MLLMs via Self-Learning

ICLR 2026Withdrawn

Evidence-Guided Multi-Image Reasoning in Visual Retrieval-Augmented Generation

ICLR 2026Withdrawn

LLaVA-UHD v3: Progressive Visual Compression for Efficient Naive-Resolution Encoding in MLLMs

ICLR 2026Withdrawn

KARE-RAG: Knowledge-Aware Refinement and Enhancement for RAG

ICLR 2026Rejected

Listens like Mel: Boosting Latent Audio Diffusion with Channel Locality

ICLR 2026Rejected

Query Routing over Multimodal Knowledge Bases for Retrieval-Augmented Reasoning

ICLR 2026Withdrawn

AUTOTRITON: Automatic Triton Programming with Reinforcement Learning in LLMs

ICLR 2026Rejected

RLPR: Extrapolating RLVR to General Domains without Verifiers

ICLR 2026Rejected

Musical Score Understanding Benchmark: Evaluating Large Language Models’ Comprehension of Complete Musical Scores

ICLR 2026Withdrawn

StateX: Enhancing RNN Recall via Post-training State Expansion

ICLR 2026Withdrawn

EMind: A Foundation Model for Multi-task Electromagnetic Signals Understanding

ICLR 2026Withdrawn

Diversity-aware Training for Test-time Scaling

ICLR 2026Rejected

Reflective Reinforcement Tool Learning

ICLR 2026Withdrawn

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

ICML 2025Poster

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

NeurIPS 2025Poster

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

ICLR 2025Spotlight

The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

NeurIPS 2025Poster

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

COLM 2025Poster

Scaling Large Language Model-based Multi-Agent Collaboration

ICLR 2025Poster

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

COLM 2025Poster

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

NeurIPS 2025Poster

Advancing LLM Reasoning Generalists with Preference Trees

ICLR 2025Poster

Multi-Agent Collaboration via Evolving Orchestration

NeurIPS 2025Poster

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models

ICLR 2025Poster

Stuffed Mamba: Oversized States Lead to the Inability to Forget

COLM 2025Poster

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

ICLR 2025Poster

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

ICLR 2025Poster

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

ICLR 2025Poster

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

ICLR 2025Rejected

Rational Decision-Making Agent with Learning Internal Utility Judgment

ICLR 2025Poster

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System

ICLR 2025Rejected

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

ICLR 2025Poster

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

ICLR 2025Rejected

Selecting Influential Samples for Long Context Alignment via Homologous Models’ Guidance and Contextual Awareness Measurement

ICLR 2025Withdrawn

Improving Zero-Shot Generalization of Instruction Tuning by Data Arrangement

ICLR 2025Withdrawn

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

ICLR 2025Rejected

合作者 (20)

PhD Advisee48 篇