影响力指数

98.36/100

前 0.1%

全站排名 #40

发表论文70 篇

平均评分5.2

年均产出23.3 篇/年

Rongrong Ji

Full Professor@Xiamen University·中国·OpenReview

研究方向

Model Compression

Out of the Memory Barrier: A Highly Memory-Efficient Training System for LLMs with Million-Token Contexts

ICLR 2026Poster

SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence

ICLR 2026Poster

Flow Caching for Autoregressive Video Generation

ICLR 2026Poster

Mix-Ecom: Towards Mixed-Type E-Commerce Dialogues with Complex Domain Rules

ICLR 2026Poster

RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning

ICLR 2026Poster

FlexibleLLM: Making Low-Bit Quantization for Large Language Models More Flexible and Efficient

ICLR 2026Rejected

Event-Anchored Frame Selection for Efficient Long-Video Understanding

ICLR 2026Withdrawn

HS-SFT: Hybrid Sparse Supervised Fine-tuning for Offline LLM KV Cache Eviction

ICLR 2026Withdrawn

RoPK: A Head-Level Key Cache Channel Pruning Method for Efficient Long-Context LLM Inference

ICLR 2026Rejected

Towards Efficient Chain-of-Thought Reasoning via Adaptive-Budgeting based Policy Optimization

ICLR 2026Withdrawn

LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation

ICLR 2026Withdrawn

HGT-UCOD: A Hint-Guided Teacher Framework for Unsupervised Camouflaged Object Detection

ICLR 2026Rejected

Breaking the Bias: Quantifying the Attention of Industrial Anomaly Detection

ICLR 2026Withdrawn

FOLD: Fast Correct Speculative Decoding

ICLR 2026Withdrawn

Modality-Aware Quantization: Balancing Visual and Textual Fidelity in Multimodal Compression

ICLR 2026Rejected

Continuous Test-Time Adaptation of Vision-Language Models

ICLR 2026Withdrawn

Beyond Proxy Metrics: A New Evaluation Framework for LLM Compression by Directly Measuring Generative Faithfulness

ICLR 2026Rejected

Pseudo-Label Supervision in Unsupervised Industrial Anomaly Detection

ICLR 2026Withdrawn

SplitQuant: Efficient Low-Bit Quantization for Diffusion Transformers via In-Channel Dimension Splitting

ICLR 2026Rejected

DAMamba: Vision State Space Model with Dynamic Adaptive Scan

NeurIPS 2025Poster

Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval

NeurIPS 2025Poster

Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective

ICML 2025Spotlight

EasyInv: Toward Fast and Better DDIM Inversion

ICML 2025Poster

BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training

ICML 2025Poster

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

NeurIPS 2025Poster

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

NeurIPS 2025Spotlight

$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models

ICLR 2025Poster

Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models

ICLR 2025Poster

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

ICLR 2025Poster

Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings

NeurIPS 2025Poster

FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification

ICML 2025Poster

Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs

NeurIPS 2025Poster

VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model

NeurIPS 2025Poster

GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models

ICML 2025Poster

polybasic Speculative Decoding Through a Theoretical Perspective

ICML 2025Poster

CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models

NeurIPS 2025Poster

Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

ICLR 2025Poster

Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective

ICML 2025Poster

Discovering Important Experts for Mixture-of-Experts Models Pruning Through a Theoretical Perspective

NeurIPS 2025Poster

AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection

ICLR 2025Rejected

DS-VLM: Diffusion Supervision Vision Language Model

ICML 2025Poster

Weakly-supervised 3D Referring Expression Segmentation

ICLR 2025Rejected

Dynamic Low-Rank Sparse Adaptation for Large Language Models

ICLR 2025Poster

Hierarchy-Aided Sparse Attention For Fast LLMs Prefilling Inference

ICLR 2025Rejected

BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training

ICLR 2025Rejected

Low Rank Quantization Adaptation for Large Language Model

ICLR 2025Rejected

SOLOS: Sparse Optimization For Long Sequence In Context Compression Enhanced LLMs

ICLR 2025Rejected

Tradiffusion++：Hierarchical Guidance for Fine-Grained Trajectory-Based Image Generation

ICLR 2025Withdrawn

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

ICLR 2025Rejected

Automated Fine-Grained Mixture-of-Experts Quantization

ICLR 2025Withdrawn

Mitigating Gradient Interference for Efficient Sparse Fine-Tuning of Large Language Models

ICLR 2025Rejected

Exploring the Collaborative Advantage of Low-level Information on Generalizable AI-generateted Image Detection

ICLR 2025Withdrawn

Extreme composite compression of large language models through joint optimization

ICLR 2025Rejected

Less is More: Learning Reference Knowledge Using No-Reference Image Quality Assessment

ICLR 2025Rejected

Revisiting Adversarial Examples from the Perspective of Asymptotic Equipartition Property

ICLR 2025Rejected

Multi-Perspective Test-Time Prompt Tuning for Global, Local Visuals, and Language

ICLR 2025Withdrawn

DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On

ICLR 2025Withdrawn

Towards Efficient Automatic Self-Pruning of Large Language Models

ICLR 2025Rejected

Polybasic Speculative Decoding Under a Theoretical Perspective

ICLR 2025Withdrawn

合作者 (20)

合作者12 篇