PaperHub

Rongrong Ji

~Rongrong_Ji5

51
论文总数
25.5
年均投稿
5.6
平均评分
接收情况32/51
会议分布
ICLR
27
NeurIPS
16
ICML
8

发表论文 (51 篇)

202540

3.8
4

Revisiting Adversarial Examples from the Perspective of Asymptotic Equipartition Property

ICLR 2025Rejected
5.5
4

Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective

ICML 2025Poster
7.0
3

EasyInv: Toward Fast and Better DDIM Inversion

ICML 2025Poster
6.0
4

CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models

NeurIPS 2025Poster
3.5
4

DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On

ICLR 2025withdrawn
4.8
4

SOLOS: Sparse Optimization For Long Sequence In Context Compression Enhanced LLMs

ICLR 2025Rejected
3.5
4

Towards Efficient Automatic Self-Pruning of Large Language Models

ICLR 2025Rejected
5.0
3

Hierarchy-Aided Sparse Attention For Fast LLMs Prefilling Inference

ICLR 2025Rejected
6.5
4

Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models

ICLR 2025Poster
4.5
4

Automated Fine-Grained Mixture-of-Experts Quantization

ICLR 2025withdrawn
3.7
3

Multi-Perspective Test-Time Prompt Tuning for Global, Local Visuals, and Language

ICLR 2025withdrawn
7.2
4

Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective

ICML 2025Spotlight
6.3
3

GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models

ICML 2025Poster
5.5
4

Discovering Important Experts for Mixture-of-Experts Models Pruning Through a Theoretical Perspective

NeurIPS 2025Poster
5.8
5

Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

ICLR 2025Poster
4.5
4

Mitigating Gradient Interference for Efficient Sparse Fine-Tuning of Large Language Models

ICLR 2025Rejected
5.5
4

AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection

ICLR 2025Rejected
7.5
5

Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval

NeurIPS 2025Poster
6.4
4

Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings

NeurIPS 2025Poster
7.8
4

DAMamba: Vision State Space Model with Dynamic Adaptive Scan

NeurIPS 2025Poster
7.0
3

BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training

ICML 2025Poster
5.5
4

DS-VLM: Diffusion Supervision Vision Language Model

ICML 2025Poster
6.1
4

polybasic Speculative Decoding Through a Theoretical Perspective

ICML 2025Poster
4.8
4

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

ICLR 2025Rejected
4.8
4

Tradiffusion++:Hierarchical Guidance for Fine-Grained Trajectory-Based Image Generation

ICLR 2025withdrawn
4.0
4

Less is More: Learning Reference Knowledge Using No-Reference Image Quality Assessment

ICLR 2025Rejected
6.7
3

$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models

ICLR 2025Poster
5.2
5

Dynamic Low-Rank Sparse Adaptation for Large Language Models

ICLR 2025Poster
3.0
4

Polybasic Speculative Decoding Under a Theoretical Perspective

ICLR 2025withdrawn
5.5
4

Weakly-supervised 3D Referring Expression Segmentation

ICLR 2025Rejected
6.5
4

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

ICLR 2025Poster
5.0
4

BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training

ICLR 2025Rejected
6.4
5

FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification

ICML 2025Poster
4.5
4

Exploring the Collaborative Advantage of Low-level Information on Generalizable AI-generateted Image Detection

ICLR 2025withdrawn
4.3
4

Extreme composite compression of large language models through joint optimization

ICLR 2025Rejected
5.0
5

Low Rank Quantization Adaptation for Large Language Model

ICLR 2025Rejected
6.4
4

Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs

NeurIPS 2025Poster
6.8
4

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

NeurIPS 2025Poster
6.8
4

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

NeurIPS 2025Spotlight
6.4
4

VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model

NeurIPS 2025Poster

202411