Limin Wang
~Limin_Wang1
29
论文总数
14.5
年均投稿
平均评分
接收情况19/29
会议分布
ICLR
17
NeurIPS
10
ICML
2
发表论文 (29 篇)
202521 篇
3
RotPruner: Large Language Model Pruning in Rotated Space
ICLR 2025Rejected
4
MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation
NeurIPS 2025Poster
4
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
ICLR 2025Poster
4
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
ICLR 2025withdrawn
4
Tra-MoE: Scaling Trajectory Prediction Models for Adaptive Policy Conditioning
ICLR 2025withdrawn
3
TrackMamba: Mamba-Transformer Tracking
ICLR 2025withdrawn
4
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training
ICLR 2025withdrawn
4
Efficient Test-Time Prompt Tuning for Vision-Language Models
ICLR 2025Rejected
5
Stochastic Layer-Wise Shuffle for Improving Vision Mamba Training
ICML 2025Poster
4
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
ICLR 2025Rejected
3
Differentiable Solver Search for Fast Diffusion Sampling
ICML 2025Poster
3
Differentiable Solver Search for fast diffusion sampling
ICLR 2025Rejected
4
VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception
NeurIPS 2025Poster
5
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
ICLR 2025Poster
4
LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
NeurIPS 2025Poster
4
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
ICLR 2025Poster
4
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
ICLR 2025Poster
4
StreamForest: Efficient Online Video Understanding with Persistent Event Memory
NeurIPS 2025Spotlight
4
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
NeurIPS 2025Poster
5
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
ICLR 2025Poster
4
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
ICLR 2025Spotlight
20248 篇
4
ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video
ICLR 2024Rejected
4
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
ICLR 2024Poster
4
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
NeurIPS 2024Poster
4
Spatiotemporal Predictive Pre-training for Robotic Motor Control
NeurIPS 2024Rejected
4
Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection?
NeurIPS 2024Poster
4
VFIMamba: Video Frame Interpolation with State Space Models
NeurIPS 2024Poster
4
Exploring DCN-like architecture for fast image generation with arbitrary resolution
NeurIPS 2024Poster
4
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
ICLR 2024Spotlight