Wenqi Shao
~Wenqi_Shao2
28
论文总数
14.0
年均投稿
平均评分
接收情况13/28
会议分布
ICLR
23
NeurIPS
4
ICML
1
发表论文 (28 篇)
202518 篇
4
LLaMA Decoder As Vision Transformer
ICLR 2025Rejected
3
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
ICLR 2025Rejected
5
Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing
ICLR 2025withdrawn
4
MatchMask: Mask-Centric Generative Data Augmentation for Label-Scarce Semantic Segmentation
ICLR 2025withdrawn
3
TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts
ICLR 2025Rejected
4
SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement
ICLR 2025Poster
4
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
ICLR 2025Rejected
4
HRVMamba: High-Resolution Visual State Space Model for Dense Prediction
ICLR 2025withdrawn
4
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression
ICLR 2025withdrawn
4
TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception
NeurIPS 2025Spotlight
5
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
ICLR 2025Rejected
3
PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
ICLR 2025Rejected
4
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
ICML 2025Poster
4
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
ICLR 2025Oral
4
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents
ICLR 2025Poster
4
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis
NeurIPS 2025Poster
5
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
ICLR 2025Spotlight
5
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
ICLR 2025Poster
202410 篇
5
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models
ICLR 2024Spotlight
4
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation
ICLR 2024Poster
4
Simple CNN for Vision
ICLR 2024Rejected
3
SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge
NeurIPS 2024Poster
4
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
ICLR 2024Rejected
5
Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face
ICLR 2024withdrawn
4
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models
ICLR 2024Poster
3
SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving
ICLR 2024Rejected
4
CTRL: Graph condensation via crafting rational trajectory matching
ICLR 2024Rejected
4
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality
NeurIPS 2024Poster