Wenhai Wang
~Wenhai_Wang2
15
论文总数
7.5
年均投稿
平均评分
接收情况14/15
会议分布
NeurIPS
8
ICLR
5
ICML
2
发表论文 (15 篇)
202510 篇
4
CycleVTON: Improving Diffusion-Based Virtual Try-On with Cycle-Consistent Training
ICLR 2025withdrawn
4
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
ICLR 2025Spotlight
4
CoMemo: LVLMs Need Image Context with Image Memory
ICML 2025Poster
4
OPMapper: Enhancing Open-Vocabulary Semantic Segmentation with Multi-Guidance Information
NeurIPS 2025Poster
4
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis
NeurIPS 2025Poster
3
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
ICML 2025Poster
4
Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings
NeurIPS 2025Poster
3
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
ICLR 2025Spotlight
4
ArchCAD-400K: A Large-Scale CAD drawings Dataset and New Baseline for Panoptic Symbol Spotting
NeurIPS 2025Poster
3
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
NeurIPS 2025Poster
20245 篇
4
Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments
ICLR 2024Spotlight
3
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
ICLR 2024Poster
3
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
NeurIPS 2024Poster
4
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
NeurIPS 2024Poster
3
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
NeurIPS 2024Poster