Xizhou Zhu
~Xizhou_Zhu1
13
论文总数
6.5
年均投稿
平均评分
接收情况11/13
会议分布
ICLR
7
NeurIPS
5
ICML
1
发表论文 (13 篇)
20256 篇
4
CoMemo: LVLMs Need Image Context with Image Memory
ICML 2025Poster
3
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
NeurIPS 2025Poster
3
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
ICLR 2025Spotlight
4
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
ICLR 2025withdrawn
5
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
ICLR 2025Poster
4
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
ICLR 2025Spotlight
20247 篇
3
Parameter-Inverted Image Pyramid Networks
NeurIPS 2024Spotlight
3
Ghost in the Minecraft: Hierarchical Agents for Minecraft via Large Language Models with Text-based Knowledge and Memory
ICLR 2024Rejected
4
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
NeurIPS 2024Poster
4
Learning 1D Causal Visual Representation with De-focus Attention Networks
NeurIPS 2024Poster
3
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
NeurIPS 2024Poster
3
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
ICLR 2024Poster
4
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
ICLR 2024Poster