PaperHub

暂无评分数据

ICLR 2025

EVLM: An Efficient Vision-Language Model for Visual Understanding

OpenReviewPDF
提交: 2024-09-24更新: 2024-10-26
TL;DR

an efficient vision-language model for visual understanding.

摘要

关键词
multi-modal language modelscross-attentionmoe

评审与讨论

编辑台拒稿

直接拒稿原因

violation of formatting guideline (margin, length)