暂无评分数据
ICLR 2025
EVLM: An Efficient Vision-Language Model for Visual Understanding
TL;DR
an efficient vision-language model for visual understanding.
摘要
关键词
multi-modal language modelscross-attentionmoe
评审与讨论
PC编辑台拒稿
直接拒稿原因
violation of formatting guideline (margin, length)