Can Huang
~Can_Huang1
10
论文总数
5.0
年均投稿
平均评分
接收情况4/10
会议分布
ICLR
6
NeurIPS
4
发表论文 (10 篇)
20256 篇
4
Video Q-Former: Multimodal Large Language Model with Spatio-Temporal Querying Transformer Towards Video Understanding
ICLR 2025Rejected
4
GLOMA: Global Video Text Spotting with Morphological Association
ICLR 2025Poster
4
MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark
ICLR 2025Rejected
4
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
ICLR 2025Rejected
5
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering
ICLR 2025withdrawn
4
TextSquare: Scaling up Text-Centric Visual Instruction Tuning
ICLR 2025withdrawn
20244 篇
4
PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition
NeurIPS 2024Poster
4
Harmonizing Visual Text Comprehension and Generation
NeurIPS 2024Poster
3
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
NeurIPS 2024Poster
4
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
NeurIPS 2024Rejected