暂无评分数据
ICLR 2024
Overcoming Weak Visual-Textual Alignment for Video Moment Retrieval
TL;DR
We propose BM-DETR to tackle weak visual-textual alignment, effectively identifying relevant visual features via proposed background moment dection. BM-DETR has demonstrated superior performance and generalization ability on various VMR datasets..
摘要
关键词
Video Moment Retrieval; Video and Language;
评审与讨论
暂无评审记录