Ruiyi Zhang
~Ruiyi_Zhang3
18
论文总数
9.0
年均投稿
平均评分
接收情况7/18
会议分布
ICLR
15
COLM
3
发表论文 (18 篇)
20259 篇
3
LLaVA-Read: Enhancing Reading Ability of Multimodal Large Language Models
ICLR 2025Rejected
5
VaQuitA: Enhancing Alignment in LLM-Assisted Zero-Shot Video Understanding
ICLR 2025withdrawn
4
SV-RAG: LoRA-Contextualizing Adaptation of MLLMs for Long Document Understanding
ICLR 2025Poster
4
ADOPD-Instruct: A Large-Scale Multimodal Dataset for Document Editing
ICLR 2025Rejected
5
Taipan: Efficient and Expressive State Space Language Models with Selective Attention
ICLR 2025Rejected
4
Enhancing Diffusion Posterior Sampling for Inverse Problems by Integrating Crafted Measurements
ICLR 2025withdrawn
3
VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
ICLR 2025Rejected
4
OIDA-QA: A Multimodal Benchmark for Analyzing the Opioid Industry Document Archive
ICLR 2025Rejected
4
DynaSaur: Large Language Agents Beyond Predefined Actions
COLM 2025Poster
20249 篇
4
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints
ICLR 2024Poster
4
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach
ICLR 2024Rejected
4
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models
COLM 2024Poster
4
Enhanced Visual Instruction Tuning for Text-Rich Image Understanding
ICLR 2024Rejected
4
AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models
ICLR 2024Rejected
4
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models
COLM 2024Poster
4
ADOPD: A Large-Scale Document Page Decomposition Dataset
ICLR 2024Poster
3
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
ICLR 2024Poster
4
ARTIST: Towards Disentangled Text Painter with Diffusion Models
ICLR 2024withdrawn