PaperHub

Jiaqi Wang

~Jiaqi_Wang1

23
论文总数
11.5
年均投稿
5.6
平均评分
接收情况12/23
会议分布
ICLR
14
NeurIPS
7
ICML
2

发表论文 (23 篇)

202516

4.7
3

DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models

ICLR 2025withdrawn
6.6
5

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

ICLR 2025Poster
5.3
4

Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate

ICLR 2025withdrawn
6.4
4

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

NeurIPS 2025Poster
6.0
4

MotionClone: Training-Free Motion Cloning for Controllable Video Generation

ICLR 2025Poster
3.8
4

Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data

ICLR 2025withdrawn
4.8
4

Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

ICLR 2025withdrawn
3.0
4

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

ICLR 2025withdrawn
4.6
5

BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way

ICLR 2025withdrawn
5.3
4

SAM2Long: Enhancing SAM2 for Long Video Segmentation with a Training-Free Memory Tree

ICLR 2025withdrawn
4.6
5

SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation

ICLR 2025Rejected
5.5
3

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

ICML 2025Poster
5.5
4

RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition

ICLR 2025Rejected
8.2
5

HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance

NeurIPS 2025Poster
5.5
4

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

ICLR 2025Poster
8.3
4

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

ICML 2025Oral