Ying Shan
~Ying_Shan2
30
论文总数
15.0
年均投稿
平均评分
接收情况13/30
会议分布
ICLR
22
NeurIPS
5
ICML
3
发表论文 (30 篇)
202513 篇
5
FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
ICLR 2025Rejected
5
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
ICLR 2025Rejected
5
Self-Conditioned Diffusion Model for Consistent Human Image and Video Synthesis
ICLR 2025withdrawn
4
UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning
NeurIPS 2025Poster
5
SEED-Story: Multimodal Long Story Generation with Large Language Model
ICLR 2025withdrawn
4
HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding
ICML 2025Poster
3
LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
ICML 2025Poster
4
GPT4LoRA: Optimizing LoRA Combination via MLLM Self-Reflection
ICLR 2025Rejected
4
LoRA-Gen: Specializing Language Model via Online LoRA Generation
ICLR 2025withdrawn
5
SEED-X: Multimodal Models in Real World
ICLR 2025Rejected
5
Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
ICLR 2025withdrawn
4
Taming Rectified Flow for Inversion and Editing
ICML 2025Poster
4
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
NeurIPS 2025Poster
202417 篇
4
TaCA: Hot-Plugging Upgrades for Foundation Model with Task-agnostic Compatible Adapter
ICLR 2024Rejected
5
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
ICLR 2024Spotlight
3
ReVideo: Remake a Video with Motion and Content Control
NeurIPS 2024Poster
3
What Makes for Good Visual Tokenizers for Large Language Models
ICLR 2024Rejected
5
ReBaR: Reference-Based Reasoning for Robust Human Pose and Shape Estimation from Monocular Images
ICLR 2024Rejected
3
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
ICLR 2024withdrawn
4
SemanticBoost: Elevating Motion Generation with Augmented Textual Cues
ICLR 2024withdrawn
4
StyleAdapter: A Unified Stylized Image Generation Model without Test-Time Fine-Tuning
ICLR 2024withdrawn
4
FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling
ICLR 2024Poster
3
CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models
ICLR 2024Rejected
4
TapMo: Shape-aware Motion Generation of Skeleton-free Characters
ICLR 2024Poster
3
Making LLaMA SEE and Draw with SEED Tokenizer
ICLR 2024Poster
4
HiFi-123: Towards High-fidelity One Image to 3D Content Generation
ICLR 2024withdrawn
4
MambaTree: Tree Topology is All You Need in State Space Model
NeurIPS 2024Spotlight
3
DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
ICLR 2024withdrawn
4
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
NeurIPS 2024Poster
4
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024Spotlight