Yu-Gang Jiang
~Yu-Gang_Jiang1
24
论文总数
12.0
年均投稿
平均评分
接收情况16/24
会议分布
NeurIPS
13
ICLR
10
COLM
1
发表论文 (24 篇)
202517 篇
5
GeoGS3D: Single-view 3D Reconstruction via Geometric-aware Diffusion Model and Gaussian Splatting
ICLR 2025withdrawn
4
Adaptive Retention & Correction: Test-Time Training for Continual Learning
ICLR 2025Poster
4
Towards a Theoretical Understanding of Memorization in Diffusion Models
ICLR 2025withdrawn
4
IDEATOR: Jailbreaking VLMs Using VLMs
ICLR 2025withdrawn
4
HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning
NeurIPS 2025Poster
4
Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words
ICLR 2025withdrawn
4
TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making
NeurIPS 2025Poster
4
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
ICLR 2025withdrawn
4
SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models
NeurIPS 2025Poster
4
BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
ICLR 2025Poster
4
Object Fusion via Diffusion Time-step for Customized Image Editing with Single Example
ICLR 2025withdrawn
4
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
NeurIPS 2025Poster
4
OmniGen-AR: AutoRegressive Any-to-Image Generation
NeurIPS 2025Poster
4
INST-IT: Boosting Instance Understanding via Explicit Visual Prompt Instruction Tuning
NeurIPS 2025Poster
4
OmniSVG: A Unified Scalable Vector Graphics Generation Model
NeurIPS 2025Poster
4
Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection
NeurIPS 2025Poster
4
AgentGym: Evaluating and Evolving Large Language Model-based Agents across Diverse Envronments
ICLR 2025Rejected
20247 篇
4
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation
NeurIPS 2024Poster
5
GenRec: Unifying Video Generation and Recognition with Diffusion Models
NeurIPS 2024Poster
5
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
NeurIPS 2024Poster
4
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
NeurIPS 2024Poster
3
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs
NeurIPS 2024Poster
4
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
ICLR 2024withdrawn
4
Poly-Visual-Expert Vision-Language Models
COLM 2024Poster