Wei Xue
~Wei_Xue5
25
论文总数
12.5
年均投稿
平均评分
接收情况15/25
会议分布
ICLR
16
NeurIPS
4
ICML
4
COLM
1
发表论文 (25 篇)
202520 篇
4
Co$^{\mathbf{3}}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
ICLR 2025Spotlight
5
Foundation Cures Personalization: Improving Personalized Models’ Prompt Consistency via Hidden Foundation Knowledge
NeurIPS 2025Poster
4
You Know What I'm Saying: Jailbreak Attack via Implicit Reference
ICLR 2025Rejected
4
GuideEdit: Enhancing Face Video Editing with Fine-grained Control
ICLR 2025withdrawn
3
Structured Mixture-of-Experts LLMs Compression via Singular Value Decomposition
ICLR 2025Rejected
4
ThinkSound: Chain-of-Thought Reasoning in Multimodal LLMs for Audio Generation and Editing
NeurIPS 2025Poster
5
NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
ICLR 2025withdrawn
4
Delta Decompression for MoE-based LLMs Compression
ICML 2025Poster
4
MoE-SVD: Structured Mixture-of-Experts LLMs Compression via Singular Value Decomposition
ICML 2025Poster
4
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
ICLR 2025Spotlight
4
Empowering World Models with Reflection for Embodied Video Prediction
ICML 2025Poster
4
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
ICLR 2025withdrawn
4
STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
ICLR 2025Poster
5
AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems
ICLR 2025withdrawn
4
EVA: An Embodied World Model for Future Video Anticipation
ICLR 2025Rejected
3
$\textbf{CoCoGesture}$: Towards Coherent Co-speech 3D Gesture Generation in the Wild
ICLR 2025withdrawn
4
PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
ICLR 2025withdrawn
4
ViML: A Video, Music, Language Unified Dataset for Understanding and Generation
ICLR 2025withdrawn
4
OmniAudio: Generating Spatial Audio from 360-Degree Video
ICML 2025Poster
4
MuPT: A Generative Symbolic Music Pretrained Transformer
ICLR 2025Poster
20245 篇
5
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate
ICLR 2024Poster
4
RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation
COLM 2024Poster
5
Discovering Sparsity Allocation for Layer-wise Pruning of Large Language Models
NeurIPS 2024Poster
4
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
ICLR 2024Poster
3
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
NeurIPS 2024Poster