影响力指数

99.39/100

前 0.1%

全站排名 #10

发表论文86 篇

平均评分5.5

年均产出28.7 篇/年

Zhangyang Wang

Associate Professor@University of Texas at Austin·美国·OpenReview

研究方向

Machine Learning · Computer Vision · Optimization

Neon: Negative Extrapolation From Self-Training Improves Image Generation

$\nabla$-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space

ICLR 2026Poster

E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models

ICLR 2026Desk Rejected

RAPID$^3$: Tri-Level Reinforced Acceleration Policies for Diffusion Transformer

ICLR 2026Poster

DD-Ranking: Rethinking the Evaluation of Dataset Distillation

ICLR 2026Rejected

LLMs Can Get "Brain Rot"!

ICLR 2026Rejected

Baleen: Self‑Interpretable, Robust SSMs with Stochastic Selective Memory

ICLR 2026Rejected

Self-Discriminative Optimization for Video Diffusion Models

ICLR 2026Desk Rejected

Why RL Updates Look Sparse: An Implicit Compass Drives Optimization Bias

ICLR 2026Rejected

Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

ICLR 2026Rejected

Data Efficient Any Transformer-to-Mamba Distillation via Attention Bridge

ICLR 2026Withdrawn

Enhance-A-Video: Better Generated Video for Free

ICLR 2026Withdrawn

COPER: Agentic Context Significantly Improves and Stabilizes LLM in Multi-Player Game

ICLR 2026Rejected

Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel

ICLR 2026Poster

Revisiting Spectral Representations in Generative Diffusion Models

ICLR 2026Rejected

SAGE: Fast, Generalizable and Photorealistic 3D Human Reconstruction from a Single Image

ICLR 2026Withdrawn

Stable-SPAM: How to Stably Train Large Language Models in 4-Bit

ICLR 2026Withdrawn

Towards Personalized Parameter Generation via Data-Conditioned Mapping

ICLR 2026Rejected

POPS: Recovering Unlearned Multi-Modality Knowledge in MLLMs with Fine-tuning and Prompt-based Attacks

ICLR 2026Rejected

Make Optimization Once and for All with Fine-grained Guidance

ICLR 2026Withdrawn

ResearchGPT: Benchmarking and Training LLMs for End-to-End Computer Science Research Workflows

ICLR 2026Withdrawn

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

NeurIPS 2025Poster

4K4DGen: Panoramic 4D Generation at 4K Resolution

ICLR 2025Spotlight

SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?

COLM 2025Poster

LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning

COLM 2025Poster

Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models

NeurIPS 2025Poster

HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model Training

ICML 2025Poster

On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention for Long-Context LLM Serving

ICML 2025Poster

Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding

ICML 2025Poster

Transformers Provably Learn Two-Mixture of Linear Classification via Gradient Flow

ICLR 2025Poster

LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing

ICLR 2025Poster

SEAL: Steerable Reasoning Calibration of Large Language Models for Free

COLM 2025Poster

Scaling Up Parameter Generation: A Recurrent Diffusion Approach

NeurIPS 2025Poster

REPA Works Until It Doesn’t: Early-Stopped, Holistic Alignment Supercharges Diffusion Training

NeurIPS 2025Poster

SAS: Simulated Attention Score

NeurIPS 2025Poster

Can Test-Time Scaling Improve World Foundation Model?

COLM 2025Poster

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment

COLM 2025Poster

From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications

ICML 2025Poster

HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models

ICLR 2025Poster

SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

ICLR 2025Poster

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

ICLR 2025Poster

Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding

ICLR 2025Rejected

One Token Embedding Is Enough to Deadlock Your Large Reasoning Model

NeurIPS 2025Poster

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

ICLR 2025Rejected

PIPA: Preference Alignment as Prior-Informed Statistical Estimation

ICML 2025Poster

On the Provable Separation of Scales in Maximal Update Parameterization

ICML 2025Poster

R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference

ICLR 2025Poster

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

ICML 2025Poster

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

ICLR 2025Withdrawn

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

ICLR 2025Rejected

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

ICLR 2025Withdrawn

CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation

ICLR 2025Rejected

Fantastic Experts and How to Find Them: A Multi-Dimensional Study for Experts-Level Sparsification in Mixture-of-Experts

ICLR 2025Rejected

OscillationInversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

ICLR 2025Withdrawn

合作者 (20)