影响力指数

98.68/100

前 0.1%

全站排名 #27

发表论文66 篇

平均评分5.4

年均产出22.0 篇/年

Furu Wei

Distinguished Scientist@Microsoft Research·中国·OpenReview

研究方向

general ai · foundation model · deep learning · natural language procesing

VibeVoice: Expressive Podcast Generation with Next-Token Diffusion

Synergizing Understanding and Generation with Interleaved Analyzing-Drafting Thinking

ICLR 2026Poster

VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models

ICLR 2026Poster

From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics

ICLR 2026Poster

Code Aesthetics with Agentic Reward Feedback

ICLR 2026Poster

Multimodal Latent Language Modeling with Next-Token Diffusion

ICLR 2026Rejected

11Plus-Bench: Demystifying Multimodal LLM Spatial Reasoning with Cognitive-Inspired Analysis

ICLR 2026Rejected

Learning To Draft: Adaptive Speculative Decoding with Reinforcement Learning

ICLR 2026Poster

Geometric-Mean Policy Optimization

ICLR 2026Poster

AlignDiff: Exploiting Model-Intrinsic Information for Better Preference Data Selection

ICLR 2026Rejected

BitNet Distillation

ICLR 2026Rejected

Scaling Laws for Fully Sparsely-Activated Large Language Models

ICLR 2026Rejected

QueST: Incentivizing LLMs to Generate Difficult Problems

ICLR 2026Rejected

Breaking Training Bottlenecks: Effective Reinforcement Learning for Modern Coding Models

ICLR 2026Rejected

Rectified Sparse Attention for Efficient Long-Sequence Generation

ICLR 2026Withdrawn

WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale

ICLR 2026Rejected

Two Pathways to Truthfulness: On the Intrinsic Encoding of LLM Hallucinations

ICLR 2026Withdrawn

Towards Stable and Effective Reinforcement learning for Mixture-of-Experts

ICLR 2026Withdrawn

DocReward: A Document Reward Model for Structuring and Stylizing

ICLR 2026Withdrawn

Thinking Augmented Pre-training

ICLR 2026Rejected

Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs

ICLR 2026Withdrawn

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

ICLR 2026Rejected

Information-Preserving Reformulation of Reasoning Traces for Antidistillation

ICLR 2026Withdrawn

On-Policy RL with Optimal Reward Baseline

ICLR 2026Withdrawn

Efficient RL Training for Reasoning Models via Length-Aware Optimization

ICLR 2026Withdrawn

Data Selection via Optimal Control for Language Models

Differential Transformer

Think Only When You Need with Large Hybrid-Reasoning Models

NeurIPS 2025Poster

Preference Optimization for Reasoning with Pseudo Feedback

ICLR 2025Spotlight

Semi-Parametric Retrieval via Binary Bag-of-Tokens Index

ICLR 2025Poster

Generative Representational Instruction Tuning

ICLR 2025Poster

Chain-of-Retrieval Augmented Generation

NeurIPS 2025Poster

Self-Boosting Large Language Models with Synthetic Preference Data

ICLR 2025Poster

Scaling Laws of Synthetic Data for Language Model

COLM 2025Poster

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

NeurIPS 2025Poster

Reward Reasoning Models

NeurIPS 2025Poster

ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation

ICLR 2025Poster

Imagine While Reasoning in Space: Multimodal Visualization-of-Thought

ICML 2025Poster

Scaling Optimal LR Across Token Horizons

ICLR 2025Poster

E5-V: Universal Embeddings with Multimodal Large Language Models

ICLR 2025Rejected

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

ICLR 2025Rejected

VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers

ICLR 2025Rejected

Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

ICLR 2025Rejected

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

ICLR 2025Withdrawn

Textual Aesthetics in Large Language Models

ICLR 2025Withdrawn

One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks

ICLR 2025Withdrawn

Next Block Prediction: Video Generation via Semi-Auto-Regressive Modeling

ICLR 2025Rejected

合作者 (20)