影响力指数

98.62/100

前 0.1%

全站排名 #33

发表论文57 篇

平均评分5.8

年均产出19.0 篇/年

Jianfeng Gao

Principal Researcher@Microsoft Research·美国·OpenReview

研究方向

foundation model · deep learning · natural language processing

FlowRL: Matching Reward Distributions for LLM Reasoning

ICLR 2026Poster

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents

ICLR 2026Poster

Training Large Reasoning Models Efficiently via Progressive Thought Encoding

ICLR 2026Poster

Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents

ICLR 2026Rejected

SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

ICLR 2026Poster

WidgetEval: Benchmarking Foundation Models on Dynamic Widget Generation for Apps

ICLR 2026Rejected

DenseMixer: Improving MoE Post-Training with Precise Router Gradient

ICLR 2026Rejected

TrustGen: A Platform of Dynamic Benchmarking on the Trustworthiness of Generative Foundation Models

ICLR 2026Poster

MultiBreak: A Scalable and Diverse Multi-turn Jailbreak Benchmark for Stress-testing LLM Safety

ICLR 2026Rejected

EfficientLLM: Evaluating Large Language Models Efficiency

ICLR 2026Desk Rejected

TemporalBench: Evaluating Fine-Grained Temporal Dynamics Understanding for Multimodal Models

ICLR 2026Withdrawn

Coupling Attention and Memory: A Dynamic Memory Module for Efficient Adapation with Pretrained LLMs

ICLR 2026Rejected

Mixture of Inputs: Text Generation Beyond Discrete Token Sampling

NeurIPS 2025Poster

CollabLLM: From Passive Responders to Active Collaborators

Interpretable Next-token Prediction via the Generalized Induction Head

NeurIPS 2025Poster

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

NeurIPS 2025Poster

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

NeurIPS 2025Poster

TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies

ICLR 2025Poster

MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention

ICML 2025Poster

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

NeurIPS 2025Poster

Interpretable Language Modeling via Induction-head Ngram Models

ICLR 2025Rejected

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

ICLR 2025Poster

Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation

NeurIPS 2025Poster

Training Language Models to Generate Quality Code with Program Analysis Feedback

NeurIPS 2025Poster

SAS: Simulated Attention Score

NeurIPS 2025Poster

GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding

ICLR 2025Poster

Simplifying DINO via Coding Rate Regularization

ICML 2025Poster

Matryoshka Multimodal Models

ICLR 2025Poster

Vector-ICL: In-context Learning with Continuous Vector Representations

ICLR 2025Poster

DataGen: Unified Synthetic Dataset Generation via Large Language Models

ICLR 2025Poster

Latent Action Pretraining from Videos

ICLR 2025Poster

ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning

ICLR 2025Poster

Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass

ICLR 2025Poster

SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents

ICLR 2025Poster

Model Tells Itself Where to Attend: Steerable Prompting for Reliable Reading Comprehension of LLM

ICLR 2025Withdrawn

Riemannian Low-Rank Adaptation for Federated Fine-Tuning of Foundation Models

ICLR 2025Withdrawn

TemporalBench: Towards Fine-grained Temporal Understanding for Multimodal Video Models

ICLR 2025Withdrawn

Pixelated Instructions: Can Multimodal Large Language Models Follow Printed Instructions in Images?

ICLR 2025Rejected

Evaluating Graphical Perception of Large Multimodal Models

ICLR 2025Withdrawn

合作者 (20)

合作者14 篇