影响力指数

97.16/100

前 0.1%

全站排名 #88

发表论文61 篇

平均评分5.4

年均产出20.3 篇/年

Jun Wang

Professor@University College London·英国·OpenReview

研究方向

machine learning · information retrieval · data science

Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

ICLR 2026Poster

ViMo: A Generative Visual GUI World Model for App Agents

ICLR 2026Poster

SpatialViz-Bench: A Cognitively-Grounded Benchmark for Diagnosing Spatial Visualization in MLLMs

ICLR 2026Poster

CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs

ICLR 2026Poster

AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and Beyond

ICLR 2026Rejected

VoG: Enhancing LLM Reasoning through Stepwise Verification on Knowledge Graphs

ICLR 2026Poster

A Benchmark for Deep Information Synthesis

ICLR 2026Poster

From Experience to Strategy: Empowering LLM Agents with Trainable Graph Memory

ICLR 2026Desk Rejected

SuRe: Surprise-Driven Prioritised Replay for Continual LLM Learning

ICLR 2026Rejected

Paying Attention to Hybrid Attention: Untangling the Issues with Conversion Methods

ICLR 2026Rejected

Grouped-head latenT Attention

ICLR 2026Withdrawn

LoopServe: An Adaptive Dual-phase LLM Inference Acceleration System for Multi-Turn Dialogues

ICLR 2026Rejected

Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control

ICLR 2026Withdrawn

Memory-Driven Self-Improvement with Large Language Models

ICLR 2026Withdrawn

Evolving LLMs' Self-Refinement Capability via Synergistic Training-Inference Optimization

ICLR 2026Withdrawn

Emergent Bayesian Behaviour and Optimal Cue Combination in LLMs

ICLR 2026Rejected

BEYOND SYNTAX: ACTION SEMANTICS LEARNING FOR APP AGENTS

ICLR 2026Withdrawn

Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference

ICLR 2026Withdrawn

Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning

ICLR 2026Withdrawn

Subjective Depth and Timescale Transformers: Learning Where and When to Compute

ICLR 2026Withdrawn

Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?

ICLR 2026Rejected

Causal Discovery under Changing Mechanisms: A Unified Graphical Approach

ICLR 2026Rejected

MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework

NeurIPS 2025Poster

Lightweight Neural App Control

ICLR 2025Spotlight

SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION

ICLR 2025Spotlight

Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control

NeurIPS 2025Poster

Mixture of Attentions For Speculative Decoding

ICLR 2025Poster

Risk-aware Direct Preference Optimization under Nested Risk Measure

NeurIPS 2025Poster

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

NeurIPS 2025Poster

Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning

NeurIPS 2025Poster

Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Video Temporal Grounding

NeurIPS 2025Poster

Self-Verifying Reflection Helps Transformers with CoT Reasoning

NeurIPS 2025Poster

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent

ICLR 2025Poster

Circuit Transformer: A Transformer That Preserves Logical Equivalence

ICLR 2025Poster

A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

NeurIPS 2025Poster

Curious Causality-Seeking Agents Learn Meta Causal World

NeurIPS 2025Poster

ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning

NeurIPS 2025Poster

On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models

ICLR 2025Poster

Efficient Reinforcement Learning with Large Language Model Priors

ICLR 2025Poster

Human-inspired Episodic Memory for Infinite Context LLMs

ICLR 2025Poster

Direct Preference Optimization Using Sparse Feature-level Constraints

ICLR 2025Rejected

Attaining Human's Desirable Outcomes in Indirect Human-AI Interaction via Multi-Agent Influence Diagrams

ICLR 2025Rejected

Constrain Alignment with Sparse Autoencoders

ICML 2025Poster

Large Language Models are Demonstration Pre-Selectors for Themselves

ICLR 2025Rejected

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

ICLR 2025Rejected

Risk-aware Direct Preference Optimization under Nested Risk Measure

ICML 2025Rejected

Large Language Models are Demonstration Pre-Selectors for Themselves

ICML 2025Poster

Mitigating Unobserved Confounding via Diffusion Probabilistic Models

ICLR 2025Rejected

Active Causal Learning for Conditional Average Treatment Effect Estimation

ICLR 2025Rejected

Temporal Visiting-Monitoring Feature Interaction Learning for Modelling Structured Electronic Health Records

ICLR 2025Rejected

FedADM: Adaptive Federated Learning via Dissimilarity Measure

ICLR 2025Withdrawn

合作者 (20)

合作者11 篇

Haitham Bou Ammar