Jun Wang
~Jun_Wang2
39
论文总数
19.5
年均投稿
平均评分
接收情况24/39
会议分布
ICLR
22
NeurIPS
14
ICML
3
发表论文 (39 篇)
202529 篇
6
On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models
ICLR 2025Poster
4
Temporal Visiting-Monitoring Feature Interaction Learning for Modelling Structured Electronic Health Records
ICLR 2025Rejected
4
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
NeurIPS 2025Poster
4
Curious Causality-Seeking Agents Learn Meta Causal World
NeurIPS 2025Poster
4
Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control
NeurIPS 2025Poster
4
Mixture of Attentions For Speculative Decoding
ICLR 2025Poster
4
SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks
ICLR 2025Rejected
4
FedADM: Adaptive Federated Learning via Dissimilarity Measure
ICLR 2025withdrawn
4
Lightweight Neural App Control
ICLR 2025Spotlight
4
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent
ICLR 2025Poster
4
Active Causal Learning for Conditional Average Treatment Effect Estimation
ICLR 2025Rejected
4
Risk-aware Direct Preference Optimization under Nested Risk Measure
ICML 2025Rejected
4
Mitigating Unobserved Confounding via Diffusion Probabilistic Models
ICLR 2025Rejected
5
Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Video Temporal Grounding
NeurIPS 2025Poster
3
Risk-aware Direct Preference Optimization under Nested Risk Measure
NeurIPS 2025Poster
3
Circuit Transformer: A Transformer That Preserves Logical Equivalence
ICLR 2025Poster
4
Human-inspired Episodic Memory for Infinite Context LLMs
ICLR 2025Poster
5
Self-Verifying Reflection Helps Transformers with CoT Reasoning
NeurIPS 2025Poster
4
Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning
NeurIPS 2025Poster
3
GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning
NeurIPS 2025Poster
4
Efficient Reinforcement Learning with Large Language Model Priors
ICLR 2025Poster
4
Attaining Human's Desirable Outcomes in Indirect Human-AI Interaction via Multi-Agent Influence Diagrams
ICLR 2025Rejected
4
ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning
NeurIPS 2025Poster
4
Large Language Models are Demonstration Pre-Selectors for Themselves
ICLR 2025Rejected
4
Large Language Models are Demonstration Pre-Selectors for Themselves
ICML 2025Poster
4
Constrain Alignment with Sparse Autoencoders
ICML 2025Poster
4
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework
NeurIPS 2025Poster
3
Direct Preference Optimization Using Sparse Feature-level Constraints
ICLR 2025Rejected
3
SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION
ICLR 2025Spotlight
202410 篇
4
Reinforcing LLM Agents via Policy Optimization with Action Decomposition
NeurIPS 2024Poster
4
Leveraging Large Language Models for Optimised Coordination in Textual Multi-Agent Reinforcement Learning
ICLR 2024Rejected
4
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf
NeurIPS 2024Poster
4
Alphazero-like Tree-Search can guide large language model decoding and training
ICLR 2024Rejected
3
A Generative Model for Game Theory with Flow Equilibrium
ICLR 2024withdrawn
4
Augmented Policy Optimization for Safe Reinforcement Learning
ICLR 2024withdrawn
3
Large Language Models Play StarCraft II:Benchmarks and A Chain of Summarization Approach
NeurIPS 2024Poster
4
Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting
NeurIPS 2024Oral
4
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models
ICLR 2024Rejected
4
Parsimonious Demonstrations and Fine-Tuning for Large Language Models
ICLR 2024withdrawn