PaperHub

Jun Wang

~Jun_Wang2

39
论文总数
19.5
年均投稿
5.8
平均评分
接收情况24/39
会议分布
ICLR
22
NeurIPS
14
ICML
3

发表论文 (39 篇)

202529

6.3
6

On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models

ICLR 2025Poster
4.0
4

Temporal Visiting-Monitoring Feature Interaction Learning for Modelling Structured Electronic Health Records

ICLR 2025Rejected
6.4
4

A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

NeurIPS 2025Poster
6.4
4

Curious Causality-Seeking Agents Learn Meta Causal World

NeurIPS 2025Poster
7.3
4

Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control

NeurIPS 2025Poster
7.0
4

Mixture of Attentions For Speculative Decoding

ICLR 2025Poster
5.0
4

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

ICLR 2025Rejected
3.0
4

FedADM: Adaptive Federated Learning via Dissimilarity Measure

ICLR 2025withdrawn
7.5
4

Lightweight Neural App Control

ICLR 2025Spotlight
6.8
4

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent

ICLR 2025Poster
4.3
4

Active Causal Learning for Conditional Average Treatment Effect Estimation

ICLR 2025Rejected
4.9
4

Risk-aware Direct Preference Optimization under Nested Risk Measure

ICML 2025Rejected
4.3
4

Mitigating Unobserved Confounding via Diffusion Probabilistic Models

ICLR 2025Rejected
6.8
5

Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Video Temporal Grounding

NeurIPS 2025Poster
7.0
3

Risk-aware Direct Preference Optimization under Nested Risk Measure

NeurIPS 2025Poster
6.7
3

Circuit Transformer: A Transformer That Preserves Logical Equivalence

ICLR 2025Poster
5.8
4

Human-inspired Episodic Memory for Infinite Context LLMs

ICLR 2025Poster
6.8
5

Self-Verifying Reflection Helps Transformers with CoT Reasoning

NeurIPS 2025Poster
6.8
4

Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning

NeurIPS 2025Poster
7.0
3

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

NeurIPS 2025Poster
6.3
4

Efficient Reinforcement Learning with Large Language Model Priors

ICLR 2025Poster
5.5
4

Attaining Human's Desirable Outcomes in Indirect Human-AI Interaction via Multi-Agent Influence Diagrams

ICLR 2025Rejected
6.4
4

ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning

NeurIPS 2025Poster
5.3
4

Large Language Models are Demonstration Pre-Selectors for Themselves

ICLR 2025Rejected
4.9
4

Large Language Models are Demonstration Pre-Selectors for Themselves

ICML 2025Poster
5.5
4

Constrain Alignment with Sparse Autoencoders

ICML 2025Poster
7.8
4

MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework

NeurIPS 2025Poster
5.7
3

Direct Preference Optimization Using Sparse Feature-level Constraints

ICLR 2025Rejected
7.3
3

SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION

ICLR 2025Spotlight

202410