PaperHub

Jianye HAO

~Jianye_HAO1

57
论文总数
28.5
年均投稿
5.9
平均评分
接收情况40/57
会议分布
ICLR
30
NeurIPS
20
ICML
7

发表论文 (57 篇)

202539

5.5
4

R*: Efficient Reward Design via Reward Structure Evolution and Parameter Alignment Optimization with Large Language Models

ICML 2025Poster
7.3
4

Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control

NeurIPS 2025Poster
6.4
4

CORE: Collaborative Optimization with Reinforcement Learning and Evolutionary Algorithm for Floorplanning

NeurIPS 2025Poster
6.8
4

COLA: Towards Efficient Multi-Objective Reinforcement Learning with Conflict Objective Regularization in Latent Space

NeurIPS 2025Poster
7.0
3

MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning

ICML 2025Poster
7.5
4

Lightweight Neural App Control

ICLR 2025Spotlight
6.8
4

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent

ICLR 2025Poster
7.2
5

Differentiable Integer Linear Programming

ICLR 2025Spotlight
6.4
4

LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search

NeurIPS 2025Poster
6.8
5

Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Video Temporal Grounding

NeurIPS 2025Poster
7.8
4

Conditioning Matters: Training Diffusion Policies is Faster Than You Think

NeurIPS 2025Poster
3.7
3

A Theory of Multi-Agent Generative Flow Networks

ICLR 2025Rejected
4.0
4

Can Symbolic Regression of Boolean Functions Boost Logic Synthesis?

ICLR 2025withdrawn
6.4
4

OptiTree: Hierarchical Thoughts Generation with Tree Search for LLM Optimization Modeling

NeurIPS 2025Poster
3.7
3

Actra: Optimized Transformer Architecture for Vision-Language-Action Models in Robot Learning

ICLR 2025withdrawn
7.0
3

STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

ICML 2025Spotlight
7.6
3

High-Performance Arithmetic Circuit Optimization via Differentiable Architecture Search

NeurIPS 2025Spotlight
5.8
5

3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds

ICLR 2025Poster
5.5
4

Trajectory World Models for Heterogeneous Environments

ICML 2025Poster
5.5
4

Reinforced In-Context Black-Box Optimization

ICLR 2025Rejected
4.0
4

Towards LLM4Floorplan: Agents Can Do What Engineers Do in Chip Design

ICLR 2025withdrawn
3.4
5

LLM4Solver: Large Language Model for Efficient Algorithm Design of Combinatorial Optimization Solver

ICLR 2025withdrawn
3.5
4

Searching Strengthens Large Language Models in Finding Bugs of Deep Learning Libraries

ICLR 2025withdrawn
6.7
3

A Graph Enhanced Symbolic Discovery Framework For Efficient Logic Optimization

ICLR 2025Poster
6.3
4

Apollo-MILP: An Alternating Prediction-Correction Neural Solving Framework for Mixed-Integer Linear Programming

ICLR 2025Poster
6.1
4

Accelerating Large Language Model Reasoning via Speculative Search

ICML 2025Poster
6.1
4

HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking

ICML 2025Poster
5.5
3

Boosting Multi-Domain Fine-Tuning of Large Language Models through Evolving Interactions between Samples

ICML 2025Poster
6.7
3

Computing Circuits Optimization via Model-Based Circuit Genetic Evolution

ICLR 2025Poster
4.3
4

SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation

ICLR 2025withdrawn
8.2
4

LogicTree: Improving Complex Reasoning of LLMs via Instantiated Multi-step Synthetic Logical Data

NeurIPS 2025Spotlight
6.8
5

Accurate KV Cache Eviction via Anchor Direction Projection for Efficient LLM Inference

NeurIPS 2025Poster
5.0
4

The Graph's Apprentice: Teaching an LLM Low-Level Knowledge for Circuit Quality Estimation

ICLR 2025Rejected
7.5
4

LaMPlace: Learning to Optimize Cross-Stage Metrics in Macro Placement

ICLR 2025Oral
6.8
4

Dynamic Configuration for Cutting Plane Separators via Reinforcement Learning on Incremental Graph

NeurIPS 2025Poster
6.4
4

AttentionPredictor: Temporal Patterns Matter for KV Cache Compression

NeurIPS 2025Poster
3.8
5

Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms

ICLR 2025withdrawn
4.5
4

ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models

ICLR 2025Rejected
7.3
3

SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION

ICLR 2025Spotlight

202418

6.5
4

DiffuserLite: Towards Real-time Diffusion Planning

NeurIPS 2024Poster
6.3
4

PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation

NeurIPS 2024Poster
6.3
3

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

ICLR 2024Poster
6.0
4

Iteratively Refined Behavior Regularization for Offline Reinforcement Learning

NeurIPS 2024Poster
3.5
4

Improving Sample Efficiency in Off-policy RL with Low-dimensional Policy Representation

ICLR 2024Rejected
5.0
4

HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach

ICLR 2024Rejected
-

Value-Evolutionary-Based Reinforcement Learning

ICLR 2024withdrawn
6.0
3

Unlock the Intermittent Control Ability of Model Free Reinforcement Learning

NeurIPS 2024Poster
7.0
4

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model

ICLR 2024Poster
5.5
4

Addressing Real-Time Fragmentary Interaction Control Problems via Muti-step Representation Reinforcement Learning

ICLR 2024Rejected
5.3
6

The Ladder in Chaos: Improving Policy Learning by Harnessing the Parameter Evolving Path in A Low-dimensional Space

NeurIPS 2024Poster
5.3
4

Rethinking Decision Transformer via Hierarchical Reinforcement Learning

ICLR 2024Rejected
4.0
3

Iteratively Refined Behavior Regularization for Offline Reinforcement Learning

ICLR 2024Rejected
8.0
3

Sample-Efficient Quality-Diversity by Cooperative Coevolution

ICLR 2024Spotlight
6.0
4

iVideoGPT: Interactive VideoGPTs are Scalable World Models

NeurIPS 2024Poster
5.5
4

FlexPlanner: Flexible 3D Floorplanning via Deep Reinforcement Learning in Hybrid Action Space with Multi-Modality Representation

NeurIPS 2024Poster
6.7
3

Rethinking Branching on Exact Combinatorial Optimization Solver: The First Deep Symbolic Discovery Framework

ICLR 2024Poster
5.8
4

Towards Next-Generation Logic Synthesis: A Scalable Neural Circuit Generation Framework

NeurIPS 2024Poster