影响力指数

96.06/100

前 0.2%

全站排名 #138

发表论文66 篇

平均评分5.3

年均产出22.0 篇/年

Kai Chen

Researcher@Shanghai AI Laboratory·中国·OpenReview

研究方向

large language model · object detction · recognition

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

ICLR 2026Poster

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

ICLR 2026Poster

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

ICLR 2026Poster

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

ICLR 2026Poster

ExpVid: A Benchmark for Experiment Video Understanding & Reasoning

ICLR 2026Poster

Rectifying LLM Thought from Lens of Optimization

ICLR 2026Poster

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

ICLR 2026Rejected

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

ICLR 2026Poster

Context-level Language Modeling by Learning Predictive Context Embeddings

ICLR 2026Desk Rejected

CharacterShot: Controllable and Consistent 4D Character Animation

ICLR 2026Rejected

CompassJudger-2: A Holistic Approach Towards Generalist Judge Model

ICLR 2026Rejected

Smooth Reading: Bridging the Gap of Recurrent LLM to Self-Attention LLM on Long-Context Understanding

ICLR 2026Poster

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

ICLR 2026Rejected

Code-driven Number Sequence Calculation: Enhancing the Inductive Reasoning Abilities of Large Language Models

ICLR 2026Rejected

IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

ICLR 2026Rejected

LATTE: Learner-Adaptive Teacher-Forced Reflection for Advancing Deep Search

ICLR 2026Withdrawn

Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

ICLR 2026Poster

Sliding Window Attention for Reinforced Reasoning

ICLR 2026Desk Rejected

Lost in Real-World Scenarios: Concretization Disrupts LLM Logical Reasoning

ICLR 2026Rejected

A Benchmark for Self-Evolving Agents via Experience-Driven Lifelong Learning

ICLR 2026Rejected

SciTS: Scientific Time Series Understanding and Generation with LLMs

ICLR 2026Poster

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

ICLR 2026Rejected

InteractScience: Programmatic and Visually-Grounded Evaluation of Interactive Scientific Demonstration Code Generation

ICLR 2026Rejected

EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning

ICLR 2026Withdrawn

Coding Triangle: How Does Large Language Model Understand Code?

ICLR 2026Rejected

NP-ENGINE: EMPOWERING OPTIMIZATION REASON- ING IN LARGE LANGUAGE MODELS WITH VERIFIABLE SYNTHETIC NP PROBLEMS

ICLR 2026Withdrawn

Reasoning as Meta-Learning: An Optimization Perspective to Decipher Long CoT Reasoning in LLMs

ICLR 2026Rejected

LONG-HORIZON REASONING AGENT FOR OLYMPIAD- LEVEL MATHEMATICAL PROBLEM SOLVING

ICLR 2026Rejected

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

ICLR 2026Poster

UnifiedVerifier: Unifying Paradigms in Automated LLM Evaluation

ICLR 2026Desk Rejected

OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems

ICLR 2026Withdrawn

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

ICLR 2026Withdrawn

RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

NeurIPS 2025Poster

Pre-Trained Policy Discriminators are General Reward Models

NeurIPS 2025Poster

Rethinking Verification for LLM Code Generation: From Generation to Testing

NeurIPS 2025Poster

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning

NeurIPS 2025Poster

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

COLM 2025Poster

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

ICLR 2025Poster

FaceShot: Bring Any Character into Life

ICLR 2025Poster

LLaMA Decoder As Vision Transformer

ICLR 2025Rejected

Training Language Models to Critique with Multi-Agent Feedback

ICLR 2025Rejected

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

NeurIPS 2025Poster

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

ICLR 2025Poster

FOLEYCRAFTER: BRING SILENT VIDEOS TO LIFE WITH LIFELIKE AND SYNCHRONIZED SOUNDS

ICLR 2025Rejected

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices

ICLR 2025Rejected

StyleShot: A snapshot on any style

ICLR 2025Rejected

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

ICLR 2025Withdrawn

Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models

ICLR 2025Withdrawn

MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning

ICLR 2025Withdrawn

合作者 (20)

博士导师18 篇