Kai Chen
~Kai_Chen4
33
论文总数
16.5
年均投稿
平均评分
接收情况17/33
会议分布
ICLR
20
NeurIPS
12
COLM
1
发表论文 (33 篇)
202518 篇
5
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs
ICLR 2025Poster
4
Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models
NeurIPS 2025Poster
3
FaceShot: Bring Any Character into Life
ICLR 2025Poster
4
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
ICLR 2025Poster
5
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning
ICLR 2025withdrawn
4
StyleShot: A snapshot on any style
ICLR 2025Rejected
3
Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models
ICLR 2025withdrawn
3
FOLEYCRAFTER: BRING SILENT VIDEOS TO LIFE WITH LIFELIKE AND SYNCHRONIZED SOUNDS
ICLR 2025Rejected
4
Rethinking Verification for LLM Code Generation: From Generation to Testing
NeurIPS 2025Poster
4
Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go
NeurIPS 2025Poster
4
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
ICLR 2025Oral
3
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
ICLR 2025Rejected
6
Training Language Models to Critique with Multi-Agent Feedback
ICLR 2025Rejected
4
LLaMA Decoder As Vision Transformer
ICLR 2025Rejected
4
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning
NeurIPS 2025Poster
4
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
ICLR 2025withdrawn
4
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
COLM 2025Poster
4
Pre-Trained Policy Discriminators are General Reward Models
NeurIPS 2025Poster
202415 篇
5
Differential Model Scaling using Differential Topk
ICLR 2024Rejected
4
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection
ICLR 2024Rejected
4
CriticEval: Evaluating Large-scale Language Model as Critic
NeurIPS 2024Poster
4
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
NeurIPS 2024Poster
4
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
ICLR 2024Rejected
5
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
ICLR 2024withdrawn
5
MotionBooth: Motion-Aware Customized Text-to-Video Generation
NeurIPS 2024Spotlight
3
RTMPose: Real-Time Models for Multi-Person Pose Estimation
ICLR 2024Rejected
3
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs
NeurIPS 2024Poster
4
Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization
NeurIPS 2024Poster
4
Convolution on Your 12× Wide Feature: A ConvNet with Nested Design
ICLR 2024withdrawn
4
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
NeurIPS 2024Poster
3
Evaluating Hallucinations in Chinese Large Language Models
ICLR 2024Rejected
4
MMBench: Is Your Multi-modal Model an All-around Player?
ICLR 2024Rejected
3
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
NeurIPS 2024Poster