Yuejie Chi
~Yuejie_Chi1
22
论文总数
11.0
年均投稿
平均评分
接收情况18/22
会议分布
ICLR
9
NeurIPS
8
ICML
3
COLM
2
发表论文 (22 篇)
202514 篇
4
LoRe: Personalizing LLMs via Low-Rank Reward Modeling
COLM 2025Poster
4
A Theoretical Analysis of Self-Supervised Learning for Vision Transformers
ICLR 2025Poster
5
Vertical Federated Learning with Missing Features During Training and Inference
ICLR 2025Poster
4
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
ICML 2025Poster
4
Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning
ICML 2025Poster
4
Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning
ICLR 2025Rejected
5
Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent
NeurIPS 2025Poster
4
Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL
NeurIPS 2025Poster
4
Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization
NeurIPS 2025Poster
4
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
ICLR 2025Poster
4
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
ICLR 2025Rejected
4
Feynman: Knowledge-Infused Diagramming Agent for Scaling Visual Reasoning Data
ICLR 2025Rejected
4
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
ICLR 2025Poster
4
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
ICML 2025Spotlight
20248 篇
3
The Sample-Communication Complexity Trade-off in Federated Q-Learning
NeurIPS 2024Oral
4
Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction
NeurIPS 2024Poster
4
Prompt-prompted Adaptive Structured Pruning for Efficient LLM Generation
COLM 2024Poster
4
In-Context Learning with Representations: Contextual Generalization of Trained Transformers
NeurIPS 2024Poster
4
Towards Non-Asymptotic Convergence for Diffusion-Based Generative Models
ICLR 2024Poster
4
Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning
NeurIPS 2024Poster
4
Learning Discrete Concepts in Latent Hierarchical Models
NeurIPS 2024Poster
4
Federated Natural Policy Gradient Methods for Multi-task Reinforcement Learning
ICLR 2024Rejected