Tianyi Zhou
~Tianyi_Zhou1
36
论文总数
18.0
年均投稿
平均评分
接收情况15/36
会议分布
ICLR
32
ICML
3
COLM
1
发表论文 (36 篇)
202522 篇
-
WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents
ICLR 2025desk_rejected
5
Task-Adaptation Curriculum Learning
ICLR 2025withdrawn
4
Reweighting Local Mimina with Tilted SAM
ICLR 2025Rejected
4
Customizing Reinforcement Learning Agent with Multi-Objective Preference Control
ICLR 2025Rejected
4
Tilted Sharpness-Aware Minimization
ICML 2025Poster
3
Your Mixture-of-Experts LLM Is Secretly an Embedding Model for Free
ICLR 2025Oral
4
Preference Controllable Reinforcement Learning with Advanced Multi-Objective Optimization
ICML 2025Poster
5
One Model for All: Multi-Objective Controllable Language Models
ICLR 2025withdrawn
4
Diffusion Curriculum: Synthetic-to-Real Data Curriculum via Image-Guided Diffusion
ICLR 2025withdrawn
4
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
ICML 2025Poster
3
Federated Adapter on Foundation Models: An Out-Of-Distribution Approach
ICLR 2025Rejected
4
The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise
ICLR 2025Poster
4
BenTo: Benchmark Reduction with In-Context Transferability
ICLR 2025Poster
4
Is Your Multimodal Language Model Oversensitive to Safe Queries?
ICLR 2025Poster
4
MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion
ICLR 2025Rejected
3
Hybrid Memory Replay: Blending Real and Distilled Data for Class Incremental Learning
ICLR 2025withdrawn
6
Many-Objective Multi-Solution Transport
ICLR 2025Poster
4
GenQA: An Instruction Dataset of LLM Generated Questions and Answers
ICLR 2025withdrawn
4
DataGen: Unified Synthetic Dataset Generation via Large Language Models
ICLR 2025Poster
4
Mosaic-IT: Free Compositional Data Augmentation Improves Instruction Tuning
ICLR 2025Rejected
4
OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities
ICLR 2025Poster
5
OPTune: Efficient Online Preference Tuning
ICLR 2025withdrawn
202414 篇
3
Curriculum Reinforcement Learning via Morphology-Environment Co-Evolution
ICLR 2024Rejected
4
Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
ICLR 2024Poster
4
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning
ICLR 2024Spotlight
3
Curiosity Driven Protein Sequence Generation via Reinforcement Learning
ICLR 2024Rejected
4
YoooP: You Only Optimize One Prototype per Class for Non-Exemplar Incremental Learning
ICLR 2024Rejected
4
BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers
COLM 2024Poster
3
Federated Recommendation with Additive Personalization
ICLR 2024Poster
4
Learning UI-to-Code Reverse Generator Using Visual Critic Without Rendering
ICLR 2024withdrawn
3
Controllable Pareto Trade-off between Fairness and Accuracy
ICLR 2024withdrawn
4
Mixture-of-Experts in Prompt Optimization
ICLR 2024Rejected
5
Multi-Objective Multi-Solution Transport
ICLR 2024Rejected
3
InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models
ICLR 2024Rejected
3
NLPBench: Evaluating Large Language Models on Solving NLP Problems
ICLR 2024Rejected
3
AlpaGasus: Training a Better Alpaca with Fewer Data
ICLR 2024Poster