Yuandong Tian
~Yuandong_Tian1
40
论文总数
20.0
年均投稿
平均评分
接收情况25/40
会议分布
ICLR
26
NeurIPS
6
ICML
5
COLM
3
发表论文 (40 篇)
202528 篇
4
Composing Global Solutions to Reasoning Tasks via Algebraic Objects in Neural Nets
NeurIPS 2025Poster
3
Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets
ICLR 2025Rejected
4
Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking
ICLR 2025Poster
4
AdvPrefix: An Objective for Nuanced LLM Jailbreaks
NeurIPS 2025Poster
4
R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
ICLR 2025Poster
4
Towards General-Purpose Model-Free Reinforcement Learning
ICLR 2025Spotlight
5
GSM-$\infty$: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length?
ICML 2025Poster
4
Param$\Delta$ for Direct Mixing: Post-Train Large Language Model At Zero Cost
ICLR 2025Poster
4
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
ICLR 2025Poster
4
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
ICLR 2025Rejected
4
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
ICLR 2025Rejected
3
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
ICML 2025Poster
4
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
ICML 2025Poster
4
SHARP: Accelerating Language Model Inference by SHaring Adjacent layers with Recovery Parameters
ICLR 2025Rejected
4
Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
ICLR 2025Rejected
4
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients
ICLR 2025Rejected
4
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients
ICLR 2025withdrawn
4
Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought
NeurIPS 2025Poster
4
Training Large Language Models to Reason in a Continuous Latent Space
COLM 2025Poster
5
MagicPIG: LSH Sampling for Efficient LLM Generation
ICLR 2025Spotlight
4
Training Large Language Model to Reason in a Continuous Latent Space
ICLR 2025Rejected
5
SpinQuant: LLM Quantization with Learned Rotations
ICLR 2025Poster
3
Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
ICLR 2025Rejected
4
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
ICML 2025Poster
4
Agent-as-a-Judge: Evaluate Agents with Agents
ICML 2025Poster
3
Agent-as-a-Judge: Evaluating Agents with Agents
ICLR 2025Rejected
4
ParetoQ: Improving Scaling Laws in Extremely Low-bit LLM Quantization
NeurIPS 2025Poster
3
The Perfect Blend: Redefining RLHF with Mixture of Judges
ICLR 2025withdrawn
202412 篇
4
JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP and Attention
ICLR 2024Poster
4
Efficient Streaming Language Models with Attention Sinks
ICLR 2024Poster
4
On the Surprising Effectiveness of Attention Transfer for Vision Transformers
NeurIPS 2024Poster
4
Contrastive Predict-and-Search for Mixed Integer Linear Programs
ICLR 2024Rejected
3
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
COLM 2024Poster
4
GenCO: Generating Diverse Solutions to Design Problems with Combinatorial Nature
ICLR 2024Rejected
5
RLCD: Reinforcement Learning from Contrastive Distillation for LM Alignment
ICLR 2024Poster
4
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics
NeurIPS 2024Poster
4
Learning Personalized Story Evaluation
ICLR 2024Rejected
3
End-to-end Story Plot Generator
ICLR 2024Rejected
4
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
COLM 2024Poster
3
H-GAP: Humanoid Control with a Generalist Planner
ICLR 2024Spotlight