PaperHub

Yuandong Tian

~Yuandong_Tian1

40
论文总数
20.0
年均投稿
6.0
平均评分
接收情况25/40
会议分布
ICLR
26
NeurIPS
6
ICML
5
COLM
3

发表论文 (40 篇)

202528

6.8
4

Composing Global Solutions to Reasoning Tasks via Algebraic Objects in Neural Nets

NeurIPS 2025Poster
5.7
3

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets

ICLR 2025Rejected
6.8
4

Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking

ICLR 2025Poster
6.4
4

AdvPrefix: An Objective for Nuanced LLM Jailbreaks

NeurIPS 2025Poster
5.5
4

R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference

ICLR 2025Poster
7.5
4

Towards General-Purpose Model-Free Reinforcement Learning

ICLR 2025Spotlight
6.4
5

GSM-$\infty$: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length?

ICML 2025Poster
6.5
4

Param$\Delta$ for Direct Mixing: Post-Train Large Language Model At Zero Cost

ICLR 2025Poster
4.8
4

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

ICLR 2025Poster
5.0
4

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

ICLR 2025Rejected
5.0
4

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

ICLR 2025Rejected
6.3
3

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

ICML 2025Poster
6.1
4

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

ICML 2025Poster
4.8
4

SHARP: Accelerating Language Model Inference by SHaring Adjacent layers with Recovery Parameters

ICLR 2025Rejected
5.3
4

Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition

ICLR 2025Rejected
5.0
4

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

ICLR 2025Rejected
5.0
4

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

ICLR 2025withdrawn
7.3
4

Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought

NeurIPS 2025Poster
6.3
4

Training Large Language Models to Reason in a Continuous Latent Space

COLM 2025Poster
7.2
5

MagicPIG: LSH Sampling for Efficient LLM Generation

ICLR 2025Spotlight
5.8
4

Training Large Language Model to Reason in a Continuous Latent Space

ICLR 2025Rejected
5.8
5

SpinQuant: LLM Quantization with Learned Rotations

ICLR 2025Poster
5.0
3

Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning

ICLR 2025Rejected
6.1
4

From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications

ICML 2025Poster
7.2
4

Agent-as-a-Judge: Evaluate Agents with Agents

ICML 2025Poster
5.7
3

Agent-as-a-Judge: Evaluating Agents with Agents

ICLR 2025Rejected
7.3
4

ParetoQ: Improving Scaling Laws in Extremely Low-bit LLM Quantization

NeurIPS 2025Poster
5.0
3

The Perfect Blend: Redefining RLHF with Mixture of Judges

ICLR 2025withdrawn

202412