Sainbayar Sukhbaatar
~Sainbayar_Sukhbaatar1
15
论文总数
7.5
年均投稿
平均评分
接收情况11/15
会议分布
COLM
5
ICLR
5
ICML
3
NeurIPS
2
发表论文 (15 篇)
202510 篇
4
Training Large Language Model to Reason in a Continuous Latent Space
ICLR 2025Rejected
4
Training Large Language Models to Reason in a Continuous Latent Space
COLM 2025Poster
4
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
ICLR 2025Poster
4
Contextual Position Encoding: Learning to Count What’s Important
ICLR 2025Rejected
4
Multi-Token Attention
COLM 2025Poster
4
R.I.P.: Better Models by Survival of the Fittest Prompts
ICML 2025Poster
4
Self-Challenging Language Model Agents
NeurIPS 2025Poster
4
Thinking LLMs: General Instruction Following with Thought Generation
ICML 2025Poster
4
Self-Consistency Preference Optimization
ICML 2025Poster
4
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
ICLR 2025Rejected
20245 篇
3
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
COLM 2024Poster
4
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
COLM 2024Poster
4
Compositional Interfaces for Compositional Generalization
ICLR 2024Rejected
3
Reverse Training to Nurse the Reversal Curse
COLM 2024Poster
4
Iterative Reasoning Preference Optimization
NeurIPS 2024Poster