Yikang Shen
~Yikang_Shen1
19
论文总数
9.5
年均投稿
平均评分
接收情况11/19
会议分布
ICLR
13
NeurIPS
4
COLM
1
ICML
1
发表论文 (19 篇)
20256 篇
4
Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler
ICLR 2025withdrawn
4
PaTH Attention: Position Encoding via Accumulating Householder Transformations
NeurIPS 2025Poster
4
LaMAGIC2: Advanced Circuit Formulations for Language Model-Based Analog Topology Generation
ICML 2025Poster
4
API Pack: A Massive Multi-Programming Language Dataset for API Call Generation
ICLR 2025Poster
4
Towards Efficient and No Forgetting Domain Continual Pretraining by Mitigating the Stability Gap
ICLR 2025withdrawn
4
Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study
ICLR 2025Poster
202413 篇
4
Learning to Select In-context Examples from Reward
ICLR 2024withdrawn
3
Scattered Mixture-of-Experts Implementation
COLM 2024Poster
4
The Consensus Game: Language Model Generation via Equilibrium Search
ICLR 2024Spotlight
5
Structured Fine-Tuning Enables Data-Efficient Adaptation of Code Language Models
ICLR 2024withdrawn
4
SALMON: Self-Alignment with Instructable Reward Models
ICLR 2024Poster
4
Autonomous Tree-search Ability of Large Language Models
ICLR 2024withdrawn
4
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
NeurIPS 2024Poster
3
LegoNet: Piecing Together and Breaking Apart Sub-Networks for Scalable Multi-task Learning
ICLR 2024withdrawn
4
GraphText: Graph Learning in Text Space
ICLR 2024withdrawn
3
Parallelizing Linear Transformers with the Delta Rule over Sequence Length
NeurIPS 2024Poster
4
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
NeurIPS 2024Spotlight
4
Aligning Large Multimodal Models with Factually Augmented RLHF
ICLR 2024Rejected
3
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
ICLR 2024Poster