Furu Wei
~Furu_Wei1
41
论文总数
20.5
年均投稿
平均评分
接收情况28/41
会议分布
ICLR
26
NeurIPS
12
COLM
2
ICML
1
发表论文 (41 篇)
202522 篇
4
Semi-Parametric Retrieval via Binary Bag-of-Tokens Index
ICLR 2025Poster
4
Scaling Optimal LR Across Token Horizons
ICLR 2025Poster
4
Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
ICLR 2025Rejected
4
Next Block Prediction: Video Generation via Semi-Auto-Regressive Modeling
ICLR 2025Rejected
4
Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning
NeurIPS 2025Poster
3
Textual Aesthetics in Large Language Models
ICLR 2025withdrawn
4
Generative Representational Instruction Tuning
ICLR 2025Poster
5
Self-Boosting Large Language Models with Synthetic Preference Data
ICLR 2025Poster
4
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
ICLR 2025withdrawn
3
Preference Optimization for Reasoning with Pseudo Feedback
ICLR 2025Spotlight
4
Chain-of-Retrieval Augmented Generation
NeurIPS 2025Poster
5
Data Selection via Optimal Control for Language Models
ICLR 2025Oral
4
Differential Transformer
ICLR 2025Oral
4
Reward Reasoning Models
NeurIPS 2025Poster
4
E5-V: Universal Embeddings with Multimodal Large Language Models
ICLR 2025Rejected
4
Imagine While Reasoning in Space: Multimodal Visualization-of-Thought
ICML 2025Poster
4
One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
ICLR 2025withdrawn
4
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers
ICLR 2025Rejected
4
Think Only When You Need with Large Hybrid-Reasoning Models
NeurIPS 2025Poster
4
ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation
ICLR 2025Poster
4
Scaling Laws of Synthetic Data for Language Model
COLM 2025Poster
4
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
ICLR 2025Rejected
202419 篇
4
MiniLLM: Knowledge Distillation of Large Language Models
ICLR 2024Poster
4
Mixture of LoRA Experts
ICLR 2024Poster
4
Adapting Large Language Models via Reading Comprehension
ICLR 2024Poster
-
SupMem: Support Memorization for Semiparametric Language Models
ICLR 2024withdrawn
3
Multimodal Large Language Models Make Text-to-Image Generative Models Align Better
NeurIPS 2024Poster
4
Boosting Text-to-Video Generative Model with MLLMs Feedback
NeurIPS 2024Poster
3
SCALE: Synergized Collaboration of Asymmetric Language Translation Engines
ICLR 2024Rejected
4
In-context Autoencoder for Context Compression in a Large Language Model
ICLR 2024Poster
3
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
ICLR 2024Poster
4
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
NeurIPS 2024Poster
4
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
ICLR 2024Poster
4
Multi-Head Mixture-of-Experts
NeurIPS 2024Poster
4
Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models
NeurIPS 2024Poster
4
Grounding Multimodal Large Language Models to the World
ICLR 2024Poster
3
Retentive Network
NeurIPS 2024Rejected
4
Retentive Network: A Successor to Transformer for Large Language Models
ICLR 2024Rejected
4
LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models
COLM 2024Poster
4
You Only Cache Once: Decoder-Decoder Architectures for Language Models
NeurIPS 2024Oral
4
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
NeurIPS 2024Rejected