PaperHub

Furu Wei

~Furu_Wei1

41
论文总数
20.5
年均投稿
6.0
平均评分
接收情况28/41
会议分布
ICLR
26
NeurIPS
12
COLM
2
ICML
1

发表论文 (41 篇)

202522

7.0
4

Semi-Parametric Retrieval via Binary Bag-of-Tokens Index

ICLR 2025Poster
6.0
4

Scaling Optimal LR Across Token Horizons

ICLR 2025Poster
4.8
4

Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

ICLR 2025Rejected
3.5
4

Next Block Prediction: Video Generation via Semi-Auto-Regressive Modeling

ICLR 2025Rejected
6.4
4

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

NeurIPS 2025Poster
4.7
3

Textual Aesthetics in Large Language Models

ICLR 2025withdrawn
7.0
4

Generative Representational Instruction Tuning

ICLR 2025Poster
6.6
5

Self-Boosting Large Language Models with Synthetic Preference Data

ICLR 2025Poster
4.8
4

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

ICLR 2025withdrawn
7.3
3

Preference Optimization for Reasoning with Pseudo Feedback

ICLR 2025Spotlight
6.8
4

Chain-of-Retrieval Augmented Generation

NeurIPS 2025Poster
8.0
5

Data Selection via Optimal Control for Language Models

ICLR 2025Oral
8.0
4

Differential Transformer

ICLR 2025Oral
6.4
4

Reward Reasoning Models

NeurIPS 2025Poster
5.8
4

E5-V: Universal Embeddings with Multimodal Large Language Models

ICLR 2025Rejected
6.1
4

Imagine While Reasoning in Space: Multimodal Visualization-of-Thought

ICML 2025Poster
4.3
4

One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks

ICLR 2025withdrawn
5.0
4

VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers

ICLR 2025Rejected
7.8
4

Think Only When You Need with Large Hybrid-Reasoning Models

NeurIPS 2025Poster
6.3
4

ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation

ICLR 2025Poster
6.5
4

Scaling Laws of Synthetic Data for Language Model

COLM 2025Poster
5.3
4

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

ICLR 2025Rejected

202419

6.3
4

MiniLLM: Knowledge Distillation of Large Language Models

ICLR 2024Poster
5.0
4

Mixture of LoRA Experts

ICLR 2024Poster
6.5
4

Adapting Large Language Models via Reading Comprehension

ICLR 2024Poster
-

SupMem: Support Memorization for Semiparametric Language Models

ICLR 2024withdrawn
5.7
3

Multimodal Large Language Models Make Text-to-Image Generative Models Align Better

NeurIPS 2024Poster
6.0
4

Boosting Text-to-Video Generative Model with MLLMs Feedback

NeurIPS 2024Poster
5.7
3

SCALE: Synergized Collaboration of Asymmetric Language Translation Engines

ICLR 2024Rejected
6.8
4

In-context Autoencoder for Context Compression in a Large Language Model

ICLR 2024Poster
6.0
3

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

ICLR 2024Poster
4.5
4

xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

NeurIPS 2024Poster
6.0
4

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

ICLR 2024Poster
5.0
4

Multi-Head Mixture-of-Experts

NeurIPS 2024Poster
6.0
4

Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models

NeurIPS 2024Poster
7.0
4

Grounding Multimodal Large Language Models to the World

ICLR 2024Poster
5.0
3

Retentive Network

NeurIPS 2024Rejected
4.8
4

Retentive Network: A Successor to Transformer for Large Language Models

ICLR 2024Rejected
6.3
4

LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models

COLM 2024Poster
7.0
4

You Only Cache Once: Decoder-Decoder Architectures for Language Models

NeurIPS 2024Oral
5.5
4

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

NeurIPS 2024Rejected