PaperHub

Jianfeng Gao

~Jianfeng_Gao1

45
论文总数
22.5
年均投稿
6.1
平均评分
接收情况31/45
会议分布
ICLR
29
NeurIPS
11
ICML
3
COLM
2

发表论文 (45 篇)

202527

6.0
4

Matryoshka Multimodal Models

ICLR 2025Poster
6.4
4

Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation

NeurIPS 2025Poster
7.0
4

TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies

ICLR 2025Poster
4.0
4

Pixelated Instructions: Can Multimodal Large Language Models Follow Printed Instructions in Images?

ICLR 2025Rejected
7.8
4

Mixture of Inputs: Text Generation Beyond Discrete Token Sampling

NeurIPS 2025Poster
4.0
5

Evaluating Graphical Perception of Large Multimodal Models

ICLR 2025withdrawn
6.0
4

Vector-ICL: In-context Learning with Continuous Vector Representations

ICLR 2025Poster
4.6
5

Model Tells Itself Where to Attend: Steerable Prompting for Reliable Reading Comprehension of LLM

ICLR 2025withdrawn
5.8
4

ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning

ICLR 2025Poster
7.3
4

Interpretable Next-token Prediction via the Generalized Induction Head

NeurIPS 2025Poster
6.8
4

Interpretable Language Modeling via Induction-head Ngram Models

ICLR 2025Rejected
5.8
4

Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass

ICLR 2025Poster
6.1
4

Simplifying DINO via Coding Rate Regularization

ICML 2025Poster
6.0
4

DataGen: Unified Synthetic Dataset Generation via Large Language Models

ICLR 2025Poster
7.0
3

MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention

ICML 2025Poster
6.5
6

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

ICLR 2025Poster
6.4
4

Training Language Models to Generate Quality Code with Program Analysis Feedback

NeurIPS 2025Poster
4.3
4

Riemannian Low-Rank Adaptation for Federated Fine-Tuning of Foundation Models

ICLR 2025withdrawn
7.8
5

CollabLLM: From Passive Responders to Active Collaborators

ICML 2025Oral
7.3
4

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

NeurIPS 2025Poster
5.3
4

SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents

ICLR 2025Poster
6.8
4

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

NeurIPS 2025Poster
5.8
6

Latent Action Pretraining from Videos

ICLR 2025Poster
4.2
5

TemporalBench: Towards Fine-grained Temporal Understanding for Multimodal Video Models

ICLR 2025withdrawn
6.4
4

SAS: Simulated Attention Score

NeurIPS 2025Poster
7.3
4

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

NeurIPS 2025Poster
6.3
4

GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding

ICLR 2025Poster

202418

4.8
4

Sparse Backpropagation for MoE Training

ICLR 2024Rejected
7.3
3

Is Self-Repair a Silver Bullet for Code Generation?

ICLR 2024Poster
6.0
3

Fast-ELECTRA for Efficient Pre-training

ICLR 2024Poster
7.0
3

Compositional Generalization Across Distributional Shifts with Sparse Tree Operations

NeurIPS 2024Spotlight
5.5
4

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

ICLR 2024Rejected
4.0
4

Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks

ICLR 2024withdrawn
6.7
3

DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs

NeurIPS 2024Poster
5.8
4

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

ICLR 2024Poster
8.0
6

Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

ICLR 2024Oral
6.3
4

Efficient Hybrid Long Sequence Modeling with State Space Augmented Transformers

COLM 2024Poster
5.0
4

Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions

NeurIPS 2024Poster
5.7
3

MedJourney: Counterfactual Medical Image Generation by Instruction-Learning from Multimodal Patient Journeys

ICLR 2024Rejected
6.3
3

Explaining black box text modules in natural language with language models

ICLR 2024Rejected
5.4
5

Efficient Long Sequence Modeling via State Space Augmented Transformer

ICLR 2024Rejected
5.0
5

MindAgent: Emergent Gaming Interaction

ICLR 2024withdrawn
6.8
6

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

COLM 2024Poster
7.3
4

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

ICLR 2024Oral
5.5
4

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

ICLR 2024Rejected