Jianfeng Gao
~Jianfeng_Gao1
45
论文总数
22.5
年均投稿
平均评分
接收情况31/45
会议分布
ICLR
29
NeurIPS
11
ICML
3
COLM
2
发表论文 (45 篇)
202527 篇
4
Matryoshka Multimodal Models
ICLR 2025Poster
4
Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation
NeurIPS 2025Poster
4
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies
ICLR 2025Poster
4
Pixelated Instructions: Can Multimodal Large Language Models Follow Printed Instructions in Images?
ICLR 2025Rejected
4
Mixture of Inputs: Text Generation Beyond Discrete Token Sampling
NeurIPS 2025Poster
5
Evaluating Graphical Perception of Large Multimodal Models
ICLR 2025withdrawn
4
Vector-ICL: In-context Learning with Continuous Vector Representations
ICLR 2025Poster
5
Model Tells Itself Where to Attend: Steerable Prompting for Reliable Reading Comprehension of LLM
ICLR 2025withdrawn
4
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning
ICLR 2025Poster
4
Interpretable Next-token Prediction via the Generalized Induction Head
NeurIPS 2025Poster
4
Interpretable Language Modeling via Induction-head Ngram Models
ICLR 2025Rejected
4
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
ICLR 2025Poster
4
Simplifying DINO via Coding Rate Regularization
ICML 2025Poster
4
DataGen: Unified Synthetic Dataset Generation via Large Language Models
ICLR 2025Poster
3
MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention
ICML 2025Poster
6
SCBench: A KV Cache-Centric Analysis of Long-Context Methods
ICLR 2025Poster
4
Training Language Models to Generate Quality Code with Program Analysis Feedback
NeurIPS 2025Poster
4
Riemannian Low-Rank Adaptation for Federated Fine-Tuning of Foundation Models
ICLR 2025withdrawn
5
CollabLLM: From Passive Responders to Active Collaborators
ICML 2025Oral
4
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
NeurIPS 2025Poster
4
SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents
ICLR 2025Poster
4
Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
NeurIPS 2025Poster
6
Latent Action Pretraining from Videos
ICLR 2025Poster
5
TemporalBench: Towards Fine-grained Temporal Understanding for Multimodal Video Models
ICLR 2025withdrawn
4
SAS: Simulated Attention Score
NeurIPS 2025Poster
4
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
NeurIPS 2025Poster
4
GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding
ICLR 2025Poster
202418 篇
4
Sparse Backpropagation for MoE Training
ICLR 2024Rejected
3
Is Self-Repair a Silver Bullet for Code Generation?
ICLR 2024Poster
3
Fast-ELECTRA for Efficient Pre-training
ICLR 2024Poster
3
Compositional Generalization Across Distributional Shifts with Sparse Tree Operations
NeurIPS 2024Spotlight
4
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
ICLR 2024Rejected
4
Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
ICLR 2024withdrawn
3
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs
NeurIPS 2024Poster
4
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
ICLR 2024Poster
6
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
ICLR 2024Oral
4
Efficient Hybrid Long Sequence Modeling with State Space Augmented Transformers
COLM 2024Poster
4
Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions
NeurIPS 2024Poster
3
MedJourney: Counterfactual Medical Image Generation by Instruction-Learning from Multimodal Patient Journeys
ICLR 2024Rejected
3
Explaining black box text modules in natural language with language models
ICLR 2024Rejected
5
Efficient Long Sequence Modeling via State Space Augmented Transformer
ICLR 2024Rejected
5
MindAgent: Emergent Gaming Interaction
ICLR 2024withdrawn
6
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
COLM 2024Poster
4
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
ICLR 2024Oral
4
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
ICLR 2024Rejected