Jan Kautz
~Jan_Kautz1
34
论文总数
17.0
年均投稿
平均评分
接收情况21/34
会议分布
ICLR
23
NeurIPS
10
ICML
1
发表论文 (34 篇)
202524 篇
4
Gated Delta Networks: Improving Mamba2 with Delta Rule
ICLR 2025Poster
4
Minifinetuning: Low-Data Generation Domain Adaptation through Corrective Self-Distillation
ICLR 2025Rejected
4
PHI-S: Distribution Balancing for Agglomerative Models
ICLR 2025Rejected
4
LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing
ICLR 2025Poster
4
NaVILA: Legged Robot Vision-Language-Action Model for Navigation
ICLR 2025Rejected
4
Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction
ICLR 2025Rejected
5
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
ICLR 2025Rejected
4
UNAST: Unified framework for Neural Architecture Search for Transformers
ICLR 2025withdrawn
4
VILA^2: VLM Augmented VLM with Self-Improvement
ICLR 2025withdrawn
4
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
NeurIPS 2025Poster
4
X-VILA: Cross-Modality Alignment for Large Language Models
ICLR 2025withdrawn
3
LLM Pruning and Distillation in Practice
ICLR 2025Rejected
4
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement
ICLR 2025Poster
4
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
ICML 2025Poster
3
ZoomVLM: A Tuning-Free Framework for Efficient Video Understanding via Adaptive Zooming in Vision-Language Models
ICLR 2025Rejected
4
Scaling RL to Long Videos
NeurIPS 2025Poster
4
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models
NeurIPS 2025Poster
4
Hymba: A Hybrid-head Architecture for Small Language Models
ICLR 2025Spotlight
4
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
NeurIPS 2025Poster
5
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
ICLR 2025Spotlight
4
GSPN-2: Efficient Parallel Sequence Modeling
NeurIPS 2025Poster
3
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
ICLR 2025Poster
4
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
NeurIPS 2025Poster
4
Wolf: Accurate Video Captioning with a World Summarization Framework
ICLR 2025withdrawn
202410 篇
4
CosAE: Learnable Fourier Series for Image Restoration
NeurIPS 2024Poster
4
ViR: Vision Retention Networks
ICLR 2024withdrawn
4
A Variational Perspective on Solving Inverse Problems with Diffusion Models
ICLR 2024Poster
4
DiffiT: Diffusion Vision Transformers for Image Generation
ICLR 2024withdrawn
5
3D Reconstruction with Generalizable Neural Fields using Scene Priors
ICLR 2024Poster
4
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
NeurIPS 2024Spotlight
4
SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models
NeurIPS 2024Poster
4
FasterViT: Fast Vision Transformers with Hierarchical Attention
ICLR 2024Poster
3
Compact Language Models via Pruning and Knowledge Distillation
NeurIPS 2024Poster
4
Learning to Jointly Understand Visual and Tactile Signals
ICLR 2024Poster