Bryan Catanzaro
~Bryan_Catanzaro1
29
论文总数
14.5
年均投稿
平均评分
接收情况22/29
会议分布
ICLR
16
NeurIPS
8
ICML
4
COLM
1
发表论文 (29 篇)
202522 篇
4
FeatSharp: Your Vision Model Features, Sharper
ICML 2025Poster
4
MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS
ICLR 2025Poster
4
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
ICLR 2025Poster
4
ETTA: Elucidating the Design Space of Text-to-Audio Models
ICML 2025Poster
4
OMCAT: Omni Context Aware Transformer
ICLR 2025Rejected
4
PHI-S: Distribution Balancing for Agglomerative Models
ICLR 2025Rejected
4
Elucidating the Design Space of Text-to-Audio Models
ICLR 2025Rejected
4
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
ICLR 2025Spotlight
3
Nemotron-CORTEXA: Enhancing LLM Agents for Software Engineering Tasks via Improved Localization and Solution Diversity
ICML 2025Poster
4
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning
NeurIPS 2025Poster
5
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
ICLR 2025Poster
4
A$^2$-Flow: Alignment-Aware Pre-training for Speech Synthesis with Flow Matching
ICLR 2025Rejected
3
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs
ICLR 2025Poster
5
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation
ICLR 2025Poster
4
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
ICML 2025Poster
3
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning
NeurIPS 2025Spotlight
4
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
NeurIPS 2025Spotlight
5
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
ICLR 2025Spotlight
4
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
NeurIPS 2025Poster
4
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
NeurIPS 2025Poster
3
Fugatto 1: Foundational Generative Audio Transformer Opus 1
ICLR 2025Poster
3
LLM Pruning and Distillation in Practice
ICLR 2025Rejected
20247 篇
5
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
COLM 2024Poster
3
In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models
ICLR 2024withdrawn
4
ChatQA: Surpassing GPT-4 on Conversational QA and RAG
NeurIPS 2024Poster
3
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining
ICLR 2024Rejected
3
Compact Language Models via Pruning and Knowledge Distillation
NeurIPS 2024Poster
3
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
NeurIPS 2024Poster
6
Retrieval meets Long Context Large Language Models
ICLR 2024Poster