PaperHub

Bryan Catanzaro

~Bryan_Catanzaro1

29
论文总数
14.5
年均投稿
6.6
平均评分
接收情况22/29
会议分布
ICLR
16
NeurIPS
8
ICML
4
COLM
1

发表论文 (29 篇)

202522

6.6
4

FeatSharp: Your Vision Model Features, Sharper

ICML 2025Poster
6.5
4

MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS

ICLR 2025Poster
6.8
4

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

ICLR 2025Poster
7.2
4

ETTA: Elucidating the Design Space of Text-to-Audio Models

ICML 2025Poster
6.0
4

OMCAT: Omni Context Aware Transformer

ICLR 2025Rejected
5.3
4

PHI-S: Distribution Balancing for Agglomerative Models

ICLR 2025Rejected
6.0
4

Elucidating the Design Space of Text-to-Audio Models

ICLR 2025Rejected
7.5
4

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

ICLR 2025Spotlight
4.8
3

Nemotron-CORTEXA: Enhancing LLM Agents for Software Engineering Tasks via Improved Localization and Solution Diversity

ICML 2025Poster
7.3
4

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

NeurIPS 2025Poster
5.8
5

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

ICLR 2025Poster
5.5
4

A$^2$-Flow: Alignment-Aware Pre-training for Speech Synthesis with Flow Matching

ICLR 2025Rejected
6.0
3

MIND: Math Informed syNthetic Dialogues for Pretraining LLMs

ICLR 2025Poster
6.0
5

UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation

ICLR 2025Poster
7.8
4

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

ICML 2025Poster
8.2
3

Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning

NeurIPS 2025Spotlight
8.7
4

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

NeurIPS 2025Spotlight
7.2
5

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

ICLR 2025Spotlight
6.8
4

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

NeurIPS 2025Poster
8.2
4

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

NeurIPS 2025Poster
6.7
3

Fugatto 1: Foundational Generative Audio Transformer Opus 1

ICLR 2025Poster
5.0
3

LLM Pruning and Distillation in Practice

ICLR 2025Rejected