Zejun MA
~Zejun_MA1
17
论文总数
8.5
年均投稿
平均评分
接收情况13/17
会议分布
ICLR
9
NeurIPS
5
ICML
2
COLM
1
发表论文 (17 篇)
202511 篇
4
Robust SuperAlignment: Weak-to-Strong Robustness Generalization for Vision-Language Models
NeurIPS 2025Spotlight
4
Video Instruction Tuning with Synthetic Data
ICLR 2025withdrawn
5
General-Reasoner: Advancing LLM Reasoning Across All Domains
NeurIPS 2025Poster
5
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
NeurIPS 2025Poster
4
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
COLM 2025Poster
3
Improving LLM Video Understanding with 16 Frames Per Second
ICML 2025Poster
3
LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
ICLR 2025Spotlight
4
Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization
ICLR 2025Rejected
4
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model
ICML 2025Poster
4
Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
NeurIPS 2025Poster
4
ZeCO: Zero-Communication Overhead Sequence Parallelism for Linear Attention
NeurIPS 2025Poster
20246 篇
3
SALMONN: Towards Generic Hearing Abilities for Large Language Models
ICLR 2024Poster
4
FINE-GRAINED AUDIO-VISUAL JOINT REPRESENTATIONS FOR MULTIMODAL LARGE LANGUAGE MODELS
ICLR 2024Rejected
4
TETA: Temporal-Enhanced Text-to-Audio Generation
ICLR 2024withdrawn
4
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis
ICLR 2024Poster
4
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
ICLR 2024Spotlight
4
PolyVoice: Language Models for Speech to Speech Translation
ICLR 2024Poster