影响力指数

99.37/100

前 0.1%

全站排名 #11

发表论文69 篇

平均评分5.4

年均产出23.0 篇/年

Mohit Bansal

Full Professor@University of North Carolina at Chapel Hill·美国·OpenReview

研究方向

natural language processing · multimodal ML

OpenThoughts: Data Recipes for Reasoning Models

Gistify: Codebase-Level Understanding via Runtime Execution

ICLR 2026Poster

VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing

ICLR 2026Poster

PoSh: Using Scene Graphs to Guide LLMs-as-a-Judge for Detailed Image Descriptions

ICLR 2026Poster

UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning

ICLR 2026Rejected

The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration

ICLR 2026Rejected

SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models

ICLR 2026Desk Rejected

CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval

ICLR 2026Rejected

Nudging the Boundaries of LLM Reasoning

ICLR 2026Poster

Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding

ICLR 2026Rejected

One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration

ICLR 2026Poster

TrustGen: A Platform of Dynamic Benchmarking on the Trustworthiness of Generative Foundation Models

ICLR 2026Poster

Proof-Verifier: Enabling Reinforcement Learning from Verifiable Rewards for Mathematical Theorem Proving

ICLR 2026Rejected

Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization

ICLR 2026Rejected

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

ICLR 2026Rejected

Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale

ICLR 2026Rejected

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

ICLR 2026Rejected

Self-Knowledge Without a Self? Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns

ICLR 2026Rejected

Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning

ICLR 2026Rejected

Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails

ICLR 2026Rejected

Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression

ICLR 2026Rejected

How Should We Evaluate LLM Reasoning Quality For Fact Verification?

ICLR 2026Rejected

Self-Correcting Text-to-Video Generation with Misalignment Detection and Localized Refinement

ICLR 2026Withdrawn

DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback

ICLR 2025Spotlight

Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits

NeurIPS 2025Poster

Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression

COLM 2025Poster

ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding

NeurIPS 2025Poster

CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion

ICLR 2025Poster

Adapt-$\infty$: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection

ICLR 2025Poster

Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel

ICLR 2025Poster

Anyprefer: An Agentic Framework for Preference Data Synthesis

ICLR 2025Poster

Retrieval-Augmented Generation with Conflicting Evidence

COLM 2025Poster

Self-Consistency Preference Optimization

ICML 2025Poster

See It from My Perspective: How Language Affects Cultural Bias in Image Understanding

ICLR 2025Poster

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation

ICLR 2025Poster

QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization

COLM 2025Poster

VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning

ICLR 2025Poster

Unbounded: A Generative Infinite Game of Character Life Simulation

ICLR 2025Poster

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

NeurIPS 2025Poster

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

NeurIPS 2025Poster

GenerationPrograms: Fine-grained Attribution with Executable Programs

COLM 2025Poster

Learning to Generate Unit Tests for Automated Debugging

COLM 2025Poster

What Matters for Model Merging at Scale?

ICLR 2025Rejected

RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives

ICLR 2025Rejected

LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits

ICLR 2025Rejected

Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level

ICLR 2025Withdrawn

MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning

ICLR 2025Rejected

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

ICLR 2025Withdrawn

Glider: Global and Local Instruction-Driven Expert Router

ICLR 2025Withdrawn

System 1.x: Learning to Balance Fast and Slow Planning with Language Models

ICLR 2025Poster

合作者 (20)

Elias Stengel-Eskin