Mohit Bansal
~Mohit_Bansal2
46
论文总数
23.0
年均投稿
平均评分
接收情况34/46
会议分布
ICLR
30
COLM
8
NeurIPS
7
ICML
1
发表论文 (46 篇)
202528 篇
5
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
ICLR 2025Poster
4
RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives
ICLR 2025Rejected
4
See It from My Perspective: How Language Affects Cultural Bias in Image Understanding
ICLR 2025Poster
4
LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits
ICLR 2025Rejected
3
LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits
NeurIPS 2025Poster
4
Adapt-$\infty$: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection
ICLR 2025Poster
4
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
ICLR 2025Spotlight
4
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
ICLR 2025Oral
4
Retrieval-Augmented Generation with Conflicting Evidence
COLM 2025Poster
4
Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression
COLM 2025Poster
4
Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents
NeurIPS 2025Poster
3
Learning to Generate Unit Tests for Automated Debugging
COLM 2025Poster
3
GenerationPrograms: Fine-grained Attribution with Executable Programs
COLM 2025Poster
5
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning
ICLR 2025Rejected
3
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
ICLR 2025Poster
4
QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization
COLM 2025Poster
3
What Matters for Model Merging at Scale?
ICLR 2025Rejected
5
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
ICLR 2025Poster
4
Glider: Global and Local Instruction-Driven Expert Router
ICLR 2025withdrawn
4
System 1.x: Learning to Balance Fast and Slow Planning with Language Models
ICLR 2025Poster
4
Self-Consistency Preference Optimization
ICML 2025Poster
4
ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding
NeurIPS 2025Poster
5
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
ICLR 2025withdrawn
2
Unbounded: A Generative Infinite Game of Character Life Simulation
ICLR 2025Poster
4
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
ICLR 2025withdrawn
4
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
ICLR 2025Poster
4
4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time
NeurIPS 2025Poster
4
Anyprefer: An Agentic Framework for Preference Data Synthesis
ICLR 2025Poster
202418 篇
5
$\mathbb{D}^2$ Pruning: Message Passing for Balancing Diversity & Difficulty in Data Pruning
ICLR 2024Poster
4
Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks
ICLR 2024Spotlight
4
LACIE: Listener-Aware Finetuning for Calibration in Large Language Models
NeurIPS 2024Poster
5
Multimodal Representation Learning by Alternating Unimodal Adaptation
ICLR 2024withdrawn
3
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
ICLR 2024Poster
4
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models
ICLR 2024Poster
4
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs
ICLR 2024Rejected
4
VideoDirectorGPT: Consistent Multi-Scene Video Generation via LLM-Guided Planning
ICLR 2024withdrawn
5
VideoDirectorGPT: Consistent Multi-Scene Video Generation via LLM-Guided Planning
COLM 2024Poster
4
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning
COLM 2024Poster
4
Efficient Low-Rank Diffusion Model Training for Text-to-Image Generation
ICLR 2024withdrawn
4
Dynamic Adapter Merging for Continual Video Question-Answering Learning
ICLR 2024Rejected
3
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
COLM 2024Poster
4
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
NeurIPS 2024Poster
3
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
ICLR 2024Spotlight
3
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
ICLR 2024Poster
4
GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations
NeurIPS 2024Poster
4
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
ICLR 2024Poster