Mohamed Elhoseiny
~Mohamed_Elhoseiny1
20
论文总数
10.0
年均投稿
平均评分
接收情况9/20
会议分布
ICLR
17
NeurIPS
2
ICML
1
发表论文 (20 篇)
202513 篇
4
ReferPix2Pix: Guiding Multi-Modal LLMs for Image Editing with Referential Pixel Grounding
ICLR 2025withdrawn
4
StoryGPT-V: Large Language Models as Consistent Story Visualizers
ICLR 2025withdrawn
4
Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
ICLR 2025Spotlight
4
Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding
NeurIPS 2025Spotlight
5
Query-based Knowledge Transfer for Heterogeneous Learning Environments
ICLR 2025Poster
5
InfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video Understanding
ICLR 2025Rejected
4
HuMouS: Human Motion Synthesis with Fine-Grained Control using Latent Space Manipulation of Cycle-Consistent Diffusion Models
ICLR 2025Rejected
4
ToddlerDiffusion: Interactive Structured Image Generation with Cascaded Schrödinger Bridge
ICLR 2025Poster
6
iMotion-LLM: Motion Prediction Instruction Tuning
ICLR 2025withdrawn
4
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?
ICLR 2025withdrawn
4
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
NeurIPS 2025Poster
5
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
ICLR 2025Rejected
4
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
ICML 2025Poster
20247 篇
4
Overcoming Generic Knowledge Loss with Selective Parameter Update
ICLR 2024withdrawn
4
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
ICLR 2024Poster
4
Modeling Annotation Delay In Continual Learning
ICLR 2024Rejected
3
On the Relation between Gradient Directions and Systematic Generalization
ICLR 2024withdrawn
4
CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding
ICLR 2024Poster
4
Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation
ICLR 2024Poster
4
MiniGPT-v2: Large Language Model as a Unified Interface for Vision-Language Multi-task Learning
ICLR 2024Rejected