Adel Bibi
~Adel_Bibi1
24
论文总数
12.0
年均投稿
平均评分
接收情况16/24
会议分布
ICLR
15
NeurIPS
7
ICML
1
COLM
1
发表论文 (24 篇)
202512 篇
4
Questioning Simplicity Bias Assumptions
ICLR 2025withdrawn
-
Language Models' Internal Conflicts: Layer-wise Usable Information For Detecting Model (Un)answerability
ICLR 2025withdrawn
4
Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
ICLR 2025Spotlight
5
On the Coexistence and Ensembling of Watermarks
NeurIPS 2025Poster
4
On the Coexistence and Ensembling of Watermarks
ICLR 2025Rejected
4
Towards Certification of Uncertainty Calibration under Adversarial Attacks
ICLR 2025Poster
4
SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?
ICLR 2025Rejected
4
Do as I do (Safely): Mitigating Task-Specific Fine-tuning Risks in Large Language Models
ICLR 2025Poster
4
MIP against Agent: Malicious Image Patches Hijacking Multimodal OS Agents
NeurIPS 2025Poster
4
Mixture of Experts Made Intrinsically Interpretable
ICML 2025Poster
4
Shh, don't say that! Domain Certification in LLMs
ICLR 2025Poster
3
Rethinking Safety in LLM Fine-tuning: An Optimization Perspective
COLM 2025Poster
202412 篇
4
Efficient Certification of Physics-Informed Neural Networks
ICLR 2024Rejected
4
Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation
ICLR 2024Rejected
4
When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations
ICLR 2024Poster
4
Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation
ICLR 2024Poster
5
Efficient Lifelong Model Evaluation in an Era of Rapid Progress
NeurIPS 2024Poster
4
Universal In-Context Approximation By Prompting Fully Recurrent Models
NeurIPS 2024Poster
5
From Categories to Classifier: Name-Only Continual Learning by Exploring the Web
ICLR 2024withdrawn
4
Label Delay in Online Continual Learning
NeurIPS 2024Poster
4
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
NeurIPS 2024Poster
3
Illusory Attacks: Information-theoretic detectability matters in adversarial attacks
ICLR 2024Spotlight
4
Modeling Annotation Delay In Continual Learning
ICLR 2024Rejected
4
Can Large Language Model Agents Simulate Human Trust Behavior?
NeurIPS 2024Poster