Krishnamurthy Dj Dvijotham
~Krishnamurthy_Dj_Dvijotham1
8
论文总数
4.0
年均投稿
平均评分
接收情况6/8
会议分布
ICLR
5
COLM
2
NeurIPS
1
发表论文 (8 篇)
20253 篇
4
No, of Course I Can! Deeper Fine-Tuning Attacks That Bypass Token-Level Safety Mechanisms
NeurIPS 2025Rejected
3
DoomArena: A framework for Testing AI Agents Against Evolving Security Threats
COLM 2025Poster
4
BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks
ICLR 2025Poster
20245 篇
3
Correlated Noise Provably Beats Independent Noise for Differentially Private Learning
ICLR 2024Poster
4
Expressive Losses for Verified Robustness via Convex Combinations
ICLR 2024Poster
4
Efficient Certification of Physics-Informed Neural Networks
ICLR 2024Rejected
3
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
ICLR 2024Poster
4
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
COLM 2024Poster