Ruoxi Jia
~Ruoxi_Jia1
25
论文总数
12.5
年均投稿
平均评分
接收情况17/25
会议分布
ICLR
16
NeurIPS
4
COLM
3
ICML
2
发表论文 (25 篇)
202518 篇
5
LLMs Can Plan Only If We Tell Them
ICLR 2025Poster
4
CONCORD: Concept-informed Diffusion for Dataset Distillation
ICLR 2025withdrawn
4
SCOPE: Scalable and Adaptive Evaluation of Misguided Safety Refusal in LLMs
ICLR 2025Rejected
4
LLM Spark: Critical Thinking Evaluation of Large Language Models
ICLR 2025Rejected
4
LLMs Can Reason Faster Only If We Let Them
ICML 2025Poster
4
Fast and Noise-Robust Diffusion Solvers for Inverse Problems: A Frequentist Approach
ICLR 2025Rejected
4
Data Shapley in One Training Run
ICLR 2025Oral
4
Data-Centric Human Preference with Rationales for Direct Preference Alignment
COLM 2025Poster
4
Probing Hidden Knowledge Holes in Unlearned LLMs
NeurIPS 2025Poster
4
Data-Centric Human Preference Optimization with Rationales
ICLR 2025Rejected
4
Just Enough Shifts: Mitigating Over-Refusal in Aligned Language Models with Targeted Representation Fine-Tuning
ICML 2025Poster
4
Capturing the Temporal Dependence of Training Data Influence
ICLR 2025Oral
4
AutoScale: Automatic Prediction of Compute-optimal Data Compositions for Training LLMs
ICLR 2025Rejected
3
AutoScale: Scale-Aware Data Mixing for Pre-Training LLMs
COLM 2025Poster
4
Mind Control through Causal Inference: Predicting Clean Images from Poisoned Data
ICLR 2025Poster
4
AIR-BENCH 2024: A Safety Benchmark based on Regulation and Policies Specified Risk Categories
ICLR 2025Spotlight
4
LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models
COLM 2025Poster
4
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal
ICLR 2025Poster
20247 篇
5
GREATS: Online Selection of High-Quality Data for LLM Training in Every Iteration
NeurIPS 2024Spotlight
4
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
ICLR 2024Oral
3
Who Leaked the Model? Tracking IP Infringers in Accountable Federated Learning
ICLR 2024Rejected
4
Data-Centric Defense: Shaping Loss Landscape with Augmentations to Counter Model Inversion
ICLR 2024withdrawn
4
Fairness-Aware Meta-Learning via Nash Bargaining
NeurIPS 2024Poster
4
Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs
ICLR 2024Poster
4
Boosting Alignment for Post-Unlearning Text-to-Image Generative Models
NeurIPS 2024Poster