全部论文

为您找到 3,422 篇相关研究

An analytic theory of creativity in convolutional diffusion models

We obtain an analytic, interpretable and predictive theory of creativity in convolutional diffusion models. Indeed, score-matching diffusion models can generate highly original images that lie far from their training data. However, optimal score-matching theory suggests that these models should only be able to produce memorized training examples. To reconcile this theory-experiment gap, we identify two simple inductive biases, locality and equivariance, that: (1) induce a form of combinatorial creativity by preventing optimal score-matching; (2) result in fully analytic, completely mechanistically interpretable, local score (LS) and equivariant local score (ELS) machines that, (3) after calibrating a single time-dependent hyperparameter can quantitatively predict the outputs of trained convolution only diffusion models (like ResNets and UNets) with high accuracy (median $r^2$ of $0.95, 0.94, 0.94, 0.96$ for our top model on CIFAR10, FashionMNIST, MNIST, and CelebA). Our model reveals a {\it locally consistent patch mosaic} mechanism of creativity, in which diffusion models create exponentially many novel images by mixing and matching different local training set patches at different scales and image locations. Our theory also partially predicts the outputs of pre-trained self-attention enabled UNets (median $r^2 \sim 0.77$ on CIFAR10), revealing an intriguing role for attention in carving out semantic coherence from local patch mosaics.

全部论文

An analytic theory of creativity in convolutional diffusion models

Model Immunization from a Condition Number Perspective

Learning with Expected Signatures: Theory and Applications

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML

Multi-agent Architecture Search via Agentic Supernet

Learning Smooth and Expressive Interatomic Potentials for Physical Property Prediction

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

FedSSI: Rehearsal-Free Continual Federated Learning with Synergistic Synaptic Intelligence

Towards Universal Offline Black-Box Optimization via Learning Language Model Embeddings

AutoGFM: Automated Graph Foundation Model with Adaptive Architecture Customization

Beyond Self-Repellent Kernels: History-Driven Target Towards Efficient Nonlinear MCMC on General Graphs

Conformal Anomaly Detection in Event Sequences

Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings

ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via $\alpha$-$\beta$-Divergence

Towards flexible perception with visual memory

The dark side of the forces: assessing non-conservative force models for atomistic machine learning

Expected Variational Inequalities

Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

筛选条件

会议

年份

状态