Efficient Training Image Extraction from Diffusion Models Ryan Webs
A Google TechTalk, presented by Ryan Webster, 2023-09-13
Abstract: The recent demonstration of Carlini et al. shows highly duplicated training images can be copied by diffusion models during generation, which is problematic in terms of data privacy and copyright. Known as an extraction attack, this method reconstructs training images using only a model’s generated samples. As the original work requires on the order of gpu-years to perform, we provide a pipeline that can run in gpu-days and can extract a similar number of images. We first de-duplicate the public dataset LAION-2B and demonstrate a high level of duplicated images. We then provide whitebox and blackbox extraction attacks on par with the original attack, whilst requiring significantly less network evaluations. As we can evaluate more samples, we expose the phenomenon of template copies, wherein a diffusion model copies a fixed image region and varies another. We demonstrate that new diffusion models that deduplicate their training set do not generate exact copies as in Carlini et al., but do generate templates. We conclude with several insights into copied images from a data perspective.
1 view
0
0
3 months ago 00:18:50 1
The Power of Diesel: Inside the Engine | Shell Historical Film Archive
3 months ago 00:00:26 9
Rowing on a Rowing Machine: Strength and Cardio Training on TYTAX
3 months ago 00:13:43 14
Волновой Редуктор с ПТК 1:17 | Лучший редуктор на 3D принтере?
3 months ago 00:26:12 1
How to Remember Everything You Read
3 months ago 00:30:04 1
Spectacular WW2 shell impacts at Gouda! Explore with me, ride along from Gouda to The Hague! 8/10/24
3 months ago 00:01:22 3
Creating Efficient Digital Doubles for VFX with Pat Imrie
3 months ago 00:05:58 1
Skinny Strong: How it Happens and a Technique (.) for achieving it
3 months ago 00:06:35 1
How the World’s Largest Shipyard Is Challenging China’s Dominance | WSJ
3 months ago 01:07:08 1
🇯🇵 TOKYO JAPAN, FUTURISTIC JAPAN UPSIDE-DOWN TRAIN, RIDING THE WORLD’S LONGEST SKY TRAIN, CHIBA
3 months ago 00:06:05 1
Quickly Modify Schematic Symbols with Ultra Librarian and EDABuilder!
3 months ago 00:31:17 1
АЭРОБИКА для занятия дома под ритмичную музыку🔥 Aerobics dance exercise | aerobics for beginners