What We Need From Watermarking�Research to Scale Image Provenance��Pierre Fernandez,
Meta, FAIR
APAI @ CVPR2026
Outline
2
- Part 1 -
Introduction to Watermarking
Watermarking
010100 … 100
Watermark
Watermarked content�(Transformed)
Watermarked content�(Slightly modified)
010100 … 100
Watermarking
Transformed when �transmitted from user to user
Reveal 🔍
Content
Traditionally used for IP protection.
4
Watermarking
010100 … 100
Watermark
Watermarked content�(Transformed)
Watermarked content�(Slightly modified)
010100 … 100
Watermarking
Transformed when �transmitted from user to user
Reveal 🔍
Content
Traditionally used for IP protection.�Can it be used to for Generative AI? Why? How?
5
GenAI comes with risks
Opinion swaying, scam, fraud, internet pollution
6
GenAI comes with mushrooms
Opinion swaying, scam, fraud, internet pollution
7
GenAI comes with mushrooms
Opinion swaying, scam, fraud, internet pollution
8
Forensics / Deepfake detection
Passive detection: hard, and will get harder
‘AI generated?’ ✔ / ✗
🔍
Midjourney
Classifier
9
Embedding provenance in metadata
Metadata:
Gen AI
Photography
Social platforms
Author: …
Date: …�Location: …
AI-generated: …
🔗
10
Fingerprinting
Map each data point to a vector representation, that serves as “fingerprint”
1/ Save the fingerprint of every generated image
2/ Detect if there is a match
�Database
of Gen-AI
content
Match?
→ Needs to store everything
→ Does not scale well
11
Watermarking
1/ Embed a watermark in all generated content
2/ Detect the watermark
Watermarked ✔ / ✗
🔍
Midjourney
WM Extractor
Watermarked image
12
Regulation
[White House Executive order, June 2023] → commit to label AI-generated content. Canceled
[EU AI Act, March 2024] → compulsory watermarking for general purpose AI (GPAI) by May 2025� Currently: Code of Practice
[California Provenance, Authenticity and Watermarking Standards, June 2024] → “requires a GenAI provider to place an imperceptible and maximally indelible watermark into synthetic content”
13
Watermarking can be done at different stages
Digital forensics
Training data
Model training
Model inference
Output data
✗
AI generated?�✔ / ✗
Detect
Generator
14
Watermarking can be done at different stages
Training data
Model training
Model inference
Output data
AI generated?�✔ / ✗
Post-hoc watermarking
Metadata
✗
✗
Embed
Extract
Generator
15
Watermarking can be done at different stages
Training data
Model training
Model inference
Output data
✗
✗
Generation-time WM�Out-of-model
AI generated?�✔ / ✗
Extract
Generator
16
Watermarking can be done at different stages
Training data
Model training
Model inference
Output data
✗
Generation-time WM�In-model
AI generated?�✔ / ✗
Extract
Generator�with WM
Generator
17
Watermarking can be done at different stages
Training data
Model training
Model inference
Output data
✗
AI generated?�✔ / ✗
Extract
Generator
with WM
18
Watermarking can be done at different stages
Digital forensics
Training data
Model training
Model inference
Output data
Post-hoc watermarking
✗
✗
Metadata
✗
✗
✗
Generation-time WM�Out-of-model
Generation-time WM�In-model
19
- Part 2 -
Watermarking AI-Generated Content
Post-hoc
Watermarking with deep neural networks
[📄 Zhu, Jiren, Russell Kaplan, Justin Johnson, et Li Fei-Fei. “HiDDeN: Hiding Data with Deep Networks”. In ECCV, 2018.]
[📄Ahmadi, Mahdi, Alireza Norouzi, Nader Karimi, Shadrokh Samavi, and Ali Emami. "ReDMark: Framework for residual diffusion watermarking based on deep networks." Expert Systems with Applications (2020).]
Jointly trains 2 deep neural networks to embed/extract watermarks:
Embedder
Watermarked
Original
0100101001
Augmented
Random transform
0100101001
Extractor
22
Watermarking with deep neural networks
[📄 Zhu, Jiren, Russell Kaplan, Justin Johnson, et Li Fei-Fei. « HiDDeN: Hiding Data with Deep Networks ». In ECCV, 2018.]
[📄Ahmadi, Mahdi, Alireza Norouzi, Nader Karimi, Shadrokh Samavi, and Ali Emami. "ReDMark: Framework for residual diffusion watermarking based on deep networks." Expert Systems with Applications (2020).]
Jointly trains 2 deep neural networks to embed/extract watermarks:
Embedder
Watermarked
Original
0100101001
ℓwatermark
Augmented
Random transform
ℓpercep
0100101001
Extractor
Imperceptibility
Robustness
23
Embedder-extractor based approaches
A lot of works expand this setup:
24
PixelSeal training and inference pipeline
[📄 Soucek et al. “Pixel Seal: Adversarial-only training for invisible image and video watermarking”]
Embedder
Resize�to 256x256
Resize �to original
JND
Extractor
Resize�to 256x256
0101..001
0101..001
Augment
ℓmsg
Discriminator
ℓadv
25
Example
256 bits - target PSNR ≈ 48dB
26
How do we do detection?
Generated by our model
Ext.
m : 11111…01
Hidden message �m’ : 11111…11
→ 95 /100
27
How do we do detection?
Generated by our model
Ext.
m : 11111…01
Random m: 10100…11
Natural image
Ext.
Hidden message �m’ : 11111…11
→ 95 /100
→ 51 / 100
28
How do we do detection?
Ext.
Test: bit accuracy (m,m’) > 𝜏 ex: 𝜏 = 80%
Generated by our model
Ext.
m : 11111…01
Random m: 10100…11
Natural image
Hidden message �m’ : 11111…11
→ 95 /100
→ 51 / 100
𝜏
bit accuracy
FPR
29
Video watermarking with neural nets
Video Seal
30
Audio watermarking with neural nets
Same!�[📄 WavMark] [📄 SilentCipher] [📄 Maskmark]
Embedder
Watermarked
Original
0100101001
ℓwatermark
Attacked
Random transform
ℓpercep
0100101001
Extractor
31
AudioSeal: Localized audio watermarking
Extractor
110110001
WM?
✔ or ✗
1.0
time steps
watermark detection�probability
0.0
✔
✗
Detector
📄 San Roman et al., Proactive Detection of Voice Cloning with Localized Watermarking , ICML 2024
32
Aside: Attacks on the watermark extractor
Watermark
Extractor
White-box: we know everything
Extremely easy to attack
Black-box: we know nothing
Hard to attack �(but not impossible)
33
Example of attacks
Diffusion model
+
DiffPure
Auto-encoder
VAE Regeneration�/ Neural compression
34
Semantic watermarking
Post-hoc encoder/decoder:�→ high PSNR regimes,�→ limited in the amount of pixel change,�→ harder for adversarial attacks
What if we could semantically watermark the image?��Low PSNR - harder to remove
35
Generation-time
Watermarking for LLMs
← WM happens here
FA(-5)IR (-4) is (-3) a (-2) great (-1)
Sample
Context (tokens)
lab (0)
research(0)
WM �Sample
Generation with LLMs
LLM
logits l = ( l1 …lV )
…
lab research
…
banana
37
Watermarking for LLMs
📄 Kirchenbauer et al., A Watermark for Large Language Models, ICML 2023
📄 Aaronson et al., Watermarking GPT Outputs, 2022
38
Stable Signature
Diffusion
Model
WM�Extractor
“Fine-tune LDM decoder s.t. �every generated image is directly watermarked”
‘Tahiti mountains, in the style of Gauguin’
…
WM�Decoder
AI generated?
✔ / ✗
Original�Decoder
❄️
Fine-tuned �before distribution
Watermarked
[📄 Fernandez et al. , The stable signature: Rooting watermarks in latent diffusion models, ICCV23]
39
Generation quality
Quantitative results:��10k generated �512x512 images
Original model
Watermarked
Difference
40
DistSeal
Diffusion
Model
WM�Extractor
‘Tahiti mountains, in the style of Gauguin’
AI generated?
✔ / ✗
Watermarked
WM�Embedder
Decoder
…
0100101001
[📄 Rebuffi et al. , Learning to Watermark in the Latent Space of Generative Models , ICML26]
41
Tree-Ring Watermarks
📄 Wen et al., Tree-Rings Watermarks: Invisible Fingerprints for Diffusion Images, �NeurIPS 2023
‘Tahiti mountains, in the style of Gauguin’
…
Decoder
AI generated?
✔ / ✗
Watermarked
WM
Invert diffusion process (iterative)
42
Same thing for auto-regressive models
Watermark Detection
Detokenizer
2
9
1
4
7
?
8
Autoregressive
Model
Watermarked�Sampling
Tokenizer
Watermarked!
p-value: 1.2 x 10-7
Generated Content
…
The Black Minorca is a rare and historic breed originating from the Balearic Islands of Spain. Here's an image of a Black Minorca hen, showcasing its signature green sheen on its feathers:
What
is
Black
Minorca
?
2
9
1
4
7
8
📄 Jovanovic et al., Watermarking AR Image Generation, NeurIPS 2025�📄 Wu et al., Robust Distortion-Free Watermark for Autoregressive Audio Generation Models, NeurIPS 2025�📄 Tong et al., Training-Free Watermarking for Autoregressive Image Generation, arXiv 2025�📄 Hui et al., Autoregressive Images Watermarking through Lexical Biasing: An Approach Resistant to Regeneration Attack, arXiv 2025�📄 Müller et al., On the Robustness of Watermarking for Autoregressive Image Generation, arXiv 2026�📄 Yilmaz et al., UniMark: Unified Adaptive Multi-bit Watermarking for Autoregressive Image Generators, arXiv 2026
43
Qualitative
Without watermarking
With watermarking
📄 Jovanovic et al., Watermarking AR Image Generation, NeurIPS 2025
44
Qualitative: Semantic Watermarking
Without watermarking
With watermarking
45
Qualitative: Semantic Watermarking
Without watermarking
With watermarking
→ More green tokens
46
Robustness (vs SOTA post-hoc)
→ Robust to removal attacks (Adv.) and neural compression (NC)� (SOTA post-hoc watermarks often break here)
Autoencoder
NC
Diffusion steps
Adv.�(DiffPure)
+
47
- Part 3 -
Roadblocks and Discussions
Invisible watermarking in prod.
✔
✗
✔
✔
✔
✔
✔
✔
✗
Other companies:
49
Invisible watermarking in prod.
✔
✗
✔
✔
✔
✔
✔
✔
✗
Other companies:
→ Post-hoc almost all the time (except for text). Why?
50
Revisiting the 3 criteria
Imperceptibility
Payload
Robustness
51
Revisiting the 3 criteria
Imperceptibility
Payload
Robustness
?
?
?
52
Example of Tree-Ring
⚠️ Meta updates the generative model.� → x2 detection compute
Develops a generative model and want to watermark the outputs.�⚠️ WM team convinces that it’s better to tweak GenAI model/ sampling. � → Uses Tree-Ring.
⚠️ Detection? iterative, ≈10s � → Scaling detection to FB/IG is impossible� → Giving API access to detection very costly
⚠️ Meta owns photo devices ( ). � Will have to develop post-hoc models anyway
…
Generative model
Generative model 2
Generative model
53
Example of Tree-Ring
⚠️ Meta updates the generative model.� → x2 detection compute
Develops a generative model and want to watermark the outputs.�⚠️ WM team convinces that it’s better to tweak GenAI model/ sampling. � → Uses Tree-Ring.
⚠️ Detection? iterative, ≈10s � → Scaling detection to FB/IG is impossible� → Giving API access to detection very costly
⚠️ Meta owns photo devices ( ). � Will have to develop post-hoc models anyway
Organizational
Efficiency
Adaptability (backward compatibility)
Adaptability (accross use-cases)
…
Generative model
Generative model 2
Generative model
54
Adaptability: Organizational roadblocks
The company must be able to change the generative model.
�� → any fine-tuning diffusion or autoregressive model needs to be done � at the end adds complexity to the generative pipeline��� → changing generative model is harder: � people doing the generative models do not like when you touch the sampling
→ Post-hoc watermarking
55
Adaptability: Modularity
In companies, constraints are not the same depending on where the WM is embedded.�→ on device, e.g. cameras, phones, for marking real content�→ on GPUs for marking AI-generated content ��Could we have different embedders trained with a single extractor?
Embedder�(Real)�(small, on device)
Embedder�(GenAI)
Watermarked
Original
0100101001
0100101001
Single Extractor
Watermarked
AI Gen
0100101001
?
56
Adaptability: Updates and backward compatibility
Company updates the embedder:
Could it keep the same extractor as before?��
57
Security issues
Watermark
Extractor
White-box: we know everything
Extremely easy to attack
Black-box: we know nothing
Hard to attack �(but not impossible)
58
Example of attacks
Regeneration attacks�e.g. DiffPure
Diffusion model
+
Gradient-based�Adversarial attack
Watermark
Extractor
+
optimize such that:�- the message is removed�- the message is forged
59
Robustness ≠ Security
Robustness
“Degradation is due to a classical content processing (compression, low-pass filtering, noise addition, geometric attack… ).”
📄 Cayre et al.: Watermarking Security: Theory and Practice, 2005
Security
“the inability by unauthorized users to access [i.e. to remove, to read, or to write the hidden message] the communication channel”
60
What about white-box attacks?
→ Interoperability��Detection of 3rd party content requires cross-industry collaboration
61
Public facing detectors
How to communicate detection results:
→ API. What are the best ways to protect?
→ OSS extractor. Is this even possible?
62
Main takeaways
Watermarking:
THANKS!
slides are on my website
MetaSeal website
63