Sampled from AudioSet
WAKE is the first key-controllable audio watermark framework, which aims to embed watermarks into audio and decode the embedded watermarks using specific keys, as shown in the following Figure. If an incorrect key is used, it will be impossible to decode the correct watermark, substantially enhancing the watermarking system's security and scalability while also fulfilling personalized watermarks. Notably, WAKE can achieve multiple watermark embeddings and corresponding watermark decoding based on the key used during embedding. WAKE outperforms the current state-of-the-art audio watermarking models in watermarked audio quality and decoding performance.
Sampled from AudioSet
| Sample 1 | Sample 2 | Sample 3 | Sample 4 | Sample 5 | Sample 6 | Sample 7 | Sample 8 | |
|---|---|---|---|---|---|---|---|---|
| Origin Audio | ||||||||
| AudioSeal (single watermark) | ||||||||
| WavMark (single watermark) | ||||||||
| WAKE (single watermark) | ||||||||
| AudioSeal (double watermark) | ||||||||
| WavMark (double watermark) | ||||||||
| WAKE (double watermark) |
Sampled from LibriSpeech.
| Sample 1 | Sample 2 | Sample 3 | Sample 4 | Sample 5 | Sample 6 | Sample 7 | Sample 8 | |
|---|---|---|---|---|---|---|---|---|
| Origin Audio | ||||||||
| AudioSeal (single watermark) | ||||||||
| WavMark (single watermark) | ||||||||
| WAKE (single watermark) | ||||||||
| AudioSeal (double watermark) | ||||||||
| WavMark (double watermark) | ||||||||
| WAKE (double watermark) |
Sampled from CommonVoice.
| Sample 1 | Sample 2 | Sample 3 | Sample 4 | Sample 5 | Sample 6 | Sample 7 | Sample 8 | |
|---|---|---|---|---|---|---|---|---|
| Origin Audio | ||||||||
| AudioSeal (single watermark) | ||||||||
| WavMark (single watermark) | ||||||||
| WAKE (single watermark) | ||||||||
| AudioSeal (double watermark) | ||||||||
| WavMark (double watermark) | ||||||||
| WAKE (double watermark) |
Sampled from FMA.
| Sample 1 | Sample 2 | Sample 3 | Sample 4 | Sample 5 | Sample 6 | Sample 7 | Sample 8 | |
|---|---|---|---|---|---|---|---|---|
| Origin Audio | ||||||||
| AudioSeal (single watermark) | ||||||||
| WavMark (single watermark) | ||||||||
| WAKE (single watermark) | ||||||||
| AudioSeal (double watermark) | ||||||||
| WavMark (double watermark) | ||||||||
| WAKE (double watermark) |
Sampled from outside the train/test dataset.
| Sample 1 | Sample 2 | Sample 3 | Sample 4 | Sample 5 | Sample 6 | Sample 7 | Sample 8 | |
|---|---|---|---|---|---|---|---|---|
| Origin Audio | ||||||||
| AudioSeal (single watermark) | ||||||||
| WavMark (single watermark) | ||||||||
| WAKE (single watermark) | ||||||||
| AudioSeal (double watermark) | ||||||||
| WavMark (double watermark) | ||||||||
| WAKE (double watermark) |
Sampled Randomly
| Sample 1 | Sample 2 | Sample 3 | |
|---|---|---|---|
| Origin Audio |
|
|
|
| AudioSeal (single watermark) |
|
|
|
| WavMark (single watermark) |
|
|
|
| WAKE (single watermark) |
|
|
|
| AudioSeal (double watermark) |
|
|
|
| WavMark (double watermark) |
|
|
|
| WAKE (double watermark) |
|
|
|
Sampled from AudioSet.
| Origin Audio | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AudioSeal | |||||||||||
| WavMark | |||||||||||
| WAKE |
Sampled from LibriSpeech.
| Origin Audio | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AudioSeal | |||||||||||
| WavMark | |||||||||||
| WAKE |
Sampled from FMA.
| Origin Audio | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AudioSeal | |||||||||||
| WavMark | |||||||||||
| WAKE |
Sampled from CommonVoice.
| Origin Audio | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AudioSeal | |||||||||||
| WavMark | |||||||||||
| WAKE |
Sampled Randomly
| AudioSeal | WavMark | WAKE | |
|---|---|---|---|
| Origin Audio |
|
|
|
| Watermark 1 time |
|
|
|
| Watermark 2 times |
|
|
|
| Watermark 3 times |
|
|
|
| Watermark 4 times |
|
|
|
| Watermark 5 times |
|
|
|
| Watermark 6 times |
|
|
|
| Watermark 7 times |
|
|
|
| Watermark 8 times |
|
|
|
| Watermark 9 times |
|
|
|
| Watermark 10 times |
|
|
|