Google is embedding inaudible watermarks right into its AI generated music

L4sBot@lemmy.world · 8 months ago

Google is embedding inaudible watermarks right into its AI generated music

SuckMyWang@lemmy.world · edit-2 8 months ago

it does this by converting the audio into a 2d visualisation that shows how the spectrum of frequencies evolves in a sound over time

Old school windows media player has entered the chat

Seriously fuck off with this jargon, it doesn’t explain anything

Terminarchs@slrpnk.net · 8 months ago

That’s actually an accurate description of what is happening: an audio file turned into a 2d image with the x axis being time, the y axis being frequency and color being amplitude.

RufusLoacker@feddit.it · 8 months ago

That’s literally a spectrograph

Terminarchs@slrpnk.net · 8 months ago

Spectrogram*

Viking_Hippie@lemmy.world · 8 months ago

Your mom’s literally a spectrograph.

SuckMyWang@lemmy.world · 8 months ago

I know, it’s like the old windows media player visualisations.

FishFace@lemmy.world · 8 months ago

Sounds like a bad journalist hasn’t understood the explanation. A spectrogram contains all the same data as was originally encoded. I guess all it means is that the watermark is applied in the frequency domain.

datavoid@lemmy.ml · edit-2 8 months ago

Also this isn’t new by any stretch… Aphex Twin would like a word

FishFace@lemmy.world · 8 months ago

Well, encoding stuff in the spectrogram isn’t new, sure. But encoding stuff into an audio file that is inaudible but robust to incidental modifications to the file is much harder. Aphex Twin’s stuff is audible!

SuckMyWang@lemmy.world · edit-2 8 months ago

I would like to know what it is that makes it so robust. The article explains very little. Is it in the high frequencies? Higher than the human ear can hear? Compression will effect that plus that’s going to piss dogs off. Could be something with the phasing too. Filters and effects might be able to get rid of the water mark

FishFace@lemmy.world · 8 months ago

I don’t know what frequencies are annoying for dogs but I’m guessing it’s above 24kHz so no sound file or sound system is going to be able to store or produce it anyway.

There will certainly be some way to get rid of the watermark. But it might nevertheless persist through common filters.