Analytic Catalog
Synthetic Audio Detection using Spectrogram Transformer With 18 sec Windowing
This component reads audio files, formats them as spectrograms, and then detects whether they are synthesized or authentic audio with a trained Patchout faSt Spectrogram Transformer (PaSST). This component implements a windowing approach with window length 18 seconds and takes maximum to fuse decision of all windows.
Supported media types: Audio
Contact
Kratika Bhagtani kbhagtan@purdue.edu
Edward J. Delp ace@purdue.edu
Resource Links
Source code: To be published soon
License: Apache-2.0