Analytic Catalog

Synthetic Audio Detection using Spectrogram Transformer With 18 sec Windowing

This component reads audio files, formats them as spectrograms, and then detects whether they are synthesized or authentic audio with a trained Patchout faSt Spectrogram Transformer (PaSST). This component implements a windowing approach with window length 18 seconds and takes maximum to fuse decision of all windows.

Supported media types: Audio

Contact

Kratika Bhagtani kbhagtan@purdue.edu

Edward J. Delp ace@purdue.edu

Resource Links

Source code: To be published soon

License: Apache-2.0