Analytic Catalog

J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News

J-Guard is capable of steering existing supervised AI text detectors for detecting AI-generated news while boosting adversarial robustness. By incorporating stylistic cues inspired by the unique journalistic attributes, J-Guard effectively distinguishes between real-world journalism and AI-generated news articles. Our experiments on news articles generated by a vast array of AI models, including ChatGPT (GPT3.5), demonstrate the effectiveness of J-Guard in enhancing detection capabilities while maintaining an average performance decrease of as low as 7% when faced with adversarial attacks.

Supported media types: Text

Contact

Tharindu Kumarage (Arizona State University) kskumara@asu.edu

Arslan Basharat (Kitware) semafor-sid-software@kitware.com

Resource Links

Source code: https://github.com/TSKumarage/J-Guard

License: MIT License