Analytic Catalog
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News
J-Guard is capable of steering existing supervised AI text detectors for detecting AI-generated news while boosting adversarial robustness. By incorporating stylistic cues inspired by the unique journalistic attributes, J-Guard effectively distinguishes between real-world journalism and AI-generated news articles. Our experiments on news articles generated by a vast array of AI models, including ChatGPT (GPT3.5), demonstrate the effectiveness of J-Guard in enhancing detection capabilities while maintaining an average performance decrease of as low as 7% when faced with adversarial attacks.
Supported media types: Text
Contact
Tharindu Kumarage (Arizona State University) kskumara@asu.edu
Arslan Basharat (Kitware) semafor-sid-software@kitware.com
Resource Links
Source code: https://github.com/TSKumarage/J-Guard
License: MIT License