FreqMark: Frequency-Based Watermark for Sentence-Level Detection of LLM-Generated Text

Xu, Zhenyu; Zhang, Kun; Sheng, Victor S.

Computer Science > Computation and Language

arXiv:2410.10876 (cs)

[Submitted on 9 Oct 2024]

Title:FreqMark: Frequency-Based Watermark for Sentence-Level Detection of LLM-Generated Text

Authors:Zhenyu Xu, Kun Zhang, Victor S. Sheng

View PDF HTML (experimental)

Abstract:The increasing use of Large Language Models (LLMs) for generating highly coherent and contextually relevant text introduces new risks, including misuse for unethical purposes such as disinformation or academic dishonesty. To address these challenges, we propose FreqMark, a novel watermarking technique that embeds detectable frequency-based watermarks in LLM-generated text during the token sampling process. The method leverages periodic signals to guide token selection, creating a watermark that can be detected with Short-Time Fourier Transform (STFT) analysis. This approach enables accurate identification of LLM-generated content, even in mixed-text scenarios with both human-authored and LLM-generated segments. Our experiments demonstrate the robustness and precision of FreqMark, showing strong detection capabilities against various attack scenarios such as paraphrasing and token substitution. Results show that FreqMark achieves an AUC improvement of up to 0.98, significantly outperforming existing detection methods.

Subjects:	Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2410.10876 [cs.CL]
	(or arXiv:2410.10876v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.10876

Submission history

From: Zhenyu Xu [view email]
[v1] Wed, 9 Oct 2024 05:01:48 UTC (1,547 KB)

Computer Science > Computation and Language

Title:FreqMark: Frequency-Based Watermark for Sentence-Level Detection of LLM-Generated Text

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FreqMark: Frequency-Based Watermark for Sentence-Level Detection of LLM-Generated Text

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators