Burger menu

Text Watermarking: Definition, General Approaches, Application

Digital text watermarking is an effective technique against copyright infringement, plagiarism, and dishonest GenAI usage.

What Is Text Watermarking?

Overview of the digital watermarking process

Text watermarking is a technology, which allows copyright protection with the help of special metadata inserted into a copy of a text. This metadata contains details on authorship, owner status, publication date, and other info that can be extracted to avoid unauthorised distribution, counterfeiting, intellectual theft, and so on.

An early digital watermarking algorithm

While the technology initially emerged in 13th century Italy, it wasn’t until 1992 when the concept of digital watermark was introduced by Andrew Tirkel and Charles Osborne. The advent of Large Language Models (LLMs) made it necessary to develop failproof watermarking techniques to prevent, among all else, academic fraud and dishonesty.

The concept of digital watermarking visualised

What Is the Difference between Text Watermarking and Natural Language Watermarking?

Text watermarking techniques taxonomy

Digital watermarking refers to embedding a hidden sequence of bits that can be extracted with a key later to identify origins of the media. However, with the natural language it requires additional manipulations on the levels of:

  • Surface. Manipulation of how the text looks.
  • Syntax. Reordering words and phrases in the text.
  • Semantics. Replacing words with synonyms, etc.

 With that in mind, a protected text shouldn’t lose its original structure, meaning, and language.

Text Watermarking Approaches

To address the issue, a number of methods are suggested.

Text watermarking techniques overview
  1. Structural-Based Approach

The method manipulates either structure or features of the text to insert a watermark. It can include repeating letters in a word, sentence or word shifting upwards/downwards, word distance analysis, analyzing the length of preceding and following words that surround a specific keyword, and so on.

  1. Linguistic-Based Approach

This family of techniques focuses on syntactic and semantic components of the text that are manipulated to hide a watermark. They include grammatical changes that don’t impact the initial meaning of a writing, reshuffling of the word order, changing active clause into passive, usage of typos or acronyms, noun-verb transformation, and others.

Noun-verb sentence tree
  1. Image-Based Approach
Image-based watermarking technique

If a watermark is represented as a certain symbol — a logo for example — it can be secretly inserted inside the cover text. The image of the watermark needs to be converted into text string. Then, it is aligned with the visual parameters of the text in question, so the watermark can be generated. 

Digital watermarks are basically invisible to the human eye
  1. LLM-Based Approach

This is a subset of methods that rely on a Large Language Model (LLM). Watermark creation includes threes stages in this case: 

  1. Training an LLM on a specific dataset that suits the task.
  2. Logits generation to predict vocabulary’s probability distribution.
  3. Token sampling for generating a watermark token.

They include token-level, sentence-level, trigger-based, and other types of watermarking, 

Text Watermarking and Natural Language Watermarking Application

Real-life application of text watermarking
  1. Copyright Protection

Watermarks help avoid copyright infringement since it’s easy to track down the source text. For that purpose, mostly format-based techniques are used: text color, word spacing, font manipulations, and others as they don’t alter the written content. Backdoor watermarking and word substitution are used for protecting LLMs and training datasets respectively.

  1. AI Generated Text Detection

It is suggested that detecting AI-produced texts may become virtually impossible in the foreseeable future. In turn, this can negatively affect academic integrity, as students and authors may automatically generate content and pass it as an original work. Watermarks covertly embedded by LLM models, can signal about the true origin of a synthetic text and prevent academic or other types of fraud.

Text Watermarking in Different Languages

The only flexible watermarking technique for multilingual content is format-based — it doesn't focus on linguistic/structural properties of a text, as they can radically differ from one language to another. For instance, an English language-based watermarking method can be adjusted to Portuguese or Spanish fairly easily as their alphabets are based on Latin.  

Try our AI Text Detector

Avatar Antispoofing

1 Followers

Editors at Antispoofing Wiki thoroughly review all featured materials before publishing to ensure accuracy and relevance.

Article contents

Hide

More from AI Generated Content

avatar Antispoofing Bias against Non-Native English Writers in AI-Detectors

The problem of bias — related to age, ethnicity, or race — has been discussed in relation to AI since…

avatar Antispoofing Plagiarism in AI-Generated Texts

Can AI-Generated Text Be Detected by Plagiarism Checkers? Plagiarism concerns in regard to GenAI have been raised repeatedly, especially since…

avatar Antispoofing Methods of Natural Language Watermarking

What Are the Main Methods of Natural Language Watermarking (NLW)? NLW is a technique that protects a writing with a…

avatar Antispoofing Natural Language Watermarking: General Principles, Techniques, Applications, Challenges

What Is Natural Language Watermarking (NLW)? Natural Language Watermarking (NLW) is a technique, which helps labeling a text with a…