Reversible natural language watermarking using synonym substitution and arithmetic coding
For protecting the copyright of a text and recovering its original content harmlessly, this paper proposes a novel reversible natural language watermarking method that combines arithmetic coding and synonym substitution operations. By analyzing relative frequencies of synonymous words, synonyms empl...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/106752 http://hdl.handle.net/10220/48952 https://doi.org/10.3970/cmc.2018.03510 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-106752 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1067522019-12-06T22:17:38Z Reversible natural language watermarking using synonym substitution and arithmetic coding Hao, Wei Xiang, Lingyun Li, Yan Yang, Peng Shen, Xiaobo School of Computer Science and Engineering Arithmetic Coding DRNTU::Engineering::Computer science and engineering Synonym Substitution For protecting the copyright of a text and recovering its original content harmlessly, this paper proposes a novel reversible natural language watermarking method that combines arithmetic coding and synonym substitution operations. By analyzing relative frequencies of synonymous words, synonyms employed for carrying payload are quantized into an unbalanced and redundant binary sequence. The quantized binary sequence is compressed by adaptive binary arithmetic coding losslessly to provide a spare for accommodating additional data. Then, the compressed data appended with the watermark are embedded into the cover text via synonym substitutions in an invertible manner. On the receiver side, the watermark and compressed data can be extracted by decoding the values of synonyms in the watermarked text, as a result of which the original context can be perfectly recovered by decompressing the extracted compressed data and substituting the replaced synonyms with their original synonyms. Experimental results demonstrate that the proposed method can extract the watermark successfully and achieve a lossless recovery of the original text. Additionally, it achieves a high embedding capacity. Published version 2019-06-26T05:16:58Z 2019-12-06T22:17:38Z 2019-06-26T05:16:58Z 2019-12-06T22:17:38Z 2018 Journal Article Xiang, L., Li, Y., Hao, W., Yang, P., & Shen, X. (2018). Reversible natural language watermarking using synonym substitution and arithmetic coding. Computers, Materials & Continua, 55(3), 541-559. doi:10.3970/cmc.2018.03510 1546-2218 https://hdl.handle.net/10356/106752 http://hdl.handle.net/10220/48952 https://doi.org/10.3970/cmc.2018.03510 en Computers, Materials & Continua © 2018 Tech Science Press. All rights reserved. This paper was published in Computers, Materials & Continua and is made available with permission of Tech Science Press. 19 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Arithmetic Coding DRNTU::Engineering::Computer science and engineering Synonym Substitution |
spellingShingle |
Arithmetic Coding DRNTU::Engineering::Computer science and engineering Synonym Substitution Hao, Wei Xiang, Lingyun Li, Yan Yang, Peng Shen, Xiaobo Reversible natural language watermarking using synonym substitution and arithmetic coding |
description |
For protecting the copyright of a text and recovering its original content harmlessly, this paper proposes a novel reversible natural language watermarking method that combines arithmetic coding and synonym substitution operations. By analyzing relative frequencies of synonymous words, synonyms employed for carrying payload are quantized into an unbalanced and redundant binary sequence. The quantized binary sequence is compressed by adaptive binary arithmetic coding losslessly to provide a spare for accommodating additional data. Then, the compressed data appended with the watermark are embedded into the cover text via synonym substitutions in an invertible manner. On the receiver side, the watermark and compressed data can be extracted by decoding the values of synonyms in the watermarked text, as a result of which the original context can be perfectly recovered by decompressing the extracted compressed data and substituting the replaced synonyms with their original synonyms. Experimental results demonstrate that the proposed method can extract the watermark successfully and achieve a lossless recovery of the original text. Additionally, it achieves a high embedding capacity. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Hao, Wei Xiang, Lingyun Li, Yan Yang, Peng Shen, Xiaobo |
format |
Article |
author |
Hao, Wei Xiang, Lingyun Li, Yan Yang, Peng Shen, Xiaobo |
author_sort |
Hao, Wei |
title |
Reversible natural language watermarking using synonym substitution and arithmetic coding |
title_short |
Reversible natural language watermarking using synonym substitution and arithmetic coding |
title_full |
Reversible natural language watermarking using synonym substitution and arithmetic coding |
title_fullStr |
Reversible natural language watermarking using synonym substitution and arithmetic coding |
title_full_unstemmed |
Reversible natural language watermarking using synonym substitution and arithmetic coding |
title_sort |
reversible natural language watermarking using synonym substitution and arithmetic coding |
publishDate |
2019 |
url |
https://hdl.handle.net/10356/106752 http://hdl.handle.net/10220/48952 https://doi.org/10.3970/cmc.2018.03510 |
_version_ |
1681048094430461952 |