Rights Protection for Natural Language Text
Related Papers by various authors (in reverse chronological order, and classified by topic)
Overview
Internet has become one of the main sources of knowledge acquisition
harboring online newspapers, advertisements, web portals for
scientific documents, personal blogs, encyclopedias etc. Even though
being able to search and access immense amount of knowledge online has
become a part of everyday life, it is still an open question as to how
the authors or owners of digital text will have control on how their
data is distributed or re-used. Rights management problems are more
serious for text than they are for image, video and audio since it is
much easier for users to download and manipulate copyrighted text and
re-use it free from control.
What is needed is a rights protection system that ``travels with the
content'', i.e. a technology that can protect the content even after
it is decrypted or an existing digital signature is separated from the
document in another way. Digital watermarking is an information hiding
mechanism that embeds the copyright information in the
document. Besides traveling with the content of the documents, digital
watermarks are also imperceptible, making the process of removing it
from the document challenging.
Our focus is on using the linguistic features of the sentence
constituents in natural language text in order to insert information
(i.e. watermark, meta-data, fingerprint etc.). This approach is
different from techniques, collectively referred to as ``text
watermarking,'' which embed information by modifying the appearance of
text elements, such as lines, words, or characters.
Natural Language Watermaking
o H. M. Meral, E. Sevinc, E. Unkar, B. Sankur, A. S. Ozsoy, T. Gungor, "Syntactic Tools for Text Watermarking", Proceedings of the SPIE International Conference on Security, Steganography, and Watermarking of Multimedia Contents, January 29- February 1, 2007, San Jose, CA.
o M. Topkara, U. Topkara, M. J. Atallah, "Information Hiding through Errors: A Confusing Approach", Proceedings of the SPIE International Conference on Security, Steganography, and Watermarking of Multimedia Contents, January 29- February 1, 2007, San Jose, CA.
o M. Topkara, U. Topkara, M. J. Atallah, "Words Are Not Enough: Sentence Level Natural Language Watermarking", Proceedings of ACM Workshop on Content Protection and Security (in conjuction with ACM Multimedia), Santa Barbara, CA, October 27, 2006.
o U. Topkara, M. Topkara, M. J. Atallah, "The Hiding Virtues of Ambiguity: Quantifiably Resilient Watermarking of Natural Language Text through Synonym Substitutions" , Proceedings of ACM Multimedia and Security Workshop, Geneva, Switzerland, September 26-27, 2006.
o M. Topkara, G. Riccardi, D. Hakkani-Tur, M. J. Atallah, "Natural Language Watermarking: Challenges in Building a Practical System", Proceedings of the SPIE International Conference on Security, Steganography, and Watermarking of Multimedia Contents, January 15 - 19, 2006, San Jose, CA. Slides
o Yuling Liu, Xingming Sun, Yong Wu. "A Natural Language Watermarking Based on Chinese Syntax", Lecture Notes in Computer Science (LNCS) 3612: 968-997, August 2005.
o M. Topkara, C. Taskiran and E. J. Delp, "Natural Language Watermarking", Proceedings of the SPIE International Conference on Security, Steganography, and Watermarking of Multimedia Contents, January 17 - 21, 2005, San Jose, CA. Slides
o Xingming Sun , Gang Luo , Huajun Huang, "Component-based digital watermarking of Chinese texts", Proceedings of the 3rd international conference on Information security, November 14-16, 2004, Shanghai, China
o M. Atallah, V. Raskin, C. F. Hempelmann, M. Karahan, R. Sion, U. Topkara, K. E. Triezenberg, "Natural Language Watermarking and Tamperproofing", Fifth Information Hiding Workshop, IHW 2002, LNCS 2578, Springer Verlag, October 2002, Noordwijkerhout, The Netherlands
o M. Atallah, V. Raskin, M. C. Crogan, C. F. Hempelmann, F. Kerschbaum, D. Mohamed, S. Naik, "Natural Language Watermarking: Design, Analysis, and a Proof-of-Concept Implementation ", Fourth Information Hiding Workshop, IHW 2001, Pittsburgh, PA
Natural Language Text Steganography and Steganalysis
o Liu Yuling, Sun Xingming, Gan Can, Wang Hong. "An Efficient Linguistic Steganography for Chinese Text", Proceedings of 2007 IEEE International Conference on Multimedia & Expro, pages 2094-2097, Beijing, China, July 2007.
o C. Taskiran, U. Topkara, M. Topkara, E. J. Delp, "Attacks on Lexical Natural Language Steganography Systems", Proceedings of the SPIE International Conference on Security, Steganography, and Watermarking of Multimedia Contents, January 15 - 19, 2006, San Jose, CA.
o M. J. Atallah, C. Grothoff, K. Grothoff, L. Alkhutova, R. Stutsman, "Translation-Based Steganography", Proc. 7th International Information Hiding Workshop (IHW 05), Barcelona, Spain, June 2005, pp. 219-233.
o A brief version of Peter Wayner Mimic functions paper Mimic Functions and Tractability. Real paper appeared in Cryptologia, July 1992.
o NICETEXT by Davida and Chapman "Hiding The Hidden: A Software System For Concealing Ciphertext As Innocuous Text" (1997)
Watermarking Image of the Text
o Brassil, Low, Maxemchuk, O'Gorman, "Hiding Information in Document Images" (1994)
Related Web Pages on Natural Language Watermarking and Linguistic Steganography
o A Comprehensive Bibliography of Linguistic Steganography by Richard Bergmair
o Spam Mimic (Steganography Tool)
o TLEX (Steganography Tool)
o Towards Linguistic Steganography: A Systematic Investigation of Approaches, Systems, and Issues
o Miscellanous Bookmarks for Natural Language Processing and Information Hiding
My personal homepage