An Improved Web Page Watermarking 英语论文.doc
《An Improved Web Page Watermarking 英语论文.doc》由会员分享,可在线阅读,更多相关《An Improved Web Page Watermarking 英语论文.doc(8页珍藏版)》请在三一文库上搜索。
1、An Improved Web Page Watermarking Zhu Ping1, Ding Wei2, Lu Ming2 1. College of Information and Security Engineering, Wuhan, 430070 2. College of Computer Science and Technology, Wuhan, 430070 Abstract: Web Page Watermarking is a research branch of the text watermarking. It is relatively difficult to
2、 embed the watermarking into the Web Page. For the particularity of the Web Page, this paper proposes an effectively improved Web Page Watermarking program which can not only protect the Web Pages completeness from tampering, but also can protect the whole Web sites copyright, integrity and consiste
3、ncy. This program, which can be effectively used for copyright protection, is able to test whether the Web Page suffers tampering, and to locate tampering. Keywords: Text Digit Watermarking, Web Page Watermarking, Fragile Watermarking, Robust Watermarking 1. Introduction The Digital Watermarking Tec
4、hnology, the basic idea of which derives from the early steganography technique, secretly embeds the specific markers into the digital content. This kind of markers usually is invisible, and only can be seen through the special detector or reader. According to the different types of the carrier, the
5、 digital watermarking technology can be divided into Text Digit Watermarking, Image Watermarking, and Video Watermarking. The text usually consists of words, sentences, paragraphs, punctuations and other regular structures. It is not easy to embed the watermarking into the text and not to be found b
6、y invaders. With little redundant information in the text, the common text digital technology includes: the shift coding, word shift coding and feature coding and so on. The Web Page Watermarking in this paper is one of the text digital watermarking. Web Page is different from the ordinary plain tex
7、t document. HTML document is a non-formatted text with labels and web page information constituted. Labels, not case-sensitive, is used to control the format and the display effect of web page information, and can be divided into single-label and dual-label. The single-label can be used alone with t
8、he format as ; the dual-label contains a start label and an end one with the format as web page information . At present, the program of embedding the text digital watermarking into the web page is based on characteristics of HTML grammar and labels. There are: method based on the invisible characte
9、rs (for example, the web browser is non-sensitive to the extra Tab characters and Space characters in the HTML documents.), method based on the non-sensitive HTML labels and one based on the order of the label attributes. 2. Word Watermarking Word Watermarking is to generate a label string of every
10、word by the Hash algorithm, to calculate the accumulated value of all the characters ASDII value in the label string, and finally with it as the seed of the pseudo-random function to generate the corresponding 0-1 coding of the word. However, in the Web Page, it is to extract all the words of the bo
11、dy parts between the HTML labels and generate the corresponding watermarking information of every word. First, it is to encode each word by the SHA-1 security Hash algorithm and generate the 160 bit-length binary sequence (lets suppose the key is Key1), which means Hex string with the length of 40 a
12、nd from 0 to F; then to make the accumulative operation of the ASCII value of each character in the Hexadecimal string sequence to get the sum; with the sum as the seed of the pseudo-random sequence algorithm to generate the six binary 0-1 sequence that is the word watermarking of the word. Later th
13、ese 0-1 sequences should be converted into the corresponding “Spaces-Tabs” sequences, and embed them into the web page through the browsers invisibility. The generation process is shown below and expressed by the formula 1: WW(Wi) = Random(sum(Hash(Wi, Key1), i = 1,2,M (1) Each word SHA-1encryption
14、algorithm, Hex string c1,c2,c3, to get the cs code value and make accumulative operation Key1 Word watermarking six 0-1 sequence, pseudo-random sequence generates algorithm Random, accumulative sum. Figure 1 the word watermarking generation process of the web page Among these, Wi means the No.i word
15、 in the body part of the web page; and WW(Wi) indicates its corresponding watermarking information; Key1 is the key in the Hash algorithm; M is the number of the total words in the body parts; sum is the accumulative algorithm process. 3. Line Watermarking Line Watermarking is to generate the charac
16、ter string of each line through the Hash algorithm and calculate the accumulative value of all the ASCII value in the character string, then with it as the seed of the pseudo-random function to generate the corresponding 0-1 coding of the line. The below is the line watermarking generation process o
17、f the Web Page: First, all the words in the line should be extracted, and to generate the respective word watermarking by the above method, next to operate on these word watermarking, and finally the line watermarking of the line is generated. The generation process is shown below and expressed by t
18、he formula2: LW(Li) = WW(Wi1)WW(Wi2)WW(Wij)WW(WiN) (2) Among these, Li means the No.i line; and LW(Li) indicates the line watermarking of the line; WW(Wij) denotes the No.i line, No.j words word watermarking. M is the number of the total lines and N is the number of the total words. Figure 2: the li
19、ne watermarking generation process of the web page 4. Improved Watermarking Algorithm 4.1 The generation of the watermarking The specific applications and the purposes of the text digital watermarking are different in each web page, so the demand on the function of the Robust Watermarking is differe
20、nt and the generation program of embedding the watermarking information into the web page is also different. Since the word watermarking and the line watermarking is used for protecting the web page from tampering, they and the specific contents of the web page should be closely connected and are ra
21、ndomly embedded into the web page. The watermarking distributed in the Web navigator and corresponded with the web page is used to identify the copyright information, thus the watermarking information is generated by the owner information of the web page, the serial number and icon and others. In or
22、der to facilitate description, so just the English web page is taken into consideration and this program can be easily applied to the web page with other character formats. The watermarking generation process of the web page is described as follows: Each character in Java can be expressed through th
23、e form of the binary, and the binary sequence can be generated by the method of String to get the Bytes. To embed the adscription of the watermarking proof copyright, the author information or the serial number can be showed by the Bytes, and then be encoded into the binary sequence, next the waterm
24、arking sequence is achieved: Wm=Wi(i=1,2,m), finally the encrypted watermarking information by operating through the key sequence Key2=Kj(i=1,2,n) is Wm=Wi(i=1,2,m). The specific operation process can be expressed by the equation 3: Wi = Wi Kj, i = 1 m; j = i % n (3) Among these, m is the length of
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- An Improved Web Page Watermarking 英语论文
链接地址:https://www.31doc.com/p-26340.html