|
|
Document Relevance Calculation Based on Lexical Cohesion |
Zhao Yu-ming Xu Zhi-ming Wang Xiao-long Zhu Kun-peng |
School of Computer Science and Technolog, Harbin Institute of Technolog, Harbin 150001, China |
|
|
Abstract A new document relevance calculating method based on lexical cohesion is presented in this paper. The main principle is: documents are formalized with lexicon chains which are constructed by extracting semantic-relative word clusters according to the lexicon cohesion principle under the help of semantic dictionary HowNet; then weight of each lexical chain is evaluated; finally relevance of documents is calculated with their representations. Experiments are conducted on corpus of Chinese Library Classification, and precision about 85.4% is achieved. The experimental results show that the method describes the semantic feature of documents to a certain extent, and it is an effective method for relevance calculating of documents.
|
Received: 02 April 2007
|
|
Corresponding Authors:
Zhao Yu-ming
|
|
|
|
|
|
|