|
|
A Novel Composite Kernel and Application to Question Retrieval |
Wang Jun① Li Zhou-jun① Hu Xia② Hu Bi-yun① |
①School of Computer Science and Engineering, Beihang University, Beijing 100191, China ②School of Computing, National University of Singapore, Singapore 117590 |
|
|
Abstract Question retrieval plays important role in question and answering systems. The main problem is how to measure the similarity between candidate questions and query question. This paper presents a tree kernel based method, named weighted tree kernel, to calculate the similarity of sentences’ structures and proposes improvements to the original tree kernel algorithm. In order to reduce the effect on tree kernel bringing by syntactic parsing, a composite kernel is proposed based on the weighted tree kernel and two other string kernels, which can capture syntax, part-of-speech and lexical level information of a sentence, to calculate the semantic similarity between question sentences. Experimental results on Yahoo!Answers dataset show that the proposed method outperforms traditional vector space model based methods by 24.02% in question retrieval accuacry.
|
Received: 23 March 2010
|
|
Corresponding Authors:
Wang Jun
E-mail: wangjun0706149@cse.buaa.edu.cn
|
|
|
|
|
|
|