|
|
A Method of Automatic Recognition for Chinese
Organization Name Based on SVM/RS |
Yu Ying;Wang Xiao-long; Liu Bing-quan |
School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China |
|
|
Abstract A method to identify Chinese organization names by utilizing SVM (Support Vector Machines) and RS (Rough Set) is provided. Forming rule of organization name is defined based on semanteme collocation relation, and then the un-redundancy set of rough forming rules can be learned by employing attribute reduction in RS automatically. A chain of words matching forming rule is selected first as candidate, then a SVM classifier discern whether a candidate is real organization name according to candidate semanteme and its contextual semanteme while recognizing. Results of open testing achieve F-measure 82.06% in 16.17 million words news based on this project separately.
|
Received: 10 September 2004
|
|
|
|
|
|
|
|