Abstract:A method to identify Chinese organization names by utilizing SVM (Support Vector Machines) and RS (Rough Set) is provided. Forming rule of organization name is defined based on semanteme collocation relation, and then the un-redundancy set of rough forming rules can be learned by employing attribute reduction in RS automatically. A chain of words matching forming rule is selected first as candidate, then a SVM classifier discern whether a candidate is real organization name according to candidate semanteme and its contextual semanteme while recognizing. Results of open testing achieve F-measure 82.06% in 16.17 million words news based on this project separately.
宇 缨; 王晓龙;刘秉权. 一种基于SVM/RS的中文机构名称自动识别方法[J]. 电子与信息学报, 2006, 28(5): 895-900 .
Yu Ying;Wang Xiao-long; Liu Bing-quan. A Method of Automatic Recognition for Chinese
Organization Name Based on SVM/RS. , 2006, 28(5): 895-900 .