Zong Yu①② Jin Ping② Chen En-hong① Li Hong① Liu Ren-jin②
①(Department of Computer Science and Technology, University of Science and Technology of China, Hefei 230036, China) ②(Department of Information and Engineering, West Anhui University, Lu’an 237012, China)
Abstract:Weblog co-clustering is an important research content of Weblog mining, which has ability to find out the users clusters and pages clusters simultaneously. Most of the proposed Weblog co-clustering algorithm use hard partition method to assign the users into its corresponding cluster. However, hard partition method make these clustering algorithm can not handle the cluster’s bond problem very well, which has significant influence for the clustering result quality. In this paper, a Fuzzy CO-clustering for Weblog (FCOW) algorithm is proposed to overcome the default of hard partition and improve the clustering results quality of Weblog co-clustering. In particularly, the underlying users model set PA={pa1, …paK} is first found by using Hadamard product; and then, the rest users are assigned to its corresponding model pak based on page subset to generate the co-clustering result {CSk, CPk}; Finally, the fuzzy membership of each user to its page cluster CPk is calculated and this information is used to do recommendation. Experimental results on five real world datasets show that FCOW has ability for improving the clustering quality of Weblog co-clustering.