Using a new relational concept to improve the clustering performance of search engines |
| |
Authors: | Lin-Chih Chen |
| |
Institution: | Department of Information Management, National Dong Hwa University, No. 1, Sec. 2, Da Hsueh Road, Shou-Feng, Hualien 97401, Taiwan |
| |
Abstract: | In this paper, we present a novel clustering algorithm to generate a number of candidate clusters from other web search results. The candidate clusters generate a connective relation among the clusters and the relation is semantic. Moreover, the algorithm also contains the following attractive properties: (1) it can be applied to multilingual web documents, (2) it improves the clustering performance of any search engine, (3) its unsupervised learning can automatically identify potentially relevant knowledge without using any corpus, and (4) clustering results are generated on the fly and fitted into search engines. |
| |
Keywords: | Document clustering Semantic relation Relational concept Web search engines Web documents |
本文献已被 ScienceDirect 等数据库收录! |
|