A fast procedure for the calculation of similarity coefficients in automatic classification |
| |
Authors: | Peter Willett |
| |
Institution: | Postgraduate School of Librarianship and Information Science, University of Sheffield, Western Bank, Sheffield S10 2TN, England |
| |
Abstract: | A fast algorithm is described for comparing the lists of terms representing documents in automatic classification experiments. The speed of the procedure arises from the fact that all of the non-zero-valued coefficients for a given document are identified together, using an inverted file to the terms in the document collection. The complexity and running time of the algorithm are compared with previously described procedures. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|