首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于Ontology的Web文本分类法
引用本文:凌云,魏贵义,刘军.基于Ontology的Web文本分类法[J].情报学报,2006,25(2):202-207.
作者姓名:凌云  魏贵义  刘军
作者单位:浙江工商大学计算机与信息工程学院,杭州,310035
摘    要:传统方法处理文本分类时都需要进行文本训练,并且在文本表示时需要抽取特征项。搜集训练文本的过程需要费时费力的人工参与,而且中文信息的特征项抽取工作难度较大。为了解决这些问题,本文探讨了一种新的文本分类法———基于Ontology的Web文本分类法。该方法首先通过“知网”建立一个Ontology,然后根据分类体系建立每个类的Ontology,最后根据每个类的Ontology对文本进行分类。试验表明这种分类法与KNN分类法在准确率上相当,但比KNN方法稳定,在召回率上优于KNN方法。

关 键 词:文本分类  知网
修稿时间:2005年4月29日

An Ontology-based Approach of Web Texts Classification
Ling Yun,Wei Guiyi,Liu Jun.An Ontology-based Approach of Web Texts Classification[J].Journal of the China Society for Scientific andTechnical Information,2006,25(2):202-207.
Authors:Ling Yun  Wei Guiyi  Liu Jun
Abstract:Traditional methods for text classification need text-training and characteristic abstracting in the step of text-expression.The work of collecting training texts is laborious and time-consuming.Additionally,it is difficult to abstract the characteristics from Chinese texts.In order to resolve above problems,this paper puts forward an approach of ontology-based web text classification.Firstly,the approach establishes an ontology model based on Hownet theory.Then,it creates ontologies for each subclass of the classification system.The web texts classification is performed using these ontologies.Comparing with the method of KNN,the results of our experiments indicate that the accuracy of ontology-based approach is close to KNN's,its algorithms is more steady than KNN's,and its recalling rate is more eminent than KNN's.
Keywords:Ontology
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号