文本分类器稳定性评估研究 Research on Stability Evaluation of Text Classifier期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

文本分类器稳定性评估研究

引用本文：	程泽凯,林士敏.文本分类器稳定性评估研究[J].情报学报,2005,24(1):64-68.

作者姓名：	程泽凯林士敏

作者单位：	1. 广西师范大学计算机科学系,桂林,541004;安徽工业大学计算机学院,马鞍山,243002 2. 广西师范大学计算机科学系,桂林,541004

基金项目：	清华大学智能技术与系统国家重点实验室开放课题资助 (99002)

摘要：	文本分类是文本挖掘的基础和核心。构建一个分类准确而且稳定的文本分类器是文本分类的关键,很多学者提出了不同的文本分类器模型和算法。在现有的分类器评估方法中,关心的只是分类准确率,而对稳定性这个重要的评价标准却没有涉及。本文提出使用开放测试和封闭测试的准确性指标的比值作为衡量文本分类器稳定性的评估标准。通过文献数据验证以及在所建构的贝叶斯分类器实验平台MBNC上进行的检验表明,用这种标准评价文本分类器具有其合理性。
关键词：	文本分类器稳定性评估数据挖掘
修稿时间：	2004年4月12日
Research on Stability Evaluation of Text Classifier

Cheng Zekai , and Lin Shimin.Research on Stability Evaluation of Text Classifier[J].Journal of the China Society for Scientific andTechnical Information,2005,24(1):64-68.

Authors:	Cheng Zekai and Lin Shimin

Institution:	Cheng Zekai 1,2 and Lin Shimin 1

Abstract:	Text categorization is the base and core of text mining. Constructing an accurate and stable text classifier is a key to text categorization. Many researchers put forward various text classifier models and algorithms. The important criterion, namely stability evaluation, has not been involved in the existing evaluation methods, which concerns classification accuracy. This paper purposes stability evaluation criterion for text classifier. It uses accuracy evaluation criterion ratio of open test and close test to measure text classifier stability criterion. Literature data and test result in the experiment platform MBNC(Bayesian Networks Classifier using Matlab) indicate this criterion is reasonable.

Keywords:	text categorization evaluation text classifier stability evaluation data mining
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏