首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于群体增量学习算法的文本特征选择方法
引用本文:罗毅辉,熊曙初.一种基于群体增量学习算法的文本特征选择方法[J].图书情报工作,2011,55(24):102-125.
作者姓名:罗毅辉  熊曙初
作者单位:湖南商学院信息学院信息管理工程研究所
基金项目:湖南省自然科学基金项目“电子商务环境下信任演化模型的构建与应用研究”(项目编号:10JJ6111)研究成果之一
摘    要:尽管目前存在许多文本特征选择方法,但是它们都有着一定的局限性。提出一种新的基于群体增量学习(Population Based Incremental Learning)算法的文本特征选择方法,其特点是无需特征集的先验知识和容易实现,并且由于使用了简单分类器性能作为评价准则,计算复杂度很低。对Reuters-21578文本集的分类实验结果表明,该方法平均分类性能要优于卡方统计量、信息增益和简单遗传算法三种常用的特征选择方法。

关 键 词:群体增量学习  特征选择  文本分类  遗传算法  
收稿时间:2011-03-23

A Text Feature Selection Method Using the Population Based Incremental Learning Algorithm
Luo Yihui Xiong Shuchu.A Text Feature Selection Method Using the Population Based Incremental Learning Algorithm[J].Library and Information Service,2011,55(24):102-125.
Authors:Luo Yihui Xiong Shuchu
Institution:1. Institute of Management Engineering, Information College of Hunan University of Commerce,;2. Institute of Management Engineering, Information College of Hunan University of Commerce,;
Abstract:At present there are many methods to deal with text feature selection,but each of them has certain disadvantages.A novel text feature selection method using the population based incremental learning algorithm is introduced in this paper.Advantages of the proposed method are that it needs no priori knowledge of features,is easily implemented and its computational complexity is very low due to using a simple classifier.Experimental results obtained from the Reuters-21578 dataset show that the method is better than chi-square,information gain and genetic algorithm in the performance of text categorization.
Keywords:population based incremental learning feature selection text categorization genetic algorithm
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号