用于co-training的特征选择技术 Feature selection for co-training期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

用于co-training的特征选择技术

引用本文：	李国正,刘天羽.用于co-training的特征选择技术[J].上海大学学报(英文版),2008,12(1):47-51.

作者姓名：	李国正刘天羽

作者单位：	[1]School of Computer Engineering and Science, Shanghai University, Shanghai 200072, P. R. China [2]School of Electronic, Shanghai Dianji University, Shanghai 200240, P. R. China

摘要：	Co-training is a semi-supervised learning method, which employs two complementary learners to label the unlabeled data for each other and to predict the test sample together. Previous studies show that redundant information can help improve the ratio of prediction accuracy between semi-supervised learning methods and supervised learning methods. However, redundant information often practically hurts the performance of learning machines. This paper investigates what redundant features have effect on the semi-supervised learning methods, e.g. co-training, and how to remove the redundant features as well as the irrelevant features. Here, FESCOT （feature selection for co-training） is proposed to improve the generalization performance of co-training with feature selection. Experimental results on artificial and real world data sets show that FESCOT helps to remove irrelevant and redundant features that hurt the performance of the co-training method.
关键词：	CO—training 特征选择技术电子技术
文章编号：	10.1007/s11741-008-0110-2
收稿时间：	2006-05-31
修稿时间：	2006-07-17
Feature selection for co-training

Guo-zheng Li,Tian-yu Liu.Feature selection for co-training[J].Journal of Shanghai University(English Edition),2008,12(1):47-51.

Authors:	Guo-zheng Li Tian-yu Liu

Institution:	1. School of Computer Engineering and Science, Shanghai University, Shanghai 200072, P. R. China 2. School of Electronic, Shanghai Dianji University, Shanghai 200240, P. R. China

Abstract:	Co-training is a semi-supervised learning method, which employs two complementary learners to label the unlabeleddata for each other and to predict the test sample together. Previous studies show that redundant information can helpimprove the ratio of prediction accuracy between semi-supervised learning methods and supervised learning methods. However,redundant information often practically hurts the performance of learning machines. This paper investigates what redundantfeatures have effect on the semi-supervised learning methods, e.g. co-training, and how to remove the redundant features aswell as the irrelevant features. Here, FESCOT (feature selection for co-training) is proposed to improve the generalizationperformance of co-training with feature selection. Experimental results on artificial and real world data sets show that FESCOThelps to remove irrelevant and redundant features that hurt the performance of the co-training method.

Keywords:	feature selection semi-supervised learning co-training
本文献已被维普万方数据 SpringerLink 等数据库收录！
	点击此处可从《上海大学学报(英文版)》浏览原始摘要信息
	点击此处可从《上海大学学报(英文版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏