首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A weighted ML-KNN based on discernibility of attributes to heterogeneous sample pairs
Institution:1. West China Biomedical Big Data Center, West China Hospital, Sichuan University, No.37 Guoxue Alley, Chengdu 610041, China;2. Department of Radiology, West China Hospital, Sichuan University, No.37 Guoxue Alley, Chengdu 610041, China;3. West China Periodicals, West China Hospital, Sichuan University, No.37 Guoxue Alley, Chengdu 610041, China;4. Department of Bile Duct Surgery, West China Hospital, Sichuan University, No.37 Guoxue Alley, Chengdu 610041, China;1. School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan 430073, China;2. School of Information Management, Wuhan University, Wuhan 430072, China
Abstract:As a well-known multi-label classification method, the performance of ML-KNN may be affected by the uncertainty knowledge from samples. The rough set theory acts as an effective tool for data uncertainty analysis, which can identify the samples easy to cause misclassification in the learning process. In this paper, a hybrid framework by fusing rough sets with ML-KNN for multi-label learning is proposed, whose main idea is to depict easy misclassified samples by rough sets and to measure the discernibility of attributes for such samples. First, a rough set model titled NRFD_RS based on neighborhood relations and fuzzy decisions is proposed for multi-label data to find the heterogeneous sample pairs generated from the boundary regions of each label. Then, the weight of an attribute is defined by evaluating its discernibility to those heterogeneous sample pairs. Finally, a weighted HEOM distance is reconstructed and utilized to ML-KNN. Comprehensive experimental results with fourteen public multi-label data sets, including ten regular-scale and four larger-scale data sets, verify the effectiveness of the proposed framework relative to several state-of-the-art multi-label classification methods.
Keywords:Multi-label classification  Rough set  Attribute weight  Boundary region  Heterogeneous sample pair
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号