首页 | 本学科首页   官方微博 | 高级检索  
     检索      

An efficient algorithm for mining closed itemsets
作者姓名:刘君强  潘云鹤
作者单位:Institute of Artificial Intelligence,Institute of Artificial Intelligence Zhejiang University,Hangzhou 310027,China. Hangzhou University of Commerce,Hangzhou 310035,China,Zhejiang University,Hangzhou 310027,China.
基金项目:ProjectsupportedbytheMinistryofEducation (No .11110 1 G10 110 )andtheZhejiangProvinceNaturalScienceFoundation (No .60 2 14 0 ),China
摘    要:INTRODUCTIONMiningfrequentitemsetsisafundamentalandessentialprobleminmanydataminingapplica tionsincludingthediscoveryofassociationrules,strongrules,correlations,sequentialrules,epi sodes,multi dimensionalpatterns,andmanyoth erimportantdiscoverytasks (AgarwalandSri kant,1994;Wangetal.,2 0 0 2 ) .Mostalgo rithmsproposedsofarworkwellondatasetswherethesizesofitemsetsarerelativelysmall.Howev er,theyusuallycrashwithdensedatasetswheretheitemsetsizesarelarge.Suchdatasetsincludethosecomposedofque…


An efficient algorithm for mining closed itemsets
Abstract:This paper presents a new efficient algorithm for mining frequent closed itemsets. It enumerates the closed set of frequent itemsets by using a novel compound frequent itemset tree that facilitates fast growth and efficient pruning of search space. It also employs a hybrid approach that adapts search strategies, representations of projected transaction subsets, and projecting methods to the characteristics of the dataset. Efficient local pruning, global subsumption checking, and fast hashing methods are detailed in this paper. The principle that balances the overheads of search space growth and pruning is also discussed. Extensive experimental evaluations on real world and artificial datasets showed that our algorithm outperforms CHARM by a factor of five and is one to three orders of magnitude more efficient than CLOSET and MAFIA.
Keywords:Knowledge discovery  Data mining  Frequent closed patterns  Association rules
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号