首页 | 本学科首页   官方微博 | 高级检索  
     检索      


User k-anonymity for privacy preserving data mining of query logs
Authors:Guillermo Navarro-Arribas  Vicenç Torra  Arnau Erola  Jordi Castellà-Roca
Institution:1. IIIA, Institut d’Investigació en Intel·ligència Artificial – CSIC, Consejo Superior de Investigaciones Científicas, Campus UAB s/n, 08193 Bellaterra, Catalonia, Spain;2. Departament d’Enginyeria Informàtica i Matemàtiques, UNESCO Chair in Data Privacy, Universitat Rovira i Virgili, Av. Països Catalans 26, E-43007 Tarragona, Spain
Abstract:The anonymization of query logs is an important process that needs to be performed prior to the publication of such sensitive data. This ensures the anonymity of the users in the logs, a problem that has been already found in released logs from well known companies. This paper presents the anonymization of query logs using microaggregation. Our proposal ensures the k-anonymity of the users in the query log, while preserving its utility. We provide the evaluation of our proposal in real query logs, showing the privacy and utility achieved, as well as providing estimations for the use of such data in data mining processes based on clustering.
Keywords:Privacy  Query log  Microaggregation  k-Anonymity  Clustering  Web search
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号