首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Chat mining: Predicting user and message attributes in computer-mediated communication
Authors:Tayfun Kucukyilmaz  B Barla Cambazoglu  Cevdet Aykanat  Fazli Can
Institution:1. Computer Engineering Department, Bilkent University, TR 06800 Bilkent, Ankara, Turkey;2. Yahoo! Research Barcelona, Barcelona, Spain
Abstract:The focus of this paper is to investigate the possibility of predicting several user and message attributes in text-based, real-time, online messaging services. For this purpose, a large collection of chat messages is examined. The applicability of various supervised classification techniques for extracting information from the chat messages is evaluated. Two competing models are used for defining the chat mining problem. A term-based approach is used to investigate the user and message attributes in the context of vocabulary use while a style-based approach is used to examine the chat messages according to the variations in the authors’ writing styles. Among 100 authors, the identity of an author is correctly predicted with 99.7% accuracy. Moreover, the reverse problem is exploited, and the effect of author attributes on computer-mediated communications is discussed.
Keywords:Authorship analysis  Chat mining  Computer-mediated communication  Machine learning  Stylistics  Text classification
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号