首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于行为-内容融合模型的用户画像研究
引用本文:余传明,田鑫,郭亚静,安璐.基于行为-内容融合模型的用户画像研究[J].图书情报工作,2018,62(13):54-63.
作者姓名:余传明  田鑫  郭亚静  安璐
作者单位:1. 中南财经政法大学信息与安全工程学院 武汉 430073; 2. 武汉大学信息管理学院 武汉 430072
基金项目:本文系国家自然科学基金面上项目"大数据环境下基于领域知识获取与对齐的观点检索研究"(项目编号:71373286)和教育部哲学社会科学研究重大课题攻关项目"提高反恐怖主义情报信息工作能力对策研究"(项目编号:17JZD034)研究成果之一。
摘    要:目的/意义]为识别并去除非理性投资者的网络评论,提升评论的专业程度与质量,促进理性投资,本文以识别股吧中的用户是否属于噪声投资者为研究任务,进行用户画像。方法/过程]对股吧的用户发文内容进行深度用户表示学习(deep user representation learning),结合股吧用户的粉丝数量、影响力、关注量、自选股、吧龄、发帖量、评论量、访问量等行为特征,提出一种行为-内容融合模型(behaviour and content combined model,BCCM),并在标注数据集上进行实证与对比研究。结果/结论]实验结果显示,该模型对噪声投资者识别的F1值为79.47%,优于决策树方法(69.90%)、SVM方法(75.61%)、KNN方法(73.21%)和ANN方法(74.83%)。在噪声投资者识别这一特定用户画像研究任务中,通过利用深度用户表示学习引入文本内容特征,能够显著提升用户画像的各种评价指标。

关 键 词:用户画像  情感分析  用户表示学习  特征融合  
收稿时间:2018-01-04

User Profiling Based on the Behaviour and Content Combined Model
Yu Chuanming,Tian Xin,Guo Yajing,An Lu.User Profiling Based on the Behaviour and Content Combined Model[J].Library and Information Service,2018,62(13):54-63.
Authors:Yu Chuanming  Tian Xin  Guo Yajing  An Lu
Institution:1. School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan 430073; 2. School of Information Management, Wuhan University, Wuhan 430072
Abstract:Purpose/significance] To identify and remove online reviews from irrational investors, enhance the professional degree and quality of comments, and to promote rational investment, this article takes identifying whether the users on the Guba website belong to the noise investors as an example, and carries out a user profiling study.Method/process] Deep user representation learning method was used to learn text information such as users'posts, then a behavior and content combined model was proposed with respect to behavior characteristics such as fans number, influence, bar age, post number and so on, and an empirical and comparative study was done on the annotated data set.Result/conclusion] Experiment result showed that the BCCM model got the F1 score of 79.47%, which is superior to Decision Tree model(69.90%), SVM model(75.61%), KNN model(73.21%) and ANN model(74.83%). In the specific user profiling task of identifying noise traders, by using deep user representation learning method to obtain text content characteristics, the various evaluation metrics of use profiling can be remarkably improved.
Keywords:user modelling  emotional analysis  user representation learning  characteristic fusion  
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号