首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于聚类的网络舆情热点发现及分析*
引用本文:王伟,许鑫.基于聚类的网络舆情热点发现及分析*[J].现代图书情报技术,2009,3(3):74-79.
作者姓名:王伟  许鑫
作者单位:华东师范大学信息学系,上海,200241
基金项目:教育部人文社会科学研究项目 
摘    要:根据对网络舆情分析的需求,构建出基于聚类的网络舆情热点发现及分析系统。通过对样本网页文本的特征提取,构建向量空间模型,使用OPTICS算法获取网页热点簇,根据热点簇特征向量对网页进行二次聚类,从而获取关于舆情的时间演变模式,为相关领域研究提供决策支持。通过二次聚类,提高舆情网页相关度的质量,使网络舆情分析更为准确可靠。

关 键 词:网络舆情  热点发现  舆情分析  文本聚类
收稿时间:2009-01-12
修稿时间:2009-02-02

Online Public Opinion Hotspot Detection and Analysis Based on Document Clustering
Wang Wei,Xu Xin.Online Public Opinion Hotspot Detection and Analysis Based on Document Clustering[J].New Technology of Library and Information Service,2009,3(3):74-79.
Authors:Wang Wei  Xu Xin
Institution:(Department of InformaticsEast China Normal University,Shanghai 200241,China)
Abstract:According to the requirement of online public opinion analysis, this paper builds an online public opinion hotspot detection and analysis system based on document clustering. It builds vector space model by abstracting document features from sample Web pages, and get the hot-spot cluster by OPTICS algorithm. According the vector of hot-spot cluster, the Web pages are clustered for the second time. At last, it gets the time evolution mode about the public opinion to afford decision support for specific field,and improves the quality of page correlation and analyze the public opinion more accurately.
Keywords:Online public opinion  Hotspot Detection  Public opinion analysis  Document clustering
本文献已被 万方数据 等数据库收录!
点击此处可从《现代图书情报技术》浏览原始摘要信息
点击此处可从《现代图书情报技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号