首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于盘古分词的新闻行业垂直搜索引擎
引用本文:李敏.基于盘古分词的新闻行业垂直搜索引擎[J].丽水学院学报,2012,34(5):66-69.
作者姓名:李敏
作者单位:丽水学院商学院,浙江丽水,323000
摘    要:通过对新闻行业进行分析,针对新闻网站对信息要求的特征,研究相关的中文分词算法以及全文检索框架,并设计了一个能够多线程进行数据采集和检索的垂直搜索引擎,然后通过盘古分词组件与Lucene搭建了一个高效的检索系统。系统通过中小型新闻网站的测试运行能够达到搜索引擎对信息查询准确性以及高效响应速度的要求,有较强的处理,改善了用户体验。

关 键 词:新闻网站  盘古分词  检索系统  垂直搜索引擎

A Vertical Search Engine of News Industry Based on Pangu Segment
LI Min.A Vertical Search Engine of News Industry Based on Pangu Segment[J].Journal of Lishui University,2012,34(5):66-69.
Authors:LI Min
Institution:LI Min (School of Business,Lishui University,Lishui 323000,Zhejiang)
Abstract:A multi-threaded vertical search engine for data collection and retrieval is designed through analysing news industry and the characteristics of the news websites’ requirements towards news Information and by studying Chinese word segmentation algorithm and the full-text retrieval framework.An efficient full-text retrieval system is built as well utilizing Lucene and Pangu sub-word components.Through the test runs involving small news sites,it is indicated that having relatively power capacity,the system can meet the requirements regarding both the accuracy of information inquiries and the efficiency in response speed and help to improve user experience.
Keywords:news websites  Pangu segment  search system  vertical search engine
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号