借助八爪鱼采集器实现过刊网刊元数据的自动提取 Realization of automatic extraction of metadata in back issues of network journals by octopus collector期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

借助八爪鱼采集器实现过刊网刊元数据的自动提取

引用本文：	崔玉洁,廖坤.借助八爪鱼采集器实现过刊网刊元数据的自动提取[J].编辑学报,2016,28(5):485-487.

作者姓名：	崔玉洁廖坤

作者单位：	西南大学期刊社,400715,重庆;西南大学期刊社,400715,重庆

基金项目：	中国高校科技期刊研究会2015年专项课题资助项目(CUJS2015-010)，中央高校基本业务费专项资金资助项目(SWU1609165)，全国理工农医院校社科学报2016年度基金资助项目(LGNY16B8)

摘要：	现有的元数据提取方法提取规则烦琐、适应性差.针对这一问题,文章提出了借助八爪鱼采集器实现过刊网刊元数据提取的新方法.该方法以大型数据库的网页信息为对象,建立了提取元数据的流程图,通过该流程图设置相应的规则,并配置抓取数据模块,最后将该方法应用于网刊元数据的自动提取中.实际应用显示,该方法有效地提高了元数据的提取性能,并且具有较强的适应性.
关键词：	采集器网刊元数据自动提取
收稿时间：	2016/3/6 0:00:00
修稿时间：	2016/3/6 0:00:00
Realization of automatic extraction of metadata in back issues of network journals by octopus collector

CUI Yujie and LIAO Kun.Realization of automatic extraction of metadata in back issues of network journals by octopus collector[J].Acta Editologica,2016,28(5):485-487.

Authors:	CUI Yujie and LIAO Kun

Institution:	Journal Press of Southwest University, 400715, Chongqing, China and Journal Press of Southwest University, 400715, Chongqing, China

Abstract:	Existing metadata extraction methods have problems such as cumbersome rules and poor adaptability. To solve this problem, we propose a means of octopus collector to realize metadata extraction for published webzines. In this method, a large database of information on the page is regarded as an object, a flowchart of extracting metadata is established, rules are set through the flow chart, and the data capture module is configured. The method has been applied to the final webzine automatic metadata extraction. Practical application shows that the method can effectively improve the performance of metadata extraction, and has strong adaptability.

Keywords:	collector webzine metadata automatic extraction
本文献已被万方数据等数据库收录！
	点击此处可从《编辑学报》浏览原始摘要信息
	点击此处可从《编辑学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏