Deep Web信息抽取研究 On Deep Web Information Extraction期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Deep Web信息抽取研究

引用本文：	董旻,方曙.Deep Web信息抽取研究[J].图书情报工作,2007,51(10):25-28.

作者姓名：	董旻方曙

作者单位：	1. 中国科学院研究生院,北京,100049 2. 中国科学院国家科学图书馆成都分馆,成都,610041

摘要：	针对Deep Web信息资源的利用问题，指出对其进行信息抽取的意义，分析对比在信息抽取过程中处理查询接口和抽取结构化数据这两个主要步骤所使用的技术，采用基于关键词查询和建立文档对象模型的方法对专利数据库进行抽取实验。通过分析实验结果，验证抽取方法的准确性，指出不足之处和解决的途径，以期达到充分利用Deep Web信息资源的目的。
关键词：	信息抽取查询接口命名实体识别文档对象模型
修稿时间：	2007-04-18
On Deep Web Information Extraction

Dong Min,Fang Shu.On Deep Web Information Extraction[J].Library and Information Service,2007,51(10):25-28.

Authors:	Dong Min Fang Shu

Institution:	Graduate UniversityofChineseAcademyofSciences, Beijing 100049;Chengdu Library of Chinese Academy of Sciences, Chengdu 610041

Abstract:	Aiming at solving the problem of how to utilize the information resources in the Deep Web, this paper indicates the approach by information extraction, and through analyses and compares the technologies used in two major processes of handling database searching interface and extracting structured data, does information extraction experiment on patent databases by using the approach based on keywords search and document object modeling technologies. The results of experiment verify the precision of extraction approach and the author lastly points out the disadvantages and the ways to improve, so as to provide references for the full use of Deep Web information resources.

Keywords:	Deep Web
本文献已被维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏