基于Web资源的信息抽取技术 The Technology of Information Extraction for Web Resource期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于Web资源的信息抽取技术

引用本文：	郭志红.基于Web资源的信息抽取技术[J].情报科学,2002,20(12):1282-1284.

作者姓名：	郭志红

作者单位：	上海交通大学情报研究所,上海,200030

摘要：	Web资源含有大量的有用信息，但由于它们欠结构化，不能为传统的数据库型查询系统所利用。如何将这些信息抽取出来，转化成结构化信息供其它信息集成系统所利用，成为该领域的研究热点。本文介绍了一个简单的Web信息抽取模型，对于基于该模型的wrapper归纳技术进行了探讨，并描述了一个wrapper自动生成系统的原型。
关键词：	Web资源信息抽取 wrapper归纳技术自动生成原型系统
修稿时间：	2002年3月27日
The Technology of Information Extraction for Web Resource

Guo Zhihong.The Technology of Information Extraction for Web Resource[J].Information Science,2002,20(12):1282-1284.

Authors:	Guo Zhihong

Abstract:	There is plenty of useful information in web resource.It can't be used by the traditional database query system because it is not well-structured.Recently considerable attention has been received on how to extract it from web resource and transfer it to structured information that can be used by other information integration systems.This paper presents a simple web information extraction model,discussed the technology of wrapper induction based on the model and describes automatic generation prototype system of wrapper.

Keywords:	Information extraction Wrapper induction Automatic generation prototype system
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏