首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 437 毫秒
1.
2.
This article proposes a process to retrieve the URL of a document for which metadata records exist in a digital library catalog but a pointer to the full text of the document is not available. The process uses results from queries submitted to Web search engines for finding the URL of the corresponding full text or any related material. We present a comprehensive study of this process in different situations by investigating different query strategies applied to three general purpose search engines (Google, Yahoo!, MSN) and two specialized ones (Scholar and CiteSeer), considering five user scenarios. Specifically, we have conducted experiments with metadata records taken from the Brazilian Digital Library of Computing (BDBComp) and The DBLP Computer Science Bibliography (DBLP). We found that Scholar was the most effective search engine for this task in all considered scenarios and that simple strategies for combining and re-ranking results from Scholar and Google significantly improve the retrieval quality. Moreover, we study the influence of the number of query results on the effectiveness of finding missing information as well as the coverage of the proposed scenarios.  相似文献   

3.
中文数字化期刊的DC元数据标准设计实例   总被引:2,自引:0,他引:2  
刘廷元 《情报科学》2003,21(6):609-612
文章将元数据标准的使用作为各种不同的数字化期刊仓储资源共享的一种可行性方法进行了论述。研究集中在三个方面:首先,讨论了数字化期刊采用元数据标准的必要性;其次,讨论了数字化期刊的DC元数据定义与限定;最后,提供了一个用DC1.1元数据和HTML4.0语法设计的中文数字化期刊元数据标准实例。  相似文献   

4.
The name ambiguity problem is especially challenging in the field of bibliographic digital libraries. The problem is amplified when names are collected from heterogeneous sources. This is the case in the Scholarometer system, which performs bibliometric analysis by cross-correlating author names in user queries with those retrieved from digital libraries. The uncontrolled nature of user-generated annotations is very valuable, but creates the need to detect ambiguous names. Our goal is to detect ambiguous names at query time by mining digital library annotation data, thereby decreasing noise in the bibliometric analysis. We explore three kinds of heuristic features based on citations, metadata, and crowdsourced topics in a supervised learning framework. The proposed approach achieves almost 80% accuracy. Finally, we compare the performance of ambiguous author detection in Scholarometer using Google Scholar against a baseline based on Microsoft Academic Search.  相似文献   

5.
安艳杰 《现代情报》2007,27(9):172-173
因特网信息越来越多地被学者们引用,但准确引用这些信息却不容易。文章分析了目前国际及国内相关的引文规范。介绍了自动化引用电子文献的一些初步进展。最后提出自动抽取内嵌在电子资源的书目元数据获取电子参考引文的一些设想。  相似文献   

6.
李文生  李超  杨吉江 《现代情报》2009,29(12):35-39
政务电子邮件日渐成为政府信息资源的重要组成部分。电子邮件元数据研究是政务电子资源管理领域的重要课题之一。本文通过政务电子邮件管理及元数据标准的相关研究成果的分析吸收,根据政务电子邮件管理的总体目标,总结出政务电子邮件元数据标准的设计规则和方法。  相似文献   

7.
陆小辉 《科技广场》2007,(9):219-221
元数据是数字信息组织和处理的基本工具,为各种形态的数字化信息提供规范、普遍的描述基准和方法。电子文件管理元数据是电子文件管理系统的核心组成部分。  相似文献   

8.
数字文献元数据标准比较分析   总被引:4,自引:0,他引:4  
数字文献是数字资源中最接近传统文献的部分 ,它一般指数字文本信息 ,有比较稳定的结构 ,比起数字图像、数字音乐、数字动画来 ,其内容特征、使用、管理和长期保存等涉及的问题跟传统方式比较接近。新的技术和方法在该领域的应用也是比较成熟的。国外在这方面已经有了一套良好的处理方法 ,这就是借用了图书馆长期使用的编目法与描述法 ,基本没有脱离MARC (Machine readableCataloge)和ISBD ,包括TEIHeader,也包括整合现代出版、经销、分销与用户的工具———ONIX标准。以MARC记录为标志的…  相似文献   

9.
In order to organise and manage geospatial and georeferenced information on the Web making them convenient for searching and browsing, a digital portal known as G-Portal has been designed and implemented. Compared to other digital libraries, G-Portal is unique for several of its features. It maintains metadata resources in XML with flexible resource schemas. Logical groupings of metadata resources as projects and layers are possible to allow the entire metadata collection to be partitioned differently for users with different information needs. These metadata resources can be displayed in both the classification-based and map-based interfaces provided by G-Portal. G-Portal further incorporates both a query module and an annotation module for users to search metadata and to create additional knowledge for sharing respectively. G-Portal also includes a resource classification module that categorizes resources into one or more hierarchical category trees based on user-defined classification schemas. This paper gives an overview of the G-Portal design and implementation. The portal features will be illustrated using a collection of high school geography examination-related resources.  相似文献   

10.
裘江南  刘丽丽  许晶  王延章 《情报杂志》2012,31(6):149-155,161
目前应急领域元数据标准种类繁多,但不具有跨领域特征,缺乏一个通用的元数据标准,无法为多种类型的非常规突发事件信息描述和综合应急管理提供支持.针对此问题,对应急领域已有的元数据标准进行了对比分析,抽取各类元数据标准的共性要素,并在总结现有元数据标准结构与要素的基础上加以完善,构建了一个通用可扩展的适用于描述应急信息的元数据标准.  相似文献   

11.
This article describes the results of our analysis of the data from the CiteSeer digital library. First, we examined the data from the point of view of source top-level Internet domains from which the data were collected. Second, we measured country shares in publications indexed by CiteSeer and compared them to those based on mainstream bibliographic data from the Web of Science and Scopus. And third, we concentrated on analyzing publications and their citations aggregated by countries. This way, we generated rankings of the most influential countries in computer science using several non-recursive as well as recursive methods such as citation counts or PageRank. We conclude that even if East Asian countries are underrepresented in CiteSeer, its data may well be used along with other conventional bibliographic databases for comparing the computer science research productivity and performance of countries.  相似文献   

12.
MARC未来及质量控制   总被引:1,自引:0,他引:1  
罗军 《现代情报》2009,29(3):216-218
本文首先介绍了MARC格式的背景和基本功能,指出现行的著录和MARC格式首先要遵循"巴黎原则",应该着眼于著录和著录格式的未来发展,搭建成一个更为可靠、面向用户的平台。最后探讨了未来MARC格式质量控制指标和未来著录实践和书目记录可能采用的大致轮廓。  相似文献   

13.
李建伟 《现代情报》2017,37(2):57-62
基于国际元数据标准设计世界客都“古民居数字记忆工程”地方文化特色资源语义组织与元数据模型,以智能化知识服务为导向设置资源分类,基于内容特征关联设计资源开放获取机制,完成项目海量无序特色文化数字资源的有序组织与高效利用,实现元数据的国际化交换共享。  相似文献   

14.
OAI-PMH框架内的全文获取研究   总被引:4,自引:0,他引:4  
郭少友 《情报理论与实践》2006,29(3):353-354,379
OAI-PMH是一种元数据收割协议,虽然不直接支持对元数据所描述的全文的获取,但通过某些元数据字段可以找到全文的URL,从而可以利用全文获取程序来获取全文。本文探讨了以长期保存为目的的全文获取方法和步骤,同时也探讨了以实现全文检索或建立引文索引为目的的全文获取方法和步骤。  相似文献   

15.
16.
In this work, we elaborate on the meaning of metadata quality by surveying efforts and experiences matured in the digital library domain. In particular, an overview of the frameworks developed to characterize such a multi-faceted concept is presented. Moreover, the most common quality-related problems affecting metadata both during the creation and the aggregation phase are discussed together with the approaches, technologies and tools developed to mitigate them. This survey on digital library developments is expected to contribute to the ongoing discussion on data and metadata quality occurring in the emerging yet more general framework of data infrastructures.  相似文献   

17.
为了探讨现有地图元数据规范的特征, 对基于DC、CDLS、FRBR的地图元数据规范进行对比分析, 发现三者在著录对象、著录对象的关系揭示、元素组成等方面存在异同, 对地图的描述与揭示各有侧重。认为, 应加强各种地图元数据规范自身的完善、加强不同规范间的互操作性, 同时建立对现有规范的评价与完善机制。  相似文献   

18.
19.
在对纸本图书数字化加工过程中,元数据录入是必需的环节,然而手工录入工作量大、效率低,针对这一问题,提出了一种基于机器学习的扫描图书元数据自动获取方法。首先定义元数据的描述、管理和结构元素,然后以扫描页面的DjVu XML文档为数据源,分析页面的格式、结构等特征,以行作为初始特征向量,采用基于有监督的机器学习方法进行元数据抽取,实验表明该算法能够取得较高的准确率和召回率,能够显著的提高图书数字化的效率。  相似文献   

20.
我国作为全球遭受国外反倾销调查最多的目标国,反倾销预警机制对我国的贸易出口具有重要意义。针对我国现有反倾销预警机制现存缺陷,构建基于竞争情报理论的反倾销预警机制有利于做到对风险的知晓、预警与应对。通过竞争环境、竞争对手与竞争战略制定三个系统的构建,确保了反倾销预警机制的系统性与完备性。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号