共查询到20条相似文献,搜索用时 156 毫秒
1.
情报语言的兼容与互换问题是21世纪情报语言学的重要研究课题之一.互联网时代,当代情报语言的兼容化具有重要意义.在研究、比较国内外情报检索语言兼容互换方式方法的基础上,认为可以基于大规模已标引好的语料,对多个不同的分类体系进行自动转换,并且设计了一种基于大语料库的多分类体系自动转换方案. 相似文献
2.
信息检索系统中情报语言的一体化 总被引:6,自引:1,他引:6
从信息检索系统出发论述情报语言珠一体化。其中,手工信息检索系统主要采用若干情报语言组织文献记录,若干情报语言编制同一索引或若干索引的相互交织三种方式实现情报语言的一体化;计算机信息检索系统主要采用多种语言编制混合索引和一体化语言系统两种方式实现情报语言的一体化。 相似文献
3.
检索语言在网络检索中的应用 总被引:4,自引:0,他引:4
在网络检索中,检索语言因其各自不同的特点有着不同的用途:分类法仍是进行信息组织的重要方法:主题法是广泛利用关键词组织网络信息;分类主题一体化则直接用于超文本系统,并为用户提供经济有效的多种检索途径;自然语言的文本检索以其综合优势和方便性将在未来的检索中成为主流;受控语言——自然语言一体化是最适合于网络检索的检索语言。 相似文献
4.
近年来我国情报检索语言研究的热点问题 总被引:10,自引:1,他引:9
80年代以来,我国情报检索语言理论和实践研究取得了丰硕的成果,情报语言学从初创逐步走向成熟,情报语言学系列课程不断完善,并取得丰富的经验.特别是对一些热点问题,如分类主题一体化、自然语言应用等问题在理论上进行了探索,在技术上进行了实践.90年代以来,随着国际互联网络的发展与延伸,情报检索语言如何适应新的检索环境是一个非常值得研究的课题,也是面向21世纪情报语言学的发展方向. 相似文献
5.
中国近现代分类法与主题法发展轨迹及展望 总被引:6,自引:0,他引:6
本文分阶段分析了中国近现代分类法与主题法研究和发展的状况及特征,预测了21世纪分类法、主题法、情报检索语言的发展前景。 相似文献
6.
对己有的几种内容标示方案(分类法、主题法、分类主题一体化方法、组面组配法)作了简要的评价。在吸收组面标示法的组面组合思想的基础上,结合语言学的视野,提出句法标引的概念,并且探讨其作为新的检索语言的可能性。 相似文献
7.
8.
Biological Abstracts的生物分类索引在检索语言的应用方面颇具特色,即自然分类法与文献分类法的有机融合;在分类索引中渗入主题法因素,力图实现分类主题一体化,具有直观、简洁等优点,检索效率要高于普通分类索引。 相似文献
9.
搜索引擎对网络信息资源的文献控制方法研究 总被引:6,自引:0,他引:6
搜索引擎是二次文献在环境下的应用与发展,它融合了传统的文献控制理论与计算机情报检索技术。本文主要从文献控制的角度介绍了搜索引擎对分类法、主题法以及分类主题一体化的应用与发展,并提出了搜索引擎功能优化的几种途径。 相似文献
10.
网络环境下信息资源组织方式整合新探 总被引:8,自引:2,他引:8
从分析信息资源组织面临的新形势入手,介绍了信息资源组织主要方式(分类法与主题法)的运用现状,着重分析了分类法与主题法在运用过程中的优势和不足,并在网络信息资源组织方式整合、完善创新方面作了探讨。 相似文献
11.
利用分类法和主题法改善搜索引擎的性能 总被引:6,自引:0,他引:6
本文对Internet上的检索工具搜索引擎的工作机理和性能进行了全方位的探讨,指出了常见搜索引擎信息检索缺点。同时还指出了Meta搜索引擎、智能搜索引擎和代理搜索引擎虽然提高了网络信息检索的质量,但由于分类体系不统一,类目划分标准模糊,因而仍然未能从根本上改变搜索引擎主要以关键词(自然语言)作为检索入口的现状,不能实现分类检索与主题检索的一体化。要实现搜索引擎信息检索的突破,笔者认为有必要运用情报检索语言的理论和方法来完善因特网搜索引擎的性能,实现分类、主题一体化的检索机制,克服分类检索语言单纯以学科聚类、主题语言单纯以事物聚类的局限性。 相似文献
12.
13.
《Information processing & management》2022,59(5):103041
Named Entity Recognition (NER) aims to automatically extract specific entities from the unstructured text. Compared with performing NER in English, Chinese NER is more challenging in recognizing entity boundaries because there are no explicit delimiters between Chinese characters. However, most previous researches focused on the semantic information of the Chinese language on the character level but ignored the importance of the phonetic characteristics. To address these issues, we integrated phonetic features of Chinese characters with the lexicon information to help disambiguate the entity boundary recognition by fully exploring the potential of Chinese as a pictophonetic language. In addition, a novel multi-tagging-scheme learning method was proposed, based on the multi-task learning paradigm, to alleviate the data sparsity and error propagation problems that occurred in the previous tagging schemes, by separately annotating the segmentation information of entities and their corresponding entity types. Extensive experiments performed on four Chinese NER benchmark datasets: OntoNotes4.0, MSRA, Resume, and Weibo, show that our proposed method consistently outperforms the existing state-of-the-art baseline models. The ablation experiments further demonstrated that the introduction of the phonetic feature and the multi-tagging-scheme has a significant positive effect on the improvement of the Chinese NER task. 相似文献
14.
In Mongolian, two different alphabets are used, Cyrillic and Mongolian. In this paper, we focus solely on the Mongolian language using the Cyrillic alphabet, in which a content word can be inflected when concatenated with one or more suffixes. Identifying the original form of content words is crucial for natural language processing and information retrieval. We propose a lemmatization method for Mongolian. The advantage of our lemmatization method is that it does not rely on noun dictionaries, enabling us to lemmatize out-of-dictionary words. We also apply our method to indexing for information retrieval. We use newspaper articles and technical abstracts in experiments that show the effectiveness of our method. Our research is the first significant exploration of the effectiveness of lemmatization for information retrieval in Mongolian. 相似文献
15.
F.J. Devadason 《Information processing & management》1985,21(1):11-26
Classaurus is a faceted hierarchic scheme of terms with vocabulary control features. It is a system of terms having separate hierarchic schedules of the Elementary Categories: Discipline, Entity, Property, and Action, together with their respective Species/Types, Parts and Special Modifiers. Also there are separate schedules for the Common Modifiers: Form, Time, Environment, and Place. Each of the terms in these hierarchic schedules is enriched with synonyms, quasi synonyms etc. The hierarchic schedules constituting the systematic part is supplemented by an alphabetical index of chain entries. Classaurus is used in the formulation of subject headings in general, and in particular, subject headings according to the Postulate based Permuted Subject Indexing (POPSI) language. For the construction of classaurus the POPSI language itself provides guidelines. A set of programs have been developed to construct a classaurus using as input, subject headings formulated according to POPSI language which are enriched with certain codes to denote the different Elementary Categories, their Species, Parts, Special Modifiers and other Common Modifiers of different kinds. The resulting classaurus has hierarchic schedules but terms in an array are arranged only alphabetically. The hierarchic schedules constitute the Systematic part of the classaurus. The system generates an alphabetic Index Part to the Systematic Part, in which for each term its broader terms are kept to its right hand side successively along with a code to denote the schedule to which the term belongs. To find out the position of a term in the Systematic Part, the whole entry for the term in the Alphabetic Part is taken and the sequence of the terms in it is reversed. Using the code for the schedule in the entry, the appropriate hierarchic schedule is selected. The schedule is then searched using the broader terms successively as keys until the term in question is reached, wherein all the hierarchically related terms could be found, including synonyms, quasi-synonyms etc. Both the Systematic Part and the Alphabetical Index Part are printed out for manual reference and also kept as direct access files for ondashline access and ondashth-spot updating and building up of the classaurus while inputting new subject headings formulated for this purpose. 相似文献
16.
17.
18.
国外知名大学信息管理学院科研现状分析 总被引:2,自引:0,他引:2
本文选择了国内知名大学的图书馆信息科学专业,对他们的师资力量、科研领域、科研项目及经济来源、科研成果进行了分析,以期对我国的信息管理学科建设有参考价值。 相似文献
19.
When public catalog users enter queries that exactly match the catalog's controlled vocabulary, online systems should respond with browsing lists of alphabetically arranged subject headings, because such displays guide users to retrievals based on the assignment of the matched subject headings to bibliographic records. Unfortunately, studies of online catalog searching demonstrate that alphabetical displays are no longer capable of managing large numbers of subdivided forms of subject headings, because searchers exhibit low levels of perseverance when faced with large numbers of retrievals. This paper introduces a new approach to displaying retrieved subject headings in subject searching—the exact-display approach—designed to encourage users to browse bibliographic information. The purpose of this paper is to emphasize the importance of the exact-display approach by showing how many user queries would be candidates for this approach, demonstrate an implementation of the exact-display approach in an experimental online catalog, and feature end-user experiences with this approach as implemented in the experimental catalog. End-user experiences gave the authors the opportunity to make several recommendations for enhancing the original design of the exact-display approach so that future implementations of this approach in operational online catalogs are responsive to the needs of online catalog users. 相似文献