首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到18条相似文献,搜索用时 140 毫秒
1.
社会化标注系统中标签的语义模糊性和形式不规范使得资源管理与共享越来越困难,为准确定位标签语义,文章从扩展标签语义与涌现标签语义两个方面,对标签语义检索研究现状进行了综述,分析了社会化标注系统中标签语义检索的研究动态和不足,并总结得出可计算性高、可操作性强、能智能获取标签的语义关系是社会化标注系统标签语义检索的未来研究方向。  相似文献   

2.
通过实验采集用户的图像标注结果,对3种图像语义标注模式——基于标签打分的图像标注模式、单标签下基于图像比较的标注模式以及多标签下基于图像比较的标注模式的标注效果进行对比研究。研究发现:1基于标签打分的图像标注模式和单标签下基于图像比较的标注模式能够帮助用户对图像各标签的语义强度进行有效标注;2多标签下基于图像比较的标注模式可以帮助用户对图像各标签语义强度的比例关系进行有效标注;3标注界面中是否同时显示图像的所有标签,可能影响到用户对图像在各标签上语义强度比例关系的判断。  相似文献   

3.
协同标注系统的语义丰富   总被引:1,自引:0,他引:1  
提出利用语义网技术语义丰富协同标注系统的方法,通过对协同标注系统的标签进行标准化处理,利用标签的共现分析出标签的意思组,并将其与相关本体的SWTS(概念、属性、实例)映射,从而丰富标签的语义,以改善协同标注系统的检索结果.  相似文献   

4.
目前,国内外许多学者借助语义词典Word Net进行标签间语义关系挖掘方面研究,并取得了一定的进展,但却很少有专门针对中文语义词典与标签结合的研究。文章通过选取豆瓣读书上的标签数据,充分分析并利用中文语义词典《同义词词林》的分类体系和编码特点,利用一种基于《同义词词林》的词汇语义相似度计算系统Word Similar计算标签数据的语义相似度,进而挖掘标签间的语义关系,该方法得到的结果与我们思维中的词汇语义关系基本一致,有比较高的准确性。  相似文献   

5.
易明  秦涵  蒋武轩 《情报科学》2020,38(2):29-38
【目的/意义】基于标签系统所蕴含的语义信息与隐性社会网络,构建融合标签概念空间及用户网络的语义社 团发现模型,提高社团发现的质量。【方法/过程】通过构建标签的概念空间挖掘标签间的语义关系,并根据标签包 含的隐性社会网络发现用户网络,进而将两者结合融入到社团发现算法中,并以豆瓣网数据对模型进行实证。【结 果/结论】标签概念空间及用户网络能够提升语义社团发现算法效果。  相似文献   

6.
国外标签本体研究进展   总被引:1,自引:0,他引:1  
吴芬 《现代情报》2009,29(11):16-20
为解决folksonomies的问题,提出给标签、标注行为增加语义的标签本体,并利用语义网本体建模标注行为和folksonomies。标签本体的发展从关注标注活动发展到关注folksonomy(协同标注活动),并从标签含义的角度,创建MOAT跨越标注行为与语义检索的鸿沟。标签本体正走向统一、共享的新阶段。  相似文献   

7.
用户标注是web2.0主要的资源标引和组织方法,由于用户标签组织的平面结构导致标签之间缺乏语义关系,使之很难适应语义信息组织的要求.针对这一问题,探讨在用户标注优化基础上,用户标签的词语网络构建和语义关系处理与控制,简要讨论了用户标注的模型化或本体化.  相似文献   

8.
社会标注系统中用户生成的标签具有随意性和弱关联性,这将导致标签推荐的精确性降低。本文基于加权元组潜在语义的三维张量结构模型,引入社会网络的结构化分析方法对相关元组进行量化加权,以构建加权的三维张量结构模型,并通过元组的潜在语义分析,得到能体现用户兴趣度的加权元组集。最后,通过典型标注网站Delicious中的用户标注数据集,验证了基于加权元组潜在语义分析的三维张量模型具有较好的标签推荐效果。  相似文献   

9.
基于标签的个性化推荐应用越来越普遍,但是标签带有的语义模糊、时序动态性等问题影响着个性化推荐质量,现有研究仅从数量和结构上考虑用户与标签的关系。基于社会化标注系统的个性化推荐首先对融合社会关系的标签进行潜在语义主题挖掘,然后构建多层、多维度用户兴趣模型,提出模型更新策略,最后实现个性化推荐。采集CiteUlike站点数据进行实验分析,结果表明改进算法比传统算法更准确表达用户兴趣偏好,有效提高了个性化推荐准确率。  相似文献   

10.
占泚  熊回香  蒋武轩  李琰 《情报科学》2022,39(1):121-129
【目的/意义】在线健康信息的有效组织对提升全民身体素质具有重要的社会价值。【方法/过程】在分析健 康信息主题、关联关系和资源标引的基础上,构建基于主题图的在线健康信息标签语义挖掘模型,从而构建了健康 信息标签主题图并实现了其可视化导航、浏览和检索等功能。【结果/结论】基于主题图的在线健康信息标签语义挖 掘模型能够准确的发现在线健康信息与信息标签间的深层关系,可以更好地揭示在线健康信息标签的语义关联, 为用户提供信息的可视化浏览和导航功能、提升健康信息的组织效果,帮助用户健康信息获取。【创新/局限】本文 将主题图与健康信息标签相结合,提高了健康信息的检索效率和利用效率,但本文也存在着不足,例如标签样本量 和样本范围较小,缺乏专业医学研究者的参与。  相似文献   

11.
Social tagging systems enable users to assign arbitrary tags to various digital resources. However, they face vague-meaning problems when users retrieve or present resources with the keyword-based tags. In order to solve these problems, this study takes advantage of Semantic Web technology and the topological characteristics of knowledge maps to develop a system that comprises a semantic tagging mechanism and triple-pattern and visual searching mechanisms. A field experiment was conducted to evaluate the effectiveness and user acceptance of these mechanisms in a knowledge sharing context. The results show that the semantic social tagging system is more effective than a keyword-based system. The visualized knowledge map helps users capture an overview of the knowledge domain, reduce cognitive effort for the search, and obtain more enjoyment. Traditional keyword tagging with a keyword search still has the advantage of ease of use and the users had higher intention to use it. This study also proposes directions for future development of semantic social tagging systems.  相似文献   

12.
13.
王凯 《现代情报》2021,41(1):39-49
[目的/意义] 构建基于用户兴趣标签的网络社团识别模型(Fuzzy Interests and User Hybrid Model,FIUHM),揭示用户兴趣与社团形式概念间的模糊层级关系,实现多粒度属性与社团拓扑结构的层次聚类。[方法/过程] 通过抽取豆瓣电影社区数据,实现基于用户标签的兴趣强度语义标注,利用用户相似度,获取社区用户间兴趣语义距离;将网络社区的领接矩阵映射为社团形式背景,构建社团模糊概念格,建立社团形式概念及其偏序关系集,完成社团形式概念建模;通过计算社团稳定指数,识别网络社团边界,并聚类最大独立社团,实现兴趣社团的在线检测。[结果/结论] 通过对比实验,验证了FIUHM模型的有效性,实验表明将模糊形式概念分析引入网络社团识别研究,利用模糊概念格的偏序关系建模用户节点间的兴趣相似度,有利于提高社团识别的分辨率。  相似文献   

14.
非物质文化遗产数据库对非遗资源的分类,必须遵从《非物质文化遗产数字化保护专业标准》,但这种分类方式存在一些问题。为了充分揭示非遗资源的特性和非遗资源之间的文化关联性,以建设承德地区非遗数据库为例,引入大众标注标引资源的方式来补充专家分类的不足:用户协作建设基础标签库,使用基础标签作为推荐标签,使用时间、空间、文化场所标签作为文化空间标签,由推荐标签等形成的高频标签作为专家分类的二级类目的备选,构建基于用户协作的非遗数字资源混合分类模式。  相似文献   

15.
Recently, social network has been paid more and more attention by people. Inaccurate community detection in social network can provide better product designs, accurate information recommendation and public services. Thus, the community detection (CD) algorithm based on network topology and user interests is proposed in this paper. This paper mainly includes two parts. In first part, the focused crawler algorithm is used to acquire the personal tags from the tags posted by other users. Then, the tags are selected from the tag set based on the TFIDF weighting scheme, the semantic extension of tags and the user semantic model. In addition, the tag vector of user interests is derived with the respective tag weight calculated by the improved PageRank algorithm. In second part, for detecting communities, an initial social network, which consists of the direct and unweighted edges and the vertexes with interest vectors, is constructed by considering the following/follower relationship. Furthermore, initial social network is converted into a new social network including the undirected and weighted edges. Then, the weights are calculated by the direction and the interest vectors in the initial social network and the similarity between edges is calculated by the edge weights. The communities are detected by the hierarchical clustering algorithm based on the edge-weighted similarity. Finally, the number of detected communities is detected by the partition density. Also, the extensively experimental study shows that the performance of the proposed user interest detection (PUID) algorithm is better than that of CF algorithm and TFIDF algorithm with respect to F-measure, Precision and Recall. Moreover, Precision of the proposed community detection (PCD) algorithm is improved, on average, up to 8.21% comparing with that of Newman algorithm and up to 41.17% comparing with that of CPM algorithm.  相似文献   

16.
The number of patent documents is currently rising rapidly worldwide, creating the need for an automatic categorization system to replace time-consuming and labor-intensive manual categorization. Because accurate patent classification is crucial to search for relevant existing patents in a certain field, patent categorization is a very important and useful field. As patent documents are structural documents with their own characteristics distinguished from general documents, these unique traits should be considered in the patent categorization process. In this paper, we categorize Japanese patent documents automatically, focusing on their characteristics: patents are structured by claims, purposes, effects, embodiments of the invention, and so on. We propose a patent document categorization method that uses the k-NN (k-Nearest Neighbour) approach. In order to retrieve similar documents from a training document set, some specific components to denote the so-called semantic elements, such as claim, purpose, and application field, are compared instead of the whole texts. Because those specific components are identified by various user-defined tags, first all of the components are clustered into several semantic elements. Such semantically clustered structural components are the basic features of patent categorization. We can achieve a 74% improvement of categorization performance over a baseline system that does not use the structural information of the patent.  相似文献   

17.
Semantic representation reflects the meaning of the text as it may be understood by humans. Thus, it contributes to facilitating various automated language processing applications. Although semantic representation is very useful for several applications, a few models were proposed for the Arabic language. In that context, this paper proposes a graph-based semantic representation model for Arabic text. The proposed model aims to extract the semantic relations between Arabic words. Several tools and concepts have been employed such as dependency relations, part-of-speech tags, name entities, patterns, and Arabic language predefined linguistic rules. The core idea of the proposed model is to represent the meaning of Arabic sentences as a rooted acyclic graph. Textual entailment recognition challenge is considered in order to evaluate the ability of the proposed model to enhance other Arabic NLP applications. The experiments have been conducted using a benchmark Arabic textual entailment dataset, namely, ArbTED. The results proved that the proposed graph-based model is able to enhance the performance of the textual entailment recognition task in comparison to other baseline models. On average, the proposed model achieved 8.6%, 30.2%, 5.3% and 16.2% improvement in terms of accuracy, recall, precision, and F-score results, respectively.  相似文献   

18.
A growing number of tagging applications have begun to provide users the ability to socialise their own keywords. Tagging, which assigns a set of keywords to resources, has become a powerful way for organising, browsing, and publicly sharing personal collections of resources on the Web. It is called folksonomies. These systems on current social websites, however, have deficiencies in defining tag's meaning, and are often blocked to users in order to reuse, share, and exchange the tags across heterogeneous websites. In this paper, we describe a semantic model for expressing folksonomies in social websites. This model, called Social Semantic Cloud of Tags, aims to provide a consistent format of representing folksonomies and some features in terms of tagging activities. We describe core concepts and relevant properties such as a popularity and usage of tags, along with deduced relationships between tags. We will discuss how this model helps to reduce drawbacks regarding tag sharing between users, applications, or folksonomies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号