首页 | 本学科首页   官方微博 | 高级检索  
     检索      

从社会性标签中进行语义关系抽取——一种元数据生成方法
引用本文:Miao Chen,Xiaozhong Liu,Jian Qin,崔晓莉,张春景.从社会性标签中进行语义关系抽取——一种元数据生成方法[J].现代图书情报技术,2009,3(3):38-45.
作者姓名:Miao Chen  Xiaozhong Liu  Jian Qin  崔晓莉  张春景
作者单位:1. 美国雪城大学,美国
2. 华东师范大学信息学系
3. 上海图书馆
摘    要:标签形式的社会性语义越来越占据主导地位,使元数据界在这种新形式的信息内容表达和检索方面面临机遇和挑战。其中,主要的挑战是与标签相关的语境信息的缺失。以Flickr标签为例,对如何利用社会性语义资源来丰富主题元数据进行了实验。实验过程包含4个步骤:收集Flickr标签样本;通过共有信息计算标签间的同现情况;通过Google检索结果来追踪标签对的语境信息;用自然语言处理和机器学习技术来抽取标签间的语义关系。本实验能够利用Google搜索结果构建语境库,并且以自然语言处理和机器学习算法对这些语句进行处理。这种新方法对于赋予标签对以一定语义关系有相当高的准确率。也探讨该方法在利用社会性语义丰富的主题元数据方面的意义。

关 键 词:关系抽取  标签  搜索引擎  社会性语义  元数据
收稿时间:2009-02-09

Semantic Relation Extraction from Socially-generated Tags:A Methodology for Metadata Generation
Miao Chen Xiaozhong Liu Jian Qin.Semantic Relation Extraction from Socially-generated Tags:A Methodology for Metadata Generation[J].New Technology of Library and Information Service,2009,3(3):38-45.
Authors:Miao Chen Xiaozhong Liu Jian Qin
Institution:(Syracuse University, USA)
Abstract:The growing predominance of social semantics in the form of tagging presents the metadata community with both opportunities and challenges as for leveraging this new form of information content representation and for retrieval. One key challenge is the absence of contextual information associated with these tags. This paper presents an experiment working with Flickr tags as an example of utilizing social semantics sources for enriching subject metadata. The procedure included four steps:1) Collecting a sample of Flickr tags, 2) Calculating cooccurrences between tags through mutual information, 3) Tracing contextual information of tag pairs via Google search results,4) Applying natural language processing and machine learning techniques to extract semantic relations between tags. The experiment helped us to build a context sentence collection from the Google search results, which was then processed by natural language processing and machine learning algorithms. This new approach achieved a reasonably good rate of accuracy in assigning semantic relations to tag pairs. This paper also explores the implications of this approach for using social semantics to enrich subject metadata.
Keywords:Relation extraction  Tags  Search engine  Social semantics  Metadata
本文献已被 万方数据 等数据库收录!
点击此处可从《现代图书情报技术》浏览原始摘要信息
点击此处可从《现代图书情报技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号