首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于中文学术文献的领域本体概念层次关系抽取研究
引用本文:唐琳,郭崇慧,陈静锋,孙磊磊.基于中文学术文献的领域本体概念层次关系抽取研究[J].情报学报,2020,39(4):387-398.
作者姓名:唐琳  郭崇慧  陈静锋  孙磊磊
作者单位:大连理工大学系统工程研究所,大连 116024;北京航空航天大学软件开发环境国家重点实验室,北京 100083
基金项目:国家自然科学基金项目“电子病历挖掘中的聚类模型与算法研究”(71771034);揭阳市科技计划项目“大数据驱动的中药材产业发展决策支持系统”(2017xm041)。
摘    要:基于学术文献构建领域本体对促进领域学科发展具有重要的意义。本文提出了一种以中文学术文献为数据源,半自动化抽取领域本体层次关系的框架方法。首先,构建了一个通用的领域本体层次关系的细粒度研究框架。其次,设计了一种新的概念表示方法,融合了深度学习方法得到的概念语义特征和上下文的时间序列词频。进一步结合了AP聚类、Prim算法和Web搜索引擎的查询数据,提出了基于规则推理的本体概念层次关系抽取算法(RROCHE),实现了半自动化概念层次关系抽取。最后,基于中文分词领域的中文学术文献数据,通过数值实验方法讨论了方法的可行性和有效性。本文提出的框架方法也非常容易推广并应用到各领域本体层次关系任务中。

关 键 词:概念层次关系  本体构建  学术文献  深度学习  时间序列

Learning Concept Hierarchies from Chinese Academic Literature for Domain Ontology Construction
Tang Lin,Guo Chonghui,Chen Jingfeng,Sun Leilei.Learning Concept Hierarchies from Chinese Academic Literature for Domain Ontology Construction[J].Journal of the China Society for Scientific andTechnical Information,2020,39(4):387-398.
Authors:Tang Lin  Guo Chonghui  Chen Jingfeng  Sun Leilei
Institution:(Institute of Systems Engineering,Dalian University of Technology,Dalian 116024;SKLSDE Lab and BDBC,Beihang University,Beijing 100083)
Abstract:Constructing domain ontology from academic literature has great significance in promoting discipline development. Taking Chinese academic literature as a data source, this study proposed a semi-automatic method for extracting concept hierarchy. First, a fine-grained universal research framework for constructing hierarchical relations of the domain ontology was proposed. Then, a novel concept representation fusion method was developed, considering concepts semantic features based on deep learning and concept frequency in time series. Combined with an affinity propagation(AP) clustering algorithm, Prims algorithm, and data from a Web search engine, the ontology concept hierarchy extraction algorithm was proposed via rule-based reasoning(RROCHE). Concept hierarchy relations are learned semi-automatically. The algorithm was then applied to the academic literature on Chinese word segmentation. Numerical experiments examined the feasibility and effectiveness of the proposed methods. The proposed method can also be applied effectively and widely to other domains.
Keywords:concept hierarchies  ontology construction  academic literature  deep learning  time series
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号