首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于依存句法分析的中文专利候选术语选取研究
引用本文:俞琰,陈磊,姜金德,赵乃瑄.基于依存句法分析的中文专利候选术语选取研究[J].图书情报工作,2019,63(18):109-118.
作者姓名:俞琰  陈磊  姜金德  赵乃瑄
作者单位:1. 南京工业大学信息服务部 南京 210009; 2. 东南大学成贤学院计算机工程系 南京 211816; 3. 南京晓庄学院商学院 南京 211171
基金项目:本文系教育部人文社会科学规划项目 "大数据时代技能知识图谱构建研究"(项目编号:16YJAZH073)和国家社会科学基金一般规划项目"大数据时代支持创新设计的多维度多层次专利文本挖掘研究"(项目编号:17BTQ059)研究成果之一。
摘    要:目的/意义]针对中文专利候选术语选取方法存在需要对不同的数据集分别制定不同的模式匹配规则、专利术语抽取准确性不高等问题,本文提出基于依存句法分析的中文专利术语选取方法,以提高中文专利术语抽取准确性。方法/过程]主要包括依存句法分析、剪枝、生成依存子树等三个主要步骤。首先对中文专利进行依存句法分析,得到依存树,对依存树进行剪枝,去除不符合要求的依存关系,生成依存子树,从中选取连续词串作为候选术语,以抽取中文专利术语。结果/结论]实验结果表明,与已有的中文专利候选术语选取方法相比,本文提出的基于依存句法分析的中文候选术语选取方法能够有效地提高中文专利术语抽取的准确性。

关 键 词:术语抽取  依存句法分析  中文候选术语选取  
收稿时间:2019-01-22

Research on the Selection of Chinese Patent Candidate Term Based on Dependency Syntax Parsing
Yu Yan,Chen lei,Jiang Jinde,Zhao Naixuan.Research on the Selection of Chinese Patent Candidate Term Based on Dependency Syntax Parsing[J].Library and Information Service,2019,63(18):109-118.
Authors:Yu Yan  Chen lei  Jiang Jinde  Zhao Naixuan
Institution:1. Information Service Department, Nanjing Tech University, Nanjing 210009; 2. Computer Science Department, Chengxian College, Southeast University, Nanjing 211816; 3. School of Business, Nanjing Xiaozhuang University, Nanjing 211171
Abstract:Purpose/significance] Aiming at the difficulties in making different pattern matching rules for different data sets and the low accuracy of Chinese patent term extraction, this paper proposes a selection method of Chinese patent candidate term based on dependency syntax parsing to improve the accuracy of Chinese patent term extraction.Method/process] The method mainly includes three main steps:dependency syntax parsing, pruning and dependency subtree generation. Firstly, dependency syntax analysis was carried out on the Chinese patent text, from which dependency tree were obtained. Then, the dependency subtrees were generated by removing dependency relations which do not meet requirements. At last, the continuous word strings were selected as candidate terms to extract Chinese patent terms.Result/conclusion] The experimental results show that compared with the existing related methods, the proposed method based on dependency syntax parsing can effectively improve the accuracy of Chinese patent term extraction.
Keywords:term extraction  dependency syntax parsing  Chinese patent candidate term selection  
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号