首页 | 本学科首页   官方微博 | 高级检索  
     检索      

政务领域本体术语的自动抽取*
引用本文:翟笃风,刘柏嵩.政务领域本体术语的自动抽取*[J].现代图书情报技术,2010,26(4):59-65.
作者姓名:翟笃风  刘柏嵩
作者单位:(宁波大学商学院宁波 315211) (宁波大学网络中心宁波  315211)
基金项目:*本文系国家社会科学基金项目“领域本体的自动构建和应用研究”(项目编号:08CTQ014)的研究成果之一。
摘    要:提出一种新的政务本体术语自动抽取的方法。首先通过中文分词技术和单字合并法提取政务文本中的词作为候选术语;通过C-value求解法和TF-IDF算法对候选术语进行过滤抽取,从而实现政务领域术语的自动抽取。通过实验比较,发现该方法在不影响领域术语抽取召回率的同时可以提高抽取术语的正确率。

关 键 词:政务领域本体  术语  单字合并法  C-value  TFIDF算法
收稿时间:2010-03-22
修稿时间:2010-04-10

Automatic Domain-specific Term Extraction in Administrative-domain Ontology
Zhai Dufeng,Liu Baisong.Automatic Domain-specific Term Extraction in Administrative-domain Ontology[J].New Technology of Library and Information Service,2010,26(4):59-65.
Authors:Zhai Dufeng  Liu Baisong
Institution:(School of Business, Ningbo University, Ningbo 315211, China) (Ningbo University Network Center, Ningbo 315211, China)
Abstract:This paper introduces a new method to extract the administrative-domain Ontology term automatically. Firstly, some words that are representative of the candidate terms should be extracted through the technology of word segmentation and the characters merger method. Secondly, the candidate terms are filtered by the way of C-value method and TF-IDF algorithm to achieve the automatic domain-specific term extraction in administrative-domain Ontology. Finally,the experiment shows that this method can improve the accuracy of the extracted terms and do not affect the recall-rate.
Keywords:
点击此处可从《现代图书情报技术》浏览原始摘要信息
点击此处可从《现代图书情报技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号