基于统计的中文四字姓名识别方法 Statistical Chinese Four-Character Name Recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于统计的中文四字姓名识别方法

引用本文：	刘兴义,李成城.基于统计的中文四字姓名识别方法[J].山东商业职业技术学院学报,2012,12(4):88-92.

作者姓名：	刘兴义李成城

作者单位：	内蒙古师范大学计算机与信息工程学院,内蒙古呼和浩特,010022

摘要：	采用统计方法来识别中文四字姓名。该方法将中文四字姓名的识别过程分为姓名候选和姓名求精两个阶段。采用二元隐马尔科夫模型从已经切分好的文本中候选姓名。利用边界规则对候选姓名进行求精。实验结果表明，该方法的召回率为82．9％，准确率为87．3％。
关键词：	人工智能自然语言处理中文四字姓名识别隐马尔科夫模型
Statistical Chinese Four-Character Name Recognition

LIU Xing-yi,LI Cheng-cheng.Statistical Chinese Four-Character Name Recognition[J].Journal of Shandong Institute of Commerce and Technology,2012,12(4):88-92.

Authors:	LIU Xing-yi LI Cheng-cheng

Institution:	(Computer and Information Engineering College,Inner Mongolia Normal University,Hohhot,Inner Mongolia 010022,China)

Abstract:	Automatic recognition of Chinese four-character name is an important part of Chinese Named Entity recognition.The Chinese four-character name recognition process can be divided into two stages,namely,the name candidate and the name of refinement.A Hidden Markov Model(HMM) is applied for the extraction of candidate names from text.The boundary rule on candidate names is used for the refinement.The test experiments show that the precision and recall rate reach 87.3% and 82.9% respectively.

Keywords:	artificial intelligence natural language processing name recognition HMM
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏