基于汉字聚类特征的中文字符串相似度计算研究 Research Towards Chinese String Similarity Based on the Clustering Feature of Chinese Characters期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于汉字聚类特征的中文字符串相似度计算研究

引用本文：	王静婷.基于汉字聚类特征的中文字符串相似度计算研究[J].现代图书情报技术,2011,27(2):48-53.

作者姓名：	王静婷

作者单位：	南京政治学院上海分院军事信息管理系上海 200433

摘要：	采用聚类分析的方法,对汉字的特征进行研究和分析,找出其内在规律,根据汉字具有“成簇性”的特点,对中文字符串进行精细化匹配,给出基于改进编辑距离的相似度计算模型。实验结果表明,该模型对中文字符串的相似度具有更为精细的体现。
关键词：	中文字符串匹配汉字成簇性相似度
收稿时间：	2010-10-18
修稿时间：	2011-01-28
Research Towards Chinese String Similarity Based on the Clustering Feature of Chinese Characters

Wang Jingting.Research Towards Chinese String Similarity Based on the Clustering Feature of Chinese Characters[J].New Technology of Library and Information Service,2011,27(2):48-53.

Authors:	Wang Jingting

Institution:	Department of Military Information Management, Shanghai Branch of Nanjing Institute of Politics, Shanghai 200433,China

Abstract:	This paper adopts cluster analysis method to discuss and analyze the features of Chinese characters,in order to discover the internal rules. Based on the clustering feature of Chinese characters,it refines the matching result of string matching,and advances a 2-level similarity model. The experiment result shows that this model can reflect the similarity better.

Keywords:	Chinese string matching Clustering of Chinese character Similarity
本文献已被 CNKI 等数据库收录！
	点击此处可从《现代图书情报技术》浏览原始摘要信息
	点击此处可从《现代图书情报技术》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏