首页 | 本学科首页   官方微博 | 高级检索  
     检索      

巧用Clementine简化数据处理
引用本文:郑慧霞.巧用Clementine简化数据处理[J].中华医学图书馆杂志,2011(4):59-62.
作者姓名:郑慧霞
作者单位:中国协和医科大学图书馆网络技术服务部,北京100005
基金项目:中国医学科学院医学信息研究所基本科研业务费支持项目:基于Web挖掘的读者行为分析(编号R0830)
摘    要:用著名的数据挖掘工具Clementine处理数据有些大材小用,但它的确比Excel更易用、更高效,处理数据时不需要翻看复杂的编程手册、在Excel表中拉滚动条、选择各种函数等。以国家科技文献中心(NSTL)签到数据上传处理为研究实例,涉及数据查重、规范、筛选、映射、比对、频次统计等各种常见任务,介绍了如何根据不同处理需求定制相应Clementine数据流和Clementine工具在海量数据处理中的优势。

关 键 词:Clementine  数据处理  映射  比对

Simplifying data processing by making use of Clementine in a clever way
ZHENG Hui-xia.Simplifying data processing by making use of Clementine in a clever way[J].Chinese Journal of Medical Library,2011(4):59-62.
Authors:ZHENG Hui-xia
Institution:ZHENG Hui-xia (Network Technology Service Division, Library of Chinese Union Medical University, Beijing 100005, China)
Abstract:It is to put a large material to a small use when Clementine, a well-known data mining tool is used to process da. However, it is easier to use with a higher efficacy in processing data than Excel because it does not need to read the complex programming manual, to pull the scroll bar in Excel, and to select the different functions. How to build the corresponding data flow according to the requirements of different data processing and bring Clementine into full play was described by taking the uploading of registered attendance data in National Science and Technology Literature Center as an example, including duplicate data check, data standardization, data screening, data mapping, comparison and frequency.
Keywords:Clementine  data processing  mapping  comparison
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号