首页 | 本学科首页   官方微博 | 高级检索  
     检索      

利用迁移学习精准识别领域信息之探讨
引用本文:陆泉,郝志同,陈静,陈仕,朱安琪.利用迁移学习精准识别领域信息之探讨[J].图书情报工作,2021(5):110-117.
作者姓名:陆泉  郝志同  陈静  陈仕  朱安琪
作者单位:武汉大学信息资源研究中心;国土资源部城市土地资源监测与仿真重点实验室;华中师范大学信息管理学院
基金项目:国家社会科学基金重点项目“心理账户理论视角下在线健康社区精准信息服务研究”(项目编号:2070008)研究成果之一。
摘    要:目的/意义]将从互联网大数据中无监督学习的结果迁移到目标领城,解决目标领城因学习样本有限而信息识别效果难以提升的问题。方法/过程]使用以中文维基百科等数据预训练的RoBERTa模型进行迁移学习,将学习结果映射到目标领城后使用DPCNN对其进行聚合凝练,然后结合部分标注数据微调模型完成领域信息的精准识别。结果/结论]在10个领城内与未进行迁移学习的模型及经典模型TextCNN对比,提出的模型均较大幅度优于对比模型,平均后的精确率绝对提高4.15%、3.43%,召回率绝对提高4.55%、3.44%,F1分数绝对提高4.52%.3.44%,表明利用网络大数据迁移学习可以显著提升目标领城的信息识别效果。

关 键 词:迁移学习  信息识别  RoBERTa

Discussion on Using Transfer Learning to Accurately Identify Domain Information
Lu Quan,Hao Zhitong,Chen Jing,Chen Shi,Zhu Anqi.Discussion on Using Transfer Learning to Accurately Identify Domain Information[J].Library and Information Service,2021(5):110-117.
Authors:Lu Quan  Hao Zhitong  Chen Jing  Chen Shi  Zhu Anqi
Institution:(Center for Studies of Information Resources,Wuhan University,Wuhan 430072;Key Laboratory of Urban Land Resources Monitoring and Simulation,Ministry of Land and Resources,Shenzhen 5180343;School of Information Management,Central China Normal University,Wuhan 430079)
Abstract:Purpose/significance]To solve the problem that the identification effect of the target domain infor-mation is difcult to improve because of not enough samples,we will transfer the results of unsupervised learning from big data to the feature space of the target domain.Method/process]Used the RoBERTa model,which was pre-trained with Chinese Wikipedia and other data,for transfer learning.After mapping the learming results to the target domain,DPCNN was used to aggregate and condense it,and then fine-tuned the model with part of the labeled data to complete the accurate recognition of domain information.Result/conclusion]Compared with the model without transfer learning and the classic model TextCNN in 10 fields,the model in this paper is much better than the compar-ison models.After average,the precision is increased by 4.15% and 3.43%,the recall is increased by 4.55% and 3.44%,and the F1 score is increased by 4.52%and 3.44%.It shows that knowledge transfer using big data can effectively improve the information recognition effect in the target field.
Keywords:transfer learning  information recogmition  RoBERTa
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号