首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Local word vectors guiding keyphrase extraction
Authors:Eirini Papagiannopoulou  Grigorios Tsoumakas
Institution:School of Informatics, Aristotle University of Thessaloniki, 54124, Greece
Abstract:Automated keyphrase extraction is a fundamental textual information processing task concerned with the selection of representative phrases from a document that summarize its content. This work presents a novel unsupervised method for keyphrase extraction, whose main innovation is the use of local word embeddings (in particular GloVe vectors), i.e., embeddings trained from the single document under consideration. We argue that such local representation of words and keyphrases are able to accurately capture their semantics in the context of the document they are part of, and therefore can help in improving keyphrase extraction quality. Empirical results offer evidence that indeed local representations lead to better keyphrase extraction results compared to both embeddings trained on very large third corpora or larger corpora consisting of several documents of the same scientific field and to other state-of-the-art unsupervised keyphrase extraction methods.
Keywords:Keyphrase extraction  Unsupervised method  GloVe  Local word vectors  Reference vector algorithm  68T50
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号