首页 | 本学科首页   官方微博 | 高级检索  
     检索      


HTSS: A novel hybrid text summarisation and simplification architecture
Institution:1. Information Technology University, 346-B, Ferozepur Road, Lahore, Pakistan;2. Manchester Metropolitan University, Manchester, United Kingdom;3. Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia;1. Computer Engineering Department, Middle East Technical University, Ankara, Turkey;2. School of Computing Science, University of Glasgow, Glasgow, UK;3. Computer Engineering Department, Bilkent University, Ankara, Turkey;1. Beijing University of Posts and Telecommunications, Beijing, China;2. Singapore Management University, Singapore;3. Worcester Polytechnic Institute, USA;4. Alibaba Group, Hangzhou, China
Abstract:Text simplification and text summarisation are related, but different sub-tasks in Natural Language Generation. Whereas summarisation attempts to reduce the length of a document, whilst keeping the original meaning, simplification attempts to reduce the complexity of a document. In this work, we combine both tasks of summarisation and simplification using a novel hybrid architecture of abstractive and extractive summarisation called HTSS. We extend the well-known pointer generator model for the combined task of summarisation and simplification. We have collected our parallel corpus from the simplified summaries written by domain experts published on the science news website EurekaAlert (www.eurekalert.org). Our results show that our proposed HTSS model outperforms neural text simplification (NTS) on SARI score and abstractive text summarisation (ATS) on the ROUGE score. We further introduce a new metric (CSS1) which combines SARI and Rouge and demonstrates that our proposed HTSS model outperforms NTS and ATS on the joint task of simplification and summarisation by 38.94% and 53.40%, respectively. We provide all code, models and corpora to the scientific community for future research at the following URL: https://github.com/slab-itu/HTSS/.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号