Automatic text summarization: A comprehensive survey

El-Kassas, WS; Salama, Cherif; Rafea, AA; Mohamed, HK;

Abstract


Automatic Text Summarization (ATS) is becoming much more important because of the huge amount of textual content that grows exponentially on the Internet and the various archives of news articles, scientific papers, legal documents, etc. Manual text summarization consumes a lot of time, effort, cost, and even becomes impractical with the gigantic amount of textual content. Researchers have been trying to improve ATS techniques since the 1950s. ATS approaches are either extractive, abstractive, or hybrid. The extractive approach selects the most important sentences in the input document(s) then concatenates them to form the summary. The abstractive approach represents the input document(s) in an intermediate representation then generates the summary with sentences that are different than the original sentences. The hybrid approach combines both the extractive and abstractive approaches. Despite all the proposed methods, the generated summaries are still far away from the human-generated summaries. Most researches focus on the extractive approach. It is required to focus more on the abstractive and hybrid approaches. This research provides a comprehensive survey for the researchers by presenting the different aspects of ATS: approaches, methods, building blocks, techniques, datasets, evaluation methods, and future research directions.


Other data

Title Automatic text summarization: A comprehensive survey
Authors El-Kassas, WS; Salama, Cherif ; Rafea, AA; Mohamed, HK
Keywords Automatic text summarization; Text summarization approaches; Text summarization techniques; Text summarization evaluation; PARTICLE SWARM OPTIMIZATION; MODEL; ARTICLES
Issue Date 1-Mar-2021
Publisher PERGAMON-ELSEVIER SCIENCE LTD
Journal EXPERT SYSTEMS WITH APPLICATIONS 
Volume 165
ISSN 0957-4174
DOI 10.1016/j.eswa.2020.113679
Scopus ID 2-s2.0-85089417054
Web of science ID WOS:000608479600001

Recommend this item

Similar Items from Core Recommender Database

Google ScholarTM

Check

Citations 366 in scopus


Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.