“An Extensive study of Symantic and Syntatic Approaches to Automatic Text Summarization”

Main Article Content

Manisha Gaikwad, Gitanjali Shinde, Parikshit Mahalle, Nilesh Sable, Namrata Kharate

Abstract

Automatic text summarization (ATS) has emerged as a crucial research domain in the discipline of natural language processing (NLP) and information retrieval. The exponential growth of digital content has necessitated the need for efficient techniques that can automatically generate concise and informative summaries from lengthy documents. This article provided a comprehensive recap of automatic text summarization, covering both abstractive and extractive methods. Using extractive techniques, prime phrases or keywords from the original text are identified and chosen, while abstractive methods involve producing summaries by paraphrasing and synthesizing content in a more human-like manner. Discussed the advantages and limitations of each approach, including the challenge of ATS, which arises when summarizing content from external sources. Furthermore, reviews common evaluation metrics used for assessing the quality of summaries and discusses recent advancements in neural network-based approaches for text summarization. This survey aims to provide an overview of automatic text summarization which acts as a useful resource for researchers and practitioners in the fields of information retrieval and NLP.

Article Details

Section
Articles
Author Biography

Manisha Gaikwad, Gitanjali Shinde, Parikshit Mahalle, Nilesh Sable, Namrata Kharate

[1]Manisha Gaikwad

2Gitanjali Shinde

3Parikshit Mahalle

4Nilesh Sable

5Namrata Kharate

 

[1] Department of Computer Engineering, Vishwakarma Institute of Information Technology, Savitribai Phule Pune University, India. Corresponding Author manisha.221p0072@viit.ac.in

   2Department of Computer Science and Engineering (AI&ML), Vishwakarma Institute of Information Technology, Savitribai Phule Pune University, India

3Department of Artificial Intelligence and Data Science, Vishwakarma Institute of Information Technology, Savitribai Phule Pune University, India

4Department of Computer Science and Engineering (AI), Vishwakarma Institute of Information Technology, Savitribai Phule Pune University, India.

5Department of Computer Engineering, Vishwakarma Institute of Information Technology, Savitribai Phule Pune University, India

manisha.221p0072@viit.ac.in, gitanjali.shinde@viit.ac.in, parikshit.mahalle@viit.ac.in, nilesh.sable@viit.ac.in, namrata.kharate@viit.ac.in

 

References

Chatterjee, Niladri, and Raksha Agarwal. "Studying the effect of syntactic simplification on text summarization." IETE Technical Review 40, no. 2 155-166, 4 mar. 2023.

Agarwal, Raksha, and Niladri Chatterjee. "Votesumm: A multi-document summarization scheme using influential nodes of multilayer weighted sentence network." IETE Technical Review 40, no. 4,535-548 jully 2023.

Yao, Kaichun, et al. "Dual encoding for abstractive text summarization." IEEE transactions on cybernetics 50.3 985-996, 2018.

Sun, Xiaoping, and Hai Zhuge. "Summarization of scientific paper through reinforcement ranking on semantic link network." IEEE Access 6, 40611-40625, 2018.

Al-Sabahi, Kamal, Zhang Zuping, and Mohammed Nadher. "A hierarchical structured self-attentive model for extractive document summarization (HSSAS)." IEEE Access 6 (2018): 24205-24212.

Li, Sheng, et al. "Visual to text: Survey of image and video captioning." IEEE Transactions on Emerging Topics in Computational Intelligence 3.4 (2019): 297-312.

Zhuang, Haojie, and Weibin Zhang. "Generating semantically similar and human-readable summaries with generative adversarial networks." IEEE Access 7 (2019): 169426-169433.

Saini, Naveen, et al. "Extractive single document summarization using binary differential evolution: Optimization of different sentence quality measures." PloS one 14.11 (2019): e0223477.

Li, Wei, and Hai Zhuge. "Abstractive multi-document summarization based on semantic link network." IEEE Transactions on Knowledge and Data Engineering 33.1 (2019): 43-54.

Liu, Yang, and Mirella Lapata. "Text summarization with pretrained encoders." arXiv preprint arXiv:1908.08345 (2019).

9. W. Kai and Z. Lingyu, "Research on Text Summary Generation Based on Bidirectional Encoder Representation from Transformers," 2020 2nd International Conference on Information Technology and Computer Application (ITCA), Guangzhou, China, 2020, pp. 317-321, doi: 10.1109/ITCA52113.2020.00074.

10. Stein, Aviel J., et al. "News article text classification and summary for authors and topics." Comput. Sci. Inf. Technol.(CS & IT) 10 (2020): 1-12.

11. Hernández-Castañeda, Ángel, et al. "Extractive automatic text summarization based on lexical-semantic keywords." IEEE Access 8 (2020): 49896-49907.

You, Fucheng, Shuai Zhao, and Jingjing Chen. "A topic information fusion and semantic relevance for text summarization." IEEE Access 8 (2020): 178946-178953.

Yang, Min, et al. "An Effective Hybrid Learning Model for Real-Time Event Summarization." IEEE Transactions on Neural Networks and Learning Systems 32.10 (2020): 4419-4431.

Yang, Min, et al. "Hierarchical human-like deep neural networks for abstractive text summarization." IEEE Transactions on Neural Networks and Learning Systems 32.6 (2020): 2744-2757

Bhargava, Rupal, Gargi Sharma, and Yashvardhan Sharma. "Deep text summarization using generative adversarial networks in Indian languages." Procedia Computer Science 167 (2020): 147-153.

Zaheer, Manzil, et al. "Big bird: Transformers for longer sequences." Advances in neural information processing systems 33 (2020): 17283-17297.

Ma, Tinghuai, et al. "T-bertsum: Topic-aware text summarization based on bert." IEEE Transactions on Computational Social Systems 9.3 (2021): 879-890.

Y. Chen, C. Chang and J. Gan, "A Template Approach for Summarizing Restaurant Reviews," in IEEE Access, vol. 9, pp. 115548-115562, 2021, doi: 10.1109/ACCESS.2021.3103512.

Jang, Heewon, and Wooju Kim. "Reinforced Abstractive Text Summarization With Semantic Added Reward." IEEE Access 9 (2021): 103804-103810.

Jiang, Jiawen, et al. "Enhancements of attention-based bidirectional lstm for hybrid automatic text summarization." IEEE Access 9 (2021): 123660-123671.

Paharia, Naman, Muhammad Syafiq Mohd Pozi, and Adam Jatowt. "Change-Oriented Summarization of Temporal Scholarly Document Collections by Semantic Evolution Analysis." IEEE Access 10 (2021): 76401-76415.

Abdel-Salam, S.; Rafea, A. Performance Study on Extractive Text Summarization Using BERT Models. Information 2022, 13, 67. https://doi.org/10.3390/info13020067.

Alomari, Ayham, et al. "Deep reinforcement and transfer learning for abstractive text summarization: A review." Computer Speech & Language 71 (2022): 101276.

Pang, Bo, et al. "Long document summarization with top-down and bottom-up inference." arXiv preprint arXiv:2203.07586 (2022).

Yadav, Divakar, Jalpa Desai, and Arun Kumar Yadav. "Automatic Text Summarization Methods: A Comprehensive Review." arXiv preprint arXiv:2204.01849 (2022).

Zhu, Haichao, et al. "Transforming wikipedia into augmented data for query-focused summarization." IEEE/ACM Transactions on Audio, Speech, and Language Processing 30 (2022): 2357-2367.

Aliakbarpour, Hassan, Mohammad Taghi Manzuri, and Amir Masoud Rahmani. "Improving the readability and saliency of abstractive text summarization using combination of deep neural networks equipped with auxiliary attention mechanism." The Journal of Supercomputing (2022): 1-28.

Park, Hyun, and Yanggon Kim. "Finding the Optimal Ratio of Summaries to NEWS Articles and Establishing a Hybrid NEWS Summary System." 2022 IEEE/ACIS 7th International Conference on Big Data, Cloud Computing, and Data Science (BCD). IEEE, 2022.

Chitty-Venkata, Krishna Teja, et al. "Neural architecture search for transformers: A survey." IEEE Access 10 (2022): 108374-108412.

Mao, Qianren, et al. "Fact-Driven Abstractive Summarization by Utilizing Multi-Granular Multi-Relational Knowledge." IEEE/ACM Transactions on Audio, Speech, and Language Processing 30 (2022): 1665-1678.

Laban, Philippe, et al. "SummaC: Re-visiting NLI-based models for inconsistency detection in summarization." Transactions of the Association for Computational Linguistics 10 (2022): 163-177.

Aote, Shailendra, Anjusha Pimpalshende, and Archana Potnurwar. "Multi-Document Extractive Text Summarizer for Hindi Documents Using Swarm Intelligence Based Hybrid Approach." Available at SSRN 4019476.

Srivastava, Ridam, et al. "A topic modeled unsupervised approach to single document extractive text summarization." Knowledge-Based Systems 246 (2022): 108636.

Shi, Kaile, et al. "StarSum: A Star Architecture Based Model for Extractive Summarization." IEEE/ACM Transactions on Audio, Speech, and Language Processing 30 (2022): 3020-3031.

Biswas, Pratik K., and Aleksandr Iakubovich. "Extractive summarization of call transcripts." IEEE Access 10 (2022): 119826-119840.

Ahmed, Rowanda DA, Mansoor AbdulHak, and Omar Hesham ELNabrawy. "Text Summarization Clustered Transformer (TSCT)." 2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML). IEEE, 2022.

Kong, Yuqi, Fanchao Meng, and Benjamin Carterette. "A Topological Method for Comparing Document Semantics." arXiv preprint arXiv:2012.04203 (2020).

Joshi, Akanksha, et al. "RankSum—An unsupervised extractive text summarization based on rank fusion." Expert Systems with Applications 200 (2022): 116846.

Mahalakshmi, P., and N. Sabiyath Fatima. " Summarization of Text and Image Captioning in Information Retrieval Using Deep Learning Techniques." IEEE Access 10 (2022): 18289-18297.

Koh, Huan Yee, Jiaxin Ju, Ming Liu, and Shirui Pan. "An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics." ACM computing surveys 55, no. 8 (2022): 1-35.

Asad Abdi, Norisma Idris, Rasim M. Alguliev, Ramiz M. Aliguliyev, “Automatic summarization assessment through a combination of semantic and syntactic information for intelligent educational systems”,Information Processing & Management,Volume 51, Issue 4,2015,Pages 340-358,ISSN 0306-4573,https://doi.org/10.1016/j.ipm.2015.02.001.

Mihalcea, R., & Tarau, P. (2004, July). Textrank: Bringing order into text. In Proceedings of the 2004 conference on empirical methods in natural language processing (pp. 404-411).

See, A., Liu, P. J., & Manning, C. D. (2017). Get to the point: Summarization with pointer-generator networks. arXiv preprint arXiv:1704.04368.