Advancements in Text Summarization Through Machine Learning: A Comprehensive Survey and Analysis

Main Article Content

Vandana Jagtap, Pallavi Parlewar, Sheetal Dhande, Anagha Langhe, Harsh Choudhary, Ashutosh Mishra

Abstract

Due to the extensive amount of text available for any given task, for instance, a research project, it has become a need to have the gist of these documents in a succinct format. In this review paper, we discussed various methods used for single and multi-document summarization. It explores extractive, abstractive, and hybrid methods, along with the role of deep learning models like RNNs, CNNs, and transformers. The survey examines datasets, evaluation metrics, recent advancements, and future scopes in this field. A comparative analysis of methodologies and approaches is also presented.

Article Details

Section
Articles
Author Biography

Vandana Jagtap, Pallavi Parlewar, Sheetal Dhande, Anagha Langhe, Harsh Choudhary, Ashutosh Mishra

[1]Vandana Jagtap

2Pallavi Parlewar 

3Sheetal Dhande

4Anagha Langhe

5Harsh Choudhary

6Ashutosh Mishra

 

[1],4,5,6School of Computer Engineering and Technology, Dr. Vishwanath Karad MIT World Peace University, Pune.

2Department of Electronics and Communication Engineering, Shri Ramdeobaba College of Engineering and Management, Nagpur, India

3Department of Computer Science, Sipna College of Enginnering and Technology, Amravati, India

 

References

Ma, C., Zhang, W.E., Guo, M., Wang, H., Sheng, Q.Z.: Multi-document Summarization via Deep Learning Techniques: A Survey. ACM Comput. Surv. 55, 5, 37 pages (2023). https://doi.org/10.1145/3529754

Sable, N.P., Rathod, V.U. (2023). Rethinking Blockchain and Machine Learning for Resource-Constrained WSN. In: Neustein, A., Mahalle, P.N., Joshi, P., Shinde, G.R. (eds) AI, IoT, Big Data and Cloud Computing for Industry 4.0. Sig-nals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-031-29713-7_17.

Liu, Y., Zhong, S.H., Li, W.: Query-Oriented Multi-Document Summarization via Unsupervised Deep Learning. AAAI, 26(1), 1699-1705 (2021).

Liu, Y., Lapata, M.: Hierarchical Transformers for Multi-Document Summarization. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 5070–5081, Florence, Italy (2019). Association for Computa-tional Linguistics.

Song, S., Huang, H., Ruan, T.: Abstractive text summarization using LSTM-CNN based deep learning. Multimed Tools Appl 78, 857–875 (2019). https://doi.org/10.1007/s11042-018-5749-3

Nilesh P. Sable, Vijay U. Rathod, Parikshit N. Mahalle, Jayashri Bagade, Rajesh Phursule ; Internet of Things-based Smart Sensing Mechanism for Mining Applications, Industry 4.0 Convergence with AI, IoT, Big Data and Cloud Computing: Fundamentals, Challenges and Applications IoT and Big Data Analytics (2023) 4: 132. https://doi.org/10.2174/9789815179187123040012.

Day, M.Y., Chen, C.Y.: Artificial Intelligence for Automatic Text Summarization. In: 2018 IEEE International Conference on Information Reuse and Integration (IRI), 478-484, Salt Lake City, UT, USA (2018). https://doi.org/10.1109/IRI.2018.00076

V. U. Rathod and S. V. Gumaste, “Role of Deep Learning in Mobile Ad-hoc Networks”, IJRITCC, vol. 10, no. 2s, pp. 237–246, Dec. 2022.

Rezaei, A., Dami, S., Daneshjoo, P.: Multi-Document Extractive Text Summarization via Deep Learning Approach. In: 2019 5th Conference on Knowledge Based Engineering and Innovation (KBEI), 680-685, Tehran, Iran (2019). https://doi.org/10.1109/KBEI.2019.8735084

N. P. Sable, V. U. Rathod, P. N. Mahalle, and P. N. Railkar, “An Efficient and Reliable Data Transmission Service using Network Coding Algorithms in Peer-to-Peer Network”, IJRITCC, vol. 10, no. 1s, pp. 144–154, Dec. 2022.

Munot, N., Govilkar, S.: Comparative Study of Text Summarization Methods. International Journal of Computer Applica-tions 102, 33-37 (2014). https://doi: 10.5120/17870-8810.

N. P. Sable, R. Sonkamble, V. U. Rathod, S. Shirke, J. Y. Deshmukh, and G. T. Chavan, “Web3 Chain Authentication and Authorization Security Standard (CAA)”, IJRITCC, vol. 11, no. 5, pp. 70–76, May 2023.

Shirwandkar, N.S., Kulkarni, S.: Extractive Text Summarization Using Deep Learning. In: 2018 Fourth International Con-ference on Computing Communication Control and Automation (ICCUBEA), 1-5, Pune, India (2018). https://doi.org/10.1109/ICCUBEA.2018.8697465

Chopade, H.A., Narvekar, M.: Hybrid Auto Text Summarization Using Deep Neural Network and Fuzzy Logic System. In: 2017 International Conference on Inventive Computing and Informatics (ICICI), 52-56, Coimbatore, India (2017). https://doi.org/10.1109/ICICI.2017.8365192

Vijay U. Rathod* & Shyamrao V. Gumaste, “Effect Of Deep Channel To Improve Performance On Mobile Ad-Hoc Net-works”, J. Optoelectron. Laser, vol. 41, no. 7, pp. 754–756, Jul. 2022.

Allahyari, M., Pouriyeh, S., et al.: Text Summarization Techniques: A Brief Survey. International Journal of Advanced Computer Science and Applications, 8(10) (2017).

Rathod, V.U. and Gumaste, S.V., 2022. Role of Neural Network in Mobile Ad Hoc Networks for Mobility Predic-tion. International Journal of Communication Networks and Information Security, 14(1s), pp.153-166.

MalarSelvi, G., Pandian, A.: Analysis of Different Approaches for Automatic Text Summarization. In: 2022 6th Interna-tional Conference on Computing Methodologies and Communication (ICCMC), 812-816, Erode, India (2022). https://doi.org/10.1109/ICCMC53470.2022.9753732

Li, S., Lei, D., Qin, P., Wang, W.Y.: Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 6038-6044, Hong Kong, China (2019). Association for Computational Linguistics.

Y. Mali, Vijay U. Rathod “A Comparative Analysis of Machine Learning Models for Soil Health Prediction and Crop Selection”, International Journal of Intelligent Systems and Applications in Engineering (IJISAE), vol. 11, no. 10s, pp. 811–828, Aug. 2023.

Malik, R., et al.: Maximal gSpan: Multi-Document Summarization through Frequent Subgraph Mining. In: Proceedings of the 2023 17th International Conference on Ubiquitous Information Management and Communication (IMCOM).

Walunjkar, G., Birari, D., Desai, R., Gholap, Y., Gadekar, D. P., & Dandavate, A. (2024). Energy Efficient Routing in Group and Disaster Mobility: Comparative Analysis. International Journal of Intelligent Systems and Applications in Engi-neering, 12(4s), 85-94.

Awasthi, I., et al.: Natural Language Processing (NLP) based Text Summarization - A Survey. In: Proceedings of the Sixth International Conference on Inventive Computation Technologies [ICICT 2021], IEEE Xplore Part Number: CFP21F70-ART; ISBN: 978-1-7281-8501-9

Padmakumar, A., Saran, A.: Unsupervised Text Summarization Using Sentence Embeddings (2016).

Fang, X., Xiao, T., Liu, S.: Research on Text Summarization Generation and Classification Method Based on Deep Learn-ing. In: 2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA), 757-762, Da-lian, China (2022). https://doi.org/10.1109/ICAICA54878.2022.9844483

Janjanam, P., Reddy, C.P.: Text Summarization: An Essential Study. In: 2019 International Conference on Computational Intelligence in Data Science (ICCIDS), 1-6, Chennai, India (2019). https://doi.org/10.1109/ICCIDS.2019.8862030

Siddiqui, H., Siddiqui, S., Rawat, M., Maan, A., Dhiman, S., Asad, M.: Text Summarization using Extractive Techniques. In: 2021 3rd International Conference on Advances in Computing, Communication Control and Networking (ICAC3N), 28-31, Greater Noida, India (2021). https://doi.org/10.1109/ICAC3N53548.2021.9725501

N. P. Sable, V. U. Rathod, M. D. Salunke, H. B. Jadhav, R. S. Tambe, and S. R. Kothavle, “Enhancing Routing Perfor-mance in Software-Defined Wireless Sensor Networks through Reinforcement Learning”, International Journal of Intelli-gent Systems and Applications in Engineering (IJISAE), vol. 11, no. 10s, pp. 73–83, Aug. 2023.

Ramirez-Orta, J., Milios, E.E.: Unsupervised Document Summarization Using Pre-trained Sentence Embedding’s and Graph Centrality. In: Proceedings of the Second Workshop on Scholarly Document Processing (2021)

Ma, C., et al.: Abstractive Multi-Document Summarization via Phrase Selection and Merging. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 4810-4819 (2019).

Vijay U. Rathod, Yogesh Mali, Nilesh Sable, Deepika Ajalkar, M. Bharathi, and N. Padmaja,” A Network-Centred Opti-mization Technique for Operative Target Selection”, Journal of Electrical Systems (JEs), vol. 19, no. 2s, pp. 87–96, 2023.

Fabbri, A.R., et al.: Multi-news: A Large-scale Multi-document Summarization Dataset and Abstractive Hierarchical Mod-el. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 1075-1085 (2019).

Narayan, S., Papasarantopoulos, N., Lapata, M., Cohen, S.B.: Neural Extractive Summarization with Side Information. ArXiv abs/1704.04530 (2017).

Vaswani, A., et al.: Attention Is All You Need. In: Advances in Neural Information Processing Systems, 30 (2017).

Zhang, Y., et al.: Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natu-ral Language Processing (Volume 1: Long Papers), 4055-4065 (2019).

N. P. Sable, V. U. Rathod, R. Sable and G. R. Shinde, "The Secure E-Wallet Powered by Blockchain and Distributed Ledger Technology," 2022 IEEE Pune Section International Conference (PuneCon), Pune, India, 2022, pp. 1-5, doi: 10.1109/PuneCon55413.2022.10014893.

Liu, Y., et al.: Single Document Summarization as Tree Induction. In: Proceedings of NAACL-HLT 2019, 1745–1755, Minneapolis, Minnesota (2019).

Gokhan, T., et al.: GUSUM: Graph-Based Unsupervised Summarization Using Sentence Features Scoring and Sentence-BERT. In: Text-Graphs 16: Graph-based Methods for Natural Language Processing, 44–53, October 16, 2022.

Villavicencio, P., Watanabe, T.: Text Summarization of Single Documents Based on Syntactic Sequences. In: Intelligent Interactive Multimedia Systems and Services. Smart Innovation, Systems and Technologies, 11 (2011). https://doi.org/10.1007/978-3-642-22158-3_31

. U. Rathod and S. V. Gumaste, "Role of Routing Protocol in Mobile Ad-Hoc Network for Performance of Mobility Mod-els," 2023 IEEE 8th International Conference for Convergence in Technology (I2CT), Lonavla, India, 2023, pp. 1-6, doi: 10.1109/I2CT57861.2023.10126390.

V. U. Rathod and S. V. Gumaste, "An Effect on Mobile Ad-Hoc Networks for Load Balancing Through Adaptive Con-gestion Routing," 2023 International Conference on Integration of Computational Intelligent System (ICICIS), Pune, India, 2023, pp. 1-5, doi: 10.1109/ICICIS56802.2023.10430257.

Rahul, et al.: NLP based Machine Learning Approaches for Text Summarization. In: Proceedings of the Fourth Internation-al Conference on Computing Methodologies and Communication (ICCMC 2020), IEEE Xplore Part Number: CFP20K25-ART; ISBN: 978-1-7281-4889-2

Mali, Y., Rathod, V. U., Kulkarni, M. M., Mokal, P., Patil, S., Dhamdhere, V., & Birari, D. R. (2023). A Comparative Analysis of Machine Learning Models for Soil Health Prediction and Crop Selection. International Journal of Intelligent Systems and Applications in Engineering, 11(10s), 811-828.

Lee, S., et al.: A Survey on Evaluation Metrics for Machine Translation. Mathematics, 11(4), 1006 (2023). https://doi.org/10.3390/math11041006

Wang, C., Long, L.: HowNet-based Evaluation for Chinese Text Summarization. In: 2008 International Conference on Natural Language Processing and Knowledge Engineering (2008). https://doi.org/10.1109/nlpke.2008.4906789

Devlin, J., et al.: BERT: Bidirectional Encoder Representations from Transformers. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 4171-4186 (2019).

Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: A Method for Automatic Evaluation of Machine Translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics (ACL '02), 311–318 (2002). https://doi.org/10.3115/1073083.1073135

Birari, D., Hiran, D., & Narawade, V. (2022). Constraint and Descriptor Based Image Retrieval through Sketches with Data Retrieval using Reversible Data Hiding. International Journal of Intelligent Systems and Applications in Engineer-ing, 10(1s), 409-417

Post, M.: A Call for Clarity in Reporting BLEU Scores. In: Proceedings of the Third Conference on Machine Translation: Research Papers, 186–191 (2018).

Wolf, T., et al.: Transformers: State-of-the-Art Natural Language Processing. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 38–45 (2020).

Radford, A., et al.: Language Models are Unsupervised Multitask Learners. (2019).

Raffel, C., et al.: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. J. Mach. Learn. Res. 21, 1, Article 140 (2020).

Lewis, M., et al.: BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 7871–7880 (2020).

Zhang, J., et al.: PEGASUS: Pre-training with Extracted Gap-Sentences for Abstractive Summarization. In: Proceedings of the 37th International Conference on Machine Learning (ICML'20), Vol. 119. JMLR.org, Article 1051, 11328–11339 (2020).

N. P. Sable, V. U. Rathod, P. N. Mahalle and D. R. Birari, "A Multiple Stage Deep Learning Model for NID in MA-NETs," 2022 International Conference on Emerging Smart Computing and Informatics (ESCI), Pune, India, 2022, pp. 1-6, doi: 10.1109/ESCI53509.2022.9758191.

Sorte, B.W., Joshi, P.P., & Jagtap, V.S. (2015). Use of Artificial Intelligence in Software Development Life Cycle: A state of the Art Review.

Jagtap, V. S., & Pawar, K. (2013). Analysis of different approaches to sentence-level sentiment classification. International Journal of Scientific Engineering and Technology, 2(3), 164-170.

V. U. Rathod, N. P. Sable, N. N. Thorat and S. N. Ajani, "Deep Learning Techniques Using Lightweight Cryptography for IoT Based E-Healthcare System," 2023 3rd International Conference on Intelligent Technologies (CONIT), Hubli, India, 2023, pp. 1-5, doi: 10.1109/CONIT59222.2023.10205808.

Pallavi Parlewar,et.al, “Depth Estimation using FCNN”, Biosc.Biotech.Res.Comm. Special Issue Vol 13 No 14 (2020) pp-372-375.

Kumar, S. ., Parlewar, P. ., & Jagtap, V. . (2023). Design of an Ensemble learning Model to Recognize Multilingual Hand Gestures via Interlinked Key Point analysis. International Journal of Intelligent Systems and Applications in Engineering, 11(10s), 103–112. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3238