Effect of Supervised Sense Disambiguation Model Using Machine Learning Technique and Word Embedding in Word Sense Disambiguation

Main Article Content

Rupesh Mahajan, Chandrakant Kokane, Kishor Pathak, Manohar Kodmelwar, Kapil Wagh, Mahesh Bhandari

Abstract

Natural language processing includes a subfield called word sense disambiguation, which focuses mostly on words that might have several meanings. Polysemous terms are also referred to as confusing phrases in some circles. The performance of word sense disambiguation depends on how effectively the ambiguous word is recognized by the machine. The discussed word embedding model for the ambiguous words represents the words from the document space to vector space with no data loss. The most identified challenge of ambiguous word representation is the features. The selection and representation of ambiguous words with respect to the features is the tedious task of word embedding. The discussed word embedding model uses countable features of available context for disambiguation. The proposed model is implemented for ambiguous words with context information. The available context of ambiguous/polysemous words is used for disambiguation. The unavailability of the context is the challenge in this model. The Recurrent Neural Network with Large Small Term Memory is used for the classification. The output of the RNN-LSTM is the sense values which are further mapped with the freely available lexical resource WordNet for retrieving the correct sense(meaning).

Article Details

Section
Articles
Author Biography

Rupesh Mahajan, Chandrakant Kokane, Kishor Pathak, Manohar Kodmelwar, Kapil Wagh, Mahesh Bhandari

[1]Dr. Rupesh Mahajan

2Dr. Chandrakant Kokane

3Kishor Pathak

4Dr. Manohar Kodmelwar

5Kapil Wagh

6Mahesh Bhandari

 

[1] Dr. D.Y. Patil Institute of Technology, Pimpri, Pune, Maharashtra, India, mhjn.rpsh@gmail.com 0009-0004-5371-8080

2Nutan Maharashtra Institute of Engineering and Technology, Talegaon, Pune, Maharashtra, India, cdkokane1992@gmail.com 0000-0001-7957-3933

3Vishwakarma Institute of Information Technology, Pune, Maharashtra, India, kishor.pathak@viit.ac.in 0000-0001-8409-7433

4Vishwakarma Institute of Information Technology, Pune, Maharashtra, India, manohar.kodmelwar@viit.ac.in 0000-0001-5248-528X

5Nutan Maharashtra Institute of Engineering and Technology, Talegaon, Pune, Maharashtra, India, kapilwagh2686@gmail.com 0000-0002-7741-6050

6Vishwakarma Institute of Information Technology Pune, Maharashtra, India, mahesh.bhandari@viit.ac.in 0000-0002-4235-1832  

*Correspondence: cdkokane1992@gmail.com

 

References

Correa Jr, E. A.; Lopes, A. A.; Amancio, D. R. Word sense disambiguation: A complex network approach. Information Sciences,2018, 442, 103-113.

M. Y. Kang; T. H. Min; J. S. Lee. Sense Space for Word Sense Disambiguation, IEEE International Conference on Big Data and Smart Computing (BigComp), Shanghai, China, 2018, pp. 669-672

Myung Yun Kang; Tae Hong Min; Jae Sung Lee. Sense Space for Word Sense Disambiguation, 2018 IEEE International Conference on Big Data and Smart Computing.

Alian, M.; Awajan, A.; Al-Kouz, A. Word sense disambiguation for Arabic text using Wikipedia and Vector Space Model. Int J Speech Technol,2016,19, 857–867.

Nguyen, Q. P.; Vo, A. D., Shin, J. C.; Ock, C. Y. Effect of word sense disambiguation on neural machine translation: A case study in Korean. IEEE,2018, Access, 6, 38512-38523.

Rahman, Mohammad Marufur; Saeed Anwar Khan; KM Azharul Hasan. Word Sense Disambiguation by Context Detection. 4th International Conference on Electrical Information and Communication Technology (EICT),2019, IEEE.

Collobert, R.; Weston, J.; Bottou, L.; Karlen, M.; Kavukcuoglu, K.; Kuksa, P. Natural language processing (almost) from scratch, Journal of Machine Learning Research,2011, 12(1),2493–2537.

Chun-Xiang Zhang; Rui Liu, Xue-Yao Gao; Bo Yu. Graph Convolutional Network for Word Sense Disambiguation, Discrete Dynamics in Nature and Society, vol. 2021, Article ID 2822126, 12 pages, 2021.

Tang, D.; Wei, F.; Yang, N.; Zhou, M.; Liu, T.; Qin, B. Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics,2014, vol. 1, pp. 1555–1565.

Kokane, C. D.; Babar, S. D.; Mahalle, P. N. Word Sense Disambiguation for Large Documents Using Neural Network Model. In 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT),2021, (pp. 1-5).

Kokane, C.; Babar, S.; Mahalle, P.; Patil, S. Word Sense Disambiguation: A Supervised Semantic Similarity based Complex Network Approach. International Journal of Intelligent Systems and Applications in Engineering,2022, 10(1s), 90-94.

Kokane, C.D.; Babar, S.D.; Mahalle, P.N.; Patil, S.P. Word Sense Disambiguation: Adaptive Word Embedding with Adaptive-Lexical Resource. In: Chaki, N., Roy, N.D., Debnath, P., Saeed, K. (eds) Proceedings of International Conference on Data Analytics and Insights, 2023, ICDAI 2023. Lecture Notes in Networks and Systems, vol 727.

Mikolov; Tomas, et al. Efficient estimation of word representations in vector space. 2013,arXiv preprint arXiv:1301.3781.

Pennington, J.; Socher, R.; Manning, C. D. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP),2014, (pp. 1532-1543).

Khetani, V. ., Gandhi, Y. ., Bhattacharya, S. ., Ajani, S. N. ., & Limkar, S. . (2023). Cross-Domain Analysis of ML and DL: Evaluating their Impact in Diverse Domains. International Journal of Intelligent Systems and Applications in Engineering, 11(7s), 253–262.