Homonym Detection Using WordNet and Modified Lesk Approach for English Language

Main Article Content

Madhuri P. Karnik D. V. Kodavade

Abstract

Word sense disambiguation (WSD) is a basic and persistent problem that has existed since its inception in the natural language processing (NLP) area. The process of determining the accurate meaning of a word within a specific context is referred to as word sense disambiguation, commonly known as WSD. In NLP, a single word can have two or more meanings, with each meaning being distinguished by its context. This is known as word polysemy. Its applications span a wide range of fields, such as question answering systems, machine translation, information retrieval (IR) etc. Ontology and NLP are still struggling with ambiguity. Homonyms, which are ubiquitous in most languages, are words that have the same spelling but a different meaning. This method's fundamental premise is to select the appropriate sense by comparing a word's context in a sentence to contexts generated from WordNet.  The primary goal of this study is to employ WordNet and the Lesk algorithm for WSD. After the algorithm was put into practice and tested on a collection of sentences that included ambiguous words, the synset was able to determine the proper interpretation for most of the sentences. The Lesk algorithm relies on finding the highest number of shared words (maximum overlap) between a word’s context, prepositions and the definitions for its different meanings(glosses). This approach helps in identifying the most accurate interpretation for a given word within a specific context. According to experimental findings, the suggested strategy considerably boosts performance while identifying homonyms.

Article Details

Section
Articles
Author Biography

Madhuri P. Karnik D. V. Kodavade

[1]Madhuri P. Karnik

2Dr. D. V. Kodavade  

 

[1]Ph.D Scholar, Shivaji university, Kolhapur, Maharashtra, India

madhuri.chavan@viit.ac.in

2Professor, D.K.T.E. Society’s Textile & Engineering, Ichalkaranji, Maharashtra, India

dvkodavade@gmail.com

 

 

References

Lesk, Michael. "Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone." In Proceedings of the 5th annual international conference on Systems documentation, pp. 24-26. 1986.

Banerjee, Satanjeev, and Ted Pedersen. "An adapted Lesk algorithm for word sense disambiguation using WordNet." In International conference on intelligent text processing and computational linguistics, pp. 136-145. Berlin, Heidelberg: Springer Berlin Heidelberg, 2002.

J Garg, Mr Sandy, and Er Anand Kumar Mittal. "A Comparative Study of Svm and New Lesk Algorithm for Word Sense Disambiguation in Hindi Language.", International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 2, Issue 5, May 2015, PP 24-28.

B. Surekha. Dr. K. Vijaya kumar and S. Siva skandha, “WORD SENSE DISAMBIGUATION USING LESK”, International Journal of Latest Trends in Engineering and Technology IJLTET Special Issue- ICRACSC-2016 , pp.063-066.

Kågebäck, Mikael, and Hans Salomonsson. "Word sense disambiguation using a bidirectional lstm." arXiv preprint arXiv:1606.03568 (2016).

Ayetiran, Eniafe Festus, and Kehinde Agbele. "An optimized Lesk-based algorithm for word sense disambiguation." Open Computer Science 8, no. 1 (2016): 165-172.

Gautam, Chandra Bhal Singh, and Dilip Kumar Sharma. "Hindi word sense disambiguation using Lesk approach on bigram and trigram words." In Proceedings of the International Conference on Advances in Information Communication Technology & Computing, pp. 1-5. 2016.

van den Beukel, Sven, and Lora Aroyo. "Homonym detection for humor recognition in short text." In Proceedings of the 9th workshop on computational approaches to subjectivity, sentiment and social media analysis, pp. 286-291. 2018.

Ackermann, Marcel R., and Florian Reitz. "Homonym detection in curated bibliographies: learning from dblp’s experience." In Digital Libraries for Open Knowledge: 22nd International Conference on Theory and Practice of Digital Libraries, TPDL 2018, Porto, Portugal, September 10–13, 2018, Proceedings 22, pp. 59-65. Springer International Publishing, 2018.

Roll, Uri, Ricardo A. Correia, and Oded Berger‐Tal. "Using machine learning to disentangle homonyms in large text corpora." Conservation Biology 32, no. 3 (2018): 716-724.

Kumar, Manish, Prasenjit Mukherjee, Manik Hendre, Manish Godse, and Baisakhi Chakraborty. "Adapted lesk algorithm based word sense disambiguation using the context information." International Journal of Advanced Computer Science and Applications 11, no. 3 (2020): 254-260.

Bhattacharjee, Krishnanjan, S. ShivaKarthik, Swati Mehta, Ajai Kumar, Snehal Phatangare, Kirti Pawar, Sneha Ukarande, Disha Wankhede, and Devika Verma. "Survey and gap analysis of word sense disambiguation approaches on unstructured texts." In 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), pp. 323-327. IEEE, 2020.

Kharate, Namrata G., and Varsha H. Patil. "Word sense disambiguation for Marathi language using WordNet and the lesk approach." In Proceeding of First Doctoral Symposium on Natural Computing Research: DSNCR 2020, pp. 45-54. Springer Singapore, 2021.

Saha, Rohan. "CMPUT 600 Project Report: Homonym Identification using BERT." arXiv:2101.02398v1 [cs.CL] 7 Jan 2021

Habibi, Amir Ahmad, Bradley Hauer, and Grzegorz Kondrak. "Homonymy and polysemy detection with multilingual information." In Proceedings of the 11th Global Wordnet Conference, pp. 26-35. 2021.

AlMousa, Mohannad, Rachid Benlamri, and Richard Khoury. "A novel word sense disambiguation approach using WordNet knowledge graph." Computer Speech & Language 74 (2022): 101337.

Rahman, Nazreena, and Bhogeswar Borah. "An unsupervised method for word sense disambiguation." Journal of King Saud University-Computer and Information Sciences 34, no. 9 (2022): 6643-6651.

Su, Ying, Hongming Zhang, Yangqiu Song, and Tong Zhang. "Multilingual word sense disambiguation with unified sense representation." arXiv preprint arXiv:2210.07447 (2022).

ILHOMOVNA, ELOV BOTIR BOLTAEVICH1&AKHMEDOVA HOLISKHON. "HOMONYMY DETECTION USING A NAÏVE BAYES CLASSIFIER.", Journal of Computer Science Engineering and Information Technology Research (JCSEITR) ISSN(P): 2250-2416; ISSN(E): Applied Vol. 13, Issue 1, Jun 2023, 5–20.

Hall Maudslay, Rowan, and Simone Teufel. “Homonymy Information for English WordNet.” arXiv e-prints (2022): arXiv-2212.

Wordnet- https://wordnet.princeton.edu/