Arabic Plagiarism Detection using Text Summarization and Arabic word Embeddings

Main Article Content

Shahed Teko, Khaled Omar

Abstract

Plagiarism detection has become a latest research area in Natural Language Processing field. In today's with the huge available content of Arabic articles on the internet this make the text plagiarism is so easy and Spread widely in academic society, so many algorithm have been developed to decrease this Harmful habit, in this article we have developed an algorithm to detect plagiarism in Arabic text using Arabic text summarization and Arabic text embedding   , that the proposed algorithm contains two main stages for plagiarism detection for the first stage the algorithm utilized T5 model for Arabic summarization , and for the second stage the algorithm used Arabic text embedding technology to convert Arabic text to numeric vectors (which taking into consideration keeping  the syntax and the meaning of sentences( to calculate the similarity score between origin and suspected files, we have tested our proposed algorithm on Arabic dataset which contains origin and suspected files and the accuracy was about 90%.

Article Details

Section
Articles