Hybrid Model for Assamese Document Classification using Doc2vec for feature extraction

Chayanika Talukdar

doi:10.52783/jes.9104

PDF

Published: Apr 28, 2025

DOI: https://doi.org/10.52783/jes.9104

Keywords:

CNN, LSTM, Doc2vec, Assamese, SVM, LR, Hybrid.

Chayanika Talukdar, Shikhar Kumar Sarma

Abstract

Document level categorization is challenging for texts with a huge number of words, often indicating contradicting categories. This research is particularly useful for vast amount of unorganized digitized text, produced as a side effect of the exponential growth of internet. Many text classification studies have been carried out using various machine learning and deep learning techniques, however, mainly for short text. In this study, we will categorize Assamese documents, a subject that has mostly gone unexplored until now. Here, we propose a hybrid model that combines the advantages of two most popular deep learning models- the CNN and LSTM. Also, Doc2vec has been used to convert documents into numeric vectors of 3 dimensions- 100, 128 and 300. When evaluated on the prepared data set of 780 Assamese documents, the model was found to have worked effectively with an accuracy of 96.5% and an F1-score of 96%, for the vectors with dimension value of 300.

Issue

Vol. 21 No. 01 (2025)

Section

Articles

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.

Announcement

Call for Papers for the New Issue.
Last Date of Submission: April 30^th, 2026

Indexing

Call for Papers & Reviewers

The Journal of Electrical Systems (JES) is inviting researchers, scholars, and experts in the field of electrical systems to submit their original and unpublished research papers for consideration in our upcoming issues. We welcome high-quality contributions that address innovative ideas, advancements, and challenges in electrical systems and related areas.

Submission Deadline: March 31^st, 2025

Topics of Interest Include, but are not Limited to:

• Power Systems and Smart Grids
• Renewable Energy
• Control Systems
• Electronics and Communication
• Signal Processing
• Artificial Intelligence in Electrical Engineering
• Internet of Things (IoT) in Electrical Systems
• Electric Vehicles and Transportation
• Robotics and Automation

Authors are requested to submit their manuscripts electronically through our online submission system by the specified deadline.

Submission Guidelines:

Manuscripts should be prepared according to the JES guidelines available on our website.
All submissions will undergo a rigorous peer-review process.
Manuscripts must be original, not previously published or under consideration elsewhere.

Call for Reviewers:
JES is also seeking qualified and experienced individuals to join our esteemed panel of reviewers. If you are interested in contributing your expertise to ensure the quality of the papers published in JES, kindly submit your resume to editor@esrgroups.org. Reviewers play a crucial role in maintaining the high standards of our journal.

We look forward to receiving your valuable contributions and appreciate your interest in the Journal of Electrical Systems.

Important Links

Home

Aims and Scope

Instructions for Authors

Editorial Board

Downloads

Download Paper Template

Article Sidebar

Main Article Content

Abstract

Article Details