A  Comparative Analysis of Machine Learning Models in News Categorization

Mohammad Hossein Zolfagharnasab; Siavash Damari

doi:10.24840/2183-6493_0010-003_002464

PDF

Published: Jul 10, 2024

DOI: https://doi.org/10.24840/2183-6493_0010-003_002464

Issue: Vol. 10 No. 3 (2024)

Keywords:

Real-time Content Classification, News Categorization, Natural Language Processing, Machine Learning

Mohammad Hossein Zolfagharnasab

University of Porto, Faculty of Engineering.

https://orcid.org/0000-0001-6124-7507

Siavash Damari

University of Allameh Tabataba'i, Department of Statistics, Mathematics, and Computer Science.

https://orcid.org/0009-0002-8486-9549

Abstract

The constant stream of news nowadays highlights the necessity for meticulous assessment to ensure that the information accurately reaches its intended audience with the least amount of delay least delay. Despite the flexibility and efficiency of Deep Learning (DL) models, their intricate training and substantial resource demands pose significant challenges for their deployment in real-time applications. In this regard, this study evaluates the performance of resource-efficient Machine Learning (ML) techniques – Multinomial Naive Bayes (MNB), Random Forest (RF), Support Vector Machine (SVM), and Logistic Regression (LR) – in categorizing news. Based on the results, all the evaluated models attain a commendable level of accuracy in news categorization. Notably, the SVM excels, achieving an accuracy rate of 98% and a mean squared error of 0.28. This performance exemplifies the robust effectiveness of classical ML models in the categorization of news, particularly when enhanced by a suitably tailored preprocessing pipeline.

Downloads

Download data is not yet available.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors grant the journal the rights to provide the article in all forms and media so the article can be used on the latest technology even after publication and ensure its long-term preservation.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

Author Biographies

Mohammad Hossein Zolfagharnasab, University of Porto, Faculty of Engineering.

PhD student, Department of Electrical and Computer Engineering,

Faculty of Engineering, University of Porto,

Rua Dr. Roberto Frias, 4200-465 PORTO, Portugal

Siavash Damari, University of Allameh Tabataba'i, Department of Statistics, Mathematics, and Computer Science.

Master student,

Department of Statistics, Mathematics, and Computer Science,

University of Allameh Tabataba'i,

Western Azadi Stadium Blvd, Tehran, Iran

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Mohammad Hossein Zolfagharnasab, University of Porto, Faculty of Engineering.

Siavash Damari, University of Allameh Tabataba'i, Department of Statistics, Mathematics, and Computer Science.