Ensemble based high performance deep learning models for fake news detection

Mohammed E Almandouh; Mohammed F Alrahmawy; Mohamed Eisa; Mohamed Elhoseny; A S Tolba

doi:10.1038/s41598-024-76286-0

Ensemble based high performance deep learning models for fake news detection

Sci Rep. 2024 Nov 4;14(1):26591. doi: 10.1038/s41598-024-76286-0.

Authors

Mohammed E Almandouh^{1

2}, Mohammed F Alrahmawy^{3

4

5}, Mohamed Eisa⁶, Mohamed Elhoseny^{3

7}, A S Tolba^{3

8}

Affiliations

¹ Portsaid University, Portsaid, Egypt. mmandouh@himc.psu.edu.eg.
² Mansoura University, Mansoura, Egypt. mmandouh@himc.psu.edu.eg.
³ Mansoura University, Mansoura, Egypt.
⁴ New Mansoura University, New Mansoura, Egypt.
⁵ University of Economics and Human Sciences, Warsaw, Poland.
⁶ Portsaid University, Portsaid, Egypt.
⁷ University of Sharjah, Sharjah, United Arab Emirates.
⁸ New Heliopolis Institute for Engineering & Automotive and Energy Technologies, New Heliopolis, Egypt.

Abstract

Social media has emerged as a dominant platform where individuals freely share opinions and communicate globally. Its role in disseminating news worldwide is significant due to its easy accessibility. However, the increase in the use of these platforms presents severe risks for potentially misleading people. Our research aims to investigate different techniques within machine learning, deep learning, and ensemble learning frameworks in Arabic fake news detection. We integrated FastText word embeddings with various machine learning and deep learning methods. We then leveraged advanced transformer-based models, including BERT, XLNet, and RoBERTa, optimizing their performance through careful hyperparameter tuning. The research methodology involves utilizing two Arabic news article datasets, AFND and ARABICFAKETWEETS datasets, categorized into fake and real subsets and applying comprehensive preprocessing techniques to the text data. Four hybrid deep learning models are presented: CNN-LSTM, RNN-CNN, RNN-LSTM, and Bi-GRU-Bi-LSTM. The Bi-GRU-Bi-LSTM model demonstrated superior performance regarding the F1 score, accuracy, and loss metrics. The precision, recall, F1 score, and accuracy of the hybrid Bi-GRU-Bi-LSTM model on the AFND Dataset are 0.97, 0.97, 0.98, and 0.98, and on the ARABICFAKETWEETS dataset are 0.98, 0.98, 0.99, and 0.99 respectively. The study's primary conclusion is that when spotting fake news in Arabic, the Bi-GRU-Bi-LSTM model outperforms other models by a significant margin. It significantly aids the global fight against false information by setting the stage for future research to expand fake news detection to multiple languages.

Keywords: Deep Learning; Ensemble Learning; Fake News Detection; FastText.