Fake_News_Detection_Using_Deep_Learning_A_Systematic_Literature_Review

Uploaded by

s.francis8019525209

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Fake_News_Detection_Using_Deep_Learning_A_Systematic_Literature_Review

Uploaded by

s.francis8019525209

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Received 24 June 2024, accepted 20 July 2024, date of publication 29 July 2024, date of current version 27 August 2024.

Digital Object Identifier 10.1109/ACCESS.2024.3435497

Fake News Detection Using Deep Learning:

A Systematic Literature Review
MOHAMMAD Q. ALNABHAN AND PAULA BRANCO
Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON K1N 6N5, Canada
Corresponding author: Mohammad Q. Alnabhan (e-mail: [email protected]).

ABSTRACT Nowadays, we witness rapid technological advancements in online communication platforms,

with increasing volumes of people using a vast range of communication solutions. The fast flow of
information and the enormous number of users opens the door to the publication of non-truthful news, which
has the potential to reach many people. Disseminating this news through low- or no-cost channels resulted in
a flood of fake news that is difficult to detect by humans. Social media networks are one of these channels that
are used to quickly spread this fake news by manipulating it in ways that influence readers in many aspects.
That influence appears in a recent example amid the COVID-19 pandemic and various political events such as
the recent US presidential elections. Given how this phenomenon impacts society, it is crucial to understand
it well and study mechanisms that allow its timely detection. Deep learning (DL) has proven its potential
for multiple complex tasks in the last few years with outstanding results. In particular, multiple specialized
solutions have been put forward for natural language processing (NLP) tasks. In this paper, we systematically
review existing fake news detection (FND) strategies that use DL techniques. We systematically surveyed
the existing research articles by investigating the DL algorithms used in the detection process. Our focus
then shifts to the datasets utilized in previous research and the effectiveness of the different DL solutions.
Special attention was given to the application of strategies for transfer learning and dealing with the class
imbalance problem. The effect of these solutions on the detection accuracy is also discussed. Finally, our
survey provides an overview of key challenges that remain unsolved in the context of FND.

INDEX TERMS Classification, deep learning, fake news, misinformation, systematic literature review.

I. INTRODUCTION news, as people and influencers utilize them to share their

Due to a greater interest in the use of the internet, the spread opinions, videos, and various activities [2], [3].
of fake news has become more common than ever before. Fake news greatly increased in 2016 during the period
Before the popularity of social media platforms, fake news preceding the United States (US) presidential election [4].
was less common and much more difficult to spread to a vast As such, fake news on social media networks has captured
amount of people, as it was achieved either through word of the attention of many researchers. Recently, detecting fake
mouth or through printed media. Fake news can be defined news has become an emerging area of interest for many
as the phenomenon that occurs when incorrect information researchers, such as [4] and [5]. However, fake news detection
is purposefully spread throughout social media outlets with is a complicated task requiring the use of complex models
a significant ability to convince the reader of the content to compare related or unrelated information with known
written [1]. Nowadays, anyone can publish content without truthful information [6]. Furthermore, fake news is perceived
regulation or scrutiny. Several social media platforms, such as in several ways by researchers, leading to multiple ways
Facebook and Twitter, serve as means for disseminating fake of addressing and solving this issue. Some terms related to
misinformation are used interchangeably in multiple cases.
The associate editor coordinating the review of this manuscript and These terms include fake news, rumors, spam, and disinfor-
approving it for publication was Mohamad Afendee Mohamed . mation which usually contain numerical, categorical, textual,

2024 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
VOLUME 12, 2024 For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/ 114435
M. Q. Alnabhan, P. Branco: Fake News Detection Using DL: A Systematic Literature Review

and image contents [7], [8], [9]. Unfortunately, many people Paper Organization:
have the urge to spread false information on social media, This paper is organized as follows. Section II, presents
backed with professionally written, long, and referenced the research methodology, including the search strategy,
comments that allow the reader to more easily agree with research questions, source databases, search query, inclu-
the misinformation provided (e.g., [10], [11]). Researchers sion and extraction criteria, and data collection summary.
aim to eliminate the increased spread of misinformation by In Section III, we investigate the deep learning (DL) algo-
detecting the varied manners in which misinformation can be rithms used for detecting fake news. Section IV describes the
spread. As such, researchers have resorted to the use of deep publicly available datasets in the fake news domain and the
learning (DL) algorithms to detect fake news before it spreads associated challenges. SectionV, discusses transfer learning
(e.g., [12]). This is accomplished by collecting or creating strategies and open challenges in the FND context. Section VI
a dataset containing both true and false information within analyzes the class imbalance problem in fake news detection.
articles. Then, a pattern is determined, creating a model that Section VII provides a summary of the data collected in this
can predict whether a given article contains true or false SLR and answers to our research questions. Section VIII
information. addresses the research threats to validity, and Section IX
There are noticeable gaps in the existing studies on discusses the main gaps and open issues that still exist in fake
fake news detection that our research highlights. This news detection. Lastly, Section X concludes our paper.
includes (i) a lack of clear distinction between the defi-
nitions of misinformation, disinformation, and false infor- II. RESEARCH METHODOLOGY
mation; (ii) a lack of DL-based systematic reviews on A. SEARCH STRATEGY OVERVIEW
varying types of misinformation problems; (iii) a lack Our SLR is generated based on a set of detailed steps
of generalizable DL models that allow achieving a base described in [13]. We begin by defining our research
acceptable detection accuracy on different datasets, which questions, after which we build the keywords for the search
introduces the scarce use of transfer learning in this query to obtain the relevant papers for our study. Then,
context; and (iv) a lack of models that deal with different we select the most relevant databases to query and establish
levels of imbalance datasets in a fake news detection the inclusion and exclusion criteria. Finally, we define the
environment. fields to be extracted from the retrieved documents.
As technology progresses, the ability to detect misinfor-
mation becomes more complicated and thus more difficult
to detect using standard machine learning (ML) techniques. B. RESEARCH QUESTIONS
This motivates our focus on DL techniques for the problem The key focus of our SLR is on understanding how the DL
of fake news detection. techniques have been used to address the FND problem.
In this systematic literature review (SLR), we investigate We are also interested in how TL has been applied in this
existing fake news detection (FND) strategies that use deep field and how the class imbalance problem has been tackled.
learning. We focus on publicly available datasets used in FND • RQ1: Which deep learning algorithms have been used
and their NLP approaches. We aim to gather information for fake news detection throughout time?
about the transfer learning techniques applied and the • RQ2: Which datasets are used in the fake news detection
methods used for addressing class imbalance, to examine domain?
their effect on detection accuracy. Our survey aims to identify • RQ3: How effective are deep learning methods for fake
open issues and research gaps in current studies. To the best news detection?
of our knowledge, we are the first to provide a comprehen- • RQ4: Which solutions incorporate transfer learning
sive SLR that investigates the effects of transfer learning mechanisms, if any?
and class imbalance treatment in the fake news detection • RQ5: Which solutions deal with different levels of
domain. imbalanced datasets (if any)?
Key Contributions:
The main contributions of this paper are as follows:
C. SOURCE DATABASES AND SEARCH QUERY
• We provide a detailed discussion of the main deep
For the purpose of collecting research articles, we selected
learning-based algorithms used to detect fake news,
four digital databases that are renowned for their compre-
including their effectiveness.
hensive coverage and relevance to our field of study. These
• We discuss the main datasets available for fake news
databases include:
detection as well as their respective characteristics,
advantages, and disadvantages. • Google Scholar (we selected the articles that appeared
• We study transfer learning techniques and strategies for in the first thirteen retrieved pages);
dealing with class imbalance in this application domain. • Association for Computing Machinery (ACM) Digital
We also investigate their effects on the detection of fake Library database;
news and the challenges associated with implementing • IEEE Xplore database; and
these strategies. • Scopus.