Detection of Fake and Fraudulent Faces Via Neural Memory Networks
Detection of Fake and Fraudulent Faces Via Neural Memory Networks
1
ABSTRACT
Advances in computer vision have brought us to the point where we have the
ability to synthesise realistic fake content. Such approaches are seen as a source of
disinformation and mistrust, and pose serious concerns to governments around the
world. Convolutional Neural Networks (CNNs) demonstrate encouraging results
when detecting fake images that arise from the specific type of manipulation they
are trained on. However, this success has not transitioned to unseen manipulation
types, resulting in a significant gap in the line-of-defense. We propose a
Hierarchical Attention Memory Network (HAMN), motivated by the social
cognition processes of the human brain, for the detection of fake faces. Through
visual cues and by utilising knowledge stored in neural memories, we allow the
network to reason about the perceived face and anticipate it’s future semantic
embeddings. This renders a generalisable face tampering detection framework.
Experimental results demonstrate the proposed approach achieves superior
performance for fake and fraudulent face detection.
PROBLEM DEFINITION
• There exists a critical need for automated systems which capture the
fundamentals of video and image content tampering, and have the ability
to generalize to different manipulation attacks .
INTRODUCTION
• social media becoming a primary source of information for many people, and
with more than 100 million hours of video content watched daily , fake news
has quickly risen as a significant threat to society and democracy
• It is observed that the underlying neural network can quickly overfit to a
specific artefact left.
• a deep learning framework to detect fake and fraudulent faces in images and
video which is motivated by the social perception and social cognition
processes in the human brain.
• Advances in computer vision have brought us to the point where we have the
ability to synthesise realistic fake content. Such approaches are seen as a source
of disinformation and mistrust, and pose serious concerns to governments
around the world.
EXISTING SYSTEM
• A naive way to design such a system would be to use the entire temporal
structure of the given video from the modeled dataset and sequentially map
the temporal evolution of the facial expressions/attributes. However, this
limits applications of the system to videos only.
• In contrast, a system that operates at a frame-level is more flexible.
Furthermore, we observe that the majority of existing state-ofthe-art
frameworks for recognizing fake faces operate at the frame-level.
• Hence, to allow direct comparisons we design a framework that only
utilizes a single frame as the input to the system. It should be noted that the
proposed system requires image pairs (i.e current and future frames) only
for the training process.
DISADVANTAGE
• The main difference between the functioning of neural networks and the biological
neural network is memory. While both the human brain and neural networks have the
ability to read and write from the memory available, the brain can create/store the
memory as well. Researchers identified that this key difference is the major roadblock
neural network and linking it to external memory. The neural network would act as a
CPU with a memory attached. Such differentiable computers aim to learn programs
• Multimedia contents are generated in many different ways through the use
of consumer electronics and high-quality digital imaging devices, such as
smartphones, digital cameras, tablets, wearable sensors, and other Internet
of Things (IoT) devices. The ever-increasing convenience of image
acquisition has facilitated instant distribution and sharing of digital images
on digital social platforms, determining a great amount of the exchanged
data
SYSTEM REQUIREMENTS
HARDWARE REQUIREMENTS
SOFTWARE REQUIREMENTS