Sat - 9.Pdf - Predicting Liver Failure Using Supervised Machine Learning Approach
Sat - 9.Pdf - Predicting Liver Failure Using Supervised Machine Learning Approach
The function of liver is to filter blood that circulates through the body, converting
nutrients and drugs absorbed from the digestive tract into ready-to-use chemicals.
The liver performs many other important functions, such as removing toxins and
other chemical waste products from the blood and readying them for excretion. Liver
failure that begins in the cells of your liver. Nowadays machine learning is applied
to healthcare system where there is a chance of predicting the disease early. The
main necessity of Artificial intelligence is data. The past dataset is collected and that
dataset is used to build a machine learning model. The necessary pre-processing
techniques are applied like univariate analysis and bivariate analysis are
implemented. The data is visualized for better understanding of the features and
based on that a classification model is built by using machine learning algorithm and
comparison of algorithms are done based on their performance metrics like
accuracy, F1 score recall etc.
v
TABLE OF CONTENTS
1 INTRODUCTION
1.1 GENERAL 1
1.2 DOMAIN OVERVIEW 3
1.3 PROBLEM STATEMENT 9
2 LITERATURE SURVEY
2.1 LITERATURE REVIEW 11
2.2 SURVEY WALKTHROUGH 15
2.2.1 NUMPY 15
2.2.2 PANDAS 18
2.2.3 MATPLOTLIB 20
2.2.4 SKLEARN 23
2.3 PROJECT GOALS 25
vi
3.5.2 WORKING PROCESS 33
vii
LIST OF FIGURES
viii
4.18 Confusion Matrix of DT 65
4.19 Classification Report of DT 66
4.20 Confusion Matrix of RF 69
4.21 Classification Report of RF 70
4.22 Confusion Matrix of SVM 73
4.23 Classification Report of SVM 74
ix
CHAPTER 1
INTRODUCTION
1.1 GENERAL
The liver is a large, pyramid-shaped organ that lies behind your ribs on the right side
of your body. It‟s under the right lung. It‟s divided into right and left lobes. The liver
helps break down and store nutrients. These include sugars, starch, fats, and
proteins. It also makes proteins, such as albumin. This helps the body balance
fluids. The liver makes clotting factors, which help blood thicken or clot when a
person is bleeding. Bile made in the liver is important for digesting food and for other
bodily functions.
One of the liver‟s most important jobs is to filter out and destroy toxins in the blood.
When the liver isn‟t working well, chemicals can build up inside the body and cause
damage. Liver cancer is cancer that starts in your liver. It‟s also called primary liver
cancer. Primary liver cancer is not the same as cancer that started somewhere else
in the body and then spread (metastasized) to the liver.
Cancer that starts in another organ, such as the colon, breast, or lung, and then
spreads to the liver is called secondary liver cancer. Secondary liver cancer is far
more common in the U.S. than primary liver cancer. Cancer that has spread to the
liver from somewhere else is treated like the original cancer. For instance, lung
cancer that has spread to the liver is treated like lung cancer.
Fatty liver disease (FLD) has become a rampant condition. It is associated with a
high rate of morbidity and mortality in a population. The condition is commonly
referred as FLD. Early prediction of FLD would allow patients to take necessary
preventive, diagnosis, and treatment. Chronic liver diseases and cirrhosis are the
11th leading cause of death in the world, accounting for 1.1 million deaths annually.
The global prevalence of cirrhosis has been substantially rising from 71 million in
1990 to over 122 million in 2017. Common causes of cirrhosis are chronic hepatitis
B virus (HBV) and hepatitis C virus (HCV) infections, alcohol-related liver disease
and nonalcoholic steatohepatitis (NASH). Over the past decade, there has been a
1
temporal shift in the prevalence of causes of cirrhosis, i.e., the prevalence of NASH
has been dramatically increasing, whereas the prevalence of other causes has been
slowly decreasing. The estimated worldwide prevalence of nonalcoholic fatty liver
disease (NAFLD) is 25% and is projected to be to 33.5% by 2030, emphasizing the
importance of both cirrhosis and NAFLD.
The gold standard for the diagnosis of liver fibrosis and nonalcoholic fatty liver
disease (NAFLD) is liver biopsy. Various noninvasive modalities, e.g.,
ultrasonography, elastography and clinical predictive scores, have been used as
alternatives to liver biopsy, with limited performance. Recently, artificial intelligence
(AI) models have been developed and integrated into noninvasive diagnostic tools
to improve their performance.
2
1.2 Domain Overview
DATA SCIENCE:
The term "data science" has been traced back to 1974, when Peter Naur proposed
it as an alternative name for computer science. In 1996, the International Federation
of Classification Societies became the first conference to specifically feature data
science as a topic.
However, the definition was still in flux. The term “data science” was first coined in
2008 by D.J. Patil, and Jeff Hammerbacher, the pioneer leads of data and
analytics efforts at LinkedIn and Facebook. In less than a decade, it has become
one of the hottest and most trending professions in the market. Data science is the
field of study that combines domain expertise, programming skills, and knowledge
of mathematics and statistics to extract meaningful insights from data.
ARTIFICIAL INTELLIGENCE:
3
term may also be applied to any machine that exhibits traits associated with a human
mind such as learning and problem-solving.
Artificial intelligence (AI) is intelligence demonstrated by machines, as opposed to
the natural intelligence displayed by humans or animals. Leading AI textbooks
define the field as the study of "intelligent agents" any system that perceives its
environment and takes actions that maximize its chance of achieving its goals.
Some popular accounts use the term "artificial intelligence" to describe machines
that mimic "cognitive" functions that humans associate with the human mind, such
as "learning" and "problem solving", however this definition is rejected by major AI
researchers.
Artificial intelligence is the simulation of human intelligence processes by machines,
especially computer systems. Specific applications of AI include expert systems,
natural language processing, speech recognition and machine vision.
The various sub-fields of AI research are centered around particular goals and the
use of particular tools. The traditional goals of AI research
4
include reasoning, knowledge representation, planning, learning, natural language
processing, perception and the ability to move and manipulate objects. General
intelligence (the ability to solve an arbitrary problem) is among the field's long-term
goals. To solve these problems, AI researchers use versions of search and
mathematical optimization, formal logic, artificial neural networks, and methods
based on statistics, probability and economics. AI also draws upon computer
science, psychology, linguistics, philosophy, and many other fields.
The field was founded on the assumption that human intelligence "can be so
precisely described that a machine can be made to simulate it". This raises
philosophical arguments about the mind and the ethics of creating artificial beings
endowed with human-like intelligence. These issues have been explored
by myth, fiction and philosophy since antiquity. Science fiction and futurology have
also suggested that, with its enormous potential and power, AI may become
an existential risk to humanity.
As the hype around AI has accelerated, vendors have been scrambling to promote
how their products and services use AI. Often what they refer to as AI is simply one
component of AI, such as machine learning. AI requires a foundation of specialized
hardware and software for writing and training machine learning algorithms. No one
programming language is synonymous with AI, but a few, including Python, R and
Java, are popular.
5
are called algorithms, provide computing devices with step-by-step instructions for
how to complete a specific task.
AI is important because it can give enterprises insights into their operations that they
may not have been aware of previously and because, in some cases, AI can perform
tasks better than humans. Particularly when it comes to repetitive, detail-oriented
tasks like analyzing large numbers of legal documents to ensure relevant fields are
filled in properly, AI tools often complete jobs quickly and with relatively few errors.
Artificial neural networks and deep learning artificial intelligence technologies are
quickly evolving, primarily because AI processes large amounts of data much faster
and makes predictions more accurately than humanly possible.