Algorithm Selection
Algorithm Selection
COMPARISON OF ML ALGORITHMS
VARIOUS ML ALGORITHMS
No need to tweak
parameters
Fast to train
BUSINESS ANALYTICS
2
extreme multiclass
Recommender problems (like tagging Can fail to predict correctly due
systems text) to the curse of dimensionality
Spell checking
problems
Naive Bayes Face recognition Easy and fast to Strong and unrealistic feature
implement, doesn’t independence assumptions
Sentiment analysis require too much memory
and can be used for online Fails estimating rare
Spam detection learning occurrences
Fast to train
BUSINESS ANALYTICS
3
Bayes theorem provides a way of calculating the posterior probability, P(c|x), from P(c), P(x), and P(x|c). Naive Bayes
classifier assume that the effect of the value of a predictor (x) on a given class (c) is independent of the values of
other predictors. This assumption is called class conditional independence.
The posterior probability can be calculated by first, constructing a frequency table for each attribute against the
target. Then, transforming the frequency tables to likelihood tables and finally use the Naive Bayesian equation to
calculate the posterior probability for each class. The class with the highest posterior probability is the outcome of
prediction.
BUSINESS ANALYTICS
4
Joint posterior probability for 8 independent variables is a logical extension of two variables, by taking product of
all 8 conditional probabilities. Since python library or any statistical software would do the math, only
demonstration of logic and a sample calculation is adequate.
Learners are expected to build tables of likelihood for 8 variables. (Depending on time available they may do
so only for one or two variables.)
True positives (TP): These are cases in which we predicted yes (they have the disease),
and they do have the disease.
True negatives (TN): We predicted no, and they don't have the disease.
False positives (FP): We predicted yes, but they don't actually have the disease. (Also
known as a "Type I error.")
False negatives (FN): We predicted no, but they actually do have the disease. (Also
known as a "Type II error.")
BUSINESS ANALYTICS
5
Accuracy is not a single number but depending on precise question being answered, can be from amongst :-
The algorithm is best explained using visualization and basic concepts in coordinate geometry.
BUSINESS ANALYTICS
6
INTERPRETATION OF WE IGHTS
Magnitude of weight after due normalization of variables, helps in deciding feature importance. The interpretation
of weights can be explained using OLS (Ordinary Least Squares) regression as an analogy.
BUSINESS ANALYTICS
7
https://charlienewey.github.io/manually-calculating-an-svms-support-vectors/
While exact values of results will vary between Naïve Bayes and SVM, the interpretation of testing accuracy is
same. Previous section on Naïve Bayes accuracy refers.
BUSINESS ANALYTICS
8
HEALTHCARE APPLICATIONS
USING ML ALGORITHMS
The value of machine learning in healthcare is its ability to process huge datasets beyond the scope of human
capability, and then reliably convert analysis of that data into clinical insights that aid physicians in planning and
providing care, ultimately leading to better outcomes, lower costs of care, and increased patient satisfaction.
It has been estimated that big data and machine learning in pharma and medicine could generate a value of up to
$100B annually, based on better decision-making, improved efficiency of research/clinical trials, and new tool
creation for physicians, consumers, insurers, and regulators.
a. Disease identification / diagnosis - Creation of a platform to analyze data, and loop it back in real time to
physicians to aid in clinical decision making is CDSS. A physician sees a patient and enters symptoms, data,
and test results into the EMR, there’s machine learning behind the scenes looking at everything about that
patient, and prompting the doctor with useful information for making a diagnosis, ordering a test, or
suggesting a preventive screening. In the long term, we will be able to incorporate bigger sets of data that
can be analyzed in real time to provide all kinds of information to the provider and patient.
b. Show causal relationships in disease prognosis and help in predictions.
c. Patient risk profile – depending on various signs and symptoms and lifestyle factors.
d. Gather public health data and predict epidemic outbreaks.
e. Reduce 1-year mortality - Health systems can reduce 1-year mortality rates by predicting the likelihood of
death within one year of discharge and then match patients with appropriate interventions, care providers,
and support.
BUSINESS ANALYTICS
9
c. Reduce hospital Length-of-Stay (LOS). Health systems can reduce LOS and improve other outcomes like
patient satisfaction by identifying patients that are likely to have an increased LOS and then ensure that
best practices are followed.
d. Predict propensity-to-pay - Health systems can determine who needs reminders, who needs financial
assistance, and how the likelihood of payment changes over time and after particular events.
e. Predict no-shows - Health systems can create accurate predictive models to assess, with each scheduled
appointment, the risk of a no-show, ultimately improving patient care and the efficient use of resources.
BUSINESS ANALYTICS