0% found this document useful (0 votes)

25 views6 pages

Data Mining Algorithms

Data

Uploaded by

savvy_as_98-1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views6 pages

Data Mining Algorithms

Data

Uploaded by

savvy_as_98-1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Data Mining Algorithms (Analysis Services - Data Mining)

Page 1 of 6

Data Mining Algorithms (Analysis Services

- Data Mining)
SQL Server 2016

A data mining algorithm is a set of heuristics and calculations that creates a data mining model from data. To create
a model, the algorithm first analyzes the data you provide, looking for specific types of patterns or trends. The
algorithm uses the results of this analysis to define the optimal parameters for creating the mining model. These
parameters are then applied across the entire data set to extract actionable patterns and detailed statistics.
The mining model that an algorithm creates from your data can take various forms, including:
A set of clusters that describe how the cases in a dataset are related.

A decision tree that predicts an outcome, and describes how different criteria affect that outcome.
A mathematical model that forecasts sales.

A set of rules that describe how products are grouped together in a transaction, and the probabilities that
products are purchased together.
MicrosoftSQL ServerAnalysis Services provides multiple algorithms for use in your data mining solutions. These
algorithms are implementations of some of the most popular methodologies used in data mining. All of the
Microsoft data mining algorithms can be customized and are fully programmable using the provided APIs, or by
using the data mining components in SQL Server Integration Services.

You can also use third-party algorithms that comply with the OLEDB for Data Mining specification, or develop
custom algorithms that can be registered as services and then used within the SQL Server Data Mining framework.

Choosing the Right Algorithm

Choosing the best algorithm to use for a specific analytical task can be a challenge. While you can use different
algorithms to perform the same business task, each algorithm produces a different result, and some algorithms
can produce more than one type of result. For example, you can use the Microsoft Decision Trees algorithm not
only for prediction, but also as a way to reduce the number of columns in a dataset, because the decision tree can
identify columns that do not affect the final mining model.

Choosing an Algorithm by Type

Analysis Services includes the following algorithm types:

https://msdn.microsoft.com/en-us/library/ms175595(d=printer).aspx

4/2/2016

Data Mining Algorithms (Analysis Services - Data Mining)

Page 2 of 6

Classification algorithms predict one or more discrete variables, based on the other attributes in the
dataset.

Regression algorithms predict one or more continuous variables, such as profit or loss, based on other
attributes in the dataset.
Segmentation algorithms divide data into groups, or clusters, of items that have similar properties.

Association algorithms find correlations between different attributes in a dataset. The most common
application of this kind of algorithm is for creating association rules, which can be used in a market
basket analysis.

Sequence analysis algorithms summarize frequent sequences or episodes in data, such as a Web path
flow.
However, there is no reason that you should be limited to one algorithm in your solutions. Experienced analysts
will sometimes use one algorithm to determine the most effective inputs (that is, variables), and then apply a
different algorithm to predict a specific outcome based on that data. SQL Server data mining lets you build
multiple models on a single mining structure, so within a single data mining solution you might use a clustering
algorithm, a decision trees model, and a nave Bayes model to get different views on your data. You might also
use multiple algorithms within a single solution to perform separate tasks: for example, you could use
regression to obtain financial forecasts, and use a neural network algorithm to perform an analysis of factors
that influence sales.

Choosing an Algorithm by Task

To help you select an algorithm for use with a specific task, the following table provides suggestions for the
types of tasks for which each algorithm is traditionally used.

Examples of tasks
Predicting a discrete attribute
Flag the customers in a prospective buyers list as good or poor
prospects.
Calculate the probability that a server will fail within the next 6
months.
Categorize patient outcomes and explore related factors.

Predicting a continuous attribute

Forecast next year's sales.

Microsoft algorithms to
use
Microsoft Decision Trees
Algorithm
Microsoft Naive Bayes
Algorithm
Microsoft Clustering
Algorithm

Microsoft Neural Network

Algorithm
Microsoft Decision Trees
Algorithm

https://msdn.microsoft.com/en-us/library/ms175595(d=printer).aspx

4/2/2016

Data Mining Algorithms (Analysis Services - Data Mining)

Predict site visitors given past historical and seasonal trends.
Generate a risk score given demographics.

Predicting a sequence
Perform clickstream analysis of a company's Web site.
Analyze the factors leading to server failure.
Capture and analyze sequences of activities during outpatient visits, to
formulate best practices around common activities.
Finding groups of common items in transactions
Use market basket analysis to determine product placement.
Suggest additional products to a customer for purchase.
Analyze survey data from visitors to an event, to find which activities
or booths were correlated, to plan future activities.
Finding groups of similar items
Create patient risk profiles groups based on attributes such as
demographics and behaviors.
Analyze users by browsing and buying patterns.
Identify servers that have similar usage characteristics.

Page 3 of 6
Microsoft Time Series
Algorithm

Microsoft Linear Regression

Algorithm
Microsoft Sequence
Clustering Algorithm

Microsoft Association
Algorithm

Microsoft Decision Trees

Algorithm

Microsoft Clustering
Algorithm

Microsoft Sequence
Clustering Algorithm

Microsoft Decision Trees Algorithm

https://msdn.microsoft.com/en-us/library/ms175595(d=printer).aspx

4/2/2016

Data Mining Algorithms (Analysis Services - Data Mining)

Page 4 of 6

Microsoft Linear Regression Algorithm

Microsoft Logistic Regression Algorithm

Microsoft Naive Bayes Algorithm

Microsoft Neural Network Algorithm

Microsoft Sequence Clustering Algorithm

Microsoft Time Series Algorithm
Technical
reference

Provides technical detail about the implementation of the algorithm, with academic
references as necessary. Lists the parameters that you can set to control the behavior of
the algorithm and customize the results in the model. Describes data requirements and
provides performance tips if possible.
Microsoft Association Algorithm Technical Reference
Microsoft Clustering Algorithm Technical Reference

Microsoft Decision Trees Algorithm Technical Reference

Microsoft Linear Regression Algorithm Technical Reference

Microsoft Logistic Regression Algorithm Technical Reference

Microsoft Naive Bayes Algorithm Technical Reference

Microsoft Neural Network Algorithm Technical Reference

Microsoft Sequence Clustering Algorithm Technical Reference

Microsoft Time Series Algorithm Technical Reference
Model content

Explains how information is structured within each type of data mining model, and
explains how to interpret the information stored in each of the nodes.
Mining Model Content for Association Models (Analysis Services - Data Mining)
Mining Model Content for Clustering Models (Analysis Services - Data Mining)

Mining Model Content for Decision Tree Models (Analysis Services - Data Mining)

Mining Model Content for Linear Regression Models (Analysis Services - Data Mining)

Mining Model Content for Logistic Regression Models (Analysis Services - Data Mining)
Mining Model Content for Naive Bayes Models (Analysis Services - Data Mining)

Mining Model Content for Neural Network Models (Analysis Services - Data Mining)

Mining Model Content for Sequence Clustering Models (Analysis Services - Data Mining)

https://msdn.microsoft.com/en-us/library/ms175595(d=printer).aspx

4/2/2016

Data Mining Algorithms (Analysis Services - Data Mining)

Page 5 of 6

Mining Model Content for Time Series Models (Analysis Services - Data Mining)
Data mining
queries

Provides multiple queries that you can use with each model type. Examples include
content queries that let you learn more about the patterns in the model, and prediction
queries to help you build predictions based on those patterns.
Association Model Query Examples
Clustering Model Query Examples

Decision Trees Model Query Examples

Linear Regression Model Query Examples

Logistic Regression Model Query Examples

Naive Bayes Model Query Examples

Neural Network Model Query Examples

Sequence Clustering Model Query Examples

Time Series Model Query Examples

Determine the algorithm used by a data mining model

Query the Parameters Used to Create a Mining

Model

Create a Custom Plug-In Algorithm

Plugin Algorithms

Explore a model using an algorithm-specific viewer

Data Mining Model Viewers

View the content of a model using a generic table format

Browse a Model Using the Microsoft Generic

Content Tree Viewer

Learn about how to set up your data and use algorithms

to create models

Mining Structures (Analysis Services - Data Mining)

Mining Models (Analysis Services - Data Mining)

https://msdn.microsoft.com/en-us/library/ms175595(d=printer).aspx

4/2/2016

Data Mining Algorithms (Analysis Services - Data Mining)

Page 6 of 6

Data Mining Tools

Community Additions
2016 Microsoft

https://msdn.microsoft.com/en-us/library/ms175595(d=printer).aspx

4/2/2016

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6436)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (642)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
4/5 (1174)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (997)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1854)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4102)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (628)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1018)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (297)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (279)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4360)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2876)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Analysis of German Credit Data
100% (1)
Analysis of German Credit Data
24 pages
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (835)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
Sleep Telemedicine
No ratings yet
Sleep Telemedicine
11 pages
Weight Gain and Cpap
No ratings yet
Weight Gain and Cpap
3 pages
Problem Solving Model
100% (1)
Problem Solving Model
6 pages
The Future of The CMIO: Opportunities and Challenges
No ratings yet
The Future of The CMIO: Opportunities and Challenges
5 pages
CPOE Clinical Workflow Analysis PDF
No ratings yet
CPOE Clinical Workflow Analysis PDF
59 pages
Bicep Curl To Shoulder Press
No ratings yet
Bicep Curl To Shoulder Press
5 pages
Week 5 UMLSlides
No ratings yet
Week 5 UMLSlides
8 pages
Data Mining Algorithms
No ratings yet
Data Mining Algorithms
6 pages
HIN-706 Systems Analysis and Design Syllabus 20150105
100% (1)
HIN-706 Systems Analysis and Design Syllabus 20150105
5 pages
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)

Data Mining Algorithms

Uploaded by

Data Mining Algorithms

Uploaded by

Data Mining Algorithms (Analysis Services - Data Mining)

Data Mining Algorithms (Analysis Services

Choosing the Right Algorithm

Choosing an Algorithm by Type

Data Mining Algorithms (Analysis Services - Data Mining)

Choosing an Algorithm by Task

Predicting a continuous attribute

Microsoft Neural Network

Data Mining Algorithms (Analysis Services - Data Mining)

Microsoft Linear Regression

Microsoft Decision Trees

Microsoft Decision Trees Algorithm

Data Mining Algorithms (Analysis Services - Data Mining)

Microsoft Linear Regression Algorithm

Microsoft Logistic Regression Algorithm

Microsoft Neural Network Algorithm

Microsoft Sequence Clustering Algorithm

Microsoft Decision Trees Algorithm Technical Reference

Microsoft Linear Regression Algorithm Technical Reference

Microsoft Logistic Regression Algorithm Technical Reference

Microsoft Neural Network Algorithm Technical Reference

Microsoft Sequence Clustering Algorithm Technical Reference

Data Mining Algorithms (Analysis Services - Data Mining)

Decision Trees Model Query Examples

Linear Regression Model Query Examples

Logistic Regression Model Query Examples

Neural Network Model Query Examples

Sequence Clustering Model Query Examples

Determine the algorithm used by a data mining model

Query the Parameters Used to Create a Mining

Create a Custom Plug-In Algorithm

Explore a model using an algorithm-specific viewer

Data Mining Model Viewers

View the content of a model using a generic table format

Browse a Model Using the Microsoft Generic

Learn about how to set up your data and use algorithms

Mining Structures (Analysis Services - Data Mining)

Data Mining Algorithms (Analysis Services - Data Mining)

Data Mining Tools

You might also like