67% found this document useful (3 votes)
6K views29 pages

Data Science Hindi

The document discusses data science concepts like data, data mining, machine learning. It explains how data has evolved from small and structured to huge and unstructured. It describes the data mining life cycle and tasks like anomaly detection and association rule mining. It also differentiates between supervised and unsupervised learning, with classification and regression as examples of supervised learning and clustering as an example of unsupervised learning.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
67% found this document useful (3 votes)
6K views29 pages

Data Science Hindi

The document discusses data science concepts like data, data mining, machine learning. It explains how data has evolved from small and structured to huge and unstructured. It describes the data mining life cycle and tasks like anomaly detection and association rule mining. It also differentiates between supervised and unsupervised learning, with classification and regression as examples of supervised learning and clustering as an example of unsupervised learning.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 29

Data Science

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Data?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Data?

Data is just collection of facts

“This is
23, 8000
Sparta””

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Back Then

Small

Structured

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Now

Huge

Un-structured

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Now

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Use Case

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Use Case

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Life Cycle

Machine
Data Data Pre- Pattern Knowledge
Learning
Acquisition Processing Evaluation Representation
Algorithm

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Acquisition

Data from Multiple Sources Data Storage Target Data

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Pre-processing

Data Manipulation Data Visualization

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Machine Learning

Application of intelligent algorithms to find meaningful information from the data

Classification Regression Clustering

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Pattern Evaluation

Once Data Mining techniques have been applied, the results have to be evaluated

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Knowledge Representation

The identified patterns must be represented using simple, aesthetic graphs

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Tasks–(Anomaly Detection)

Identification of unusual patterns, outliers helps in understanding the variation in data

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Tasks–(Association Rule
Mining)

This method is used for finding interesting association amongst different entities

Beer Diaper Syndrome

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Fish

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Categories of Machine Learning

Supervised Learning Unsupervised Learning

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Supervised Learning

Input Variable X

Y = f(X)

Output Variable Y

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Categories of Supervised Learning

Supervised Learning

Regression Classification

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Classification

Classification is the process of predicting the class of a new variable

Smoke(Yes/No) Cancer(Yes/No)

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Regression

This method is used to estimate the relationship between different entities

Dependent Variable Independent Variable

Y=f(x)

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Unsupervised Learning (Clustering)

Input Data with no


class labels

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Unsupervised Learning (Clustering)

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.

You might also like