_unit2 DATA SCIENCE
_unit2 DATA SCIENCE
Phase 1: Discovery
● Come to know about data sources needed and available for the
project.
● The team formulates the initial hypothesis that can be later tested
with data.
and analysis.
● Several tools commonly used for this phase are – Hadoop, Alpine
suitable models.
● In this phase, the data science team develops data sets for
● Team builds and executes models based on the work done in the
● Several tools commonly used for this phase are – Matlab and
STASTICA.
purposes.
● Team also considers whether its existing tools will suffice for
executing models.
● Free or open-source tools – Rand PL/R, Octave, WEKA.
warning, assumptions.
stakeholders.
Phase 6: Operationalize
Use of Machine Data Science makes use of Data Analytics does not use
Learning machine learning algorithms to machine learning to get the
get insights. insight of data.
Other Skills Data Science makes use of Data Hadoop Based analysis is
mining activities for getting used for getting conclusions
meaningful insights. from raw data.
Scope The scope of data science is The Scope of data analysis is
large. micro i.e., small.
Data Type Data Science mostly deals with Data Analytics deals with
unstructured data. structured data.
Statistical Skills Statistical skills are necessary The statistical skills are of
in the field of Data Science.. minimal or no use in data
analytics.
❖Data Mining :
This step involves gathering data and information from diverse sources
analysis.
mining process.
relational databases.
future trends.
specialized tools like R, are commonly used for statistical analysis and
graphical modeling.
probability.
in any organization.