Chapter 1 Data Science Fundamentals
Chapter 1 Data Science Fundamentals
Objectives
Chapter 1 o Understanding fundamental concepts of
Data Science, AI, and Machine learning.
Data Acquisition and Analysis,
Introduction to Data Science Visualization, Statistics, and Predictive
Fundamentals Analysis.
Lecturer. Engr. Hanad Mohamud o Working Tools required for Data Analysis
Mohamed on small and large data sets
The term “Data Science” was created in the early 1960s to describe a new profession
that would support the understanding and interpretation of the large amounts of
data which was being amassed at the time. (At the time, there was no way of
predicting the truly massive amounts of data over the next fifty years.) Data Science
continues to evolve as a discipline using computer science and statistical
methodology to make useful predictions and gain insights in a wide range of fields.
While Data Science is used in areas such as astronomy and medicine, it is also used in
business to help make smarter decisions.
Statistics, and the use of statistical models, are deeply rooted within the field of Data
Science. Data Science started with statistics and has evolved to include
concepts/practices such as artificial intelligence, machine learning, and the Internet
of Things, to name a few.
Cont.?
As more and more data has become available, first by way of recorded shopping
behaviors and trends, businesses have been collecting and storing it in ever greater
amounts. With the growth of the Internet, the Internet of Things, and the
exponential growth of data volumes available to enterprises, there has been a flood
of new information or big data. Once the doors were opened by businesses seeking
to increase profits and drive better decision-making, the use of big data started being
applied to other fields, such as medicine, engineering, and social sciences.
Data All-Around
o Relational Data
(Tables/Transaction/Legacy Data)
o Unstructured Text Data (Web)
o Semi-structured Dat (XML)
o Streaming Data (images and
videos)
What do you know about Data Science?
What is Data Science?
What is Data Science?
Data science is the field of study that combines domain expertise, programming skills,
and knowledge of mathematics and statistics to extract meaningful insights from data.
Data science practitioners apply machine learning algorithms to numbers, text, images,
video, audio, and more to produce artificial intelligence (AI) systems to perform tasks
that ordinarily require human intelligence.
The company details eight ways that data scientists can add value to business
What is AI?
Alan Turing
»
Artificial Intelligence
Although there is no commonly agreed definition for big data, it can be said to mean large and
complex data, which cannot be handled with conventional data storage and processing tools
Big Data
A lot of things happen in an internet minute – millions of messages, e-mails and texts are
sent, scrolled and uploaded, and hundreds of thousands of hours of content are consumed
Open Data
https://medium.com/@mselvaraaju/open-data-value-chain-6bf628ac13ae.
Data Analytics
o Data analytics is the process of transforming raw data into meaningful insights for
better decision making, mostly using statistical processing and machine learning.
How many
customers did What are the Who are the Which customers
we loose last reasons for their likely customers should we target
churn ? to churn next? to retain ?
year ?
Past Future
* Some form of Intelligence involved
+ data or data with machine learning
Data science concepts in one picture
Source : https://youtu.be/pKPaHH7hnv8
Why Python?