Ch7-Overview of Data Science-part 1
Ch7-Overview of Data Science-part 1
of Data Science on
Modern Business
Processes
LO3: Explore The Tools And Technologies Associated With Data Science And How It Supports
Business Processes
Data Science Overview
What is Data Science?
Data science, also known as data-driven science
Modern organizations are inundated with data; there is a proliferation of devices that
can automatically collect and store information.
Online systems and payment portals capture more data in the fields of e-commerce,
medicine, finance, and every other aspect of human life. We have text, audio, video,
and image data available in vast quantities.
Data Science Importance
1. Data science helps brands to understand their customers in a much
enhanced and empowered manner.
2. It allows brands to communicate their story in such an engaging
and powerful manner.
3. Big Data is a new field that is constantly growing and evolving.
4. Its findings and results can be applied to almost any sector like
travel, healthcare, and education among others.
5. Data science is accessible to almost all sectors.
The Core Aims of Data Science
1- Making Data Useful and Retrievable: Data science involves collecting, storing, and organizing
data in a way that makes it easily accessible and useful for analysis.
2- Extracting Actionable Intelligence: Data science aims to extract insights and intelligence from
data that can inform decision-making and drive actionable outcomes.
3- Improving Business Performance: One of the primary goals of data science is to use data-driven
insights to enhance business performance.
4- Automating Extraction and Implementation: Data science also involves automating processes
for data extraction, analysis, and implementation of insights.
Sales
ID Name Birthdate ($) Notes
Bob
5 Johnson 08-10-1989 1750 VIP
The
Sales
ID Name Birthdate ($) Notes Cleaning
1 John Doe 01-05-1990 2000
Good
customer
Process of
2 jane doe 1990/06/15 1500 -
Messy
Follow up
Dataset:
3 07-20-1991 needed
Bob
5 Johnson 08-10-1989 1750 VIP
intelligent systems capable of performing tasks that typically require human intelligence.
• AI encompasses various subfields, including machine learning, natural language processing, computer
vision, robotics, and expert systems. AI techniques are used to automate tasks, make predictions, recognize
patterns, and solve complex problems across diverse domains such as healthcare, finance, autonomous
and architecture necessary for the storage, processing, and retrieval of data.
• Data engineers work with large volumes of structured and unstructured data, building pipelines and systems
for data ingestion, transformation, and storage. They ensure data quality, scalability, and reliability to
algorithms and statistical models that enable computers to learn from and make predictions or decisions
based on data.
• - Machine learning algorithms can be categorized into supervised, unsupervised, semi-supervised, and
reinforcement learning techniques. Applications of machine learning span various domains, including image
and speech recognition, natural language processing, recommendation systems, and predictive analytics.
Data Science Tools
In today’s world, there is an overwhelming amount of data. Because of this, data science
has become very popular in the tech industry.
Definition: Data science tools are software, platforms, or libraries that help data
scientists handle the entire data lifecycle—data collection, cleaning, analysis,
visualization, and modeling.
It’s like the cool and knowledgeable relative that everyone wants to spend time with at
family events. But how does data science work its magic of analyzing numbers and
finding patterns?
Purpose: These tools simplify complex tasks, improve efficiency, and allow data
scientists to derive insights and make data-driven decisions.
Data Science Tools
Definition: The foundation for writing code and performing data analysis.
Python
Data Science Tools
Definition: Used to build neural networks for complex tasks like image recognition.
Data Science Tools
Definition: Cloud platforms that provide scalable computing power and tools.
Data Science Tools
•This type is focused only on what has already happened in a business and, unlike other methods
of analysis, it is not used to draw inferences or predictions from its findings.
• Descriptive analytics is, rather, a foundational starting point used to prepare data for further
analysis down the line.
Descriptive analytics
•Generally, the most simplistic form of data analytics, descriptive analytics uses simple maths and
statistical tools, such as arithmetic, averages ,and percent changes, rather than the complex
calculations necessary for predictive and prescriptive analytics.
•Visual tools such as line graphs, pie, and bar charts are used to present findings, meaning
descriptive analytics can – and should – be easily understood by a wide business audience.
Example of Descriptive Analytics
• Predictive analytics is a way to use the past to project the future of your business. This is not, futurology but an
accurate calculation of the probabilities in any scenario, based on the processing of large volumes of data.
• The basic goal of predictive analytics is to forecast what will happen in the future with a high degree of certainty.
This distinguishes predictive analytics from descriptive analytics, which assists analysts in analyzing what has
previously occurred.
Predictive Analytics
•Predictive analytics utilizes a variety of statistical techniques, such as automated machine
learning algorithms, deep learning, data mining, and AI, to create predictive models, which
extract information from datasets, identify patterns, and provide a predictive score for an array of
organizational outcomes.
Predictive Analytics
Prescriptive Analytics
• Prescriptive analytics is a statistical method used to generate recommendations and make
• Prescriptive analytics is the third and final tier in modern, computerized data analytics.
• Prescriptive analytics is the natural progression from descriptive and predictive analytics procedures. It goes