Lesson 02 2.01 Introduction To Data Science
Lesson 02 2.01 Introduction To Data Science
Data science is created when subject expertise and scientific methodologies are combined with technology.
Data Science
Data scientists collect, explore, analyze, and visualize data. They apply mathematical and statistical models
to find patterns and solutions in the data.
Analysis
Mathematical Scientific
and statistical tools and
models methods
Domain Expertise and Scientific Methods
Data analysis helps to extract insights from data to make better business decisions.
Modern tools and technologies have made data processing and analytics faster and more efficient.
Technology
Operating system
Python Application
language design
Various sectors use data science to extract the information they need to create different services and products.
Using Data Science: Social Network Platforms
LinkedIn uses data points from its users to provide relevant digital services and data products.
Profile
Groups
Digital
Location
Services
Information
Data Points
Connections
Data
Products
Post
Likes
Using Data Science: Search Engines
Google uses data science to provide relevant search recommendations as the user types a query.
Search keyword
Influencing Factors
Fast and real-time analytics is made
possible by modern and advanced • Query volume – Unique and verifiable users
infrastructure, tools, and technologies • Geographical locations
• Keyword or phrase matches on the web
• Scrubbing for inappropriate content
Using Data Science: Healthcare
Wearable devices use data science to analyze data gathered by their biometric sensors.
Make Engagement
informed dashboard Data analytics
decisions
Using Data Science: Finance
A loan manager can easily access and sift through a loan applicant’s financial details using data science.
Governments in different countries share large datasets from various domains with the public.
Data.gov is a website hosted and maintained by the U.S. government.
Sectors or domains
The Real Challenge
Python deals with each stage of data analytics efficiently by applying different libraries and packages.
Acquire
Wrangle
Explore
Model
Python is a general-purpose, open-source programming language that lets you work quickly
and integrate systems more effectively.
Benefits of Python
Easy
Easy to
to learn
learn
Open source
Big open-source
open source community
Python is supported by well-established data platforms and processing frameworks that help
analyze data in a simple and efficient way.
Data Scientist
Big Data
Key Takeaways
A.
Asks the right questions
B.
Acquires data
C.
Performs data wrangling and data visualization
B. Acquires data
The search engine’s autocomplete feature identifies unique and verifiable users who search for a particular
keyword or phrase to build a query volume. It also helps identify the users’ locations and tags them with the
query, enabling it to be location-specific.
Knowledge
Check
In data science, the data is acquired from various sources and is then wrangled to ease its analysis. This is
followed by data exploration and data modeling. The final stage is data visualization, where the data is
presented and the patterns are identified.