Copy of Introduction to DS.pdf
Copy of Introduction to DS.pdf
TO
DATA SCIENCE
AND
MACHINE
LEARNING
DEFINING DATA
DATA SCIENCE AND ITS
IMPORTANCE
“Information is the oil of the 21st century, and
analytics is the combustion engine.” – Peter
Sondergaard, SVP, Garner Research
Data science became a buzzword when the Harvard
Business Review called it “The Sexiest Job of the 21 st
Century,” it became a buzzword. Because of this, it
tends to be used to describe predictive modeling,
business intelligence, business analytics, or other uses
of data, or to make statistics sound more interesting.
“Hiding within those mounds of data is knowledge that
could change the life of a patient or change the world.”
– Atul Butte, Stanford Quickly progressing
WHAT IS DATA SCIENCE
Data Science is a multidisciplinary
field that uses scientific methods,
algorithms, processes, and
systems to extract knowledge and
insights from structured and
unstructured data.
Artificial Intelligence
- Human Intelligence Exhibited by
Machines
Machine Learning
An Approach to Achieve Artificial
Intelligence
Deep Learning
-A Technique for Implementing Machine
Learning
DATA ANALYSIS LIFE CYCLE
DATA SCIENCE LIFE CYCLE
BENEFITS OF DATA SCIENCE FOR
A BUSINESS
• It will monetize data
1. Mathematics expertise
2. Technology; hacking skills
3. Business/strategy acumen
REAL-WORLD APPLICATIONS OF
DATA SCIENCE
AREAS OF DATA SCIENCE
Machine
Learning
Deep
Learning
Data Analysis
Big Data
Natural Language Processing
(NLP) Computer Vision
Business Intelligence (BI)
Data Engineering
Data
Mining
Financial Analytics
Geospatial Data
Science Environmental
Data Science
MACHINE LEARNING
The basic idea of machine learning, or ML, is to learn to do a certain task from
data.
MACHINE LEARNING
Herbert Alexander Simon:
"Learning is any process by which a
system improves performance from
experience."
Document Clustering:
• Input: Text documents (e.g., news articles).
• Output: Grouped topics or clusters.
• Real World: Google News uses clustering to organize similar news
stories together.
SUPERVISED VS UNSUPERVISED
REINFOREMENT MACHINE
LEARNING
REAL-LIFE EXAMPLES:
Self-Driving Cars:
• Input: Sensor data (camera, lidar, radar).
• Output: Actions (steering, braking, accelerating).
• Real World: Companies like Tesla and Waymo use reinforcement
learning to train autonomous vehicles.
Game AI:
• Input: Game state (board position, opponent’s moves).
• Output: Next best move.
• Real World: AlphaGo by DeepMind defeated human champions in
the game of Go using reinforcement learning.
AI ML DL AND DATA SCIENCE
AI ML DL AND DATA SCIENCE
JOB PROFILES IN DATA SCIENCE
Data Research
Scientist
Machine Learning Scientist
Data
Engineer
Data Analyst Journalist
Geospatial
Business Intelligence Analyst
(BI) Analyst Data Chief Data Officer
Engineer (CDO)
Big Data Engineer
Quantitative Analyst
Database
(Quant) Data Architect
Administrator
AI Engineer (DBA)
Data Consultant
Data Product
Manager
HERE IS A BREAKDOWN OF WHERE
DATA SCIENTISTS WORK
• 2% of data scientists work in gaming
• 4% work in consumer goods and retail
• 4% work in academia
• 4% work in government
• 6% work in financial services
• 7% work in pharmaceuticals and healthcare
• 9% work in consulting
• 11% work in a corporate setting
• 13% work in marketing
• 41% work in technology
WOULD YOU BE A GOOD DATA
SCIENTIST
To figure out whether or not you would make a good
data scientist, ask yourself these questions:
informed about the latest Notebook Sharing and Collaboration Google Colab,Kaggle Notebooks
developments is also important.
Dashboarding Tableau, Power BI