Lecture 1 - Intro To Big Data Analytics
Lecture 1 - Intro To Big Data Analytics
Visualization
What is Data ?
• Data refers to raw facts that have no specific meaning
• Data or raw data is not enough to make decisions
2 v 1.0
What is Information?
3 v 1.0
Example of Data and Information
4 v 1.0
Data generation in Current Scenario
5 v 1.0
Facts on Big Data
6 v 1.0
Data generated everyday
• Here are some key daily statistics highlighted in the
infographic:
• 500 million tweets are sent
• 294 billion emails are sent
• 4 petabytes of data are created on Facebook
• 4 terabytes of data are created from each connected car
• 65 billion messages are sent on WhatsApp
• 5 billion searches are made
• By 2025, it’s estimated that 463 exabytes of data will be
created each day globally – that’s the equivalent of
212,765,957 DVDs per day!
7 v 1.0
Data units
8 v 1.0
Digital Data Statistics
9 v 1.0
What is Big Data ?
10 v 1.0
Where does Big Data come from?
11 v 1.0
Big Data contains both Structured and
Unstructured Data
12 v 1.0
Big Data is Growing Fast
13 v 1.0
Defining Big Data
14 v 1.0
Volume refers to the amount of data
15 v 1.0
Velocity refers to the speed of data
processing
16 v 1.0
Variety refers to the number of types of data
17 v 1.0
Small Data Vs Big Data
18 v 1.0
Traditional vs Distributed systems
19 v 1.0
Challenges for traditional Database
management system to handle Big Data
20 v 1.0
Challenge 1: Variety of Data
Big Data has got variety of data means along with structured data
which relational databases can handle very well, Big Data also includes
unstructured data (text, log, audio, streams, video stream, sensor, GPS
data). The traditional databases require the database schema to be
created in ADVANCE to define the data how it would look like which
makes it harder to handle Big unstructured data.
21 v 1.0
Challenge 2: Velocity
22 v 1.0
Challenge 3: Volume
Big Data is data in Zettabytes, growing with exponential rate. If the data
to be processed is in the degree of Terabytes and petabytes, it is more
appropriate to process them in parallel independent tasks and collate
the results to give the output. Traditional database approach can’t
handle this.
23 v 1.0
Characteristics of Big Data
24 v 1.0
Big Data Engineering
25 v 1.0
Why Big Data Analytics ???
26 v 1.0
Big Data Market
27 v 1.0
Demand for Big Data & analytics, driven by
business outcomes
28 v 1.0
Relation between Big Data and Analytics
• “Analytics is about providing people with trusted, relevant and timely
information to address business outcomes” - Neil Isford (VP, Smarter Analytics,
IBM North America
29 v 1.0
Big Data Analytics process
30 v 1.0
Big Data Domains
31 v 1.0
Big Data Applications: Healthcare
32 v 1.0
Big Data Applications: Healthcare
33 v 1.0
Big Data Applications: Manufacturing
34 v 1.0
Big Data Applications: Media &
Entertainment
35 v 1.0
Big Data Applications in Government
36 v 1.0
Big Data Application in Education Industry
37 v 1.0
Big Data in Weather Patterns
38 v 1.0
Big Data in Transportation Industry
39 v 1.0
Big Data Applications in IoT
40 v 1.0
Big Data in Banking Sector
41 v 1.0
References
1. https://www.edureka.co/blog/top-10-data-analytics-tools/
2. https://www.proschoolonline.com/blog/top-10-data-analytics-tools
3. Big Data Analytics Powerpoint Presentation Slide | PowerPoint Presentation
Designs | Slide PPT Graphics | Presentation Template Designs (slideteam.net)
4. https://financesonline.com/big-data-statistics/
5. https://firstsiteguide.com/big-data-stats/#:~:text=By%202022%2C%20the%20big
%20data,of%20data%20in%202019%20alone.
6. https://www.statista.com/topics/1464/big-data/#dossierContents__outerWrapper
7. https://blogs.sap.com/2019/06/24/what-is-big-data-and-why-do-we-need-hadoop-
for-big-data/
8. https://www.qubole.com/big-data-analytics/
9. https://intellipaat.com/blog/10-big-data-examples-application-of-big-data-in-real-life
42 v 1.0