Big Data
Big Data
• Understand visualizations
• Big data helps to analyze the in-depth concepts for the better decisions and
strategic taken for the development of the organization.
The Evolution of Big Data
Volume: scale of data
Volume: scale of data
• 90% of today’s data has been created in just the last 2 years
• Every day we create 2.5 quintillion bytes of data or enough to fill 10 million Blu-ray discs
• 40 zettabytes (4o trillion gigabytes) of data will be created by 2020, an increase of 300
times from 2005, and the equivalent of 5,200 gigabytes of data for every man, woman and
child on Earth
• Most companies in the US have over 100 terabytes (100,000 gigabytes) of data stored
What is the importance of Big Data?
Who are the ones who use the Big Data
Technology?
Brief explanation of how
exactly businesses are utilizing Big Data
Big Data Technologies
Big Data and Analytics
Where does Big Data come from?
Email Transactions
Enterprise Partner, Employee
“Dark Data” Customer, Supplier
Contracts Monitoring
Public Commercial
Sensor
Credit
Weather
Industry
Population Social Media Sentiment
Economic
Network
Types of Data
Which Big Data characteristic is the
biggest issue for your organization?
Velocity of
data
16%
Variety of
data
Volume of 48%
data
35%
Source: Getting Value from Big Data, Gartner Webinar, May 2012
Biggest opportunity for Big Data in your
organization?
• 85% of Fortune 500
organizations will be unable
to exploit big data for
competitive advantage.
• Many companies are performing new kinds of analytics (**sentiment analysis, etc.), to
better and more quickly understand and respond to what customers are saying about them
and their products.
• The cloud, and appliances are being used as data stores
• Advanced analytics are growing in popularity and importance
**Sentiment analysis (also known as opinion mining) refers to the use of natural language processing, text
analysis and computational linguistics to identify and extract subjective information in source materials.
Analytics Models How can we make it
happen?
happen? Analytics
Diagnostic
What happened? Analytics
Descriptive
Analytics
DIFFICULTY
Descriptive Analytics
• Descriptive analytics, such as reporting/ OLAP, dashboards, and data visualization, have
been widely used for some time.
• They are the core of traditional BI.
• Algorithms for predictive analytics, such as regression analysis, machine learning, and
neural networks, have also been around for some time.
• Common-sense advice
• Invented box plot
• Worked for various US
government agencies
Jacques Bertin1967
• Principle of expressiveness:
• Say everything you want to say — no
more, no less
• Don’t mislead
• Principle of effectiveness:
• Use the best method available for
showing your data
• Cartographer
Jacques Bertin
Seven Visual Variables
• Position
• Size
• Shape
• Color
• Brightness
• Orientation
• Texture
Edward Tufte
1983
Idea
generation
Exploratory
Four Types of Data Visualizations
Declarative
Idea Everyday
illustration dataviz
Conceptual Data-Driven
Idea Visual
generation discovery
Exploratory
Data Visualization
condense information
What makes a good chart?
• Large amount of
information in a very
small space
375
275
225
175
Hypothetical City Crime 125
425 75
375 25
Thefts per 100000 citizens
275
225
200
Sometimes it’s
140000
120000
100000
80000
60000
40000
really a matter of
20000
0 preference.
Order Date
These both
Sum of Extended Price
Order Date
3-D Charts
Source: Knaflic (2015). Storytelling with Data: A Data Visualization Guide for Business Professionals. Chapter 2.
Chartjunk: Data Ink “gone wild”
http://the-digital-reader.com/2015/04/13/infographic-ebooks-on-track-to-double-dutch-ebook-market-in-2014/
Summary
• Use data visualization principles to assess a visualization
• Tell a story
• Graphical integrity (lie factor)
• Minimize graphical complexity (data ink, chartjunk)
• Explain how a visualization can be improved based on those principles
• Types of visualization
THANK YOU