0% found this document useful (0 votes)
2 views

Main Big Data

The document outlines the Five V's of Big Data: Volume, Velocity, Variety, and the types of data (structured, unstructured, and semi-structured). It highlights the massive scale of data generated daily by platforms like Facebook and Twitter, and discusses the applications and potential value of Big Data analytics across various sectors. Additionally, it notes the growing market opportunities for IT services and analytics in India related to Big Data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Main Big Data

The document outlines the Five V's of Big Data: Volume, Velocity, Variety, and the types of data (structured, unstructured, and semi-structured). It highlights the massive scale of data generated daily by platforms like Facebook and Twitter, and discusses the applications and potential value of Big Data analytics across various sectors. Additionally, it notes the growing market opportunities for IT services and analytics in India related to Big Data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 40

The Five V " s of Big Data

1st Character of Big Data


Volume
• A typical PC might have had 10 gigabytes of storage in 2000.

• Today, Facebook ingests 500 terabytes of new data every


day.

• Boeing 737 will generate 240 terabytes of flight data during a


single flight across the US.

• The smart phones, the data they create and consume;


sensors embedded into everyday objects will soon result in
billions of new, constantly-updated data feeds containing
environmental, location, and other information, including
video.
2nd Character of Big Data
Velocity
Clickstreams and ad impressions capture user behavior at millions
of events per second

high-frequency stock trading algorithms reflect market changes


within microseconds

machine to machine processes exchange data between billions of


devices

infrastructure and sensors generate massive log data in real-time

on-line gaming systems support millions of concurrent users, each


producing multiple inputs per second.
3rd Character of Big Data
Variety
Big Data isn't just numbers, dates, and strings. Big Data is
also geospatial data, 3D data, audio and video, and
unstructured text, including log files and social media.

Traditional database systems were designed to address


smaller volumes of structured data, fewer updates or a
predictable, consistent data structure.

Big Data analysis includes different types of data


Why Big Data

• FB generates 10TB
daily

• Twitter generates 7TB


of data
Daily

• IBM claims 90% of


today’s
stored data was
generated
in just the last two years.
Types of Big Data

Structured
data
Unstructured
data
Semi-
structured
data
Structured data

Structured data refers to information with a high


degree of organization, such that inclusion in a
relational database is seamless and readily searchable
by simple, straightforward search engine algorithms or
other search operations; whereas unstructured data is
essentially the opposite.
Unstructured data

Unstructured data (or unstructured information) is information that either


does not have a pre-defined data model or is not organized in a pre-
defined manner. Unstructured information is typically text-heavy, but
may contain data such as dates, numbers, and facts as well.
Semi-structured data

Semi-structured data is a form of structured data that does not conform with
the formal structure of data models associated with relational databases or
other forms of data tables, but nonetheless contains tags or other markers to
separate semantic elements and enforce hierarchies of records and fields within
the data
Application Of Big Data analytics
Smarter Multi-
Healthcar channel
e sales

Homeland Telecom
Security

Trading
Traffic Analytics
Control

Search
Manufacturin Quality
g
Types of tools used in Big-
Data
Where processing is hosted?
Distributed Servers / Cloud (e.g. Amazon EC2)
Where data is stored?
Distributed Storage (e.g. Amazon S3)
What is the programming model?
Distributed Processing (e.g. MapReduce)
How data is stored & indexed?
High-performance schema-free databases (e.g. MongoDB)
What operations are performed on data?
Analytic / Semantic Processing
Potential Value of Big
Data
$300 billion potential annual
value to US health care.

$600 billion potential annual


consumer surplus from using
personal location data.

60% potential in retailers’


operating margins.
India – Big Data

Gaining attraction
Huge market opportunities for IT services
(82.9% of revenues) and analytics firms
(17.1 % )
Current market size is $200 million. By 2015 $1
billion
The opportunity for Indian service providers lies
in offering services around Big Data
implementation and analytics for global
multinationals

You might also like