Big Data Camp Intro Hadoop
Big Data Camp Intro Hadoop
Year 2000 +
Online Applications- OLTP
Web Users
Web Servers
RDBMS
RDBMS DW
Storage
Fail
Scalability
Engine + Logic
File system
Log processing
Facebook, Yahoo Recommendation Systems
Facebook
Data Warehouse Facebook, AOL Video and Image Analysis New York Times, Eyealike INDIAN GOVERNMENT- UUID project
Map Reduce
Origin in Lisp!
Hadoop Example
Weather sensors collecting data every hour at many locations cross the globe gather a large volume of log data, which is a good candidate for analysis with MapReduce, since it is semistructured and recordoriented.
Data Format: The data is stored using a line-oriented ASCII format, in which each line is a record. The format supports a rich set of meteorological elements, many of which are optional or with variable data lengths. For simplicity, we shall focus on the basic elements, such as temperature, which are always present and are of fixed width.
Hadoop Example
Hadoop Example
12
Unstructured Data 6
5
13
Engine + Logic
File system 9
7 RDBMS
Structured Data
hiho
Java Applications
Sqoop
Learn more about Hadoop Contribute to source code Participate in Mailing Lists/Forums Share blogs etc.
Thank you
Visit bigdata.impetus.com
20
Commercial
Open source
Hybrid
Teradata/ Netezza
Informatica
SAS/
Microstrategy/
Business Objects
Pentaho/ Jasper
Web Analytics
22