03 - Using Big Data Lite Virtual Machine
03 - Using Big Data Lite Virtual Machine
Module 2: Data Acquisition and Storage Lesson 3: Introduction to the Big Data
Lite VM and MoviePlex
Module 3: Data Access and Processing Lesson 4: Introduction to the Big Data
Ecosystem
3-2
Objectives
3-3
Lesson Agenda
3-4
Oracle Big Data Lite Virtual Machine:
Introduction
Provides an integrated environment to help you get started with
the Oracle Big Data platform
3-5
Oracle Big Data Lite 4.0.1 VM Components
Cloudera’s Distribution
Oracle Enterprise Oracle Database 12c Release 1
Including Apache Hadoop
Linux 6.4 Enterprise Edition (12.1.0.2)
(CDH 5.1.2)
3-6
Oracle Big Data Lite 4.0.1 Services
3-7
Initializing the Environment for
the Big Data Lite VM
Steps Commands
1. Start VM and log in. id/password: welcome1
5. Start Oracle NoSQL DB and the First time (or every time you want to initialize Oracle
Movie Application. NoSQL DB). This will take several minutes:
./1_start_movieapp.sh
3-8
Initializing the Environment
Steps Commands
6. Start Hue. ./2_start_hue.sh
3-9
Lesson Agenda
3 - 10
Oracle MoviePlex Case Study: Introduction
3 - 11
Introduction
3 - 12
Big Data Challenge
3 - 13
Derive Value from Big Data
3 - 14
Oracle MoviePlex: Goal
3 - 15
Oracle MoviePlex: Big Data Challenges
3 - 16
MoviePlex: Architecture
Endeca Oracle
Log of all activity Information Business
Application on site Discovery Intelligence EE
Log
Customer Profile Oracle Exalytics
Capture activity required (e.g., recommended
for MoviePlex site movies)
Streamed
into HDFS Clustering/Market Basket
using Flume “Mood” Oracle Advanced
Oracle NoSQL DB
Recommendations Analytics
Load Recommendations Oracle Exadata
3 - 17
MoviePlex: Data Generation
{"custId":1185972,"movieId":null,"genreId":null,"time":"2012-07-01:00:00:07","recommended":null,"activity":8}
{"custId":1354924,"movieId":1948,"genreId":9,"time":"2012-07-01:00:00:22","recommended":"N","activity":7}
{"custId":1083711,"movieId":null,"genreId":null,"time":"2012-07-01:00:00:26","recommended":null,"activity":9}
{"custId":1234182,"movieId":11547,"genreId":44,"time":"2012-07-01:00:00:32","recommended":"Y","activity":7}
{"custId":1010220,"movieId":11547,"genreId":44,"time":"2012-07-01:00:00:42","recommended":"Y","activity":6}
{"custId":1143971,"movieId":null,"genreId":null,"time":"2012-07-01:00:00:43","recommended":null,"activity":8}
{"custId":1253676,"movieId":null,"genreId":null,"time":"2012-07-01:00:00:50","recommended":null,"activity":9}
{"custId":1351777,"movieId":608,"genreId":6,"time":"2012-07-01:00:01:03","recommended":"N","activity":7}
{"custId":1143971,"movieId":null,"genreId":null,"time":"2012-07-01:00:01:07","recommended":null,"activity":9}
{"custId":1363545,"movieId":27205,"genreId":9,"time":"2012-07-01:00:01:18","recommended":"Y","activity":7}
{"custId":1067283,"movieId":1124,"genreId":9,"time":"2012-07-01:00:01:26","recommended":"Y","activity":7}
{"custId":1126174,"movieId":16309,"genreId":9,"time":"2012-07-01:00:01:35","recommended":"N","activity":7}
{"custId":1234182,"movieId":11547,"genreId":44,"time":"2012-07-01:00:01:39","recommended":"Y","activity":7}
{"custId":1067283,"movieId":null,"genreId":null,"time":"2012-07-01:00:01:55","recommended":null,"activity":9}
{"custId":1377537,"movieId":null,"genreId":null,"time":"2012-07-01:00:01:58","recommended":null,"activity":9}
{"custId":1347836,"movieId":null,"genreId":null,"time":"2012-07-01:00:02:03","recommended":null,"activity":8}
{"custId":1137285,"movieId":null,"genreId":null,"time":"2012-07-01:00:03:39","recommended":null,"activity":8}
{"custId":1354924,"movieId":null,"genreId":null,"time":"2012-07-01:00:03:51","recommended":null,"activity":9}
{"custId":1036191,"movieId":null,"genreId":null,"time":"2012-07-01:00:03:55","recommended":null,"activity":8}
{"custId":1143971,"movieId":1017161,"genreId":44,"time":"2012-07-01:00:04:00","recommended":"Y","activity":7}
{"custId":1363545,"movieId":27205,"genreId":9,"time":"2012-07-01:00:04:03","recommended":"Y","activity":5}
{"custId":1273464,"movieId":null,"genreId":null,"time":"2012-07-01:00:04:39","recommended":null,"activity":9}
{"custId":1346299,"movieId":424,"genreId":1,"time":"2012-07-01:00:05:02","recommended":"Y","activity":4}
3 - 18
MoviePlex: Data Generation Format
Fields Description
custID The customer's ID
movieId The ID of the selected movie
genreId The genre of the selected movie
Time The timestamp when the customer watched the movie
recommended? Whether or not the selected movie recommended, Y or N
Activity • 1: Rate movie
• 2: Completed movie
• 3: Not completed
• 4: Started movie
• 5: Browsed movie
• 6: List movies
• 7: Search
{"custId":1354924,"movieId":1948,"genreId":9,"time":"201
2-07-01:00:00:22","recommended":"N","activity":7}
3 - 19
MoviePlex Application
Advanced analytics: movies
Simple profile updates 2 based on mood
1
3
4 Advanced profile attributes
What is a key-value store?
3 - 20
Summary
3 - 21