Big Data Analytics Prac
Big Data Analytics Prac
-------------------------------
College seal
INDEX
Prac. Practical Date Sign
No.
1 Install, configure and run Hadoop and
HDFS
2 Implement Decision tree classification
techniques
3 Classification using SVM
Step 2: download Hadoop binaries from the official website. The binary package size is
about 342 MB.
Step 3: After finishing the file download, we should unpack the package using 7zip int
two steps. First, we should extract the hadoop-3.2.1.tar.gz library, and then, we should
unpack the extracted tar file:
Step 4: When the “Advanced system settings” dialog appears, go to the “Advanced” tab
and click on the “Environment variables” button located on the bottom of the dialog.
Step 5: Check the version of java
Step 6: Configuration core-site.xml
Step 2: Load the party package. It will automatically load other# dependent packages
Print some records from data set readingSkills.
Step 3 : Call function ctree to build a decision tree. The first parameter is a formula,
which defines a target variable and a list of independent variables.
Output:
PRACTICAL NO : 3
Aim: Implement an application that stores big data in Hbase / MongoDB and
manipulate it using R / Python
Description:
MongoDB is a source-available cross-platform document-oriented database program.
Classified as a NoSQL database program, MongoDB uses JSON-like documents with
optional schemas. MongoDB is developed by MongoDB Inc. and licensed under the
Server Side Public License
Since MongoDB is a No-SQL database, so you can add ‘n’ number of columns for any
row/record.
Step 5 : To start with the connection click on Overview, and then click on Connect.
Step 6 : Select on add your current IP and create a MongoDB user.
# Installing Packages
# Loading package
Step 3 : Plot the clusters and their centres. Note that there are four dimensions in the
data and that only the first two dimensions are used to draw the plot below.
Step 4: Some black points close to the green centre (asterisk) are actually closer to the
black centre in the four dimensional space.