0% found this document useful (0 votes)
113 views

CS8091 Syllabus

This document outlines the objectives and units of study for the course CS8091 Big Data Analytics. The course aims to teach fundamental concepts of big data and analytics using tools like HDFS, MapReduce, YARN and programming models. It will cover topics like clustering, classification, association rule mining, recommendation systems. Students will learn about data streams, real-time analytics, graph analytics and NoSQL databases. The course totals 45 periods and aims to equip students to work with big data tools and analysis techniques and apply algorithms to large data volumes.

Uploaded by

yasmeen banu
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
113 views

CS8091 Syllabus

This document outlines the objectives and units of study for the course CS8091 Big Data Analytics. The course aims to teach fundamental concepts of big data and analytics using tools like HDFS, MapReduce, YARN and programming models. It will cover topics like clustering, classification, association rule mining, recommendation systems. Students will learn about data streams, real-time analytics, graph analytics and NoSQL databases. The course totals 45 periods and aims to equip students to work with big data tools and analysis techniques and apply algorithms to large data volumes.

Uploaded by

yasmeen banu
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

www.padeepz.

net

CS8091 BIG DATA ANALYTICS L T P C


3 0 0 3
OBJECTIVES:
 To know the fundamental concepts of big data and analytics.
 To explore tools and practices for working with big data
 To learn about stream computing.
 To know about the research that requires the integration of large amounts of data.

t
ne
UNIT I INTRODUCTION TO BIG DATA 9
Evolution of Big data - Best Practices for Big data Analytics - Big data characteristics - Validating - The
Promotion of the Value of Big Data - Big Data Use Cases- Characteristics of Big Data Applications -
Perception and Quantification of Value -Understanding Big Data Storage - A General Overview of High-
Performance Architecture - HDFS - MapReduce and YARN - Map Reduce Programming Model

.
UNIT II CLUSTERING AND CLASSIFICATION 9

pz
Advanced Analytical Theory and Methods: Overview of Clustering - K-means - Use Cases - Overview of
the Method - Determining the Number of Clusters - Diagnostics - Reasons to Choose and Cautions .-
Classification: Decision Trees - Overview of a Decision Tree - The General Algorithm - Decision Tree
Algorithms - Evaluating a Decision Tree - Decision Trees in R - Naïve Bayes - Bayes‘ Theorem -
Naïve Bayes Classifier.

UNIT III
ee
ASSOCIATION AND RECOMMENDATION SYSTEM 9
Advanced Analytical Theory and Methods: Association Rules - Overview - Apriori Algorithm - Evaluation
of Candidate Rules - Applications of Association Rules - Finding Association& finding similarity -
Recommendation System: Collaborative Recommendation- Content Based Recommendation -
Knowledge Based Recommendation- Hybrid Recommendation Approaches.
ad
UNIT IV STREAM MEMORY 9
Introduction to Streams Concepts – Stream Data Model and Architecture - Stream Computing,
Sampling Data in a Stream – Filtering Streams – Counting Distinct Elements in a Stream – Estimating
moments – Counting oneness in a Window – Decaying Window – Real time Analytics Platform(RTAP)
applications - Case Studies - Real Time Sentiment Analysis, Stock Market Predictions. Using Graph
.p

Analytics for Big Data: Graph Analytics

UNIT V NOSQL DATA MANAGEMENT FOR BIG DATA AND VISUALIZATION 9


NoSQL Databases : Schema-less Models‖: Increasing Flexibility for Data Manipulation-Key Value
w

Stores- Document Stores - Tabular Stores - Object Data Stores - Graph Databases Hive - Sharding –-
Hbase – Analyzing big data with twitter - Big data for E-Commerce Big data for blogs - Review of Basic
Data Analytic Methods using R.
TOTAL: 45 PERIODS
w

OUTCOMES:
Upon completion of the course, the students will be able to:
 Work with big data tools and its analysis techniques
 Analyze data by utilizing clustering and classification algorithms
w

 Learn and apply different mining algorithms and recommendation systems for large volumes of
data
 Perform analytics on data streams
 Learn NoSQL databases and management.

www.padeepz.net
www.padeepz.net

TEXT BOOKS:
1. Anand Rajaraman and Jeffrey David Ullman, "Mining of Massive Datasets", Cambridge University
Press, 2012.
2. David Loshin, "Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools,
Techniques, NoSQL, and Graph", Morgan Kaufmann/El sevier Publishers, 2013.

t
REFERENCES:
1. EMC Education Services, "Data Science and Big Data Analytics: Discovering, Analyzing,

ne
Visualizing and Presenting Data", Wiley publishers, 2015.
2. Bart Baesens, "Analytics in a Big Data World: The Essential Guide to Data Science and its
Applications", Wiley Publishers, 2015.
3. Dietmar Jannach and Markus Zanker, "Recommender Systems: An Introduction", Cambridge
University Press, 2010.
4. Kim H. Pries and Robert Dunnigan, "Big Data Analytics: A Practical Guide for Managers " CRC

.
Press, 2015.

pz
5. Jimmy Lin and Chris Dyer, "Data-Intensive Text Processing with MapReduce", Synthesis
Lectures on Human Language Technologies, Vol. 3, No. 1, Pages 1-177, Morgan Claypool
publishers, 2010.

ee
ad
.p
w
w
w

www.padeepz.net

You might also like