BIG DATA ANALYTICS - Syllabus
BIG DATA ANALYTICS - Syllabus
Concept of big data, Big Data Analytics, Applications of big data in industry,
Overview of Big Data, Big Data Storage: File system and Distributed File System,
Importance of Distributed File System, Big Data Applications-Industry Examples
Introduction-Hive modules, Data types and file formats,Hive, HiveQl, Hive QL-Data
Definition and Data Manipulation-Hive QL queries, Hive QL views- reduce query
complexity, HBase , HDFS
Overview of Spark – Hadoop Overview of Spark – Hadoop vs. Spark, SPARK Fast
data analysis ,Data analytics project Life Cycle, Downloading Spark and Getting
Started
Unit V: NoSQL 9
Lectures
Text Books
1. Big Data, Big Analytics: Emerging Business Intelligence and Analytics,
Michael Minelli, Michele Chambers, and Ambiga Dhiraj
2. Big Data and Analytics, Seema Acharya, Subhashini Chhellappan, Willey
Reference Books
1. Analyzing big data allows analysts, researchers, and business users to make better and faster
decisions using data that was previously inaccessible or unusable.
2. Using advanced analytics techniques such as text analytics, machine learning, predictive analytics,
data mining, statistics, and natural language processing, businesses can analyze previously untapped
data sources independent or together with their existing enterprise data to gain new insights resulting in
significantly better and faster decisions.
3. This course provides insightful inputs on concept of big data, big data analytics
COURSE OUTCOMES
1. Understand basics of Big Data.
2. Appreciate the various Big Data Platforms.
3. Understand the various Big Data storage and processing techniques.
4. Learn about the “Big data” in enterprises.
5. Appreciate the Big Data lifecycle.
UNIT I INTRODUCTION TO BIG DATA: Fundamental Terminologies and Concepts, A Brief
History of Big Data, Business Drivers that have led to Big Data Innovations, Characteristics of Big
Data, Benefits of adopting Big Data, Challenges and Limitations of Big Data.
UNIT-II FUNDAMENTALS OF BIG DATA ANALYTICS: Basic Big Data Analytics, “Big Data”
in the Enterprise, Big Data and Traditional Business Intelligence and Data Warehouses, Big Data
Visualization, Common Adoption Issues, Planning for Big Data Initiatives, New Roles Introduced by
Big Data Projects, Emerging Trends.
UNIT-III BIG DATA PLATFORMS: Development of scalable and yet elastic virtualized platforms
using innovation to cluster commodity hardware components (either cycle harvesting from local
resources or through cloud based utility computing services) coupled with open source tools and
technology. Big Data Storage and Processing Big Data Storage (Query Workload, Sharing,
Replication, CAP, ACID, BASE), Big Data Processing (Parallel Data Processing, Distributed Data
Processing, Shared Everything/Nothing Architecture, SCV).
UNIT-IV “BIG DATA” IN THE ENTERPRISE: The New Information Management Paradigm, Big
Data Implications for Industry, Emerging Database Landscape, Application Architectures for Big Data
and Analytics, Data Modeling Approaches for Big Data and Analytics Solutions, Big Data Analytics
Methodology, Extracting Value from Big Data: In-Memory Solutions, Real Time Analytics and
Recommendation Systems.
UNIT-V THE BIG DATA ANALYSIS LIFECYCLE (From Dataset Identification to Integration,
Analysis and Visualization) Common Analysis and Analytics Techniques, A/B testing, Regression,
Correlation, Text Analytics, Sentiment Analysis, Time Series Analysis, Network Analysis, Spatial
Analysis, Automated Recommendation, Classification, clustering, Machine Language, Natural
Language, Semantics, Data Visualization and Visual Analysis, Assessing Hierarchies, Part-to Whole
Relationships, Plotting Connections and Relationships, Mapping Geo-Spatial Data, Foundational Big
Data Technology Mechanisms, Big Data & Cloud Computing.
SUGGESTED BOOKS
1. Turban E, Armson, JE, Liang, TP &Sharda, Decision support and Business Intelligence Systems, 8th
Edition, John Wiley & Sons
2. Frank J. Ohlhorst, Big Data Analytics, 1st Edition, Wiley
3. Efraim Turban, Ramesh Sharda, Jay Aronson, David King, Decision Support and Business
Intelligence Systems, 9th Edition, Pearson Education