50% found this document useful (2 votes)
216 views3 pages

Big Data Analytics

This document outlines the course curriculum for a semester on big data analytics. It includes 5 units that cover: 1) an introduction to big data and analytics, including characteristics of data and challenges; 2) an introduction to Hadoop, including its history, architecture, and uses; 3) MapReduce programming; 4) introductions to Hive and Pig for querying and analyzing large datasets; and 5) Spark, an analytics engine for large-scale data processing. References include textbooks on Hadoop, big data analytics, and online learning resources.

Uploaded by

Athithya R
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
50% found this document useful (2 votes)
216 views3 pages

Big Data Analytics

This document outlines the course curriculum for a semester on big data analytics. It includes 5 units that cover: 1) an introduction to big data and analytics, including characteristics of data and challenges; 2) an introduction to Hadoop, including its history, architecture, and uses; 3) MapReduce programming; 4) introductions to Hive and Pig for querying and analyzing large datasets; and 5) Spark, an analytics engine for large-scale data processing. References include textbooks on Hadoop, big data analytics, and online learning resources.

Uploaded by

Athithya R
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

BIG DATA ANALYTICS

Semester III
20MITC15

UNIT I INTRODUCTION TO HADOOP AND BIG DATA ANALYTICS

Introduction to big data: Data, Characteristics of data and Types of digital


data, Sources of data, Working with unstructured data, Evolution and
Definition of big data, Characteristics and Need of big data, Challenges of big
data
Big data analytics: Overview of business intelligence, Data science and
Analytics, Meaning and Characteristics of big data analytics, Need of big data
analytics, Classification of analytics, Challenges to big data analytics,
Importance of big data analytics, Basic terminologies in big data environment

UNIT-II INTRODUCTION TO HADOOP


Introducing Hadoop, need of Hadoop, limitations of RDBMS, RDBMS versus
Hadoop, Distributed Computing Challenges, History of Hadoop , Hadoop
Overview, Use Case of Hadoop, Hadoop Distributors, HDFS (Hadoop
Distributed File System) , Processing Data with Hadoop, Managing Resources
and Applications with Hadoop YARN (Yet another Resource Negotiator),
Interacting with Hadoop Ecosystem
UNIT-III INTRODUCTION TO MAPREDUCE PROGRAMMING
Introduction , Mapper, Reducer, Combiner, Partitioner, Searching, Sorting ,
Compression, Real time applications using MapReduce, Data serialization and
Working with common serialization formats, Big data serialization formats

UNIT-IV: INTRODUCTION TO HIVE AND PIG


HIVE: Introduction to Hive, Hive Architecture, Hive Data Types, Hive File
Format, Hive Query Language (HQL), User-Defined Function (UDF) in Hive.
PIG: Introduction to Pig, The Anatomy of Pig, Pig on Hadoop, Pig Philosophy,
Use Case for Pig: ETL Processing, Pig Latin Overview, Data Types in Pig, Running
Pig, Execution Modes of Pig, HDFS Commands, Relational Operators, Piggy
Bank, Word Count Example using Pig , Pig at Yahoo!, Pig versus Hive

UNIT-V SPARK
Introduction to data analytics with Spark, What is Apache Spark, A Unified
Stack, Downloading Spark, Spark’s Python and Scala Shells, Core Spark
concepts, Programming with RDDS, RDD Basics, RDD Operations, Passing
functions to Spark, Working with key/value pairs, Data Partitioning, Loading
and Saving your Data, File Formats

REFERENCES
1. Big Data Analytics, Seema Acharya, Subhashini Chellappan, Wiley
2. Learning Spark: Lightning-Fast Big Data Analysis, Holden Karau, Andy
Konwinski, Patrick
Wendell, Matei Zaharia, O'Reilly Media, Inc.
3. Boris lublinsky, Kevin t. Smith, AlexeyYakubovich, “Professional Hadoop
Solutions”, Wiley,
ISBN: 9788126551071, 2015.
4. Chris Eaton,Dirk derooset al. , “Understanding Big data ”, McGraw Hill, 2012.
5. Tom White, “HADOOP: The definitive Guide”, O Reilly 2012.
6. VigneshPrajapati, “Big Data Analyticswith R and Hadoop”, Packet Publishing
2013.
WEB REFERENCES
1. http://www.bigdatauniversity.com/

You might also like