Big Data Analytics
Big Data Analytics
Semester III
20MITC15
UNIT-V SPARK
Introduction to data analytics with Spark, What is Apache Spark, A Unified
Stack, Downloading Spark, Spark’s Python and Scala Shells, Core Spark
concepts, Programming with RDDS, RDD Basics, RDD Operations, Passing
functions to Spark, Working with key/value pairs, Data Partitioning, Loading
and Saving your Data, File Formats
REFERENCES
1. Big Data Analytics, Seema Acharya, Subhashini Chellappan, Wiley
2. Learning Spark: Lightning-Fast Big Data Analysis, Holden Karau, Andy
Konwinski, Patrick
Wendell, Matei Zaharia, O'Reilly Media, Inc.
3. Boris lublinsky, Kevin t. Smith, AlexeyYakubovich, “Professional Hadoop
Solutions”, Wiley,
ISBN: 9788126551071, 2015.
4. Chris Eaton,Dirk derooset al. , “Understanding Big data ”, McGraw Hill, 2012.
5. Tom White, “HADOOP: The definitive Guide”, O Reilly 2012.
6. VigneshPrajapati, “Big Data Analyticswith R and Hadoop”, Packet Publishing
2013.
WEB REFERENCES
1. http://www.bigdatauniversity.com/