DEPARTMENT OF INFORMATION
TECHNOLOGY
FACULTY OF COMPUTING
MODULE OUTLINE
Module Name Massive Data Processing and Cloud Computing
Module Code IT3061 Version No. 2018 - 0
Year/Level 3 Semester 2
Credit Points 04
Pre-requisites None
Co-requisites None
Methods of Delivery Lectures (Face-to-face) 2 Hours/Week
Tutorials 1 Hours/Week
Labs 2 Hours/Week
Course Web Site http://courseweb.sliit.lk/
Date of Original April, 2018
Approval
Date of Next Review April, 2020
MODULE DESCRIPTION
Introduction This is a compulsory unit for the 3rd year students who follows BSc (Hons) in IT
specializing Data Science program. The objective of this unit is to provide learning
environment to take in depth understanding of both the theory and practices in the
field of Big Data and Cloud Computing.
In this unit, the students will learn:
- Hadoop Eco-system and how MapReduce algorithm works
- Scala programming and Stream processing
- Concepts related to Cloud Computing
- Storage mechanisms using with cloud
- Different Security aspects of cloud
- Different architectures in cloud computing
Learning At the end of the module student will be able to:
Outcomes
LO1: Configure Hadoop framework and supporting tools to execute Map Reduce
program model for distributed processing
LO2: Configure the fundamental elements of cloud computing
LO3: Compare different storage aspects in cloud computing
LO4: Explain different aspects of security in cloud computing
LO5: Design and develop databases with cloud
LO6: Use different architectures when developing cloud based applications
Assessment The following assessments will be held during the semester.
Criteria
Assignments: The test will be based on the practical work, the questions discussed in
tutorial sessions, and lecture material covered until the week before it is held.
Mini Project: Required to develop an application to execute using Hadoop framework for
a real world problem.
The final examination will be a comprehensive examination based on the practical work
and lecture materials covered during the semester.
The distribution of marks for the assessed components of the unit are as follows:
Continuous Assessments
Assignments 40 % LO1- LO6
Mini Project 20 % LO1
End Semester Assessment
Final Examination 40 % LO1-LO6
TOTAL 100 %
Estimated Contact Hours
Student
Lecture 26 hours
Workload
Tutorial 13 hours
Laboratory 26 hours
Time Allocated for Assessments
Continuous Assessments 15 hours
Final Examination 02 hours
Reading and Independent Study 118 hours
TOTAL 200 hours
Module
Requirement To pass this module, students need to obtain a pass mark in both “Continuous
Assessments” and “End of the Semester Examination” components which would
result in an overall mark that would qualify for a “C” grade or above
Primary 1. Ray J.Rafaels, Cloud Computing: From Beginning to End, 1st Editon, CreateSpace
References Independent Publishing Platform, 2015.
2. Arshdeep Bahga, Vijay Madisetti, Cloud Computing: A Hands-On Approach, 1st
Editon, Arshdeep Bahga, Vijay Madisetti, 2014.
CONTENTS OF THE MODULE
1. Hadoop Eco-system LO1
Map-Reduce algorithm
Hadoop Architecture
Installing and configuring Hadoop framework
HDFS Architecture and HDFS operations
MapReduce program model for distributed processing
2. Introduction to Cloud Computing LO2
What is Cloud?
Characteristics of cloud computing
Cloud service models
Cloud deployment models
Advantages of cloud computing
Cloud providers
3. Elastic Cloud Computing LO3
What is elasticity in cloud computing?
Benefits of elastic cloud computing
Amazon EC2
4. Elastic Cloud Storage LO3
What is ECS?
Benefits of ECS
Amazon Storage services
5. Load balancer LO4
What is load balancing?
Why do we need load balancing?
Load balancing methods
Amazon provided load balancers
6. Autoscaling LO4
What is Autoscaling?
Why do we need Autoscaling?
Amazon Autoscaling
7. Cloud Security LO5
What is cloud security?
Security issues in the cloud
How to minimize security issues?
Amazon security
8. Cloud Database LO6
What is a Cloud Database?
Why cloud databases?
AWS Database services (RDS)
9. APIs/ Endless Endpoint/ Digital Transformation LO7
What Cloud API?
What can we do with cloud API?
AWS CLI
10. Containers/ Docker LO7
What is containerization?
Introduction to Docker
11. Container Orchestration / Kubernetes LO7
What is container Orchestration?
Introduction to Kubernetes
12. Microservice Architecture LO8
Introduction to application architecture
What is microservices architecture?
Differences between monolithic and microservices architecture
Benefits of microservices architecture
13. Container Native Architecture / Serverless Architecture / Service LO8
Mesh Architecture
Introduction to Serverless Architecture
Serverless architecture patterns
Serverless Best practices
AWS Lambda
GENERIC INFORMATION
Any type of plagiarism is not allowed.
Plagiarism: Academic honesty is crucial to a student’s credibility and self-esteem, and
ultimately reflects the values and morals of the Institute as whole. A student may work
together with one or a group of students discussing assignment content, identifying relevant
references, and debating issues relevant to the subject. Plagiarism occurs when the work of
another person, or persons, is used and presented as one’s own.
--------------------------End of Module Outline--------------------------