0% found this document useful (0 votes)
116 views3 pages

Data Science CS481 - Course Outline Spring 2020

This document provides information about the CS481 - Data Science course offered at National University of Computer & Emerging Sciences, Lahore. The course is a 3 credit hour elective course that introduces concepts of data science including data collection, analysis, machine learning algorithms, and visualization. It is taught on Mondays and Wednesdays from 5-6:20pm in room CS-3. Assessment includes quizzes, assignments, midterms, and a final exam. The primary textbook is Doing Data Science by Cathy O'Neil and Rachel Schutt, supplemented by other references.

Uploaded by

Hassaan Ahmed
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
116 views3 pages

Data Science CS481 - Course Outline Spring 2020

This document provides information about the CS481 - Data Science course offered at National University of Computer & Emerging Sciences, Lahore. The course is a 3 credit hour elective course that introduces concepts of data science including data collection, analysis, machine learning algorithms, and visualization. It is taught on Mondays and Wednesdays from 5-6:20pm in room CS-3. Assessment includes quizzes, assignments, midterms, and a final exam. The primary textbook is Doing Data Science by Cathy O'Neil and Rachel Schutt, supplemented by other references.

Uploaded by

Hassaan Ahmed
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

NATIONAL UNIVERSITY

of Computer & Emerging Sciences, Lahore

Department of Computer Science


CS481 – Data Science
SPRING 2020
Instructor Name: Zeeshan Ali Rana
Email address: [email protected]
Office Location/Number: 1st Floor Library Building
Telephone Ext.: 328
Office Hours: Mon – Thu 1100 – 1200 Hrs, Tue, Thu 1500 – 1600 Hrs

Course Information
Program: BS Credit Hours: 3 Type: Elective
Pre-requisites (if any): Programming competence, Discrete Maths, Linear Algebra, Probabilty &
Statistics

Class Time: Mon, Wed 1700 – 1820 Hrs


Class Venue: CS-3
Exam: See date sheet

Course Description/Objectives/Goals:
Data Science is the study of the generalizable extraction of knowledge from data. Being a data
scientist requires an integrated skill set spanning computer science, mathematics, statistics, and
domain expertise along with a good understanding of the art of problem formulation to engineer
effective solutions. The goal of this course is to teach students to answer questions with data. To do
this, we will learn the necessary skills to manage and analyze data with case studies. In this course
student learn concepts such as data collection and integration, exploratory data analysis, statistical
inference and modeling, machine learning, and high-dimensional data analysis.

Course Learning Outcomes:


At the end of the course students will be able to:
understand the basics of Data Science,
prepare and wrangle the data for analysis,
perform exploratory data analysis to investigate data so as to discover patterns, to spot anomalies, to test
hypothesis and to check assumptions with the help of summary statistics and graphical representations,
understand and apply machine learning algorithms to gain insight from the data
Textbook(s) /Supplementary Readings:
There is no standard one "textbook" for this course. The following book will be used as a primary
text to guide some of the discussions, but it will be heavily supplemented with lecture notes and
reading assignments from other sources.

Cathy O'Neil and Rachel Schutt. Doing Data Science, Straight Talk From The Frontline. O'Reilly. 2014.
ISBN 978-1-449-35865-5.
Additional references and books related to the course:
 Jure Leskovek, Anand Rajaraman and Jeffrey Ullman. Mining of Massive Datasets. v2.1,
Cambridge University Press. 2014. (Free online.)
 Jiawei Han, Micheline Kamber and Jian Pei. Data Mining: Concepts and Techniques, Third
Edition. Morgan Kaufmann Publishers. 2012. ISBN 978-0-12-381479-1.
 Kevin P. Murphy. Machine Learning: A Probabilistic Perspective. MIT Press. 2013. ISBN
0262018020. ( Online info available here.)
 Foster Provost and Tom Fawcett. Data Science for Business: What You Need to Know about
Data Mining and Data-analytic Thinking. O'Reilly 2013. ISBN 978-1-449-36132-7.

Tentative Weekly Schedule


Week Topics to be covered
1  Introduction to Data Science
 Introduction to Machine Learning: Supervised, unsupervised
learning
 Intro to Linear Regression: Model Representation for single variable
Linear Regression,
2  Gradient Descent
 Multiple Variable Linear Regression and Polynomial Regression,
Normal Equation, Logistic Regression
3  Logistic Regression
 Regularization

4 -5  Advice for applying ML , ML System Design, model selection,


parameters optimization, learning curves, model’s underfitting and
overfitting detection and solution, evaluation measures

5-6  Support Vector Machines (linear, SVM with Kernels)


 Clustering Algorithms

7  Features engineering, Dimensionality Reduction (Principal


Component Analysis)
8  Deep Learning topics
9-10  Introduction to Statistics basics, descriptive stat, statistical
inference
 Data Wrangling
o Data cleaning, data reshaping, data preprocessing

11  Exploratory Data Analysis


o Basic tools (plots, graphs and summary statistics) of EDA
 Statistical measures for analysis

12-13  Exploratory Data Analysis


o Use of R-squared and other measures for analysis
o Model transformations

14-15  Data visualization

(Tentative) Grading Criteria


Quizzes 10%
Assignments/In-class Exercises/Project 15 - 20%
Midterms 30%
Final Exam 40 - 45%
Total: 100 %

Course Policies
 Course outline may change 10-20% as we proceed in the semester
 There will be no retake of quizzes or exams.
 Integrity in the assignments/quizzes is expected; otherwise result would be an F
grade in the course or may be the case is forwarded to Disciplinary committee.
 Attendance MUST be ensured according to the University policy to avoid
disqualification.

You might also like