0% found this document useful (0 votes)
3 views2 pages

Principles of Data Science

The document outlines a course on Principles of Data Science, detailing prerequisites, course outcomes, and five units covering exploratory data analysis, data collection and preparation, linear algebra and statistics, probability and hypothesis testing, and recommender systems. Students will learn various data visualization techniques, statistical analysis, and how to apply Python toolkits for data science applications. The course includes textbooks and references for further study.

Uploaded by

iothod
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views2 pages

Principles of Data Science

The document outlines a course on Principles of Data Science, detailing prerequisites, course outcomes, and five units covering exploratory data analysis, data collection and preparation, linear algebra and statistics, probability and hypothesis testing, and recommender systems. Students will learn various data visualization techniques, statistical analysis, and how to apply Python toolkits for data science applications. The course includes textbooks and references for further study.

Uploaded by

iothod
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

PRINCIPLES OF DATA SCIENCE

(Professional Elective-V)

Course Code: 22IT1159 LTPC


3 0 03

Pre-Requisites:Programming with Python Lab, Numerical Methods, Probability and Statistics

COURSE OUTCOMES:
At the end of the course, a student will be able to
CO 1: Describe Various Data Visualization techniques. (L2)
CO 2: Explain the mathematical foundations needed for data science. (L2)
CO 3: Illustrate the proficiency with statistical analysis of data. (L3)
CO 4: Apply various data preparation operations. (L3)
CO 5: Model data science applications using Python based toolkits. (L3)

UNIT-I (10 Lectures)


EXPLORATORY DATA ANALYSIS - A FIRST LOOK: Exploring a new dataset, Summarizing numerical data,
Anomalies in numerical data. [Textbook – 2]
VISUALIZING DATA: Visualizing relations between variables [Textbook – 2], The Ascendance of Data, Data Science
Definition, Motivating Hypothetical: DataSciencester, matplotlib, Bar Charts, Line Charts, Scatterplots. [Textbook – 1]

Learning Outcomes: At the end of the module, student will be able to


1. Explain Data science with applications. (L2)
2. Summarize the numerical data for a given domain data (L2)
3. Distinguish various visualization techniques. (L2)

UNIT-II (10 Lectures)


GETTING DATA:
Standard input and standard output, Reading Files, Scraping the Web, Using APIs, Example: Using the Twitter APIs.
WORKING WITH DATA:
Exploring Your Data, Using Named Tuples, Data classes, Cleaning and Munging, Manipulating Data, Rescaling, An Aside:
tqdm, Dimensionality Reduction

Learning Outcomes: At the end of the module, student will be able to


1. Select various data collection techniques. (L3)
2. Organize the input data to be suitable for the model to be trained. (L3)
3. Choose appropriate data pre-processing techniques.

UNIT-III (8 Lectures)
LINEAR ALGEBRA, STATISTICS FOR DATA SCIENCE:
Linear Algebra: Vectors, Matrices, Statistics: Describing a Single Set of Data, Correlation, Simpson’s Paradox, Some
Other Correlational Caveats, Correlation and Causation. [Textbook – 1]

Learning Outcomes: At the end of the module, student will be able to


1. Illustrate the use of Vectors and Matrices for input dataset preparation. (L2)
2. Describe Central Tendencies and Dispersion of the data. (L2)
3. Summarize various correlation techniques to explore input dataset. (L2)

UNIT-IV (12 Lectures)


PROBABILITY, HYPOTHESIS AND INFERENCE FOR DATA SCIENCE:
Dependence and Independence, Conditional Probability, Bayes’s Theorem, Random Variables, Continuous Distributions,
The Normal Distribution, The Central Limit Theorem.
Hypothesis and Inference: Statistical Hypothesis Testing, Example: Flipping a Coin, p-Values, Confidence Intervals, p-
Hacking, Example: Running an A/B Test, Bayesian Inference.
[Textbook – 1]

Learning Outcomes: At the end of the module, student will be able to


1. Describe conditional probability and Bayes’ theorem with a real-world example. (L2)
2. Illustrate the use of distributions to understand the data. (L2)
3. Explain various statistical techniques to understand the data. (L2)

UNIT-V (10 Lectures)


RECOMMENDER SYSTEMS:
Manual Curation, Recommending What’s Popular, User-Based Collaborative Filtering, Item-Based Collaborative Filtering,
Matrix Factorization. [Textbook – 1]

Learning Outcomes: At the end of the module, student will be able to


1. Apply various Collaborative filtering techniques. (L3)
2. Develop Recommender systems application. (L3)
3. Describe Matrix Factorization. (L2)
TEXT BOOKS:
1. Joel Grus, “Data Science from Scratch”, 2nd Edition, O’Reilly Media, Inc, 2019.
2. Ronald K. Pearson, “Exploratory Data Analysis using R”, Chapman & Hall/CRC Data Mining and Knowledge
Discovery Series, 2018.
REFERENCES:
1. Anil Maheshwari, “Data Analytics”, 2nd Edition, MC Graw Hill Education, 2017.
2. Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani, “An Introduction to Statistical Learning with
Applications in R”, 7th Edition, Springer, 2016.
WEB REFERENCES:
1. https://nptel.ac.in/courses/106106179/

You might also like