Foundations of Data Science.docx
Foundations of Data Science.docx
30 2 4
COURSE OBJECTIVES:
UNIT- I INTRODUCTION 8
Healthcare- Drug development, Virtual healthcare assistance- Finance- Fraud detection- Marketing-
Targeted advertising, Customer interactions- Transportation - Driverless cars, Airline routing.
TOTAL: 45 PERIODS
PRACTICAL EXERCISES:
1. Download, install and explore the features of NumPy, SciPy, Jupyter, Statsmodels and Pandas
packages.
2. Working with Numpy arrays
3. Working with Pandas data frames
4. Reading data from text files, Excel and the web and exploring various commands for doing
descriptive analytics on the Iris data set.
5. Use the diabetes data set from UCI and Pima Indians Diabetes data set for performing the
following:
a. Univariate analysis: Frequency, Mean, Median, Mode, Variance, Standard Deviation,
Skewness and Kurtosis.
b. Bivariate analysis: Linear and logistic regression modeling
c. Multiple Regression analysis
d. Also compare the results of the above analysis for the two data sets.
6. Apply and explore various plotting functions on UCI data sets.
a. Normal curves
b. Density and contour plots
c. Correlation and scatter plots
d. Histograms
e. Three dimensional plotting
7. Visualizing Geographic Data with Basemap
8. Importing Data from External Source Using Python
SOFTWARE REQUIREMENTS
Python, Numpy, Scipy, Matplotlib, Pandas, statmodels, seaborn, plotly, bokeh
TOTAL : 30 PERIODS
TOTAL: 75 PERIODS
COURSE OUTCOMES:
TEXT BOOKS
1. David Cielen, Arno D. B. Meysman, and Mohamed Ali, “Introducing Data Science”, Manning
Publications, 2016. (first two chapters for Unit I)
2. Robert S. Witte and John S. Witte, “Statistics”, Eleventh Edition, Wiley Publications, 2017.
(Chapters 1–7 for Units II)
3. Jake VanderPlas, “Python Data Science Handbook”, O’Reilly, 2016. (Parts of chapters 2–4 for
Units III,IV and V)
REFERENCES
1. Allen B. Downey, “Think Stats: Exploratory Data Analysis in Python”, Green Tea Press, 2014
2.Sanjeev J. Wagh, Manisha S. Bhende, Anuradha D. Thakare, “Fundamentals of Data
Science”, CRC Press, 2022
3.Chirag Shah, “A Hands-On Introduction to Data Science”, Cambridge University Press
CO PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11
CO1 2 2 1 2 2 - - - 1 1 1
CO2 2 1 - 1 1 - - - 2 1 1
CO3 2 2 1 2 2 1 1 - 1 2 1
CO4 3 2 2 1 2 - - - 1 1 2
CO5 2 2 1 2 2 - - - 1 1 1
CO6 2 2 1 2 2 - - - 1 1 1