0% found this document useful (0 votes)
80 views

Data Analytics: Department of Computer Science & Engineering

This document provides information about the Data Analytics course (CS61061) taught by Dr. Debasis Samanta. It discusses the course organization, syllabus, objectives, plan, materials, evaluation plan, and attendance policy. The course aims to cover fundamental algorithms and techniques in data analytics, including statistical foundations, machine learning, data management and visualization. Students will learn how to find patterns in data, implement analytic algorithms, handle large scale analytics projects, and develop decision support systems. The syllabus covers topics like descriptive statistics, basic/advanced analysis techniques, and case studies. Reference materials and the course calendar are available online. Students will be continuously evaluated through tests, with a minimum 75% attendance required.

Uploaded by

ARUOS Soura
Copyright
© © All Rights Reserved
Available Formats
Download as PPSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
80 views

Data Analytics: Department of Computer Science & Engineering

This document provides information about the Data Analytics course (CS61061) taught by Dr. Debasis Samanta. It discusses the course organization, syllabus, objectives, plan, materials, evaluation plan, and attendance policy. The course aims to cover fundamental algorithms and techniques in data analytics, including statistical foundations, machine learning, data management and visualization. Students will learn how to find patterns in data, implement analytic algorithms, handle large scale analytics projects, and develop decision support systems. The syllabus covers topics like descriptive statistics, basic/advanced analysis techniques, and case studies. Reference materials and the course calendar are available online. Students will be continuously evaluated through tests, with a minimum 75% attendance required.

Uploaded by

ARUOS Soura
Copyright
© © All Rights Reserved
Available Formats
Download as PPSX, PDF, TXT or read online on Scribd
You are on page 1/ 13

Data Analytics

(CS61061)

Dr. Debasis Samanta


Associate Professor
Department of Computer Science & Engineering
Let us discuss about…
• Course organization

• Syllabus

• Course objective

• Course plan

• Reference and study materials

• Virtual classroom with Team Microsoft

• Contact details
…Course organization
• Title: Data Analytics

• Code: CS61061

• Credit: 3-1-0 = 4

• Slot: D4

• Timing
Monday: 12:00-12:55
Tuesday: 10:00-11:55
Thursday: 08:00-08:55

• Mode: Online Classes with Video Conferencing


…Course objectives
This course will cover fundamental algorithms and techniques used in Data
Analytics. The statistical foundations will be covered first, followed by various
machine learning and data mining algorithms. Technological aspects like data
management, scalable computation and visualization will also be covered.
In summary, this course will provide exposure to theory as well as practical
systems and software used in data analytics.

After completing this course, you will learn how to:

• Find a meaningful pattern in data


• Graphically interpret data
• Implement the analytic algorithms
• Handle large scale analytics projects from various domains
• Develop intelligent decision support systems
…Syllabus
• Data definition
• Concept of data
• Data vs. Information
• Data categorization

• Descriptive Statistics
• Measure of central tendency
• Measure of location of dispersion

• Basic Analysis Techniques


• Statistical hypothesis generation and testing
• Chi-Square test
• t-Test, Analysis of variance, Correlation analysis
• Maximum likelihood test
…Syllabus
• Advanced Analysis Techniques
• Regression analysis
• Classification techniques
• Clustering techniques
• Association analysis

• Case Studies and Projects


• Understanding few business scenarios
• Feature engineering and visualization
• Scalable and parallel computing with Hadoop and MapReduce
• Sensitivity analysis
…Study materials
1. Probability & Statistics for Engineers & Scientists (9th Edn.), Ronald E. Walpole, Raymond
H. Myers, Sharon L. Myers and Keying Ye, Prentice Hall Inc.

2. The Elements of Statistical Learning, Data Mining, Inference, and Prediction (2nd Edn.),
Trevor Hastie Robert Tibshirani, Jerome Friedman, Springer, 2014

3. An Introduction to Statistical Learning: with Applications in R, G. James, D. Witten, T


Hastie, and R. Tibshirani, Springer, 2013

4. Software for Data Analysis: Programming with R (Statistics and Computing),


John M. Chambers, Springer, 2012

5. Mining Massive Data Sets, A. Rajaraman and J. Ullman, Cambridge University


Press, 2012.

6. Advances in Complex Data Modeling and Computational Methods in


Statistics, Anna Maria Paganoni and Piercesare Secchi, Springer, 2013
…Study materials
7. Data Mining and Analysis, Mohammed J. Zaki, Wagner Meira, Cambridge
University Press, 2012

8. Hadoop: The Definitive Guide (2nd Edn.) by Tom White, O-Reilly, 2014

9. MapReduce Design Patterns: Building Effective Algorithms and Analytics for


Hadoop and Other Systems, Donald Miner, Adam Shook, O'Reilly, 2014

10. Beginning R: The Statistical Programming Language, Mark Gardener, Wiley,


2013

Lecture slides, videos, and tutorial materials will


be available at SharePoint of Microsoft Teams’s virtual
classroom portal of Data Analytics
Course Plan

Course calendar is here…


(Also, find it at SharePoint of Microsoft Teams’s virtual
classroom portal of Data Analytics)
…Evaluation plan
•• Continuous
  evaluation throughout the semester

Three short tests (2 best scores out of 3)


(Weightage: 2 = 40)

Two long tests


(Weightage: 2 = 60)

Note:
Other than some medical emergency, no compensatory test will be allowed.
Attendance…
• Minimum attendance required:
75% of the total classes
Automated Attendance Marking …
Happy
Learning!

You might also like