0% found this document useful (0 votes)
74 views

Post Graduate Diploma in Data Science (PGDDS)

The document outlines the curriculum for a Post Graduate Diploma in Data Science program over 4 semesters. The first two semesters cover the basics of statistics, data structures, algorithms, R and Python programming, data warehousing, mining and big data. Semester 3 includes courses on NoSQL databases, data visualization, machine learning with R and Python. Semester 4 covers emerging trends like deep learning, AI, business intelligence and a capstone project. Students are required to complete submissions and a final project to demonstrate their learning.

Uploaded by

yash borkar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
74 views

Post Graduate Diploma in Data Science (PGDDS)

The document outlines the curriculum for a Post Graduate Diploma in Data Science program over 4 semesters. The first two semesters cover the basics of statistics, data structures, algorithms, R and Python programming, data warehousing, mining and big data. Semester 3 includes courses on NoSQL databases, data visualization, machine learning with R and Python. Semester 4 covers emerging trends like deep learning, AI, business intelligence and a capstone project. Students are required to complete submissions and a final project to demonstrate their learning.

Uploaded by

yash borkar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

POST GRADUATE DIPLOMA IN DATA SCIENCE (PGDDS)

PROGRAMME CURRICULUM

Semester - I Semester II

Basics of Statistics Big data with Data Warehousing and Data Mining
1. Basics of Statistics 1. Fundamentals of Data Warehouse
2. Data Collection and Measurement 2. Architecture of Data Warehouse
3. Data Presentation 3. Dimensional Modelling
4. Data Processing and Analysis 4. ETL and OLAP
5. Measures of Central Tendency (Mean, 5. Introduction to Data Mining
Median and Mode) 6. Data Mining Techniques
6. Measures of Dispersion 7. Applications of Data Mining
7. Correlation
8. Introduction to Big Data
Introduction to Data Science 9. Hadoop Ecosystem
1. Basics of Data 10. Querying big data with Hive
2. Basics of Data Science
3. Big Data, Datafication & its impact on Data Advanced Statistics
Science 1. Sampling and Sampling Technique
4. Data Science Pipeline, EDA & Data 2. Probability
Preparation 3. Normal Distribution
5. Data Scientist Toolbox, Applications & Case 4. Linear Regression
Studies 5. Multiple Linear Regression
6. Random Variables
Data Structures and Algorithms
1. Programming Fundamentals
Python Programming
2. Control Flow
1. Introduction to Python
3. Arrays and Pointers
2. Variables, expressions and statements
4. Functions
6. Stacks and Queues 3. Control Structures, Data structures- Arrays
7. Linked Lists and Linked lists, Queues
8. Trees 4. Functions
9. Searching Algorithms 5. Conditionals, recursion and iteration
10. Sorting Algorithms 6. Strings
11. Graphs
7. Lists and Tuples
Introduction to R Programming 8. Dictionaries
1. Introduction to R 9. Object Oriented Programming
2. Data Types and Data Structures 11. Files and Error Handling
3. Loops and Functions in R 12. Testing, Debugging and Profiling
4. Mathematics in R 13. Handling data with Python
5. Graphs 14. Python Graphical User Interface
6. String Manipulation and Input/output Development
7. Object Oriented Programming – I Submission I
8. Object Oriented Programming – II In Semester II students are required to submit a
9. Debugging and Condition Handling submission as per guidelines given by SCDL.
10. Introduction to Parallel Computing in R

1|Page
POST GRADUATE DIPLOMA IN DATA SCIENCE (PGDDS)
PROGRAMME CURRICULUM

Semester III
Ethical and Legal Issues in Data Science
NoSQL Databases 1. What are Ethics?
1. Introduction to NoSQL
2. Some Ethical concern of Data Science
2. Basics of NoSQL
3. History, Concept of Informed Consent
3. Replication and Sharding
4. Data Ownership
4. Key-Value Databases
5. Privacy, Anonymity, Data Validity
5. Document Databases
6. Algorithmic Fairness
6. Column-Oriented Databases
7. Societal Consequences
7. Graph Databases
8. Code of Ethics
8. Advanced NoSQL

Data Visualisation Semester IV


1. Introduction to Data Visualisation Emerging Trends in Data Science
2. Visualisation of Numerical Data 1. Big Data
3. Visualisation of Non-numerical Data 2. Apache Spark and Scala
4. Common Visualisation Idioms 3. Deep Learning
5. Visualisation of Spatial Data, Networks and 4. Artificial Intelligence
5. Business Intelligence
Trees
6. Natural language processing
6. Data Reduction
7. Data Analytics
7. Introduction to Tableau 8. Web Analytics
8. Data Visualisation with SPSS 9. Case Study

Machine Learning with R and Python Submission II


1. Basics of Machine Learning In Semester IV students are required to submit a
2. Supervised Machine Learning submission as per guidelines given by SCDL.
3. Unsupervised Learning
Project
4. Regression Algorithms Student should choose a technical or Techno-
5. Clustering Models business topic of his/her interest and is required
6. R Markdown, Knitr, Rpubs to develop the Project based on the provided
7. ggplot2 guidelines.
8. Computation with Python – NumPy, SciPy
9. Pandas
10. Aggregating and Analysing Data with dplyr
11. Data Visualisation in Python – Matplotlib
12. Introduction to scikit-learn
13. Web Scraping in Python – Beautiful Soup
14. Introduction to (Py) Spark

2|Page

You might also like