Post Graduate Diploma in Data Science (PGDDS)
Post Graduate Diploma in Data Science (PGDDS)
PROGRAMME CURRICULUM
Semester - I Semester II
Basics of Statistics Big data with Data Warehousing and Data Mining
1. Basics of Statistics 1. Fundamentals of Data Warehouse
2. Data Collection and Measurement 2. Architecture of Data Warehouse
3. Data Presentation 3. Dimensional Modelling
4. Data Processing and Analysis 4. ETL and OLAP
5. Measures of Central Tendency (Mean, 5. Introduction to Data Mining
Median and Mode) 6. Data Mining Techniques
6. Measures of Dispersion 7. Applications of Data Mining
7. Correlation
8. Introduction to Big Data
Introduction to Data Science 9. Hadoop Ecosystem
1. Basics of Data 10. Querying big data with Hive
2. Basics of Data Science
3. Big Data, Datafication & its impact on Data Advanced Statistics
Science 1. Sampling and Sampling Technique
4. Data Science Pipeline, EDA & Data 2. Probability
Preparation 3. Normal Distribution
5. Data Scientist Toolbox, Applications & Case 4. Linear Regression
Studies 5. Multiple Linear Regression
6. Random Variables
Data Structures and Algorithms
1. Programming Fundamentals
Python Programming
2. Control Flow
1. Introduction to Python
3. Arrays and Pointers
2. Variables, expressions and statements
4. Functions
6. Stacks and Queues 3. Control Structures, Data structures- Arrays
7. Linked Lists and Linked lists, Queues
8. Trees 4. Functions
9. Searching Algorithms 5. Conditionals, recursion and iteration
10. Sorting Algorithms 6. Strings
11. Graphs
7. Lists and Tuples
Introduction to R Programming 8. Dictionaries
1. Introduction to R 9. Object Oriented Programming
2. Data Types and Data Structures 11. Files and Error Handling
3. Loops and Functions in R 12. Testing, Debugging and Profiling
4. Mathematics in R 13. Handling data with Python
5. Graphs 14. Python Graphical User Interface
6. String Manipulation and Input/output Development
7. Object Oriented Programming – I Submission I
8. Object Oriented Programming – II In Semester II students are required to submit a
9. Debugging and Condition Handling submission as per guidelines given by SCDL.
10. Introduction to Parallel Computing in R
1|Page
POST GRADUATE DIPLOMA IN DATA SCIENCE (PGDDS)
PROGRAMME CURRICULUM
Semester III
Ethical and Legal Issues in Data Science
NoSQL Databases 1. What are Ethics?
1. Introduction to NoSQL
2. Some Ethical concern of Data Science
2. Basics of NoSQL
3. History, Concept of Informed Consent
3. Replication and Sharding
4. Data Ownership
4. Key-Value Databases
5. Privacy, Anonymity, Data Validity
5. Document Databases
6. Algorithmic Fairness
6. Column-Oriented Databases
7. Societal Consequences
7. Graph Databases
8. Code of Ethics
8. Advanced NoSQL
2|Page