SOE - Syllabus CSE (Data Science) 2 March 2023
SOE - Syllabus CSE (Data Science) 2 March 2023
Hours per
Credits Maximum Marks
Week
Sr Course
Course Title
No Code
Continual End Sem
L T P Total
Assessment Examination
-
1 DS501T Theory of Computation 3 - 3 30 70 100
Software Engineering and -
2 DS502T 2 2 15 35 50
Project Management
Machine Learning for Data
3 DS503T 3 - 3 30 70 100
Science
Machine Learning for Data
4 DS503P - - 2 1 25 25 50
Science Lab
5 DS504T Professional Elective-I 3 - 3 30 70 100
6 DS505P Computer Lab-II - - 2 1 25 25 50
7 DS506T Open Elective - I 3 - 3 30 70 100
Economics and
8 AS501T 4 4 30 70 100
Management
9 AS502T English for Engineers 2 2 25 25 50
Technical Skill
10 DS507P 2 1 50 - 50
Development
11 DS508P Career Development * 4 0 - - -
Total 20 0 10 23 255 495 750
Hours per
Credits Maximum Marks
Week
Sr Course
Course Title
No Code
Continual End Sem
L T P Total
Assessment Examination
- -
1 DS601T Operating System 3 3 30 70 100
FIFTH SEMESTER
Course Code Course Name Th Tu Pr Credits Evaluation
CA ESE Total
DS501T Theory of Computation 3 - - 3
30 70 100
Unit I [7Hrs]
Strings, Alphabet, Language operations, Finite state machine definitions, Finite automation model, Acceptance of strings and
language, Non deterministic finite automaton, Deterministic finite automaton, Equivalence between NFA and DFA, Conversion of
NFA into DFA, Moore and Mealy machines.
Unit II [7Hrs]
Regular sets, Regular expressions, Identity Rule, Manipulation of regular expressions, Equivalence between RE and FA, Inter
conversion, Pumping lemma, Closure properties of regular sets(proofs not required), Chomsky hierarchy of languages, Regular
grammars, Right linear and left linear grammars, Equivalence between regular grammar and finite automaton, Inter conversion
between RE and RG.
Unit III [7Hrs]
Context free grammar, Derivation trees (Syntax tree and Parse tree), Ambiguous Grammar, Context Free Language (CFL),
Closure properties of CFL, Normal Form of grammar: Chomsky Normal form, Greibach normal form, CYK algorithm.
Unit IV [7Hrs]
Push Down Automaton, Turing Machine: Definition, Model of TM, Design of TM, Universal Turing Machine, Types of TM’s (proofs
not required), Turing Computable Functions, Linear bounded automaton.
Unit V [7Hrs]
Decidability and Undecidability of problems, Properties of recursive & recursively enumerable languages, Halting problems, Post
correspondence problem, Ackerman function, Recursive Function: Basic functions and operations on them, Primitive recursive
function, μ-recursive function, Bounded Minimization, Unbounded Minimization.
Text Books
S.N Title Authors Edition Publisher
nd
1 Introduction to Automata Theory, J. E. Hopcraft,R. 2 Edition Pearson Education,
Languages and Computation Motwani, J. D Ullman Aisa
rd
2 Theory of Computer Science, Automata, K. L. P. Mishra and N. 3 Edition PHI Learning.
Languages and Computation Chandrasekaran
Reference Books
S.N Title Authors Edition Publisher
nd
1 Introduction to Theory of Computation Sipser 2 Edition Cengage publications
FIFTH SEMESTER
Course Code Course Name Th Tu Pr Credits Evaluation
Software Engineering and Project CA ESE Total
DS502T 2 - - 2
Management 15 35 50
Unit I [5Hrs]
What is Software?, Role of Software Engineer, software development phases Process Models: Waterfall Model, Prototype model
Evolutionary Models: Incremental model, Spiral Model, Agile process: Scrum, Extreme programming.
Unit II [5Hrs]
Requirements Engineering: Initiating the process, Eliciting Requirements, Building the Requirements Model, Negotiating,
Validating requirements, Requirements Analysis, Scenario-Based Analysis, Requirements Modeling strategies, Flow-Oriented
Modeling, Class based modeling, SRS.
Unit V [5Hrs]
Software Quality Assurance: Concepts, Approaches, Software Quality Factor, Software Reviews, Software Reliability. Software
Configuration Management
Text Books
S.N Title Authors Edition Publisher
th
1 Software Engineering, A practitioner's Roger Pressman 7 Edition Tata Mcgraw Hill
approach
nd
2 Object Oriented Software Engineering Bernd Bruegge & 2 Edition,
Using UML Patterns and Java Allen H. Dutoit.
Reference Books
S.N Title Authors Edition Publisher
1 OOA and Design Grady Booch Ad. Wesly
FIFTH SEMESTER
Course Code Course Name Th Tu Pr Credits Evaluation
CA ESE Total
DS503T Machine Learning for Data Science 3 - - 3
30 70 100
Unit I [7Hrs]
Foundations for ML: ML Techniques overview, Validation Techniques (Cross-Validations), Overfitting and under-fitting, Data
Normalization, Hypothesis Evaluation, Feature Reduction/Dimensionality reduction, Linear and Logistic regression.
Unit II [7Hrs]
Discriminative ML Methods: Classification and its use cases, Decision Tree, Algorithm for Decision Tree Induction, Creating a
Perfect Decision Tree, Confusion Matrix, Random Forest algorithm, Bagging and Boosting
Unit IV [7Hrs]
Bayes Decision Theory: Bayes decision rule, Minimum error rate classification, Normal density and discriminant functions, Naïve
Bayes Classifiers, probably approximately correct (PAC) learning , Support Vector Machine.
Unit V [7Hrs]
Unsupervised Learning: Clustering (K means, Hierarchical Clustering), Reinforcement Learning, Elements of Reinforcement
Learning, Exploration vs. Exploitation dilemma, Q – Learning.
Text Books
S.N Title Authors Edition Publisher
1 Understanding Machine Learning: From Shalev-Shwartz,S., Ben- Cambridge University
Theory to Algorithms David,S Press
2 Pattern Recognition and machine learning Christopher Bishop Springer Verlag
Reference Books
S.N Title Authors Edition Publisher
1 Machine Learning- an Artificial Tom Mitchell 2nd Edition Morgan Kaufmann
Intelligence Approach
2 Algorithms for Clustering Data A. K. Jain and R. C. Dubes 5th Edition Prentice Hall PTR
3 Introduction to Machine Learning Ethem Alpaydin 2nd Edition PHI
FIFTH SEMESTER
Course Code Course Name Th Tu Pr Credits Evaluation
CA ESE Total
DS503P Machine Learning for Data Science Lab 2 1
25 25 50
To implement the concept of data pre-processing (Working with various data types, fixing values,
2 handling missing values and outliers, etc.) using public dataset.
3 To learn various modern tools, packages and techniques for machine learning.
Text Books
S.N Title Authors Edition Publisher
1 Understanding Machine Learning: From Shalev-Shwartz,S., Ben- Cambridge University
Theory to Algorithms David,S Press
Reference Books
S.N Title Authors Edition Publisher
1 Machine Learning- an Artificial Tom Mitchell 2nd Edition Morgan Kaufmann
Intelligence Approach
2 Algorithms for Clustering Data A. K. Jain and R. C. Dubes 5th Edition Prentice Hall PTR
3 Introduction to Machine Learning Ethem Alpaydin 2nd Edition PHI
[6Hrs]
Unit I
Introduction to Distributed Database: Distributed Data Processing, Concepts of Distributed Database. Distributed vs.
Centralized Database System; advantage and application. Transparency, performance and reliability, Problem areas of Distributed
Database. Integrity Constraints in Distributed databases.
Unit II [6Hrs]
Distributed Database Architectures : DBMS standardization, Architectural models for Distributed DBMS – autonomy,
distribution and heterogeneity, Distributed Database architecture – Client/Server , Peer – to – peer distributed systems, MDBMS
Architecture, Distributed Catalog management.
Unit III [8Hrs]
Distributed Database Design: Design strategies and issues. Data Replication. Data Fragmentation – Horizontal, Vertical and
Mixed. Resource allocation. Semantic Data Control in Distributed DBMS. Distributed Query Processing, Distributed
Transaction Management
Unit IV [8Hrs]
Object Oriented Database Concept: Data types and Object, Evolution of Object Oriented Concepts, Characteristics of Object
Oriented Data Model. Object Hierarchies – Generalization, Specialization, Aggregation. Object Schema. Enter-object
Relationships, Similarities and difference between Object Oriented Database model and Other Data models.
Unit V [8Hrs]
OODBMS Architecture Approach : The Object Oriented DBMS Architecture, Performance Issue in Object Oriented DBMS,
Application Selection for Object Oriented DBMS, the Database Design for an Object Relational DBMS. The Structured Types and
ADTs, Object identity, Extending the ER Model, Storage and Access Methods, Query Processing, Query Optimization, Data
Access API (ODBC, DB Library, DAO, ADO, JDBC, OLEDB), COBRA.
Text Books
S.N Title Authors Edition Publisher
1 Principles of Distributed Database Systems Ozsu, M. Tamer and Pearson Education
Patrick Valduriez
2 C.S.R. Prabhu PHI.
Object Oriented Database System – Approaches and
Architectures;,
Reference Books
S.N Title Authors Edition Publisher
1 Silberschatz, Abraham, Henry McGraw Hill
Database System Concepts F. Korth and S. Sudarshan
2 Database Management System –. Gerald V. Post McGraw Hill
3 Database Systems – Design, Peter Rob, Carlos Coronnel Course Technology.
Implementation and Management;
FIFTH SEMESTER
Course Code Course Name Th Tu Pr Credits Evaluation
Professional Ele – I : CA ESE Total
DS504T(ii) 3 3
Data Warehousing & Mining 30 70 100
Unit I [8Hrs]
Data Warehousing and Business Analysis: - Data warehousing Components –Building a Data warehouse – Data Warehouse
Architecture – DBMS Schemas for Decision Support – Data Extraction, Cleanup, and Transformation Tools - Overview of ETL and
OLAP OLTP integration – comparison of OLAP with OLTP systems – ROLAP, MOLAP and HOLAP – Data Cube Computation
methods –Metadata – reporting tools – Query tools and Applications – Online Analytical Processing (OLAP) – OLAP and
Multidimensional Data Analysis – Data Warehouse implementation - Parallel execution – Materialized views. – Advanced SQL
support for OLAP.
Unit II [8Hrs]
Data Mining: - Fundamentals of Data Mining – Data Mining Functionalities – Data Preprocessing – Data Cleaning – Data
Integration and Transformation – Data Reduction – Data Discretization and Concept Hierarchy Generation- Efficient and Scalable
Data Mining Systems- Classification Of Data Mining Systems.
Basic concepts of Association Rule Mining: - Frequent Item set Mining Methods – Mining Various Kinds of Association Rules –
Association Mining to Correlation Analysis – Constraint-Based Association Mining.
Unit IV [8Hrs]
Cluster Analysis: - Types of Data in Cluster Analysis - Measuring Data Similarity and Dissimilarity – A Categorization of Major
Clustering Methods – Partitioning Methods – Hierarchical methods – Density-Based Methods – Grid-Based Methods – Model-
Based Clustering Methods – Clustering High-Dimensional Data – Constraint-Based Cluster Analysis – Outlier Analysis.
Unit V [8Hrs]
Mining Object, Spatial, Multimedia, Text and Web Data: Multidimensional Analysis and Descriptive Mining of Complex Data
Objects – Spatial Data Mining – Multimedia Data Mining – Text Mining – Mining the World Wide Web – Graph Mining – Mining
biological data, social networks – Mining time series and sequence data.
Text Books
S.N Title Authors Edition Publisher
1 Data Mining Concepts and Techniques Jiawei Han, Micheline Kamber and Jian Pei Third Elsevier
Reference Books
S.N Title Authors Edition Publisher
1 Introduction to Data Mining, 2007. Pang-Ning Tan, Michael Steinbach and Vipin Pearson
Kumar Education
To prepare students with the technical knowledge and skills Students will be able to
needed to protect and defend computer systems and networks. Understand common security terminology, threats,
vulnerabilities, and security design principles.
Understand basic cryptography concepts, and specific
commonly used algorithms and protocols.
Protect and defend computer systems and networks from
cybersecurity attacks.
Expresses professional responsibilities and make
judgments based on legal and ethical principles in the
context of computing practice.
Obtain hands-on experience in using common security
tools, such as firewalls, intrusion detection systems, and
port scanning software.
Text Books
S.N Title Authors Edition Publisher
1 Introduction to Computer Security Michael T. Goodrich Addison Wesley, 2011
and Roberto Tamassia
2 Cryptography and Network Security Atul Kahate 4e McGraw Hill
Reference Books
S.N Title Authors Edition Publisher
1 Cryptography and Network Security William Stallings 7e Pearson
FIFTH SEMESTER
Course Code Course Name Th Tu Pr Credits Evaluation
CA ESE Total
DS505P Computer Lab-II 2 1
25 25 50
2 To Demonstrate Local Repository Management and Version Control Mechanism (VCS) Using Git.
3 To Demonstrate Remote Repository Management and Version Control Mechanism (VCS) Using GitHub.
To Demonstrate the concept of Branching in Local and Remote along with Conflict resolution mechanism
4 Using Git and GitHub.
To Demonstrate Build automation on the project available in remote repository (GitHub) using build tool
5 (MAVEN).
To Demonstrate platform as a service that use OS-level virtualization for software delivery Using Resource
10 Management and Configuration Using Docker.
Text Books
S.N Title Authors Edition Publisher
Unit I [7Hrs]
Introduction: Introduction to Data Science – Evolution of Data Science – Data Science Roles – Stages in a Data Science Project –
Applications of Data Science in various fields – Data Security Issues.
Unit II [7Hrs]
Data Collection and Data Pre-Processing: Data Collection Strategies – Data Pre-Processing Overview – Data Cleaning – Data
Integration and Transformation – Data Reduction – Data Discretization.
Unit IV [6Hrs]
Data Visualization - Grids, axes, plots Markers, colors, fonts, and styling Types of plots - bar graphs, pie charts, histograms scatter
plots.
Unit V [7Hrs]
Exploratory Data Analytics: Descriptive Statistics – Mean, Standard Deviation, Skewness and Kurtosis – Box Plots –Pivot Table –
Heat Map, Generalization Error – Out-of-Sample Evaluation Metrics – Cross Validation – Overfitting – Under Fitting.
Text Books
S.N Title Authors Edition Publisher
1 Smarter Decisions : The Intersection of JojoMoolayil PACKT
IoT and Data Science
2 Doing Data Science Cathy O’Neil, Rachel O'Reilly
Schutt
3 Data Science and Big data Analytics David Dietrich, Barry EMC 2013
Heller, Beibei Yang
4 Handbook of Research on Cloud Raj, Pethuru IGI
Infrastructures for Big Data Analytics Global
5 The Data Science Design Manual Skiena, Steven S CRC press
Reference Books
S.N Title Authors Edition Publisher
1 Practical Data Science with R Nina Zumel, John Manning
Mount.
2 Data Science for business F. Provost, T Fawcett
FIFTH SEMESTER
Course Code Course Name Th Tu Pr Credits Evaluation
CA ESE Total
DS507P 2 1
Technical Skill Development 50 50
Text Books
S.N Title Authors Edition Publisher
1. Full Stack JavaScript Development With Colin Ihrig 1st Edition
MEAN: MongoDB, Express, AngularJS,
and Node.JS, SitePoint
2. Learning PHP, MySQL, JavaScript, CSS Robin Nixon 3rd edition O'Reilly Media
& HTML5: A Step-by-Step Guide to
Creating Dynamic Websites
Unit I [8Hrs]
Importance of Operating Systems, Basic Concepts and Terminology, An Operating System Acrhitecture, types of
Operating System ,Operating System as a Manager :Manager Memory Management Functions, Processor Management
Functions, Device Management Functions, Information Management Functions., Operating system concepts, Files and Security,
System calls for Process Management, for File Management and for Directory Management.
Unit II [8Hrs]
Management Processes Concept :Processes and Threads. Process Model and Thread Model. Job Scheduler, Process
Scheduling, operation on process,. Overview of Inter-process communication: Race Conditions, Critical Regions, Mutual Exclusion
with busy waiting etc. .CPU Scheduling: Introduction to Scheduling, Scheduling criteria, Scheduling Algorithms, Algorithm
Evaluation and Scheduling in different Systems.
Unit IV [8Hrs]
Memory Management :Single Contiguous Memory Management ,Fixed Partition Memory Management : Introduction
,Allocation Algorithm,swapping ,relocation and address traslation Variable Parttion: Introduction ,Allocation Algorithm,swapping
,relocation and address traslation,Non-contigious Allocation -general concpts Paging ,Segmentation Virtual Memory
Management system :general concepts ,Page replacement algorithm
Unit V [8Hrs]
Operating system Security and protection :Introduction ,Security Threats ,Attacks on Security ,Security Violation through
parameters ,Computer Worms,Computer Virus:Types of virus,Infection method ,mode of operation , detection
,removal,prevention. Authentication :In centralised Environment ,Distibuted Environment Protection Mechnisum :Protection
Framwork ,Access Control List,Capabiltity List ,Combined Schemes
Text Books
S.N Title Authors Edition Publisher
1 Operating System A.Godbole 3rd Edition TMH
2 Operating System Concepts A.Silberschatz,Peter B. 8th edition
Galvin,Grag Gagne
Unit I [8Hrs]
Mathematical foundation, Important summation, combinatorics and logarithmic formulas for algorithmic analysis, algorithmic design
principles, Review of asymptotic notations & growth of functions, Analysis Framework – Asymptotic Notations and its properties –
mathematical analysis for recursive and non-recursive algorithms, recurrences, solutions of recurrence relations using technique of
recursion tree method, substitution method, and master method, probability distributions, analyzing control structures.
Unit II [9Hrs]
Asymptotic notations for analysis of algorithms, best, worst case and average case analysis, amortized analysis and it’s
applications, analysis of sorting algorithms such as selection sort, insertion sort, bubble sort, heap sort, lower bound proof,
elementary and advanced data structures with operations on them and their time complexity, sorting networks.
Unit III [8Hrs]
Divide and conquer basic strategy, binary search, quick sort, merge sort, Fast Fourier Transform etc. Greedy method - basic
strategy, 0/1 knapsack problem, application to job sequencing with deadlines problem, Strassen’s Matrix Multiplication, minimum
cost spanning trees -Prim‘s algorithm and kruskal’s algorithm , single source shortest path - Dijkstra’s and Bellman Ford algorithm,
closest-pair and convex – hull problems etc.
Unit IV [9Hrs]
Dynamic Programming basic strategy, multistage graphs, all pairs shortest path Floyd Warshall algorithm, single source shortest
paths, optimal binary search trees, travelling salesman problem, matrix chain multiplication, longest common sub sequence
problem, Optimal Merge pattern – Huffman Trees.
Unit V [8Hrs]
Basic Traversal and Search Techniques, breadth first search and depth first search, connected components. Backtracking basic
strategy, 8-Queen’s problem, graph coloring, Hamiltonian Circuit Problem – Subset Sum Problem , randomized and approximate
algorithms, NP-hard and NP-complete problems, basic concepts, non-deterministic algorithms, NP-hard and NP-complete, Cook’s
Theorem, decision and optimization problems, polynomial reduction.
Text Books
S.N Title Authors Edition Publisher
1 Introduction to Algorithms Cormen T.H Prentice Hall of India
Reference Books
S.N Title Authors Edition Publisher
1 Computer Algorithms Horowitz, Sahani, Galgotia Publications
Rajsekharan Pvt. Ltd
2 Fundamentals of Algorithms Brassard, Bratley Prentice Hall
3 Data Structures and Algorithms Alfred V. Aho, John E. Reprint 2006 Pearson Education
Hopcroft , Jeffrey D.
Ullman
1 Implementation and Time Analysis of sorting algorithms.Bubble Sort, Selection Sort, Insertion Sort,Merge
Sort and Quick Sort.
2 Implementation and Time Analysis of linear and binary search algorithms.
3 Implementation of Max-Heap sort algorithm.
4 Implementation and Time Analysis of factorial program using iterative and recursive method
5 Implementation of a knapsack problem using dynamic programming.
6 Implementation of chain matrix multiplication using dynamic programming.
7 Implementation of a knapsack problem using greedy method.
8 Implementation of Graph and Searching(BFS and DFS).
9 Implement Prim’s algorithm
10 Implement Kruskal’s algorithm.
11 Implement LCS problem.
Text Books
S.N Title Authors Edition Publisher
1 Introduction to Algorithms Cormen T.H Prentice Hall of India
Reference Books
S.N Title Authors Edition Publisher
1 Computer Algorithms Horowitz, Sahani, Galgotia Publications
Rajsekharan Pvt. Ltd
2 Fundamentals of Algorithms Brassard, Bratley Prentice Hall
3 Data Structures and Algorithms Alfred V. Aho, John E. Reprint 2006 Pearson Education
Hopcroft , Jeffrey D.
Ullman
SIXTH SEMESTER
Unit I [6Hrs]
Introduction to Data Center: Data Center History, Components of Data centers, Data Center Tiers & Classes, Current Data
Center Standards, Data Center Topology
Data Center Structure, Data Center Construction, Data Center Space Relationships, Storage, Networks, Virtualization & Cloud,
High Availability Designs, Security components,, Multi-tier Applications
Unit II [6Hrs]
Data Canter Resource Management and Load Balancing: Distributed Management of Virtual Infrastructures, Server
consolidation, dynamic provisioning and resource management, Resource Optimization, Resource dynamic reconfiguration, Load
Balancing, various load balancing techniques.
Unit III [6Hrs]
Introduction to Cloud: Virtualization, Definition, Understanding and Benefits of Virtualization. Implementation Level of
Virtualization, Virtualization Structure and Mechanisms, Issues with virtualization, Cloud and Cloud Computing, Cloud Computing
Service Models, Cloud Computing Deployment Models.
Unit IV [8Hrs]
Computing and Database Services: Virtual machine, Security Groups, Authentication through Key-pair, Hardware Tenancy –
Shared vs. Dedicated, Understand cloud Storage , Block Store Its Features and Volume Type, File System and Its Features.
Cloud RDS and its benefits, Read Replica RDS, Authentication, DynamoDB, ElastiCache: Working, Redis vs Memcached,
RedShift, Kinesis
Unit V [8Hrs]
Networking and Monitoring Services: Networking Layer: VPC Elastic Network Interface (ENI) and Its Attributes, Different
Categories of IP Address, Public IP vs. Elastic IP, VPC – Benefits and Components, CIDR Notations, Network Access Control List
v/s Security Groups, NAT (Network Address Translation): NAT Devices, NAT Gateway and NAT instance, VPC peering. Monitor
related services.
Text Books
S.N Title Authors Edition Publisher
1 Cloud Computing Principles and Rajkumar Buyya, James Broberg, Wiley Publishers.2011
Paradigm
2 Cloud Security and Privacy An Enterprise Tim Mather, Subra Kumaraswamy O'Reilly
Perspective onRisks and Compliance
Reference Books
S.N Title Authors Edition Publisher
1 Distributed and cloud computing from Kai Hwang, Geoffrey C. Fox and Elsevier– 2012
Parallel Processing to the Internet of Jack J. Dongarra
Things
2 Cloud Computing : A Practical Approach Toby Velte, Anthony TVelte, Robert McGrawHill,2009
Elsenpeter
List of Experiments
1. To understand Data center and its components.
2. Sign-up for AWS free-tier account Explore Console and Configure CLI
3. User management through Identity Access Management (IAM)
4. Create an AMI Create an Elastic IP.
5. Host your website inside EC2 instance.
6. Storing an application data in MySQL DB using Relational Database Service (RDS) Creating Tables,
loading sample data and running queries
7. Create a Classic Load Balancer
8. Create a Non-default VPC and attach it to an EC2 instance
9. Accessing Internet inside Private Subnet using NAT Gateway
10. Connect two instances in different VPC’s using VPC peering
Text Books
S.N Title Authors Edition Publisher
1 Cloud Computing : A Practical Approach Toby Velte, Anthony McGrawHill,2009
TVelte, Robert
Elsenpeter
2 Learning Amazon Web Services (AWS): Mark Wilkins First Edition 1. Pearson, 2019
A Hands-On Guide to the Fundamentals
of AWS Cloud
Reference Books
S.N Title Authors Edition Publisher
1 AWS Certified Cloud Practitioner Anthony J. Sequeira Pearson, 2020.
Unit V [8Hrs]
NEURO FUZZY MODELING:Adaptive Neuro-Fuzzy Inference Systems ,Hybrid Learning Algorithm , Coactive Neuro Fuzzy
Modeling – Framework Neuron Functions for Adaptive Networks – Neuro Fuzzy Spectrum, Genetic algorithm : Fundamentals,
basic concepts, working principle, Applications & advances in GA
Text Books
S.N Title Authors Edition Publisher
1 Artificial Intelligence, , New Rich E and Knight K TMH
2 Neural Networks, Fuzzy Logic & S, Rajasekaran& G.A. PHI Publication.
Genetic Algorithms, Synthesis VijayalakshmiPai,
&applications,
Reference Books
S.N Title Authors Edition Publisher
1 Principles of Soft Computing, S.N. Sivanandam& S.N. Deepa Wiley Publications
2 Neural Network fundamental with Bose TMH
Graph , Algo.&Appl,
Unit I [8Hrs]
Introduction to Compilation Process, Compilers & Translators, Phase structure of Compiler, Design of Lexical Analysis.
Unit II [8Hrs]
Specifying Syntactic Structure of Programming Language using Context Free Grammars, The role of Parser, Top-down Parsing,
Bottom-Up Parsing, Predictive Parsers, Recursive Decent Parser.
Unit IV [8Hrs]
Syntax Directed Translation Scheme: Intermediate Code, Postfix notation, Parse tree and Syntax Trees, Three address codes,
quadruples, triples, Translation of Arithmetic Expression, Boolean expressions, Control Statements. Array references, Procedure
Calls, Declarations, Case Statements, Use of Compiler writing tools (Lex/ Flex, Yacc / Biason).
Unit V [8Hrs]
Symbol Tables: Contents, Representing scope information. Error detection and Recovery: Error handling, Lexical-phase, Syntactic
phase and semantic phase. The principle sources of optimization, Loop optimization, The DAG representation, Introductory Data
Flow analysis, Introduction to Code Generation: Object programs, Problems in Code Generation, Register allocation and
assignment, Code generation from DAG, Peephole optimization.
Text Books
S.N Title Authors Edition Publisher
1 Compilers Principles, Techniques & Alfred V. Aho, Jeffrey D. Ullman & Ravi 2 Pearson
Tools Sethi Education
2 Principles of Compiler Design by, Alfred V. Aho, Jeffrey D. Ullman Addison Wesley
Publication
Reference Books
S.N Title Authors Edition Publisher
1 Compiler Design Dr. O.G. Kakde Laxmi Publication
2 Introduction to Compiling Techniques : J.P. Bennett McGraw-Hill Publication
First Course Using ANSI C, Lex and
Yacc”, ,
SIXTH SEMESTER
Course Code Course Name Th Tu Pr Credits Evaluation
- CA ESE Total
DS605T(i) Open Elective II :Business Intelligence 3 - 3
30 70 100
Unit I [8Hrs]
Introduction : Data Definitions and Analysis Techniques:Elements, Variables, and Data categorization,Levels of
Measurement,Data management and indexing,Introduction to statistical learning and R-Programming
Descriptive Statistics:Measures of central tendency,Measures of location of dispersion, Practice and analysis with R
programming
Unit II [8Hrs]
Basic Analysis Techniques:Statistical hypothesis generation and testing,Chi-Square test,t-Test,Analysis of variance,Correlation
analysis,Maximum likelihood test
Data analysis techniques:Regression analysis,Classification techniques,Clustering,Association rules analysis
Unit III [8Hrs]
Case studies and projects:Understanding business scenarios,Feature engineering and visualization,Scalable and parallel
computing with Hadoop and Map-Reduce,Sensitivity Analysis
Business Intelligence Essentials: Introduction&Overview of BI ,Types of Business Intelligence, Business Intelligence Platform,
Dynamic roles in Business Intelligence, Roles of Business Intelligence in Modern Business- Challenges of BI, Creating Business
Intelligence Environment, Business Intelligence Landscape,
Unit IV [8Hrs]
Decision Making and Decision Support Systems: A BI perspective, Foundation and Technologies for decision
Making,Information gathering ,Managing BI,BI User Segmentation,Gathering BI Requirements, Introduction to Tableau.
Unit V [8Hrs]
Modeling and Analysis,Knowledge Management and Collaborative Systems,Strategic Approach to BISocial media, big data and
data mining,Business Analytics: Emerging Trends and Future Impacts
Text Books
S.N Title Authors Edition Publisher
1 Business Intelligence: The Savvy David Loshin Newnes
Manager's Guide
2 Business Intelligence: Practices, 1. Rajiv Sabherwal, Wiley & Sons.
Technologies, and Management Irma Becerra-
Fernandez.John
Reference Books
S.N Title Authors Edition Publisher
1 Business Intelligence: Making Decisions Jerzy Surma, Business Expert Press
Through Data Analytics, ,