SEM 4 stuff
SEM 4 stuff
Item No. :
Faculty: Science
Program: B.Sc.
Subject: Data Science
Academic year: 2024-2025
PO1. Solving Complex Problems:- Apply the knowledge gained in breaking down complex
problems into simple components; and to design processes required for problem solving.
PO2. Critical Thinking: - Ability to apply the acquired knowledge to identify assumptions and
evaluate their accuracy and validity.
PO3. Reasoning ability and Rational thinking: - Ability to analyse, interpret data and draw
logical conclusions; to evaluate ideas rationally.
PO4. Research Aptitude: - Ability to ask relevant questions to identify and define the problem,
applying research tools for analysis and interpretation of data. Understand and comply with
research ethics.
PO5. Effective Communication skill: - Demonstrate the ability to listen and to clearly express
ideas verbally. Equip to write reports, make presentations effectively.
PO6. Information and Digital Literacy: - Equip to use appropriate tools and techniques inclusive
of internet and electronic media for acquiring, assessing and analysing data from diverse
resources.
PO7. Social Interactive Skills and team work: - Exhibit networking and social interactive skills;
function effectively as an individual and as a member in diverse groups; demonstrate
leadership quality useful for employability
PO8. Self-directed and Lifelong Learning: Ability to explore and gain knowledge in
independent and self-reliant ways. Demonstrate ability to adapt and upgrade with the global ,
social and technological changes.
PSO1. Sound Knowledge: Demonstrate the knowledge of core data science concepts and apply
them to develop a user- friendly, scalable, and robust applications
PSO2. Critical and Rational Thinking: Exhibit higher order skills to adapt to the everchanging
technological environment
PSO3. Logic Building and Programming Skills: The ability to apply logic to problem solving and
acquire proficiency in various programming languages.
PSO4. Data Analysis : Apply quantitative modelling and data analysis techniques to solve real
world business problems, Learn tools and techniques for transformation of data and
statistical data analysis
PSO5. Pursue Higher Education: Make students competent to take up advanced degree courses
like MSc(Data Science),MCA, MSc(CS), MSc(IT) and MBA etc.
S.Y.B.Sc.(DS) under NEP - Semester III
Course Objective:
• Master the use of the R interactive environment and expanding by installing R
packages
• Read Structured Data into R from various sources
• Understand the different data types and data structures in R
• Manipulate strings, dates in R
• Understand basic regular expressions in R
• Understand base R graphics
• Focus on GGplot2 graphics for R and be familiar with trellis (lattice) graphics.
Course Outcomes:
After completion of this course, student will be able to:
• CO1: To use R Studio and explore the features for R programming
• CO2: To use R functions and graphics with in R programming for solving problems.
• CO3: To work with advanced graphics of R, import and use the data and represent the
data into tables.
• CO4: To manipulating Data Frames and make use of Dates in R application.
Peter O'Reill
4 Bruce,
Practical Statistics y
for Data Scientists Andrew Media, 3rd 2018
Bruce Inc.
SIUDSMJP211 : Data Science with R Practical
List of Practical:
1 Introduction to R Programming Elements
a. Write an R Program to implement expressions, assignment and decision making
b. Write an R Program to design and implement loops.
Write a R program to demonstrate the use of essential data structures in R [Hint:
c. Vectors,
Matrix, Arrays]
2 Using List, Data Frames and Functions in R
Write an R program to manage data and exhibit operations on it using List data
a.
structure
b. Write an R program to manage data and exhibit operations on it using Data Frames
Write an R program to demonstrate the use of:
c. User-defined functions
Built-in numeric function, character functions etc.
3 Implement decision making in R
Write an R program to implement if, if-else, if-else-if ladder, nested if-else, and
a.
switch
4 Implementing Strings in R
Write an R program to store and access string in R objects(vectors, matrix, arrays,
a.
data frames, and lists)
Write an R program to demonstrate use of various string manipulation functions.
b.
[Hint: paste(), print(), noquote(),format(), cat(), toString(), sprint()]
5 Performing Statistics with R-I
Write an R program to apply built-in statistical functions. [Hint: mean, median,
a.
standard deviation and others]
b. Write an R program to demonstrate Linear and Multiple Regression analysis.
Write an R program to demonstrate data interface with CSV files [Hint: creating
a.
data for CSV, analyzing, writing CSV files]
Write an R program to work with spreadsheet (Excel) programs. [Hint: installing,
b.
loading, verifying, creating data for xlsx file]
c. Write an R program to manage data using XML files. [Develop data interface for
maintaining Employee Information]
9 Handling Errors in R
Course Objective:
• To explore about random variables and implement various distribution functions
• To familiarize with concepts of probability and learn implementation of different
types of probabilities.
• Learn and implement the concept of expectation, related theorems and generating
functions.
• To know the concept and implementation of discrete distributions including
Bernoulli, Binomial and power series distributions.
Course Outcomes:
After completion of this course, student will be able to:
• CO1: Organize, manage and present data.
• CO2: Analyse statistical data graphically using frequency distributions and
cumulative frequency distributions.
• CO3: Use the basic probability rules, including additive and multiplicative laws,
using the terms independent and mutually exclusive events.
• CO4: Derive the probability density function of the transformation of random
variables.
• CO5: Calculate probabilities and derive the marginal and conditional distributions of
bivariate random variables.
Sr.
No. Title Author/s Publisher Edition Year
S.C, Gupta,
1 Fundamentals of S. Chand
and V. 10th 2002
Mathematical Statistics and Sons
K. Kapoor
2 Douglas C.
Applied Statistics and
Montgomery Wiley
Probability for 6th 2014
and George C.
Engineers
Runger
Peter
Probability,
Olofsson
Statistics, and
3 And Wiley 2nd 2012
Stochastic
Mikael
Processes
Andersson
SIUDSMJP212 : Probability and Distributions Practical
Create spreadsheet application for computing probabilities and zvalues for the
a.
standard normal distribution. [Hint: Use NORMSDIST and NORMSINV]
Create spreadsheet application for computing probabilities for the exponential
b.
probability distribution. [Hint: Use EXPONDIST]
10 Theoretical Continuous Distributions2
Create spreadsheet application for demonstrating Weibull Distribution to obtain a
a. model for
data sets. [Hint: Use WEIBULL.DIST]
b. Create spreadsheet application for demonstrating Pearson\s Distributions.
SIUDSMN211: Database Technology-I
Course Objective:
• This course concentrates the concept of the DBMS with respect To principles, design and
implementation of DBMS. It aims to specify the functional and Data requirements for a
typical database application and to understand creation, manipulation And querying of data in
databases.
• To understand Organizing, structuring and storing data
• Understand Database as Relational model
• To understand SQL to retrieve data and concept of redundancy
• To specify the functional and data requirements for a typical database application
• To understand creation, manipulation and querying of data in databases
Course Outcomes:
After completion of this course, student will be able to:
• CO1: Students should be able to evaluate business information problem and find the
requirements of a problem in terms of data.
• CO2: Students should be able to draw database design in logical structure and can
identify the entities which exist in a system
• CO3: Students should be able to construct normalized database and functional
dependencies between attributes and relational algebra queries
• CO4: Students should be able to design the database schema with the use of
appropriate datatypes for storage of data in database.
• CO5: Students should be able to create, manipulate, query and backup the databases
with features of SQL.
Sr.
No. Title Author/s Publisher Edition Year
Database System and A Silberschatz,
1 Mc Graw
Concepts H Korth, 5th
Hill
SSudarshan
Cengae
2 Database Systems Rob Coronel 12th
Learning
1st 2003
Pearson
Introduction to
3 C.J.Date
Database
System
SIUDSMNP211 : Database Technology-I Practical
List of Practical:
1 Create ER-diagram for Hospital and library system.
2 Draw E-R diagram and convert entities it to table.
3 Write SQL query for given problem statement
a. Creating a Database
Course Objective:
• Understand the Big Data Platform and its Use cases.
• Provide an overview of Apache Hadoop.
• Provide HDFS Concepts and Interfacing with HDFS.
• Understand Map Reduce Jobs Provide hands on Hadoop Eco System.
• Apply analytics on Structured, Unstructured Data.
Course Outcomes:
After completion of this course, student will be able to:
• CO1: Identify Big Data and its Business Implications.
• CO2: List the components of Hadoop and Hadoop Eco-System.
• CO3: Access and Process Data on Distributed File System.
• CO4: Manage Job Execution in Hadoop Environment.
• CO5: Develop Big Data Solutions using Hadoop Eco System.
• CO6: Analyze Infosphere BigInsights Big Data Recommendations.
Sr.
No. Title Author/s Publisher Edition Year
List of Practical:
1. Installation of Spark. Write a Hello World in Spark.
2. Implement a program in Pig.
3. Implement word count/ frequency program using MapReduce.
4. Configure the Hive and implement the application in Hive.
5. Implement Spark SQL.
6. Implement machine learning with Spark or Hadoop.
7. Implement Spark Streaming.
8. Demonstrate Spark Shell commands.
SIUDSMJ222 : Matrix Theory and Linear Algebra
Course Objective:
• Understand the basic concepts of matrix algebra and its applications.
• Solving computational problems of linear algebra.
Course Outcomes:
After completion of this course, student will be able to:
• CO1: Understand basic matrix properties like rank, determinant, inverse, and a
special type of matrices
• CO2: Introduce Gaussian / Gauss-Jordan elimination methods, LU factorization
technique
• CO3: Use computational techniques for singular value decomposition (Computational
and Algebraic Skills).
• CO4: Understand the concepts of vector space and subspaces.
• CO5: Understand the use of linear algebra and matrices in several important, modern
applications of research and industrial problems involving statistics.
Course Objective:
• To understand the basics of PL/SQL.
• To understand control and conditional statement in PL/SQL.
• To understand working of sequences and cursor in PL/SQL.
• To understand concept of stored procedure and functions.
• To understand triggers and packages in PL/SQL.
Course Outcomes:
After completion of this course, student will be able to:
• CO1: Understand the basics of PL/SQL.
• CO2: Use of the control and conditional statement in PL/SQL.
• CO3: Apply sequences and cursor in PL/SQL.
• CO4: Know the concept of stored procedure and functions
• CO5: Create the triggers and packages in PL/SQL.
List of Practical:
1 PL/SQL Basics
a. Use of variables.
b. Write executable statement.
c. Interacting with Oracle Server.
d. Create anonymous PL/SQL block
2 Control Structure in PL/SQL
a. Using while loop
b. Do loop
c. For loop
d. Use of GOTO statement
3 Create conditional statement using PL/SQL
a. Using if statement
b. Using if else statement
c. Using elsif ladder
d. Using case expression.
4 Creation of Sequence in PL/SQL
5 Create cursor in PL/SQL
a. Implicit cursor
b. Explicit
c. Parameterized cursor
d. Cursor for loop
6 Creation of Procedures in PL/SQL
7 Functions in PL/SQL
a. Compute and returns the maximum value
b. Compute factorial of given number.
8 Creation of Trigger
a. Create Row level trigger
b. Create Statement level trigger
c. Create instead of trigger
9 Handling exceptions
a. Creation of user defined exception