0% found this document useful (0 votes)
6 views

lab Question set

The document outlines the practical lab exercises for the Data Science Fundamentals course at Velammal College of Engineering and Technology. It includes various tasks related to data analytics, such as data cleaning, statistical analysis, and data visualization using tools like Excel and R programming. Each exercise is linked to specific course outcomes aimed at enhancing students' understanding and application of data science concepts.

Uploaded by

jsh
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

lab Question set

The document outlines the practical lab exercises for the Data Science Fundamentals course at Velammal College of Engineering and Technology. It includes various tasks related to data analytics, such as data cleaning, statistical analysis, and data visualization using tools like Excel and R programming. Each exercise is linked to specific course outcomes aimed at enhancing students' understanding and application of data science concepts.

Uploaded by

jsh
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Velammal College of Engineering and Technology, Madurai-625 009

Department of Information Technology


(Regulation VCETR2021)

21OAD03-Datascience Fundamentals

Model Practical Lab-Set I

Time: 1hr 40minutes Maximum Marks: 50


Year/Sem: III/VI Date: 25.03.2025

Course Outcomes:
Students will be able to
CO 1: Explain the concept of data science and role of data analytics.
CO 2: Understand the overview of life cycle of data analytics.
CO 3: Apply data analytics on data and use different analytics method related to data.
CO 4: Create informative visualization and summarize data sets.
CO 5: Analyze applications using data analysis.

S. No. QUESTIONS CO/

K-Level
1. Find how many times 'Bob' appears and remove duplicates in the CO1-K2
following dataset using Excel
ID Name ID Name
1 Alice
2 Bob
3 Charlie
4 Alice
5 David
6 Bob
7 Eve
8 Charlie
9 Frank
10 Alice
2. Find Duplicate values and handle it using R programming in the CO1-K2
following table.

ID Name ID Name
1 Alice
2 Bob
3 Charlie
4 Alice
5 David
6 Bob
7 Eve
8 Charlie
9 Frank
10 Alice
3. Find Simple Linear Regression Model in Microsoft Excel for the CO2-K2
following data set

Years_Experience Salary ($)

1 35,000
2 40,000
3 45,000
4 50,000
5 55,000
6 60,000
7 65,000
8 70,000
9 75,000
10 80,000
4. Find Univariate analysis for given data set CO2-K2

Student_ID Score
1 78
2 85
3 90
4 76
5 88
6 95
7 82
8 89
9 91
10 84
5. Construct Summary Statistics in Excel for given dataset CO3-K3

Student_ID Score
S01 78
S02 85
S03 92
S04 88
S05 74
S06 90
S07 81
S08 95
S09 87
S10 80
6. Construct Comparative Statistics in Excel for the given dataset CO3-K3

Student Math_Score Science_Score


S1 85 78
S2 90 84
S3 76 80
S4 88 85
S5 92 89
S6 79 77
S7 85 82
S8 91 87
S9 80 79
S10 87 88

7. Construct a line chart to analyze the temperature variation over a 10-day CO4-K3
period using the following dataset

Day Temperature (°C)


1 22.5
2 24.0
3 23.8
4 21.2
5 25.6
6 22.9
7 26.3
8 24.5
9 23.1
10 25.0
8. Construct a pie chart to represent the percentage contribution of each CO4-K4
product category to total sales using the dataset below

Product Category Sales ($)


Electronics 35000
Clothing 22000
Home & Kitchen 18000
Toys 12000
Books 8000
9. Assume two matrices of different sizes and implement matrix CO5-K4
multiplication in R to compare their computation times
10. Assume you have two data frames: one containing customer details and CO5-K4
another containing their purchase history. Write an R program to merge
them by Customer ID

Mark Split-up

Aim & Procedure Implementatio Output Viv Total


(10) n (15) a (50)
(15) (10)

Course Incharge Module Coordinator HoD/IT


Mrs. R. Chitra Dr. R. Kavitha Dr. R. Kavitha
Velammal College of Engineering and Technology, Madurai-625 009
Department of Information Technology
(Regulation VCETR2021)

21OAD03-Datascience Fundamentals

Model Practical Lab Set-II

Time: 1hr 40minutes Maximum Marks: 50


Year/Sem: III/VI Date: 25.03.2025

Course Outcomes:
Students will be able to
CO 1: Explain the concept of data science and role of data analytics.
CO 2: Understand the overview of life cycle of data analytics.
CO 3: Apply data analytics on data and use different analytics method related to data.
CO 4: Create informative visualization and summarize data sets.
CO 5: Analyze applications using data analysis.

S. No. QUESTIONS CO/

K-Level
1. In the following table Replace @ with 0 and # with 1 and remove extra spaces CO1-K2

ID Value
1 23
2 45
3 67
4 89
5 12
6 @5
7 34
8 #8
9 56
10 78
2. Find missing values and handle it using R programming in the following CO1-K2
table.

ID Name ID Name
1 Alice
2 Bob
3
4 Alice
5
6 Bob
7 Eve
8 Charlie
9
10 Alice
3. Find Univariate Analysis: Frequency, Mean, Median, Mode, Variance, CO2-K2
Standard Deviation and Skewness for the following dataset in Excel
Age Salary ($)
25 40000
30 40,000
35 45,000
40 50,000
45 120000
50 60,000
55 130000
60 70,000
65 100000
70 80,000

4. Find Simple Linear Regression Model in Microsoft Excel for the CO2-K2
following data set

Temperature (°C) Ice_Cream_Sales


15 100
18 150
20 200
22 250
25 300
27 350
30 400
32 450
35 500
38 550
5. Construct Comparative Statistics in Excel for the given dataset CO3-K3
Month Product_A_Sales Product_B_Sales
Jan 120 100
Feb 150 110
Mar 170 130
Apr 160 125
May 180 140
Jun 190 135
Jul 210 150
Aug 220 155
Sep 230 165
Oct 250 170
6. Construct Summary Statistics in Excel for given dataset CO3-K3

Employee_ID Salary ($)


E001 55000
E002 62000
E003 58000
E004 67000
E005 72000
E006 60000
E007 75000
E008 71000
E009 64000
E010 69000
7. Construct a scatter plot to examine the relationship between students' CO4-K3
math and science scores using the following dataset.
Student Math_Score Science_Score

S1 85 78
S2 90 84
S3 76 80
S4 88 85
S5 92 89
S6 79 77
S7 85 82
S8 91 87
S9 80 79
S10 87 88
8. Construct a histogram to visualize the distribution of employee salaries using CO4-K3
the following dataset
Employee_ID Salary ($)
E001 55000
E002 62000
E003 58000
E004 67000
E005 72000
E006 60000
E007 75000
E008 71000
E009 64000
E010 69000
9. Analyze the process of matrix Addition for 3 × 3 matrix in R by CO5-K4
implementing a function and examining its computational complexity.
10. Assume you have two data frames: one containing Student detail and CO5-K4
another containing their marks in internal. Write an R program to merge
them by Student ID

Mark Split-up
Aim & Procedure Implementatio Output Viv Total
(10) n (15) a (50)
(15) (10)

Course Incharge Module Coordinator HoD/IT

Mrs. R. Chitra Dr. R. Kavitha Dr. R. Kavitha


Course Incharge HoD/IT

Mrs. R.Chitra AP-II/IT Dr. R. Kavitha

You might also like