lab Question set
lab Question set
21OAD03-Datascience Fundamentals
Course Outcomes:
Students will be able to
CO 1: Explain the concept of data science and role of data analytics.
CO 2: Understand the overview of life cycle of data analytics.
CO 3: Apply data analytics on data and use different analytics method related to data.
CO 4: Create informative visualization and summarize data sets.
CO 5: Analyze applications using data analysis.
K-Level
1. Find how many times 'Bob' appears and remove duplicates in the CO1-K2
following dataset using Excel
ID Name ID Name
1 Alice
2 Bob
3 Charlie
4 Alice
5 David
6 Bob
7 Eve
8 Charlie
9 Frank
10 Alice
2. Find Duplicate values and handle it using R programming in the CO1-K2
following table.
ID Name ID Name
1 Alice
2 Bob
3 Charlie
4 Alice
5 David
6 Bob
7 Eve
8 Charlie
9 Frank
10 Alice
3. Find Simple Linear Regression Model in Microsoft Excel for the CO2-K2
following data set
1 35,000
2 40,000
3 45,000
4 50,000
5 55,000
6 60,000
7 65,000
8 70,000
9 75,000
10 80,000
4. Find Univariate analysis for given data set CO2-K2
Student_ID Score
1 78
2 85
3 90
4 76
5 88
6 95
7 82
8 89
9 91
10 84
5. Construct Summary Statistics in Excel for given dataset CO3-K3
Student_ID Score
S01 78
S02 85
S03 92
S04 88
S05 74
S06 90
S07 81
S08 95
S09 87
S10 80
6. Construct Comparative Statistics in Excel for the given dataset CO3-K3
7. Construct a line chart to analyze the temperature variation over a 10-day CO4-K3
period using the following dataset
Mark Split-up
21OAD03-Datascience Fundamentals
Course Outcomes:
Students will be able to
CO 1: Explain the concept of data science and role of data analytics.
CO 2: Understand the overview of life cycle of data analytics.
CO 3: Apply data analytics on data and use different analytics method related to data.
CO 4: Create informative visualization and summarize data sets.
CO 5: Analyze applications using data analysis.
K-Level
1. In the following table Replace @ with 0 and # with 1 and remove extra spaces CO1-K2
ID Value
1 23
2 45
3 67
4 89
5 12
6 @5
7 34
8 #8
9 56
10 78
2. Find missing values and handle it using R programming in the following CO1-K2
table.
ID Name ID Name
1 Alice
2 Bob
3
4 Alice
5
6 Bob
7 Eve
8 Charlie
9
10 Alice
3. Find Univariate Analysis: Frequency, Mean, Median, Mode, Variance, CO2-K2
Standard Deviation and Skewness for the following dataset in Excel
Age Salary ($)
25 40000
30 40,000
35 45,000
40 50,000
45 120000
50 60,000
55 130000
60 70,000
65 100000
70 80,000
4. Find Simple Linear Regression Model in Microsoft Excel for the CO2-K2
following data set
S1 85 78
S2 90 84
S3 76 80
S4 88 85
S5 92 89
S6 79 77
S7 85 82
S8 91 87
S9 80 79
S10 87 88
8. Construct a histogram to visualize the distribution of employee salaries using CO4-K3
the following dataset
Employee_ID Salary ($)
E001 55000
E002 62000
E003 58000
E004 67000
E005 72000
E006 60000
E007 75000
E008 71000
E009 64000
E010 69000
9. Analyze the process of matrix Addition for 3 × 3 matrix in R by CO5-K4
implementing a function and examining its computational complexity.
10. Assume you have two data frames: one containing Student detail and CO5-K4
another containing their marks in internal. Write an R program to merge
them by Student ID
Mark Split-up
Aim & Procedure Implementatio Output Viv Total
(10) n (15) a (50)
(15) (10)