Lab_questionbank

Uploaded by

devaadi0713

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Lab_questionbank

Uploaded by

devaadi0713

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Lab Exam: Mid-Semester 1

Subject: AIML (Artificial Intelligence and Machine Learning)

Institute: IEM (Institute of Engineering and Management)

Answer one question from each group: Group A and Group B

*************************************************************************************
Group A
1. Load a dataset into Colab from your local system.
Analysis Part: After loading, explore the dataset using df.info() and df.describe() to understand its
structure.
2. Identify outliers in the 'DIS' column of the Boston dataset using a box plot.
Analysis Part: After plotting, identify any data points that lie outside the whiskers and discuss possible
causes.
3. Visualize the relationship between 'INDUS' and 'TAX' columns in the Boston dataset using a scatter
plot.
Analysis Part: Discuss whether there is a noticeable correlation between industrial areas and tax rates.
4. Load the 'titanic.csv' dataset from Kaggle into Colab.
Analysis Part: Use sns.boxplot to identify outliers in the 'Fare' column.
5. Handle missing values in the 'Age' column of the Titanic dataset.
Analysis Part: Use df['Age'].fillna(df['Age'].median()) to fill missing values and visualize the distribution
of the 'Age' column using a box plot.
6. Encode categorical variables in the 'titanic.csv' dataset using one-hot encoding. Analysis
Part: Apply one-hot encoding to the 'Embarked' column and display the first five rows. 7.
Perform label encoding on the 'Species' column of the 'iris.csv' dataset.
Analysis Part: Apply label encoding using LabelEncoder from sklearn and visualize the distribution of
encoded species.
8. Normalize the 'sepal_length' and 'sepal_width' columns of the 'iris.csv' dataset. Analysis Part: Use
StandardScaler to normalize these columns and visualize them with a scatter plot. 9. Identify missing
values in the 'housing.csv' dataset from UCI Machine Learning Repository. Analysis Part: Use
df.isnull().sum() to check for missing values and visualize using a bar plot. 10. Drop rows with
missing values in the 'housing.csv' dataset.
Analysis Part: Use df.dropna() and compare the shape of the dataset before and after dropping rows. 11.
Replace missing values in the 'TotalBsmtSF' column of the 'housing.csv' dataset with the mean value.
Analysis Part: Use df['TotalBsmtSF'].fillna(df['TotalBsmtSF'].mean()) and visualize using a histogram.

*************************************************************************************
*******************
Group B

12. Visualize outliers in the 'GrLivArea' column of the 'housing.csv' dataset using a box plot.
Analysis Part: After plotting, discuss any visible outliers and potential impact on model performance.
13. Split the 'diabetes.csv' dataset from Kaggle into train and test sets.
Analysis Part: Use train_test_split to create training and testing sets, and display their shapes.
14. Normalize the 'BMI' and 'Glucose' columns of the 'diabetes.csv' dataset.
Analysis Part: Apply MinMaxScaler and visualize the normalized data with a scatter plot. 15. Handle
missing values in the 'Pregnancies' column of the 'diabetes.csv' dataset. Analysis Part: Use
df['Pregnancies'].fillna(0) to fill missing values and visualize the distribution with a bar plot.
16. Detect missing values in the 'abalone.csv' dataset from UCI Machine Learning Repository.
Analysis Part: Use df.isnull().sum() to check for missing values and create a heatmap visualization.
17. Drop columns with missing values in the 'abalone.csv' dataset.
Analysis Part: Use df.dropna(axis='columns') and discuss any columns that were dropped.
18. Use one-hot encoding on the 'Sex' column of the 'abalone.csv' dataset.
Analysis Part: Apply one-hot encoding and visualize the distribution of each sex category. 19.
Perform label encoding on the 'diagnosis' column of the 'cancer.csv' dataset from UCI Machine
Learning Repository.
Analysis Part: Use LabelEncoder to encode the 'diagnosis' column and visualize the class distribution.
20. Identify outliers in the 'area_mean' column of the 'cancer.csv' dataset using a box plot. Analysis
Part: Plot and identify any outliers in the 'area_mean' column.
21. Normalize the 'perimeter_mean' and 'concavity_mean' columns of the 'cancer.csv' dataset.
Analysis Part: Use StandardScaler and visualize the normalized data with a scatter plot. 22.
Visualize the relationship between 'age' and 'cholesterol' in the 'heart.csv' dataset from Kaggle.
Analysis Part: Use a scatter plot and discuss any visible patterns or correlations. 23. Handle
missing values in the 'thalach' column of the 'heart.csv' dataset.
Analysis Part: Use df['thalach'].fillna(df['thalach'].median()) to fill missing values and visualize using a
histogram.
24. Encode the 'gender' column in the 'adult.csv' dataset from UCI Machine Learning Repository.
Analysis Part: Use label encoding and visualize the distribution of genders.
25. Normalize the 'hours-per-week' column of the 'adult.csv' dataset.
Analysis Part: Apply MinMaxScaler and create a box plot for the normalized data. 26. Drop rows with
missing values in the 'LoanAmount' column of the 'loan.csv' dataset from Kaggle. Analysis Part: Use
df.dropna(subset=['LoanAmount']) and compare the dataset size before and after. 27. Perform a scatter
plot analysis between 'ApplicantIncome' and 'LoanAmount' in the 'loan.csv' dataset. Analysis Part: Plot
and discuss any visible correlations or patterns.
28. Identify outliers in the 'Age' column of the 'credit.csv' dataset from UCI Machine Learning
Repository.
Analysis Part: Use a box plot to identify any outliers and discuss their implications.
29. Use one-hot encoding for the 'Education' column in the 'credit.csv' dataset.
Analysis Part: Apply one-hot encoding and visualize the new column distribution. 30. Handle missing
values in the 'CreditAmount' column of the 'credit.csv' dataset. Analysis Part: Use
df['CreditAmount'].fillna(df['CreditAmount'].mean()) and visualize using a histogram. 31. Visualize the
correlation between 'Age' and 'CreditAmount' in the 'credit.csv' dataset using a scatter plot.
Analysis Part: Plot and analyze any potential relationships.
32. Encode the 'Smoker' column in the 'insurance.csv' dataset from Kaggle using label encoding.
Analysis Part: Use label encoding and visualize the distribution of smokers.
33. Normalize the 'BMI' and 'Charges' columns of the 'insurance.csv' dataset.
Analysis Part: Use StandardScaler and create a scatter plot to visualize normalized data.
34. Drop columns with missing values in the 'cars.csv' dataset from UCI Machine Learning Repository.
Analysis Part: Use df.dropna(axis='columns') and discuss which columns were dropped. 35. Identify
and handle outliers in the 'Horsepower' column of the 'cars.csv' dataset. Analysis Part: Use a box plot
to detect outliers and discuss strategies for handling them (e.g., capping, removing).

Download full An Introduction to Generalized Linear Models Third Edition Barnett ebook all chapters
No ratings yet
Download full An Introduction to Generalized Linear Models Third Edition Barnett ebook all chapters
55 pages
House Price Prediction: Project Description
No ratings yet
House Price Prediction: Project Description
11 pages
Some Exercises
No ratings yet
Some Exercises
9 pages
Continuous Assessment
No ratings yet
Continuous Assessment
4 pages
index
No ratings yet
index
4 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
DSBDA Lab Plan
No ratings yet
DSBDA Lab Plan
5 pages
DP
No ratings yet
DP
9 pages
Data Science
No ratings yet
Data Science
18 pages
External
No ratings yet
External
11 pages
Advance Python
No ratings yet
Advance Python
5 pages
ML Final Prac
No ratings yet
ML Final Prac
47 pages
Data Mining Lab 03
No ratings yet
Data Mining Lab 03
10 pages
Machine Learning Project Report
No ratings yet
Machine Learning Project Report
65 pages
AI Lab 05 Lab Tasks Maaz
No ratings yet
AI Lab 05 Lab Tasks Maaz
23 pages
Assignment
No ratings yet
Assignment
3 pages
Boston House Prediction - Colab1
No ratings yet
Boston House Prediction - Colab1
10 pages
ds
No ratings yet
ds
28 pages
4BUIS014W Business Computing-Portfolio
No ratings yet
4BUIS014W Business Computing-Portfolio
7 pages
AttiqAhmadAfsarMidExam
No ratings yet
AttiqAhmadAfsarMidExam
8 pages
Practical _Questions_Unit 1 and 2
No ratings yet
Practical _Questions_Unit 1 and 2
5 pages
PW2 DataCleaning
No ratings yet
PW2 DataCleaning
6 pages
Data Science Manual
No ratings yet
Data Science Manual
155 pages
Abhiml ML File
No ratings yet
Abhiml ML File
74 pages
Capstone Project Guidelines
No ratings yet
Capstone Project Guidelines
2 pages
Data Pre Processing
No ratings yet
Data Pre Processing
2 pages
Machine Learning Lab Manual (1)
No ratings yet
Machine Learning Lab Manual (1)
42 pages
StarterNotebook - Jupyter Notebook
No ratings yet
StarterNotebook - Jupyter Notebook
12 pages
04 DS 2023
No ratings yet
04 DS 2023
63 pages
ML Lab Records
No ratings yet
ML Lab Records
101 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
1data Cleansing Cheklist
No ratings yet
1data Cleansing Cheklist
2 pages
Certificate
No ratings yet
Certificate
25 pages
Machine Learning Laboratory
No ratings yet
Machine Learning Laboratory
23 pages
Machine File
No ratings yet
Machine File
27 pages
Kartik mlp 4-9prg (1)
No ratings yet
Kartik mlp 4-9prg (1)
10 pages
Data Analysis: Data Preparation
No ratings yet
Data Analysis: Data Preparation
9 pages
Analysis and Prediction of House Prices by Linear Regression Model
No ratings yet
Analysis and Prediction of House Prices by Linear Regression Model
91 pages
Assignment
No ratings yet
Assignment
12 pages
DA lab
No ratings yet
DA lab
27 pages
AIL303 M
No ratings yet
AIL303 M
22 pages
manishadav
No ratings yet
manishadav
27 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
Monika Sree 11-07-2024
No ratings yet
Monika Sree 11-07-2024
36 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
DA_Programs
No ratings yet
DA_Programs
44 pages
Python Practical Questions@Subas
No ratings yet
Python Practical Questions@Subas
7 pages
ModuleAr Merged
No ratings yet
ModuleAr Merged
42 pages
AMLW_Assignment_3
No ratings yet
AMLW_Assignment_3
2 pages
Exercises 2
No ratings yet
Exercises 2
10 pages
pandas__prac
No ratings yet
pandas__prac
4 pages
PR LIST DSBDA
No ratings yet
PR LIST DSBDA
2 pages
Dwdm-Lab Manual
No ratings yet
Dwdm-Lab Manual
39 pages
Lab 08 - Data Preprocessing
No ratings yet
Lab 08 - Data Preprocessing
9 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
Ml Lab Manual 2024
No ratings yet
Ml Lab Manual 2024
41 pages
List of Experiment - Data Analysis Lab
No ratings yet
List of Experiment - Data Analysis Lab
2 pages
Practical File 2024
No ratings yet
Practical File 2024
25 pages
DS Question Bank Unit-1 Part-2
No ratings yet
DS Question Bank Unit-1 Part-2
3 pages
Python Class 6 Assignment Solution
No ratings yet
Python Class 6 Assignment Solution
9 pages
Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
From Everand
Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
Abhishek Mishra
No ratings yet
Inventory reduction
No ratings yet
Inventory reduction
2 pages
CHEM-205 Analytical Chemistry-I: Anova
No ratings yet
CHEM-205 Analytical Chemistry-I: Anova
20 pages
Correlations and Scatterplots
No ratings yet
Correlations and Scatterplots
16 pages
Project
No ratings yet
Project
15 pages
Unit 3 - Activity 15 - Excel Using Statistics Functions Worksheet
No ratings yet
Unit 3 - Activity 15 - Excel Using Statistics Functions Worksheet
9 pages
Sampling Statistics Wayne A. Fuller download pdf
100% (11)
Sampling Statistics Wayne A. Fuller download pdf
50 pages
Psych Stat 1 Syllabus Parametric Tests 2023
No ratings yet
Psych Stat 1 Syllabus Parametric Tests 2023
2 pages
What Is The Level of Internet Addiction Among STEM Strand Senior High School Students at San Juan de Dios Educational Foundation Inc
No ratings yet
What Is The Level of Internet Addiction Among STEM Strand Senior High School Students at San Juan de Dios Educational Foundation Inc
3 pages
Tutorial 5 - Solutions
No ratings yet
Tutorial 5 - Solutions
8 pages
Psychology Statistics
No ratings yet
Psychology Statistics
1 page
Chapter 16
No ratings yet
Chapter 16
13 pages
Output Dot Plot Chapter 4 Hal. 95
No ratings yet
Output Dot Plot Chapter 4 Hal. 95
9 pages
Anova Assignment
No ratings yet
Anova Assignment
3 pages
Lesson Plan in Random Variable
No ratings yet
Lesson Plan in Random Variable
10 pages
Cheat Sheet (Bloomberg's Level I CFA (R) Exam Prep)
No ratings yet
Cheat Sheet (Bloomberg's Level I CFA (R) Exam Prep)
7 pages
Effectiveness of Audiovisual-Based Training On Basic Life Support Knowledge of Students in Bengkulu
No ratings yet
Effectiveness of Audiovisual-Based Training On Basic Life Support Knowledge of Students in Bengkulu
6 pages
Repeated Measures ANOVA
100% (1)
Repeated Measures ANOVA
41 pages
STAT 135: Linear Regression: Joan Bruna
No ratings yet
STAT 135: Linear Regression: Joan Bruna
232 pages
Puglio and Tucker (2021) Neural Networks and Recession Forecasting
No ratings yet
Puglio and Tucker (2021) Neural Networks and Recession Forecasting
27 pages
Only Quat
No ratings yet
Only Quat
8 pages
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
No ratings yet
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
4 pages
LESSON 7: Non-Parametric Statistics: Tests of Association & Test of Homogeneity
No ratings yet
LESSON 7: Non-Parametric Statistics: Tests of Association & Test of Homogeneity
21 pages
Math Midterm
No ratings yet
Math Midterm
9 pages
Mean Absolute Deviation (Mad) : Syntax
No ratings yet
Mean Absolute Deviation (Mad) : Syntax
3 pages
Data-Science-and-Analytics-Reviewer
No ratings yet
Data-Science-and-Analytics-Reviewer
5 pages
Statistical-Tools-in-Research
No ratings yet
Statistical-Tools-in-Research
3 pages
STA02A2_Practical Semester Test 2_MEMO_2024
No ratings yet
STA02A2_Practical Semester Test 2_MEMO_2024
3 pages
Compassion Fatigue Among Animal Shelter Volunteers - Examining Personal and Organizational Risk Factors
No ratings yet
Compassion Fatigue Among Animal Shelter Volunteers - Examining Personal and Organizational Risk Factors
20 pages
Control Charts For Variables and Attributes
No ratings yet
Control Charts For Variables and Attributes
4 pages

Lab_questionbank

Uploaded by

Lab_questionbank

Uploaded by

Lab Exam: Mid-Semester 1

Subject: AIML (Artificial Intelligence and Machine Learning)

Answer one question from each group: Group A and Group B

You might also like