0% found this document useful (0 votes)
4 views

MidTerm assignment

The document outlines a mid-term assignment for a Machine Learning lab course at Vellore Institute of Technology for Fall 2024-25. It requires students to develop Python code for various machine learning tasks, including naïve Bayesian classification, CART decision trees, and building an Artificial Neural Network with specific data preprocessing and visualization steps. The assignment is due on 10/10/2024, with a lab quiz scheduled for 08/11/2024.

Uploaded by

jayavardhand2004
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

MidTerm assignment

The document outlines a mid-term assignment for a Machine Learning lab course at Vellore Institute of Technology for Fall 2024-25. It requires students to develop Python code for various machine learning tasks, including naïve Bayesian classification, CART decision trees, and building an Artificial Neural Network with specific data preprocessing and visualization steps. The assignment is due on 10/10/2024, with a lab quiz scheduled for 08/11/2024.

Uploaded by

jayavardhand2004
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Vellore Institute of Technology,

Fall 2024-25
BCSE209P – Machine Learning lab
Mid-Term assignment

Instructions:
 Write python code to develop machine learning models for the tasks given below.
 For each one of the following tasks, visualize the results (before and after fitting the
model)
 Your submission must include the following for each of the tasks given (wherever
applicable);
o Data pre-processing
o Visualization of data before model fitting/learning
o Code to train, test, and fit the model.
o Visualization of data
 Due: 10/10/2024, 10:00 PM. Note: submission after the due time will not be considered
for grading. Lab quiz will be conducted on 08/11/2024 during lab hours.

1. Predict the class label of the following unknown sample using naïve Bayesian classification.
X = (color= “red”, type = “ordinary”, origin = “domestic”)
The training samples for the car theft database are given below.
Attributes are Color, Type, Origin, and the subject, stolen can be either yes or no
--------------------------------------------------------------------------------------------------------------
ID color type origin Class: stolen?

1 red sports domestic yes


2 red sports domestic no
3 red sports domestic yes
4 yellow sports domestic no
5 yellow sports imported yes
6 yellow ordinary imported no
7 yellow ordinary imported yes
8 yellow ordinary domestic no
9 red ordinary imported no
10 red sports imported yes
-------------------------------------------------------------------------------------------------------

2. Construct a CART decision tree for iris data set. Predict the label from the
decision tree and implement it in python.

3. This data set Diabetes have 10 baseline variables age, sex, body mass index,
average blood pressure, and six blood serum measurements were obtained for each
of n = 442 diabetes patients, as well as the response of interest, a quantitative
measure of disease progression one year after baseline.

bmi body mass index


bp average blood pressure
s1 tc, total serum cholesterol
s2 ldl, low-density lipoproteins
s3 hdl, high-density lipoproteins
s4 tch, total cholesterol / HDL
s5 ltg, possibly log of serum triglycerides level
s6 glu, blood sugar level

-Create data frame for the given values


-Is there any missing value? Find and handle the missing value using any statistical
method.
-For average blood pressure and high-density lipoproteins, apply data scaling

methods.

- Find the optimal samples as training and test data size.

- Build Artificial Neural Network by implementing the MLP algorithm and test
accuracy (MAE, MSE, RMSE, R 2 ) for at least 3 different parameters.

************

You might also like