0% found this document useful (0 votes)

9 views40 pages

AI - ML Lab Manual)

The document is a lab manual for the Artificial Intelligence and Machine Learning Laboratory course at Anjalai Ammal - Mahalingam Engineering College. It outlines course objectives, a list of experiments, and expected outcomes, including the implementation of various AI algorithms and models using Python. The manual also provides detailed instructions and sample programs for key exercises such as BFS, DFS, A* search, Naive Bayes, and Bayesian Networks.

Uploaded by

dkdeva57

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views40 pages

AI - ML Lab Manual)

Uploaded by

dkdeva57

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

ANJALAI AMMAL - MAHALINGAM ENGINEERING COLLEGE

Accredited by NAAC
KOVILVENNI - 614403
DEPARTMENT OF ELECTRONICS AND COMMUNICATION
ENGINEERING

LAB MANUAL

Course Name : Artificial Intelligence and Machine Learning

Laboratory
Course Code : CS3491
Course Credits : 02
Semester / year : VI / III Year
Regulation : 2021
Prepared By : Dr.C.Thiruvengadam, Associate Professor
CS3491 - Artificial Intelligence and Machine Learning Lab

COURSE OBJECTIVES:

The objectives of this course are to:

• Study about uninformed and Heuristic search techniques. (BL1)
• Learn techniques for reasoning under uncertainty (BL1)
• Introduce Machine Learning and supervised learning algorithms (BL1)
• Study about ensembling and unsupervised learning algorithms (BL2)
• Learn the basics of deep learning using neural networks (BL1)
 Bloom’s Level (BL)

LIST OF EXPERIMENTS

1. Implementation of Uninformed search algorithms (BFS, DFS)

2. Implementation of Informed search algorithms (A*, memory-bounded A*)
3. Implement naïve Bayes models
4. Implement Bayesian Networks
5. Build Regression models
6. Build decision trees and random forests
7. Build SVM models
8. Implement ensembling techniques
9. Implement clustering algorithms
10. Implement EM for Bayesian networks
11. Build simple NN models
12. Build deep learning NN models

TOTAL : 30 PERIODS
COURSE OUTCOMES
At the end of this course, the students will be able to:

Write python program to implement different AI search algorithms utilized in problem

C607.1
solving
C607.2 Explain the process of applying reasoning under uncertainty
C607.3 Design and simulate the supervised learning models using python programs
C607.4 Design and implement ensembling and unsupervised learning models
C607.5 Implement deep learning neural network models.

List of First Cycle Exercises:

1. Implementation of Uninformed search algorithms (BFS, DFS)
2. Implementation of Informed search algorithms (A*, memory-bounded A*)
3. Implement naïve Bayes models
4. Implement Bayesian Networks
5. Build Regression models
6. Build decision trees and random forest

List of Second Cycle Exercises:

7. Build SVM models

8. Implement ensembling techniques
9. Implement clustering algorithms
10. Implement EM for Bayesian networks
11. Build simple NN models
12. Build deep learning NN models
Exercise 1 Implementation of BFS & DFS

Aim: To write a program in Python to implement Breadth First Search and Depth First Search algorithm.

Breadth First Search:

Breadth-First Search (BFS) is a fundamental graph traversal algorithm that explores a graph
systematically by visiting all the neighboring nodes of a given starting node before moving on to the
next level of nodes. It traverses the graph level by level, starting from the root node or a specified
starting node. BFS is particularly useful for finding the shortest path between two nodes in an
unweighted graph and for exploring and analyzing graph structures. Breadth-First Search is a recursive
algorithm to search all the vertices of a graph or a tree.

Algorithm:
Step1: Initialize a queue and a set to keep track of visited nodes.
Step2: Enqueue the starting node into the queue and mark it as visited.
Step3: While the queue is not empty, dequeue a node from the queue.
Step4: Process the dequeued node.
Step5: Iterate through the neighbors of the dequeued node.
Step6: If a neighbor has not been visited, enqueue it into the queue and mark it as visited.

Program 1:
graph = {
'5' : ['3','7'],
'3' : ['2', '4'],
'7' : ['8'],
'2' : [],
'4' : ['8'],
'8' : []
}
visited = [] # List for visited nodes.
queue = [] #Initialize a queue
def bfs(visited, graph, node): #function for BFS
visited.append(node)
queue.append(node)
while queue: # Creating loop to visit each node
m = queue.pop(0)
print (m, end = "")
for neighbour in graph[m]:
if neighbour not in visited:
visited.append(neighbour)
queue.append(neighbour)
# Driver Code
print("Following is the Breadth-First Search")
bfs(visited, graph, '5') # function calling

Output:
Following BFS 5 3 7 2 4 8
Depth First Search:
The Depth-First Search is a recursive algorithm that uses the concept of backtracking. It involves
thorough searches of all the nodes by going ahead if potential, else by backtracking.

1. Start the program by putting any one of the graph's vertex on top of the stack.
2. After that take the top item of the stack and add it to the visited list of the vertex.
3. Next, create a list of that adjacent node of the vertex. Add the ones which aren't in the visited list of
vertexes to the top of the stack.
4. Lastly, keep repeating steps 2 and 3 until the stack is empty.

Program 2 :
graph = {
'5' : ['3','7'],
'3' : ['2', '4'],
'7' : ['8'],
'2' : [],
'4' : ['8'],
'8' : []
}

visited = set() # Set to keep track of visited nodes of graph.

def dfs(visited, graph, node): #function for dfs
if node not in visited:
print (node)
visited.add(node)
for neighbour in graph[node]:
dfs(visited, graph, neighbour)
# Driver Code
print("Following is the Depth-First Search")
dfs(visited, graph, '5')

Output:

Following DFS
5
3
2
4
8
7

Result:

Thus the Python Program for implementing BFS & DFS is executed successfully.
Exercise 2 Implementation of A* search & Memory bounded A* search

Aim : To write a program in Python to implement ASearch and SMA Search.

A* Search:
1. Initialize Open and Closed Sets: Create two sets, often implemented as priority queues, to keep track of
nodes that are being considered for expansion (Open Set) and nodes that have already been visited (Closed
Set).
2. Initialize Start Node: Create a node representing the initial state. Set its cost from the start node (g) to 0 and
its heuristic value (h) to an estimate of the cost to reach the goal from this node.
3. Add Start Node to Open Set: Add the start node to the Open Set.
4. Repeat Until Open Set is Empty or Goal is Found:
a. Pop the node with the lowest f value (f = g + h) from the Open Set. This node will be the current
node.
b. If the current node is the goal node, reconstruct the path from the start node to the goal node and
return it.
c. Otherwise, move the current node from the Open Set to the Closed Set.
d. Generate successor nodes of the current node and calculate their costs (g) and heuristic values (h).
e. For each successor node:
i. If the successor node is already in the Closed Set and the new path is not better, skip it.
ii. If the successor node is not in the Open Set or the new path is better, update the cost and
heuristic values, set the current node as its parent, and add it to the Open Set.
5. If the Open Set becomes empty and the goal node has not been found, the search fails (goal is unreachable).
6. Return the path from the start node to the goal node, if found.

Program:
def astaralgo(start_nod,stop_nod):
open_set=set(start_nod)
closed_set=set()
g={}
parents={}
g[start_nod]=0
parents[start_nod]=start_nod
while len(open_set) > 0:
n=None
for v in open_set:
if n == None or g[v] + heuristic(v) < g[n] + heuristic(n):
n=v
if n==stop_nod or graph_nod[n] == None:
pass
else:
for(m,weight) in get_neighbours(n):
if m not in open_set and m not in closed_set:
open_set.add(m)
parents[m] = n
g[m] = g[n]+weight
else:
if g[m]>g[n]+weight:
g[m]=g[n]+weight
parents[m]=n
if m in closed_set:
closed_set.remove(m)
open_set.add(m)
if n==None:
print("path doesn't exist")
return None
if n==stop_nod:
path=[]
while parents[n]!=n:
path.append(n)
n=parents[n]
path.append(start_nod)
path.reverse()
print("path found:{}".format(path))return path
open_set.remove(n) closed_set.add(n)
print("path doesn't exist !")

return None
def get_neighbours(v):
if v in graph_nod:
return graph_nod[v]
else:
return None
def heuristic(n):
h_dist={'a':11,'b':6,'c':5,'d':7,'e':3,'f':6,'g':5,'h':3,'i':1,'j':0}
return h_dist[n]
graph_nod={
'a':[('b',6),('f',3)],
'b':[('a',6),('c',3),('d',2)],
'c':[('b',3),('d',1),('e',5)],
'd':[('b',2),('c',1),('e',8)],
'e':[('c',5),('d',8),('i',5),('j',5)],
'f':[('a',3),('g',1),('h',7)],
'g':[('f',1),('i',3)],
'h':[('f',7),('i',2)],
'i':[('e',5),('g',3),('h',2),('j',3)]
}
astaralgo('a' , 'j')
Output:

path found:['a', 'f', 'g', 'i', 'j']

Result:
Thus the Python Program for implementing A* search is executed successfully.
Exercise 3 Implementation of Naive Bayes Classifier model

Aim : To implement Naïve Bayes Classifier model in Python

Algorithm :

Naive Bayes classifier calculates the probability of an event in the following steps:

Step 1: Calculate the prior probability for given class labels

Step 2: Find Likelihood probability with each attribute for each class
Step 3: Put these value in Bayes Formula and calculate posterior probability.
Step 4: See which class has a higher probability, given the input belongs to the higher probability class.

Program :
from functools import reduce
import pandas as pd
import pprint
class Classifier():
data = None
class_attr = None
priori = {}
cp = {}
hypothesis=None
def init (self, filename=None, class_attr=None):
self.data=pd.read_csv(filename, sep=',', header = (0))
self.class_attr = class_attr

def calculate_priori(self):
class_values = list (set (self.data[self.class_attr]))
class_data = list (self.data[self.class_attr])
for i in class_values:
self.priori[i] = class_data.count (i)/float (len (class_data))
print ("Priori Values: ", self.priori)

def get_cp(self, attr, attr_type, class_value):

data_attr= list(self.data[attr])
class_data=list (self.data[self.class_attr])
total =1
for i in range (0, len (data_attr)):
if class_data[i] == class_value and data_attr[i] == attr_type:
total+=1
return total/float (class_data.count (class_value))
def calculate_conditional_probabilities (self, hypothesis):
for i in self.priori:
self.cp[i] = {}
for j in hypothesis:
self.cp[i].update({ hypothesis [j]: self.get_cp (j, hypothesis [j], i)})
print ("\nCalculated Conditional Probabilities: \n")
pprint.pprint (self.cp)
def classify (self):
print ("Result: ")

for i in self.cp:
print (i, "==>", reduce (lambda x,y: x*y, self.cp[i].values())*self.priori[i])
if name ==" main ":
c =Classifier(filename="form.csv",class_attr="Play" )
c.calculate_priori()
c.hypothesis ={"Outlook": 'Rainy', "Temp": 'Mild', "Humidity": 'Normal', "Windy": 't'}
c.calculate_conditional_probabilities (c.hypothesis)
c.classify()

Output:
Priori Values: {'no': 0.35714285714285715, 'yes': 0.6428571428571429}

Calculated Conditional Probabilities:

{'no': {'Mild': 0.6, 'Normal': 0.4, 'Rainy': 0.8, 't': 0.6},

'yes': {'Mild': 0.5555555555555556,
'Normal': 0.7777777777777778,
'Rainy': 0.3333333333333333,
't': 0.4444444444444444}}
Result:
no ==> 0.04114285714285714
yes ==> 0.04115226337448559

Result:
Thus the Python Program for implementing Naïve Baye’s classification is executed successfully.
Exercise 4 Implementation of Bayesian Network

Aim : To implement Bayesian Networks using Python.

Description:
A Bayesian Network falls under the category of Probabilistic Graphical Modelling (PGM) technique that
is used to compute uncertainties by using the concept of probability. Popularly known as Belief
Networks, Bayesian Networks are used to model uncertainties by using Directed Acyclic
Graphs (DAG).

Program :
import numpy as np
import pandas as pd
import csv
from pgmpy.estimators import MaximumLikelihoodEstimator
from pgmpy.models import BayesianNetwork
from pgmpy.inference import VariableElimination

heartDisease = pd.read_csv('heart.csv')
heartDisease = heartDisease.replace('?',np.nan)

print('Sample instances from the dataset are given below')

print(heartDisease.head())

print('\n Attributes and datatypes')

print(heartDisease.dtypes)

model=
BayesianNetwork([('age','heartdisease'),('gender','heartdisease'),('exang','heartdisease'),('cp','heartdisease'),
('heartdisease','restecg'),('heartdisease','chol')])
print('\nLearning CPD using Maximum likelihood estimators')
model.fit(heartDisease,estimator=MaximumLikelihoodEstimator)
print('\n Inferencing with Bayesian Network:')
HeartDiseasetest_infer = VariableElimination(model)

print('\n 1. Probability of HeartDisease given evidence= restecg')

q1=HeartDiseasetest_infer.query(variables=['heartdisease'],evidence={'restecg':1})
print(q1)

print('\n 2. Probability of HeartDisease given evidence= cp ')

q2=HeartDiseasetest_infer.query(variables=['heartdisease'],evidence={'cp':2})
print(q2)

Output:
Priori Values: {'yes': 0.6428571428571429, 'no': 0.35714285714285715}

Calculated Conditional Probabilities:

{'no': {'Mild': 0.6, 'Normal': 0.4, 'Rainy': 0.8, 't': 0.8},

'yes': {'Mild': 0.5555555555555556,
'Normal': 0.7777777777777778,
'Rainy': 0.3333333333333333,
't': 0.4444444444444444}}
Result:
yes ==> 0.04115226337448559
no ==> 0.05485714285714286

Result:
Thus the Python Program for implementing Bayesian Network is executed successfully.
Exercise 5 Implementing Regression model

Aim : To implement a Regression Model using Python.

Description:
Linear regression is a regression model that estimates the relationship between one independent
variable and one dependent variable using a straight line.

Algorithm:
Step 1: Import the packages and classes that you need.
Step 2: Provide data to work with, and eventually do appropriate transformations.
Step 3: Create a regression model and fit it with existing data.
Step 4: Check the results of model fitting to know whether the model is satisfactory.
Step 5: Apply the model for predictions.

[Linear Regression]

Program:
import numpy as np
import matplotlib.pyplot as plt
def estimate_coef(x, y):
n = np.size(x)
m_x = np.mean(x)
m_y = np.mean(y)
SS_xy = np.sum(y*x) - n*m_y*m_x
SS_xx = np.sum(x*x) - n*m_x*m_x
b_1 = SS_xy / SS_xx
b_0 = m_y - b_1*m_x
return (b_0, b_1)
def plot_regression_line(x, y, b):
plt.scatter(x, y, color = "m",marker = "o", s = 30)
y_pred = b[0] + b[1]*x
plt.plot(x, y_pred, color = "g")
plt.xlabel('x')
plt.ylabel('y')
plt.show()

def main():
x = np.array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
y = np.array([1, 3, 2, 5, 7, 8, 8, 9, 10, 12])
b = estimate_coef(x, y)
print("Estimated coefficients:\nb_0 = {} \nb_1 = {}".format(b[0], b[1]))
plot_regression_line(x, y, b)
if name == " main ":
main()
Output:

Estimated coefficients:
b_0 = 1.2363636363636363
b_1 = 1.1696969696969697

[Logistic Regression]

Description:

Logistic regression is a data analysis technique that uses mathematics to find the relationships between
two data factors. It then uses this relationship to predict the value of one of those factors based on the other. The
prediction usually has a finite number of outcomes, like yes or no.

Program:
import numpy
from sklearn import linear_model

X = numpy.array([3.78, 2.44, 2.09, 0.14, 1.72, 1.65, 4.92, 4.37, 4.96, 4.52, 3.69, 5.88]).reshape(-1,1)
y = numpy.array([0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1])
logr = linear_model.LogisticRegression()
logr.fit(X,y)

predicted = logr.predict(numpy.array([3.46]).reshape(-1,1))
print(predicted)

Output: [0]
[Least Square Regression]

Description:

The least-square regression is the technique used in regression analysis of the ML model and AI
implementation. It is one of the popular mathematical methods for finding the best possible fit line that defines the
connection between dependent and independent variables.

Program:

def calculateB(x, y, n):

sx = sum(x)
sy = sum(y)
sxsy = 0
sx2 = 0
for i in range(n):
sxsy += x[i] * y[i]
sx2 += x[i] * x[i]
b = (n * sxsy - sx * sy)/(n * sx2 - sx * sx)
return b
def leastRegLine(X,Y,n):
b = calculateB(X, Y, n)
meanX = int(sum(X)/n)
meanY = int(sum(Y)/n)
a = meanY - b * meanX
print("Regression line:")
print("Y = ", '%.3f'%a, " + ", '%.3f'%b, "*X",
sep="")
X = [95, 85, 80, 70, 60 ]
Y = [90, 80, 70, 65, 60 ]
n = len(X)
leastRegLine(X, Y, n)

Output:

Regression line:
Y = 5.685 + 0.863*X

[Bayesian Regression]
Bayesian linear regression is a type of conditional modeling in which the mean of one variable is
described by a linear combination of other variables, with the goal of obtaining the posterior probability of the
regression coefficients

Program:

from sklearn.datasets import fetch_california_housing

from sklearn.model_selection import train_test_split
from sklearn.metrics import r2_score
from sklearn.linear_model import BayesianRidge
dataset =fetch_california_housing()
X, y = dataset.data, dataset.target
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.15, random_state = 42)

model = BayesianRidge() model.fit(X_train, y_train) prediction = model.predict(X_test)

print(f"r2 Score Of Test Set : {r2_score(y_test, prediction)}")

Output:

r2 Score Of Test Set : 0.5904164911079081

Result:
Thus the Python Program for implementing Regression Model is executed successfully.
Exercise 6a. Decision Tree Classification

Aim: To implement Decision tree classification in Python.

Decision Trees (DTs) are a non-parametric supervised learning method used

for classification and regression. The goal is to create a model that predicts the value of a target variable
by learning simple decision rules inferred from the data features. A tree can be seen as a piecewise
constant approximation.

Program:

from matplotlib import pyplot as plt

from sklearn import datasets
from sklearn.tree import DecisionTreeClassifier
from sklearn import tree
iris = datasets.load_iris()
X = iris.data
y = iris.target
clf = DecisionTreeClassifier(random_state=12345)
model = clf.fit(X, y)
#fig = plt.figure(figsize=(250,200))
tree.plot_tree(clf,feature_names=iris.feature_names,class_names=iris.target_names,filled=True)
plt.show()

Output:

Result:
Thus the Python Program for implementing Decision tree classification is executed successfully.
Exercise 6b. Random Forest Classifier

Aim: To implement Random Forest Classifier in Python.

Random forest Classifier is a commonly-used machine learning algorithm, which combines the output
of multiple decision trees to reach a single result. Its ease of use and flexibility have fueled its adoption,
as it handles both classification and regression problems.

Program:

from sklearn.datasets import load_wine

wine = load_wine()
X = wine.data
y = wine.target
from sklearn.ensemble import RandomForestClassifier

rf = RandomForestClassifier(n_estimators=100,
max_depth=3,
min_samples_leaf=4,
bootstrap=True,
n_jobs=-1,

random_state=0)
rf.fit(X, y)
#rf.estimators_[index]
import matplotlib.pyplot as plt
from sklearn.tree import plot_tree

fig = plt.figure(figsize=(15, 10))

plot_tree(rf.estimators_[0],
feature_names=wine.feature_names,
class_names=wine.target_names,
filled=True, impurity=True,
rounded=True)
plt.show()
Output:

Result:
Thus the Python Program for implementing Random Forest Classifier is executed successfully.
Exercise 7: SVM Models

Aim:
The aim of this Python code is to demonstrate how to use the scikit-learn library to
train support vector machine (SVM) models for classification tasks.

Algorithm:
1. Load a dataset using the pandas library
2. Split the dataset into training and testing sets using train_test_split function from
scikit-learn
3. Train three SVM models with different kernels (linear, polynomial, and RBF) using
SVC function from scikit-learn
4. Predict the test set labels using the trained models
5. Evaluate the accuracy of the models using the accuracy_score function from scikit-
learn
6. Print the accuracy of each model

Program:
import pandas as pd
fromsklearn.model_selection import train_test_split
fromsklearn.svm import SVC
fromsklearn.metrics import accuracy_score

# Load the dataset

data = pd.read_csv('data.csv')

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(data.drop('target', axis=1), data['target'],
test_size=0.3, random_state=42)

# Train an SVM model with a linear kernel

svm_linear = SVC(kernel='linear')
svm_linear.fit(X_train, y_train)

# Predict the test set labels

y_pred = svm_linear.predict(X_test)
# Evaluate the model's accuracy
accuracy = accuracy_score(y_test, y_pred)
print(f'Linear SVM accuracy: {accuracy:.2f}')

# Train an SVM model with a polynomial kernel

svm_poly = SVC(kernel='poly', degree=3)
svm_poly.fit(X_train, y_train)

# Predict the test set labels

y_pred = svm_poly.predict(X_test)

# Evaluate the model's accuracy

accuracy = accuracy_score(y_test, y_pred)
print(f'Polynomial SVM accuracy: {accuracy:.2f}')

# Train an SVM model with an RBF kernel

svm_rbf = SVC(kernel='rbf')
svm_rbf.fit(X_train, y_train)

# Predict the test set labels

y_pred = svm_rbf.predict(X_test)

# Evaluate the model's accuracy

accuracy = accuracy_score(y_test, y_pred)
print(f'RBF SVM accuracy: {accuracy:.2f}')

Output:
Accuracy: 0.9777777777777777

Result:
Thus the program for Build SVM Model has been executed successfully and output is verified.
Exercise 8: Implementation of Ensembling techniques

Aim:
The aim of ensembling is to combine the predictions of multiple individual models,
Known as base models, in order to produce a final prediction that is more accurate and
reliable than any individual model. (Bagging)

Algorithm:
1. Load the dataset and split it into training and testing sets.
2. Choose the base models to be included in the ensemble.
3. Train each base model on the training set.
4. Combine the predictions of the base models using the chosen ensembling technique
(bagging).
5. Evaluate the performance of the ensemble model on the testing set.
6. If the performance is satisfactory, deploy the ensemble model for making predictions
on new data.

Program:

from sklearn.datasets import make_classification

# define dataset
X, y = make_classification(n_samples=1000, n_features=20, n_informative=15, n_redundant=5,
random_state=5)
# summarize the dataset
print(X.shape, y.shape)
from numpy import mean
from numpy import std
from sklearn.datasets import make_classification
from sklearn.model_selection import cross_val_score
from sklearn.model_selection import RepeatedStratifiedKFold
from sklearn.ensemble import BaggingClassifier
# define dataset
X, y = make_classification(n_samples=1000, n_features=20, n_informative=15, n_redundant=5,
random_state=5)
# define the model
model = BaggingClassifier()
# evaluate the model
cv = RepeatedStratifiedKFold(n_splits=10, n_repeats=3, random_state=1)
n_scores = cross_val_score(model, X, y, scoring='accuracy', cv=cv, n_jobs=-1, error_score='raise')
# report performance
print('Accuracy: %.3f (%.3f)' % (mean(n_scores), std(n_scores)))
from sklearn.datasets import make_classification
from sklearn.ensemble import BaggingClassifier
# define dataset
X, y = make_classification(n_samples=1000, n_features=20, n_informative=15, n_redundant=5,
random_state=5)
# define the model
model = BaggingClassifier()
# fit the model on the whole dataset
model.fit(X, y)
# make a single prediction
row = [[-4.7705504,-1.88685058,-0.96057964,2.53850317,-6.5843005,3.45711663,-
7.46225013,2.01338213,-0.45086384,-1.89314931,-2.90675203,-0.21214568,-
0.9623956,3.93862591,0.06276375,0.33964269,4.0835676,1.31423977,-2.17983117,3.1047287]]
yhat = model.predict(row)
print('Predicted Class: %d' % yhat[0])

Output:
(1000, 20) (1000,)
Accuracy: 0.862 (0.042)
Predicted Class: 1

Result:
Thus the program for Bagging has been executed successfully and output is verified
Exercise 9: Implementation of Clustering Techniques

Aim:
The aim of clustering is to find patterns and structure in data that may not be
immediately apparent, and to discover relationships and associations between data points.

Algorithm:
1. Data preparation: The first step is to prepare the data that we want to cluster. This may involve
data cleaning, normalization, and feature extraction, depending on the type and quality of the data.
2. Choosing a distance metric: The next step is to choose a distance metric or similarity measure
that will be used to determine the similarity between data points. Common distance metrics
include Euclidean distance, Manhattan distance, and cosine similarity.
3. Choosing a clustering algorithm: There are many clustering algorithms available, each with its
own strengths and weaknesses. Some popular clustering algorithms include K-Means,
Hierarchical clustering, and DBSCAN.
4. Choosing the number of clusters: Depending on the clustering algorithm chosen, we may need to
specify the number of clusters we want to form. This can be done using domain knowledge or by
using techniques such as the elbow method or silhouette analysis.
5. Cluster assignment: Once the clusters have been formed, we need to assign each data point to its
nearest cluster based on the chosen distance metric.
6. Interpretation and evaluation: Finally, we need to interpret and evaluate the results of the
clustering algorithm to determine if the clustering has produced meaningful and useful insights.

Program:

import pandas as pd
import matplotlib.pyplot as plt
from sklearn.cluster import KMeans
# load the customer data into a DataFrame
customer_df = pd.read_csv('customer_data.csv')
# Check the first 5 rows
customer_df.head()
plt.scatter(customer_df["Age"],
customer_df["Spending Score (1-100)"])
plt.xlabel("Age")
plt.ylabel("Spending Score (1-100)")
plt.scatter(customer_df["Age"],
customer_df["Annual Income (k$)"])
plt.xlabel("Age")
plt.ylabel("Annual Income (k$)")
relevant_cols = ["Age", "Annual Income (k$)", "Spending Score (1-100)"]
customer_df = customer_df[relevant_cols]
from sklearn.preprocessing import StandardScaler
scaler = StandardScaler()
scaler.fit(customer_df)
scaled_data = scaler.transform(customer_df)
def find_best_clusters(df, maximum_K):

clusters_centers = []
k_values = []

for k in range(1, maximum_K):

kmeans_model = KMeans(n_clusters = k)
kmeans_model.fit(df)

clusters_centers.append(kmeans_model.inertia_)
k_values.append(k)
return clusters_centers, k_values
def generate_elbow_plot(clusters_centers, k_values):

figure = plt.subplots(figsize = (12, 6))

plt.plot(k_values, clusters_centers, 'o-', color = 'orange')
plt.xlabel("Number of Clusters (K)")
plt.ylabel("Cluster Inertia")
plt.title("Elbow Plot of KMeans")
plt.show()
clusters_centers, k_values = find_best_clusters(scaled_data, 12)

generate_elbow_plot(clusters_centers, k_values)
kmeans_model = KMeans(n_clusters = 5)

kmeans_model.fit(scaled_data)
customer_df["clusters"] = kmeans_model.labels_

customer_df.head()
plt.scatter(customer_df["Spending Score (1-100)"],
customer_df["Annual Income (k$)"],
c = customer_df["clusters"])
Result:
Thus the K-Means program is executed successfully and output is verified.
Exercise 10: Implementation of EM algorithm

Aim :
To implement Expectation- Maximization algorithm in Python.

What is an EM algorithm?

The Expectation-Maximization (EM) algorithm is defined as the combination of various

unsupervised machine learning algorithms, which is used to determine the local maximum
likelihood estimates (MLE) or maximum a posteriori estimates (MAP) for unobservable
variables in statistical models. Further, it is a technique to find maximum likelihood estimation
when the latent variables are present. It is also referred to as the latent variable model.

Algorithm:

 st
Step: The very first step is to initialize the parameter values. Further, the system is provided with
incomplete observed data with the assumption that data is obtained from a specific model.

 2nd Step: This step is known as Expectation or E-Step, which is used to estimate or guess the values
of the missing or incomplete data using the observed data. Further, E-step primarily updates the
variables.

 3rd Step: This step is known as Maximization or M-step, where we use complete data obtained
from the 2nd step to update the parameter values. Further, M-step primarily updates the hypothesis.

 4th step: The last step is to check if the values of latent variables are converging or not. If it gets
"yes", then stop the process; else, repeat the process from step 2 until the convergence occurs.

Program :
import numpy as np # import numpy
from numpy.linalg import inv # for matrix inverse
import matplotlib.pyplot as plt # import matplotlib.pyplot for plotting framework
from scipy.stats import multivariate_normal # for generating pdf
m1 = [1,1] # consider a random mean and covariance value
m2 = [7,7]
cov1 = [[3, 2], [2, 3]]
cov2 = [[2, -1], [-1, 2]]x = np.random.multivariate_normal(m1, cov1, size=(200,)) # Generating 200
samples for each mean and covariance
y = np.random.multivariate_normal(m2, cov2, size=(200,))d = np.concatenate((x, y), axis=0)
plt.figure(figsize=(10,10))
plt.scatter(d[:,0], d[:,1], marker='o')
plt.axis('equal')
plt.xlabel('X-Axis', fontsize=16)
plt.ylabel('Y-Axis', fontsize=16)
plt.title('Ground Truth', fontsize=22)
plt.grid()
plt.show()
m1 = random.choice(d)

m2 = random.choice(d)
cov1 = np.cov(np.transpose(d))
cov2 = np.cov(np.transpose(d))
pi = 0.5
x1 = np.linspace(-4,11,200)
x2 = np.linspace(-4,11,200)
X, Y = np.meshgrid(x1,x2)

Z1 = multivariate_normal(m1, cov1)
Z2 = multivariate_normal(m2, cov2)

pos = np.empty(X.shape + (2,)) # a new array of given shape and type, without initializing
entries
pos[:, :, 0] = X; pos[:, :, 1] = Y

plt.figure(figsize=(10,10)) # #creating the figure and assigning the

size
plt.scatter(d[:,0], d[:,1], marker='o')
plt.contour(X, Y, Z1.pdf(pos), colors="r" ,alpha = 0.5)
plt.contour(X, Y, Z2.pdf(pos), colors="b" ,alpha = 0.5)
plt.axis('equal') # making both the axis equal
plt.xlabel('X-Axis', fontsize=16) # X-Axis
plt.ylabel('Y-Axis', fontsize=16) # Y-Axis
plt.title('Initial State', fontsize=22) # Title of the plot
plt.grid() # displaying gridlines
plt.show()
##Expectation step
def Estep(lis1):
m1=lis1[0]
m2=lis1[1]
cov1=lis1[2]
cov2=lis1[3]
pi=lis1[4]

pt2 = multivariate_normal.pdf(d, mean=m2, cov=cov2)

pt1 = multivariate_normal.pdf(d, mean=m1, cov=cov1)
w1 = pi * pt2
w2 = (1-pi) * pt1
eval1 = w1/(w1+w2)

return(eval1)

## Maximization step
def Mstep(eval1):
num_mu1,din_mu1,num_mu2,din_mu2=0,0,0,0

for i in range(0,len(d)):
num_mu1 += (1-eval1[i]) * d[i]
din_mu1 += (1-eval1[i])
num_mu2 += eval1[i] * d[i]
din_mu2 += eval1[i]
mu1 = num_mu1/din_mu1
mu2 = num_mu2/din_mu2
num_s1,din_s1,num_s2,din_s2=0,0,0,0
for i in range(0,len(d)):
q1 = np.matrix(d[i]-mu1)
num_s1 += (1-eval1[i]) * np.dot(q1.T, q1)
din_s1 += (1-eval1[i])

q2 = np.matrix(d[i]-mu2)
num_s2 += eval1[i] * np.dot(q2.T, q2)
din_s2 += eval1[i]

s1 = num_s1/din_s1
s2 = num_s2/din_s2

pi = sum(eval1)/len(d)

lis2=[mu1,mu2,s1,s2,pi]
return(lis2)
def plot(lis1):
mu1=lis1[0]
mu2=lis1[1]
s1=lis1[2]
s2=lis1[3]
Z1 = multivariate_normal(mu1, s1)
Z2 = multivariate_normal(mu2, s2)

pos = np.empty(X.shape + (2,)) # a new array of given shape and type, without initializing
entries
pos[:, :, 0] = X; pos[:, :, 1] = Y

plt.figure(figsize=(10,10)) # creating the figure and assigning the

size
plt.scatter(d[:,0], d[:,1], marker='o')
plt.contour(X, Y, Z1.pdf(pos), colors="r" ,alpha = 0.5)
plt.contour(X, Y, Z2.pdf(pos), colors="b" ,alpha = 0.5)
plt.axis('equal') # making both the axis equal
plt.xlabel('X-Axis', fontsize=16) # X-Axis
plt.ylabel('Y-Axis', fontsize=16) # Y-Axis
plt.grid() # displaying gridlines
plt.show()
iterations = 20
lis1=[m1,m2,cov1,cov2,pi]
for i in range(0,iterations):
lis2 = Mstep(Estep(lis1))
lis1=lis2
if(i==0 or i == 4 or i == 9 or i == 14 or i == 19):
plot(lis1)
Output :
Result :
Thus the EM algorithm is executed and verified successfully.
Exercise 11: Building of simple Neural Networks

Aim:
The aim of building simple neural network (NN) models is to create a basic architecture that
can learn patterns from data and make predictions based on the input. This can involve defining the
structure of the NN, selecting appropriate activation functions, and tuning the hyperparameters to
optimize the performance of the model.

Algorithm:

1. Data preparation: Preprocess the data to make it suitable for training the NN. This may involve
normalizing the input data, splitting the data into training and validation sets, and encoding the output
variables if necessary.
2. Define the architecture: Choose the number of layers and neurons in the NN, and define the
activation functions for each layer. The input layer should have one neuron per input feature, and the
output layer should have one neuron per output variable.
3. Initialize the weights: Initialize the weights of the NN randomly, using a small value to avoid
saturating the activation functions.
4. Forward propagation: Feed the input data forward through the NN, applying the activation
functions at each layer, and compute the output of the NN.
5. Compute the loss: Calculate the error between the predicted output and the true output, using a
suitable loss function such as mean squared error or cross-entropy.
6. Backward propagation: Compute the gradient of the loss with respect to the weights, using the
chain rule and backpropagate the error through the NN to adjust the weights.
7. Update the weights: Adjust the weights using an optimization algorithm such as stochastic
gradient descent or Adam, and repeat steps 4-7 for a fixed number of epochs or until the performance
on the validation set stops improving.
8. Evaluate the model: Test the performance of the model on a held-out test set and report the
accuracy or other performance metrics.

Program:

from numpy import exp, array, random, dot

class NeuralNetwork():
def init (self):
# Seed the random number generator, so it generates the same numbers
# every time the program runs.
random.seed(1)
# We model a single neuron, with 3 input connections and 1 output connection.
# We assign random weights to a 3 x 1 matrix, with values in the range -1 to 1
# and mean 0.
self.synaptic_weights = 2 * random.random((3, 1)) - 1

# The Sigmoid function, which describes an S shaped curve.

# We pass the weighted sum of the inputs through this function to
# normalise them between 0 and 1.
def sigmoid(self, x):
return 1 / (1 + exp(-x))

# The derivative of the Sigmoid function.

# This is the gradient of the Sigmoid curve.
# It indicates how confident we are about the existing weight.
def sigmoid_derivative(self, x):
return x * (1 - x)

# We train the neural network through a process of trial and error.

# Adjusting the synaptic weights each time.
def train(self, training_set_inputs, training_set_outputs, number_of_training_iterations):
for iteration in range(number_of_training_iterations):
# Pass the training set through our neural network (a single neuron).
output = self.think(training_set_inputs)

# Calculate the error (The difference between the desired output

# and the predicted output).
error = training_set_outputs - output

# Multiply the error by the input and again by the gradient of the Sigmoid curve.
# This means less confident weights are adjusted more.
# This means inputs, which are zero, do not cause changes to the weights.
adjustment = dot(training_set_inputs.T, error * self. sigmoid_derivative(output))

# Adjust the weights.

self.synaptic_weights += adjustment

# The neural network thinks.

def think(self, inputs):
# Pass inputs through our neural network (our single neuron).
return self. sigmoid(dot(inputs, self.synaptic_weights))

if name == " main ":

#Intialise a single neuron neural network.
neural_network = NeuralNetwork()

print ("Random starting synaptic weights: ")

print (neural_network.synaptic_weights)

# The training set. We have 4 examples, each consisting of 3 input values

# and 1 output value.
training_set_inputs = array([[0, 0, 1], [1, 1, 1], [1, 0, 1], [0, 1, 1]])
training_set_outputs = array([[0, 1, 1, 0]]).T

# Train the neural network using a training set.

# Do it 10,000 times and make small adjustments each time.
neural_network.train(training_set_inputs, training_set_outputs, 10000)

print ("New synaptic weights after training: ")

print (neural_network.synaptic_weights)

# Test the neural network with a new situation.

print ("Considering new situation [1, 0, 0] -> ?: ")
print (neural_network.think(array([1, 0, 0])))

Output :
Random starting synaptic weights:
[[-0.16595599]
[ 0.44064899]
[-0.99977125]]
New synaptic weights after training:
[[ 9.67299303]
[-0.2078435 ]
[-4.62963669]]
Considering new situation [1, 0, 0] -> ?:
[0.99993704]
Result:
Thus the simple Neural Network is built and executed successfully.
Exercise 12: Building of Deep Neural Networks

Aim : To build deep neural networks using Python programming.

What is Deep Learning?

Deep Learning is a part of machine learning that deals with algorithms inspired by the structure and
function of the human brain. It uses artificial neural networks to build intelligent models and solve complex
problems. We mostly use deep learning with unstructured data.

Program :
from numpy import loadtxt
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense
# load the dataset
dataset = loadtxt('pima-indians-diabetes.csv', delimiter=',')
# split into input (X) and output (y) variables
X = dataset[:,0:8]
y = dataset[:,8]
# define the keras model
model = Sequential()
model.add(Dense(12, input_shape=(8,), activation='relu'))
model.add(Dense(8, activation='relu'))
model.add(Dense(1, activation='sigmoid'))
# compile the keras model
model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])
# fit the keras model on the dataset
model.fit(X, y, epochs=150, batch_size=10)
# evaluate the keras model
_, accuracy = model.evaluate(X, y)
print('Accuracy: %.2f' % (accuracy*100))
# fit the keras model on the dataset without progress bars
model.fit(X, y, epochs=150, batch_size=10, verbose=0)
# evaluate the keras model
_, accuracy = model.evaluate(X, y, verbose=0)

Output :
Accuracy: 75.00
Result :
Thus the Deep Neural network is built and executed successfully.

CS3491-AI &ML Lab Manual
No ratings yet
CS3491-AI &ML Lab Manual
38 pages
Cs 3491 Ai ML Lab Manual
No ratings yet
Cs 3491 Ai ML Lab Manual
43 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
44 pages
Wa0001
No ratings yet
Wa0001
62 pages
Aiml Lab Manual Upto DT
No ratings yet
Aiml Lab Manual Upto DT
40 pages
Aiml Lab Manual Upto DT
No ratings yet
Aiml Lab Manual Upto DT
40 pages
AI & ML LABORATORY Final
No ratings yet
AI & ML LABORATORY Final
104 pages
AI ML Lab Manual - Prepared by Mrs. R. Viniba-1
No ratings yet
AI ML Lab Manual - Prepared by Mrs. R. Viniba-1
44 pages
Aiml-Lab-Manual 24-25
No ratings yet
Aiml-Lab-Manual 24-25
39 pages
Aimlrecord 8
No ratings yet
Aimlrecord 8
31 pages
AIML
No ratings yet
AIML
32 pages
AIMLLABMANUALWITHOUTPUT
No ratings yet
AIMLLABMANUALWITHOUTPUT
34 pages
Cs3491 - Ai&Ml Labrecord
No ratings yet
Cs3491 - Ai&Ml Labrecord
29 pages
CS3491 AI & ML Laboratory Notes
No ratings yet
CS3491 AI & ML Laboratory Notes
34 pages
Cs3491-Artificial Intelligence and Machine - Learning Laboratory
No ratings yet
Cs3491-Artificial Intelligence and Machine - Learning Laboratory
45 pages
Cs3491-Aiml Lab Manual
No ratings yet
Cs3491-Aiml Lab Manual
59 pages
Cs3491-Aiml Lab Manual
No ratings yet
Cs3491-Aiml Lab Manual
48 pages
AIML Lab - SRMTRPEC - Observation - 20 Feb 24
No ratings yet
AIML Lab - SRMTRPEC - Observation - 20 Feb 24
73 pages
AI ML Manual
No ratings yet
AI ML Manual
36 pages
Lab Manual
No ratings yet
Lab Manual
43 pages
AIML Sem4 Record
No ratings yet
AIML Sem4 Record
34 pages
Final Copy Aiml
No ratings yet
Final Copy Aiml
68 pages
Aiml Lab Manual Upto DT
No ratings yet
Aiml Lab Manual Upto DT
40 pages
Ai&Ml Lab Manual (1) Cse II
No ratings yet
Ai&Ml Lab Manual (1) Cse II
35 pages
Lab Manual For Aiml
No ratings yet
Lab Manual For Aiml
28 pages
Aiml Final
No ratings yet
Aiml Final
107 pages
AI & ML Lab Manual for IT Students
No ratings yet
AI & ML Lab Manual for IT Students
18 pages
AI 0ML-1 (1) Merged
No ratings yet
AI 0ML-1 (1) Merged
47 pages
AI Lab Report for Students
No ratings yet
AI Lab Report for Students
24 pages
Aiml Lab Record
No ratings yet
Aiml Lab Record
58 pages
Clustering With HDBSCAN Model - Pleiades Clustering
No ratings yet
Clustering With HDBSCAN Model - Pleiades Clustering
19 pages
ML 07 Clustering
No ratings yet
ML 07 Clustering
56 pages
UNIT 1 Machine Learning (KCS-055)
No ratings yet
UNIT 1 Machine Learning (KCS-055)
184 pages
Aiml Important Questions
No ratings yet
Aiml Important Questions
14 pages
Data Science Paper
No ratings yet
Data Science Paper
8 pages
10-1108 - Aa-10-2019-0173
No ratings yet
10-1108 - Aa-10-2019-0173
12 pages
Unit-5 - Part 2
No ratings yet
Unit-5 - Part 2
11 pages
Unsupervised Learning Pre-Learning
No ratings yet
Unsupervised Learning Pre-Learning
5 pages
AI UNIT - 5 Notes
No ratings yet
AI UNIT - 5 Notes
10 pages
FemaleLiver 02 NetworkConstr Blockwise
No ratings yet
FemaleLiver 02 NetworkConstr Blockwise
6 pages
3 Mahout Clustering
No ratings yet
3 Mahout Clustering
24 pages
AI & ML Basics for Business Students
No ratings yet
AI & ML Basics for Business Students
32 pages
Data Mining in Telecommunication Industr
No ratings yet
Data Mining in Telecommunication Industr
3 pages
Updated Lecture Zero Int234 1
No ratings yet
Updated Lecture Zero Int234 1
44 pages
Oldoni Et Al. - 2019 - Delineation of Management Zones in A Peach Orchard Using Multivariate and Geostatistical Analyses
No ratings yet
Oldoni Et Al. - 2019 - Delineation of Management Zones in A Peach Orchard Using Multivariate and Geostatistical Analyses
10 pages
Python Scripting & Libraries Overview
100% (1)
Python Scripting & Libraries Overview
15 pages
Top 200+ Data Mining Viva Questions and Answers For Interviews
No ratings yet
Top 200+ Data Mining Viva Questions and Answers For Interviews
24 pages
2 Data Pre-Processing
No ratings yet
2 Data Pre-Processing
50 pages
BCA 4th Sem: AI & PHP Course Overview
No ratings yet
BCA 4th Sem: AI & PHP Course Overview
15 pages
Data Mining 1
No ratings yet
Data Mining 1
7 pages
AI-Driven Bridge Design Optimization
No ratings yet
AI-Driven Bridge Design Optimization
9 pages
S06 - Joos Et Al - 2013
No ratings yet
S06 - Joos Et Al - 2013
11 pages
Udemy Test4
No ratings yet
Udemy Test4
41 pages
Machine Learning in Crash Simulations
100% (2)
Machine Learning in Crash Simulations
6 pages
An Introduction To Statistical Learning PDF
No ratings yet
An Introduction To Statistical Learning PDF
35 pages
2azure Machine Learning Final Test
No ratings yet
2azure Machine Learning Final Test
9 pages
ML Systems & Data Science Guide
No ratings yet
ML Systems & Data Science Guide
26 pages
AI CourseSyllabus UnderGraduate
No ratings yet
AI CourseSyllabus UnderGraduate
11 pages
Fyug English Sem 2
No ratings yet
Fyug English Sem 2
15 pages
Machine Learning for Pavement Monitoring
No ratings yet
Machine Learning for Pavement Monitoring
16 pages

AI - ML Lab Manual)

Uploaded by

AI - ML Lab Manual)

Uploaded by

ANJALAI AMMAL - MAHALINGAM ENGINEERING COLLEGE

Course Name : Artificial Intelligence and Machine Learning

The objectives of this course are to:

1. Implementation of Uninformed search algorithms (BFS, DFS)

Write python program to implement different AI search algorithms utilized in problem

List of First Cycle Exercises:

List of Second Cycle Exercises:

7. Build SVM models

Breadth First Search:

visited = set() # Set to keep track of visited nodes of graph.

Aim : To write a program in Python to implement A*Search and SMA* Search.

path found:['a', 'f', 'g', 'i', 'j']

Aim : To implement Naïve Bayes Classifier model in Python

Step 1: Calculate the prior probability for given class labels

def get_cp(self, attr, attr_type, class_value):

Calculated Conditional Probabilities:

{'no': {'Mild': 0.6, 'Normal': 0.4, 'Rainy': 0.8, 't': 0.6},

Aim : To implement Bayesian Networks using Python.

print('Sample instances from the dataset are given below')

print('\n Attributes and datatypes')

print('\n 1. Probability of HeartDisease given evidence= restecg')

print('\n 2. Probability of HeartDisease given evidence= cp ')

Calculated Conditional Probabilities:

{'no': {'Mild': 0.6, 'Normal': 0.4, 'Rainy': 0.8, 't': 0.8},

Aim : To implement a Regression Model using Python.

def calculateB(x, y, n):

from sklearn.datasets import fetch_california_housing

model = BayesianRidge() model.fit(X_train, y_train) prediction = model.predict(X_test)

r2 Score Of Test Set : 0.5904164911079081

Aim: To implement Decision tree classification in Python.

Decision Trees (DTs) are a non-parametric supervised learning method used

from matplotlib import pyplot as plt

Aim: To implement Random Forest Classifier in Python.

from sklearn.datasets import load_wine

fig = plt.figure(figsize=(15, 10))

# Load the dataset

# Split the data into training and testing sets

# Train an SVM model with a linear kernel

# Predict the test set labels

# Train an SVM model with a polynomial kernel

# Predict the test set labels

# Evaluate the model's accuracy

# Train an SVM model with an RBF kernel

# Predict the test set labels

# Evaluate the model's accuracy

from sklearn.datasets import make_classification

for k in range(1, maximum_K):

figure = plt.subplots(figsize = (12, 6))

The Expectation-Maximization (EM) algorithm is defined as the combination of various

plt.figure(figsize=(10,10)) # #creating the figure and assigning the

pt2 = multivariate_normal.pdf(d, mean=m2, cov=cov2)

plt.figure(figsize=(10,10)) # creating the figure and assigning the

from numpy import exp, array, random, dot

# The Sigmoid function, which describes an S shaped curve.

# The derivative of the Sigmoid function.

# We train the neural network through a process of trial and error.

# Calculate the error (The difference between the desired output

# Adjust the weights.

# The neural network thinks.

if name == " main ":

print ("Random starting synaptic weights: ")

# The training set. We have 4 examples, each consisting of 3 input values

# Train the neural network using a training set.

print ("New synaptic weights after training: ")

# Test the neural network with a new situation.

Aim : To build deep neural networks using Python programming.

What is Deep Learning?

You might also like

Aim : To write a program in Python to implement ASearch and SMA Search.