0% found this document useful (0 votes)

2 views

vertopal.com_experiment11

The document outlines a feature selection process using the breast cancer dataset, employing various methods including SelectKBest, Recursive Feature Elimination (RFE), Random Forest importance, and Lasso regression. Each method identifies a set of top features relevant for classification, with results printed for each technique. Additionally, a bar plot visualizes the top 10 feature importances derived from the Random Forest model.

Uploaded by

Rishab

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

vertopal.com_experiment11

Uploaded by

Rishab

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

import numpy as np

import pandas as pd
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LogisticRegression
from sklearn.feature_selection import SelectKBest, f_classif, RFE
from sklearn.ensemble import RandomForestClassifier
import matplotlib.pyplot as plt
import seaborn as sns

# Load dataset
data = load_breast_cancer()
X = pd.DataFrame(data.data, columns=data.feature_names)
y = data.target

# Standardize the features

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

# Train-test split
X_train, X_test, y_train, y_test = train_test_split(X_scaled, y,
test_size=0.3, random_state=42)

# ------------------------------
# 1. Univariate Feature Selection (SelectKBest)
# ------------------------------
select_k = SelectKBest(score_func=f_classif, k=10)
select_k.fit(X_train, y_train)
selected_features_kbest = X.columns[select_k.get_support()]

print("\n📌 Top 10 Features (SelectKBest):")

print(selected_features_kbest)

# ------------------------------
# 2. Recursive Feature Elimination (RFE)
# ------------------------------
model = LogisticRegression(max_iter=10000)
rfe = RFE(estimator=model, n_features_to_select=10)
rfe.fit(X_train, y_train)
selected_features_rfe = X.columns[rfe.support_]

print("\n📌 Top 10 Features (RFE):")

print(selected_features_rfe)

# ------------------------------
# 3. Feature Importances from Random Forest
# ------------------------------
rf = RandomForestClassifier(random_state=42)
rf.fit(X_train, y_train)
importances = rf.feature_importances_
indices = np.argsort(importances)[::-1]
top_10_rf = X.columns[indices[:10]]

print("\n📌 Top 10 Features (Random Forest Importance):")

print(top_10_rf)

# Plot top 10 feature importances

plt.figure(figsize=(8, 5))
sns.barplot(x=importances[indices[:10]], y=X.columns[indices[:10]],
palette="viridis")
plt.title("Top 10 Feature Importances (Random Forest)")
plt.xlabel("Importance")
plt.ylabel("Feature")
plt.tight_layout()
plt.show()

# ------------------------------
# 4. Lasso (L1-based) Feature Selection
# ------------------------------
lasso = LogisticRegression(penalty='l1', solver='liblinear', C=0.1,
max_iter=10000)
lasso.fit(X_train, y_train)
selected_features_lasso = X.columns[lasso.coef_[0] != 0]

print("\n📌 Features selected by Lasso (L1 regularization):")

print(selected_features_lasso)

📌 Top 10 Features (SelectKBest):

Index(['mean radius', 'mean perimeter', 'mean area', 'mean concavity',
'mean concave points', 'worst radius', 'worst perimeter',
'worst area',
'worst concavity', 'worst concave points'],
dtype='object')

📌 Top 10 Features (RFE):

Index(['mean concave points', 'radius error', 'area error',
'fractal dimension error', 'worst radius', 'worst texture',
'worst perimeter', 'worst area', 'worst concavity',
'worst concave points'],
dtype='object')

📌 Top 10 Features (Random Forest Importance):

Index(['mean concave points', 'worst concave points', 'worst area',
'mean concavity', 'worst radius', 'worst perimeter', 'mean
perimeter',
'mean area', 'worst concavity', 'mean radius'],
dtype='object')
<ipython-input-17-9106fc3251d0>:59: FutureWarning:

Passing `palette` without assigning `hue` is deprecated and will be

removed in v0.14.0. Assign the `y` variable to `hue` and set
`legend=False` for the same effect.

sns.barplot(x=importances[indices[:10]], y=X.columns[indices[:10]],
palette="viridis")

📌 Features selected by Lasso (L1 regularization):

Index(['mean concave points', 'radius error', 'worst radius', 'worst
texture',
'worst smoothness', 'worst concavity', 'worst concave points',
'worst symmetry'],
dtype='object')

Manual Murphy PDF
100% (4)
Manual Murphy PDF
396 pages
Automatic Feature Selection
No ratings yet
Automatic Feature Selection
4 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
2 pages
reast-cancer-prediction-using-debt
No ratings yet
reast-cancer-prediction-using-debt
18 pages
m1
No ratings yet
m1
10 pages
Hussain-assin2_cancrclassification
No ratings yet
Hussain-assin2_cancrclassification
12 pages
5) Randomforest - Ipynb - Colaboratory
No ratings yet
5) Randomforest - Ipynb - Colaboratory
12 pages
Lab 2
No ratings yet
Lab 2
8 pages
Random Forest: Implementaciones de Scikit-Learn Sobre QSAR
100% (1)
Random Forest: Implementaciones de Scikit-Learn Sobre QSAR
11 pages
Recsify Technologies Assignment
No ratings yet
Recsify Technologies Assignment
10 pages
Breast Cancer Classification Using DTC
No ratings yet
Breast Cancer Classification Using DTC
1 page
Preductive Modelling Assignment
No ratings yet
Preductive Modelling Assignment
3 pages
Random Forest 1 Image
No ratings yet
Random Forest 1 Image
5 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 7
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 7
23 pages
vertopal.com_Female_A_S_Breast_Cancer_Prediction_model
No ratings yet
vertopal.com_Female_A_S_Breast_Cancer_Prediction_model
8 pages
bda 3.1
No ratings yet
bda 3.1
2 pages
ML Lab 5
No ratings yet
ML Lab 5
2 pages
vertopal.com_experiment8
No ratings yet
vertopal.com_experiment8
5 pages
22510045_Assignment_10[1]
No ratings yet
22510045_Assignment_10[1]
14 pages
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
100% (1)
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
73 pages
20BCP021 Assignment 6
No ratings yet
20BCP021 Assignment 6
15 pages
Import Numpy As NP Import Pandas As PD
No ratings yet
Import Numpy As NP Import Pandas As PD
7 pages
Heart Disease Prediction - Colab
No ratings yet
Heart Disease Prediction - Colab
18 pages
10 Random - Forest - Algo
No ratings yet
10 Random - Forest - Algo
6 pages
RANDOM_FOREST__1737667979
No ratings yet
RANDOM_FOREST__1737667979
11 pages
RANDOM FOREST (Binary Classification)
No ratings yet
RANDOM FOREST (Binary Classification)
5 pages
ML Codes
No ratings yet
ML Codes
9 pages
lab3
No ratings yet
lab3
6 pages
L3_Classification_RandomForest - Jupyter Notebook
No ratings yet
L3_Classification_RandomForest - Jupyter Notebook
6 pages
AAM 6th Prac
No ratings yet
AAM 6th Prac
3 pages
Warpper Method
No ratings yet
Warpper Method
8 pages
NF Assighment4
No ratings yet
NF Assighment4
5 pages
Unit1 ML Programs
No ratings yet
Unit1 ML Programs
5 pages
Machine Learning Algorithm
No ratings yet
Machine Learning Algorithm
18 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
FDP Session 4 (Decision Tree)
No ratings yet
FDP Session 4 (Decision Tree)
1 page
AIH_Lab2
No ratings yet
AIH_Lab2
10 pages
set 2
No ratings yet
set 2
19 pages
Suneel Varma
No ratings yet
Suneel Varma
11 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
PR
No ratings yet
PR
17 pages
Experiment 2
No ratings yet
Experiment 2
17 pages
ML 7
No ratings yet
ML 7
6 pages
AML_lab[1] (1)
No ratings yet
AML_lab[1] (1)
14 pages
Slip
No ratings yet
Slip
5 pages
MLfull
No ratings yet
MLfull
29 pages
22MCA1008 - Varun ML LAB ASSIGNMENTS
100% (1)
22MCA1008 - Varun ML LAB ASSIGNMENTS
41 pages
AI ML - Cycle 2 Programs (1)
No ratings yet
AI ML - Cycle 2 Programs (1)
15 pages
A_training_and_testing_model_is_developed_using_the_provided_dataset_in_Jupyter_Notebook_2[1]
No ratings yet
A_training_and_testing_model_is_developed_using_the_provided_dataset_in_Jupyter_Notebook_2[1]
4 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Vertopal.com Experiment01 Baseline Models Accuracy
No ratings yet
Vertopal.com Experiment01 Baseline Models Accuracy
35 pages
sp code
No ratings yet
sp code
6 pages
CO3
No ratings yet
CO3
8 pages
import pandas as pd
No ratings yet
import pandas as pd
2 pages
QUIZ Week 2 CART Practice PDF
No ratings yet
QUIZ Week 2 CART Practice PDF
10 pages
labaihw_
No ratings yet
labaihw_
1 page
Classification Algorithms
No ratings yet
Classification Algorithms
16 pages
COMPARISON - Jupyter Notebook
No ratings yet
COMPARISON - Jupyter Notebook
5 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Mechatronics Oral Questions
No ratings yet
Mechatronics Oral Questions
2 pages
Resist I Vity
No ratings yet
Resist I Vity
1 page
Zimsec O Level Mathematics November 2019 Past Exam Paper 1
No ratings yet
Zimsec O Level Mathematics November 2019 Past Exam Paper 1
27 pages
Appendix 1 Section 3 AS1668.2 2002
No ratings yet
Appendix 1 Section 3 AS1668.2 2002
5 pages
5.1 Wind Energy
No ratings yet
5.1 Wind Energy
28 pages
Library Management System With C++
No ratings yet
Library Management System With C++
17 pages
Low-Cost Open Source Iot-Based Scada System For A Bts Site Using Esp32 and Arduino Iot Cloud
No ratings yet
Low-Cost Open Source Iot-Based Scada System For A Bts Site Using Esp32 and Arduino Iot Cloud
6 pages
CS8603 DS Course Plan-1
No ratings yet
CS8603 DS Course Plan-1
9 pages
Weberanc 405 BFX
No ratings yet
Weberanc 405 BFX
13 pages
Field Installation Handbook
No ratings yet
Field Installation Handbook
88 pages
Definitions of Standard Enthalpy Changes
No ratings yet
Definitions of Standard Enthalpy Changes
9 pages
Lec1 Q
No ratings yet
Lec1 Q
3 pages
Arithmetic Operations
No ratings yet
Arithmetic Operations
3 pages
Vector (Exercise)
No ratings yet
Vector (Exercise)
13 pages
A New Strut Model For Solid Masonry Infills in Steel Frames
No ratings yet
A New Strut Model For Solid Masonry Infills in Steel Frames
14 pages
Tofal Supplementry Academy Science and Mathematics: Circle The Correct Answer: 20 Marks
No ratings yet
Tofal Supplementry Academy Science and Mathematics: Circle The Correct Answer: 20 Marks
2 pages
Actual Cycle Lecture Slides
No ratings yet
Actual Cycle Lecture Slides
40 pages
Principles and Elements of Interior Design - 1
No ratings yet
Principles and Elements of Interior Design - 1
3 pages
3ADW000193R0601 DCS800 Firmware Manual e F
100% (1)
3ADW000193R0601 DCS800 Firmware Manual e F
466 pages
9Id Turning Forces Study sheet 3 Pearson Edexcel Grade 8
No ratings yet
9Id Turning Forces Study sheet 3 Pearson Edexcel Grade 8
3 pages
Cubase LE 14.0 Operation Manual
No ratings yet
Cubase LE 14.0 Operation Manual
814 pages
CHHFJ
No ratings yet
CHHFJ
6 pages
Metrosil
No ratings yet
Metrosil
4 pages
(R1) Revision Topics
No ratings yet
(R1) Revision Topics
46 pages
Objective Questions Topic1 AM015 2324
No ratings yet
Objective Questions Topic1 AM015 2324
4 pages
Compiler Theory: 001 - Introduction and Course Outline
No ratings yet
Compiler Theory: 001 - Introduction and Course Outline
33 pages
Statistical Decision Theory
No ratings yet
Statistical Decision Theory
8 pages
Mitsubishi Meldas 600M
75% (4)
Mitsubishi Meldas 600M
512 pages
03-Example - Stackspan PDF
No ratings yet
03-Example - Stackspan PDF
20 pages

vertopal.com_experiment11

Uploaded by

vertopal.com_experiment11

Uploaded by

import numpy as np

# Standardize the features

print("\n📌 Top 10 Features (SelectKBest):")

print("\n📌 Top 10 Features (RFE):")

print("\n📌 Top 10 Features (Random Forest Importance):")

# Plot top 10 feature importances

print("\n📌 Features selected by Lasso (L1 regularization):")

📌 Top 10 Features (SelectKBest):

📌 Top 10 Features (RFE):

📌 Top 10 Features (Random Forest Importance):

Passing `palette` without assigning `hue` is deprecated and will be

📌 Features selected by Lasso (L1 regularization):

You might also like