0% found this document useful (0 votes)

87 views17 pages

Machine Downtime Prediction Model

Q: Why is the deployment of ensemble methods such as Gradient Boosting and Random Forest highly recommended for machine downtime prediction in comparison to SVM or Bayesian Logistic Regression?

Ensemble methods like Gradient Boosting and Random Forest are recommended for deployment due to their superior performance metrics, particularly in areas critical to machine downtime like F1-Score and ROC AUC. These methods benefit from blending multiple decision trees, achieving high accuracy and robustness. They handle feature interactions more effectively, leading to better generalization on unseen data, which is essential for reliable downtime prediction. Conversely, SVM and Bayesian Logistic Regression lag in performance and are potentially less flexible in modeling complex data patterns .

Q: What insights can be drawn from the model’s F1-Score regarding its precision and recall in the context of machine downtime prediction?

The F1-Score, being the harmonic mean of precision and recall, provides a balance between identifying actual machine downtimes (recall) and minimizing false alarms (precision). A high F1-Score indicates that the model effectively captures most downtime events with few false positives, as seen with Gradient Boosting’s performance. This makes it suitable for scenarios where exact identification of downtime is critical, as it ensures a balance between not missing true events and not raising excessive false alerts .

Q: In the context of feature importance, how does Gradient Boosting differ from XGBoost with respect to the emphasized predictors in machine downtime prediction?

Gradient Boosting places greater emphasis on Hydraulic Pressure (Pa), with it being an even more dominant predictor compared to XGBoost. While XGBoost distributes importance more evenly across features such as Torque (Nm) and Cutting (N), Gradient Boosting relies more on a few strong predictors. Coolant and Vibration features contribute less in Gradient Boosting than in XGBoost, making XGBoost balance its dependence among more features .

Q: Why might an organization prefer using Gradient Boosting over Bayesian Logistic Regression for machine downtime prediction despite the performance similarities observed during cross-validation?

Gradient Boosting is preferred over Bayesian Logistic Regression due to its superior performance metrics, particularly in achieving higher F1-Scores. This demonstrates its ability to better balance precision and recall, which is crucial for accurate downtime prediction. Additionally, the ensemble nature of Gradient Boosting allows it to make more sophisticated decisions by leveraging interactions between features, thus enhancing generalization and predictive accuracy .

Q: How does the understanding of a feature's skewness and kurtosis influence the choice between normalization and standardization preprocessing techniques?

Understanding a feature's skewness and kurtosis informs the preprocessing method as these metrics indicate the asymmetry and peakness of data relative to a normal distribution. Features with high skewness and kurtosis can distort model performance. For such skewed data, normalization techniques like RobustScaler are preferred as they adjust for outliers, smoothing the distribution. Conversely, standardization is chosen for data with low skewness and kurtosis, bringing feature means to zero and standard deviations to one, thus preserving normality and stabilizing variance .

Q: What are the primary reasons for using Stratified K-Fold cross-validation in classification tasks?

Stratified K-Fold cross-validation is used because it ensures that each fold maintains the same proportion of classes as the entire dataset. This is crucial for achieving consistent class distribution across folds, which is important for obtaining reliable performance estimates. By preserving class balance, it prevents models from being biased due to uneven distribution, especially in cases without visible class imbalance .

Q: In what ways can feature selection impact model performance, particularly in the context of machine downtime production datasets?

Feature selection affects model performance by determining which variables contribute most significantly to predicting outcomes like machine downtime. Proper selection boosts model accuracy and efficiency by focusing on impactful variables such as Hydraulic Pressure and Torque, which were identified as strong predictors. It reduces overfitting, improves model interpretability, and decreases computational cost. Ignoring irrelevant or redundant features, like certain Machine ID encodings, enhances prediction power and clarity in the decision-making process .

Q: How does the interpretation of ROC AUC scores relate to the selection of machine learning models for predicting machine downtime?

ROC AUC measures the model's ability to distinguish between different classes. A value closer to 1 indicates superior discrimination. Models with high ROC AUC values, like XGBoost, Gradient Boosting, and Random Forest, demonstrate strong capability in correctly identifying machine downtimes versus no-downtimes, thereby making them preferred choices for deployment due to their reliability in predicting machine failures .

Q: What factors might influence a model’s resilience or sensitivity to hyperparameters during machine learning model training and tuning?

A model's resilience or sensitivity to hyperparameters is especially influenced by feature interactions, the distribution of input data, the complexity of the model architecture, and the degree of regularization applied. Models like Gradient Boosting and XGBoost, which are tree-based ensembles, are highly sensitive to hyperparameters like learning rate and maximum depth, as they control model complexity and overfitting tendencies. Hyperparameter stability can also be seen in how small changes affect model performance during cross-validation, as explored through Optuna trials which systematically evaluate their impact .

Q: How does the preprocessing of features differ between skewed and normally distributed columns when preparing data for machine learning models?

The preprocessing approach involves using different scaling methods based on distribution characteristics. For skewed columns, a RobustScaler is applied to normalize these features by accounting for outliers and ensuring stability in the model. Normal distributions are managed using a StandardScaler, which scales the data to have mean 0 and standard deviation 1, ideal for normally distributed features without outlier distortion. This differentiated approach reduces model bias and improves performance .

The document outlines the development of a machine learning model for predicting machine downtime using various algorithms, including XGBoost and Random Forest. It details data preprocessing steps, model training with cross-validation, and performance evaluation metrics such as precision, recall, and ROC AUC. The results indicate that ensemble methods consistently outperform single models, with XGBoost achieving the highest performance metrics.

Uploaded by

Yusuf Aliyu U

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

87 views17 pages

Machine Downtime Prediction Model

Uploaded by

Yusuf Aliyu U

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

Machine Learning Model Development

Import the necessary libraries

In [1]:  import pandas as pd

import numpy as np
from sklearn.model_selection import train_test_split, cross_val_score, Str
cross_val_predict
from [Link] import RobustScaler, StandardScaler, LabelEncod
from [Link] import ColumnTransformer
from [Link] import Pipeline
from [Link] import RandomForestClassifier, GradientBoostingClass
from sklearn.linear_model import LogisticRegression
from [Link] import SVC
from [Link] import DecisionTreeClassifier
import xgboost as xgb
from [Link] import accuracy_score, precision_score,\
recall_score, f1_score, roc_auc_score

import [Link] as plt

import seaborn as sns
from [Link] import plot_param_importances

import optuna
c:\Users\Administrator\anaconda3\envs\machineind\lib\site-packages\tqdm
\[Link]: TqdmWarning: IProgress not found. Please update jupyter and
ipywidgets. See [Link]
[Link] ([Link]
from .autonotebook import tqdm as notebook_tqdm

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 1/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

Some useful Functions

In [23]:  def get_feature_importance(model, model_name):

"""
Extracts and plots feature importance for a trained model.

Parameters:
- model: Trained Pipeline containing the classifier.
- model_name: Name of the model ('Gradient Boosting' or 'XGBoost').
"""
# Extract classifier from pipeline
classifier = model.named_steps['classifier']

# Get feature importance values

importance = classifier.feature_importances_

# Get transformed feature names from the preprocessor

preprocessor = model.named_steps['preprocessor']

try:
feature_names = preprocessor.get_feature_names_out()
except AttributeError:
feature_names = X_train.columns # Fallback if `get_feature_names_

# Ensure feature_names and importance lengths match

if len(importance) != len(feature_names):
print(f"Warning: Mismatch in feature importance length! ({len(impo
feature_names = [f"Feature {i}" for i in range(len(importance))]

# Sort feature importance values

sorted_idx = [Link](importance)[::-1]

# Plot feature importance

[Link](figsize=(10, 6))
[Link]([Link](feature_names)[sorted_idx], importance[sorted_idx])
[Link]("Feature Importance")
[Link]("Features")
[Link](f"{model_name} Feature Importance")
[Link]().invert_yaxis()
[Link]()

# Return feature importance as a dictionary

return dict(zip(feature_names, importance))

# Return feature importance as a dictionary
return dict(zip(feature names, importance))

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 2/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

In [3]:  def plot_param_importances_(study_model):

'''
plot the importance of the most important hyperparameter

study_model: optuna optimized and tuned model

model: str. The model of interest
'''
plotly_config = {"staticPlot": True}
fig = plot_param_importances(study_model)
[Link](config=plotly config)

In [4]:  # load the dataset

machine = pd.read_csv("../data/machine_downtime_cleaned.csv", parse_dates=

# make a copy of the data
machine_ori = [Link]()
# print the first few rows
[Link]()
Out[4]: Date Machine_ID Assembly_Line_No Coolant_Temperature Hydraulic_Oil_Temperature

2021- Makino-L2-
0 Shopfloor-L2 4.5 47.9
12-08 Unit1-2015

2021- Makino-L2-
1 Shopfloor-L2 21.7 47.5
12-17 Unit1-2015

2021- Makino-L1-
2 Shopfloor-L1 5.2 49.4
12-17 Unit1-2013

2021- Makino-L1-
3 Shopfloor-L1 24.4 48.1
12-17 Unit1-2013

2021- Makino-L2-
4 Shopfloor-L2 14.1 51.8
12-21 Unit1-2015

Preprocessing
we have to divide the numeric columns into those that are skewed and those that are normal in
order to be able to apply the necessary standardization or normalization to avoid bias

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 3/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

In [5]:  # create an empty list to store columns that are normally or

# skewly distributed
normal_cols = []
skewed_cols = []

# loop through the numerical features
for col in machine_ori.select_dtypes(include=[Link]):
skewness = machine_ori[col].skew()
kurtosis = machine_ori[col].kurt()

# set a threshold for kurtosis and skewness and then append the necess
if -0.2 <= skewness <= 0.3 and -0.2 <= kurtosis <= 0.2: # Adjust thre
normal_cols.append(col)
print(f"{col}: Skewness = {skewness:.2f}, Kurtosis = {kurtosis:.2f
else:
skewed_cols.append(col)
print(f"{col}: Skewness = {skewness:.2f}, Kurtosis = {kurtosis:.2f

Coolant_Temperature: Skewness = -0.22, Kurtosis = -1.35 (Not Normally Di
stributed)
Hydraulic_Oil_Temperature: Skewness = -0.00, Kurtosis = 0.05 (Approximat
ely Normal)
Spindle_Bearing_Temperature: Skewness = -0.03, Kurtosis = -0.05 (Approxi
mately Normal)
Spindle_Vibration: Skewness = 0.03, Kurtosis = -0.11 (Approximately Norm
al)
Tool_Vibration: Skewness = -0.06, Kurtosis = 0.01 (Approximately Normal)
Voltage(volts): Skewness = -0.03, Kurtosis = -0.09 (Approximately Norma
l)
Torque(Nm): Skewness = 0.03, Kurtosis = -0.46 (Not Normally Distributed)
Hydraulic_Pressure(Pa): Skewness = 0.21, Kurtosis = -0.98 (Not Normally
Distributed)
Coolant_Pressure(Pa): Skewness = -0.01, Kurtosis = -0.13 (Approximately
Normal)
Air_System_Pressure(Pa): Skewness = -0.05, Kurtosis = 0.01 (Approximatel
y Normal)
Cutting(N): Skewness = 0.12, Kurtosis = -1.09 (Not Normally Distributed)
Spindle_Speed(RPS): Skewness = 0.22, Kurtosis = -0.45 (Not Normally Dist
ributed)

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 4/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

Model Parameters Preparation

In [6]:  # Define target and features

X = machine_ori.drop(columns=["Downtime", "Date", "Assembly_Line_No"]) #

# define encoder
label_encode = LabelEncoder()
y = label_encode.fit_transform(machine_ori["Downtime"]) # Target variable

# Identify numerical and categorical columns
numerical_cols = X.select_dtypes(include=['float64', 'int64']).columns
category_col = X.select_dtypes(include=['object']).columns

# Define transformers
preprocessor = ColumnTransformer([
("robust", RobustScaler(), skewed_cols), # Skewed data
("standard", StandardScaler(), normal_cols), # Normal data
('one-hot-encoder', OneHotEncoder(), category_col) # Machine_ID column
])

# Train-test split
# Step 1: Split into Train (60%), Validation (20%), Test (20%)
X_train_val, X_test, y_train_val, y_test = train_test_split(X, y, test_siz
X_train, X_val, y_train, y_val = train_test_split(X_train_val, y_train_val

# Define models
models = {
"Bayesian Logistic Regression": LogisticRegression(solver="lbfgs"),
"Random Forest": RandomForestClassifier(n_estimators=100, random_state
"Gradient Boosting": GradientBoostingClassifier(n_estimators=100, rand
"Decision Tree": DecisionTreeClassifier(random_state=42),
"SVM": SVC(kernel="rbf", probability=True, random_state=42),
"XGBoost": [Link](eval_metric="auc", random_state = 42)
}

Train the model

Cross Validation

Since our problem is a classification task, Stratified K-Fold (StratifiedKFold) will be use for the
cross validation.

Why Use Stratified K-Fold?

Preserves Class Distribution: Stratified K-Fold ensures that each fold maintains the same
proportion of classes as the overall dataset, which is crucial when dealing with
classification problems, even if there is no visible class imbalance.
More Reliable Performance Estimates: It provides a more stable and representative
estimate of your model’s performance compared to ShuffleSplit, which may produce folds
with different class distributions.
localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 5/17
3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

Better Generalization: Ensures that all classes are well represented in training and
validation splits, reducing the risk of biased results.

Key Performance Metrics and Their Meaning

Precision: Measures how many of the predicted failures were actually failures. A high
precision means fewer false positives.
Recall: Measures how many of the actual failures were correctly identified. A high recall
means fewer false negatives.
F1-Score: Harmonic mean of precision and recall, balancing both. Higher is better.
ROC AUC: Measures the model’s ability to distinguish between classes. A value closer to
1 is better.

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 6/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

In [7]:  # craete an empty list to store model result

model_results = []

# Initialize Stratified K-Fold
cv = StratifiedKFold(n_splits=5, shuffle=True, random_state=42)

for name, model in [Link]():
precision_scores, recall_scores, f1_scores, roc_auc_scores = [], [], [

for train_index, val_index in [Link](X_train_val, y_train_val):
X_train_fold, X_val_fold = X_train_val.iloc[train_index], X_train_
y_train_fold, y_val_fold = y_train_val[train_index], y_train_val[v

# Create a pipeline
pipeline = Pipeline([
('preprocessor', preprocessor),
('classifier', model)
])

# Train the model
[Link](X_train_fold, y_train_fold)

# Make predictions
y_pred = [Link](X_val_fold)
y_prob = pipeline.predict_proba(X_val_fold)[:, 1] if hasattr(model

# Evaluate Metrics
precision_scores.append(precision_score(y_val_fold, y_pred))
recall_scores.append(recall_score(y_val_fold, y_pred))
f1_scores.append(f1_score(y_val_fold, y_pred))
roc_auc_scores.append(roc_auc_score(y_val_fold, y_prob) if y_prob

# Compute mean scores across folds
mean_precision = [Link](precision_scores)
mean_recall = [Link](recall_scores)
mean_f1 = [Link](f1_scores)
mean_roc_auc = [Link](roc_auc_scores)

# Append results
model_results.append({
"Model": name,
"Precision": round(mean_precision, 4),
"Recall": round(mean_recall, 4),
"F1-Score": round(mean_f1, 4),
"ROC AUC": round(mean_roc_auc, 4)
})

# Convert results to DataFrame
model results df = [Link](model results)

Model Performance and Best Result

Model Performance Interpretation

1. XGBoost (0.9993 ROC AUC, 0.9919 F1-Score)

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 7/17
3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

Remains a top performer with exceptional discrimination ability (ROC

AUC) and a near-perfect balance of precision and recall (F1-Score).
It's likely to generalize well to the test set.

2. Random Forest (0.9990 ROC AUC, 0.9858 F1-Score)

Also demonstrates excellent performance, very close to XGBoost.

If interpretability is crucial, it might be preferable.

3. Gradient Boosting (0.9991 ROC AUC, 0.9919 F1-Score)

Achieves top-tier performance, comparable to XGBoost, with a slight

edge in recall.

4. Decision Tree (0.9694 ROC AUC, 0.9692 F1-Score)

Shows good performance but falls short compared to the ensemble

methods (XGBoost, Random Forest, Gradient Boosting).

5. SVM (0.9439 ROC AUC, 0.8779 F1-Score)

Exhibits decent performance but is outperformed by the ensemble

models.

6. Bayesian Logistic Regression (0.9292 ROC AUC, 0.8625 F1-Score)

Shows moderate performance, lagging behind the other models.

Observations

Ensemble methods (XGBoost, Random Forest, Gradient Boosting)

consistently outperform the single models (Decision Tree, SVM, Bayesian
Logistic Regression).
XGBoost, Random Forest, and Gradient Boosting have shown remarkable
performance, with very high ROC AUC and F1-Scores.

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 8/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

In [8]:  model_results_df.head(10)

Out[8]: Model Precision Recall F1-Score ROC AUC

0 Bayesian Logistic Regression 0.8650 0.8607 0.8625 0.9292

1 Random Forest 0.9809 0.9908 0.9858 0.9990

2 Gradient Boosting 0.9889 0.9949 0.9919 0.9991

3 Decision Tree 0.9630 0.9756 0.9692 0.9694

4 SVM 0.8799 0.8760 0.8779 0.9439

5 XGBoost 0.9909 0.9929 0.9919 0.9993

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 9/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

Hyperparameter Tuning

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 10/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

In [15]:  # Cross-validation function

def cross_validate_model(model):
skf = StratifiedKFold(n_splits=5, shuffle=True, random_state=42)
f1_scores, precision_scores, recall_scores, roc_auc_scores = [], [], [

for train_idx, val_idx in [Link](X_train, y_train):

X_train_fold, X_val_fold = X_train.iloc[train_idx], X_train.iloc[v
y_train_fold, y_val_fold = y_train[train_idx], y_train[val_idx]

pipeline = Pipeline([
('preprocessor', preprocessor),
('classifier', model)
])

[Link](X_train_fold, y_train_fold)
y_pred = [Link](X_val_fold)
y_prob = pipeline.predict_proba(X_val_fold)[:, 1] if hasattr(model

f1_scores.append(f1_score(y_val_fold, y_pred))
precision_scores.append(precision_score(y_val_fold, y_pred))
recall_scores.append(recall_score(y_val_fold, y_pred))
roc_auc_scores.append(roc_auc_score(y_val_fold, y_prob))

return [Link]([[Link](f1_scores), [Link](precision_scores), [Link]

# Define Optuna objective functions for each model
def objective_xgb(trial):
params = {
'n_estimators': trial.suggest_int('n_estimators', 100, 500, step=5
'max_depth': trial.suggest_int('max_depth', 3, 12),
'learning_rate': trial.suggest_loguniform('learning_rate', 0.01, 0
'subsample': trial.suggest_float('subsample', 0.6, 1.0),
'colsample_bytree': trial.suggest_float('colsample_bytree', 0.6, 1
'gamma': trial.suggest_float('gamma', 0, 10),
'reg_alpha': trial.suggest_float('reg_alpha', 0, 10),
'reg_lambda': trial.suggest_float('reg_lambda', 0, 10),
'random_state': 42,
# 'use_label_encoder': False,
'eval_metric': 'auc'
}
return cross_validate_model([Link](**params))

def objective_gb(trial):
params = {
'n_estimators': trial.suggest_int('n_estimators', 100, 500, step=5
'learning_rate': trial.suggest_loguniform('learning_rate', 0.01, 0
'max_depth': trial.suggest_int('max_depth', 3, 12),
'subsample': trial.suggest_float('subsample', 0.6, 1.0),
'random_state': 42
}
return cross_validate_model(GradientBoostingClassifier(**params))

# Run Optuna for each model
study_xgb = optuna.create_study(direction='maximize')
localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 11/17
3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook
study_xgb.optimize(objective_xgb, n_trials=50, timeout=1800)

study_gb = optuna.create_study(direction='maximize')
study_gb.optimize(objective_gb, n_trials=50, timeout=1800)

# Train best models
best_gb = Pipeline([
('preprocessor', preprocessor),
('classifier', GradientBoostingClassifier(**study_gb.best_params, rand
])
best_gb.fit(X_train, y_train)

best_xgb = Pipeline([
('preprocessor', preprocessor),
('classifier', [Link](**study_xgb.best_params, random_state
])
best_xgb.fit(X_train, y_train)

...

Get the best parameters for each model

In [16]:  # print the best hyperparameters for the gradient boost

print("Gradient Boost Best params:")
for key, value in study_gb.best_params.items():
print(f"\t{key}: {value}")
Gradient Boost Best params:
n_estimators: 300
learning_rate: 0.18476368934488233
max_depth: 3
subsample: 0.8349830456457842

In [17]:  # print the best hyperparameters for the XG Boost

print("XGBoost Best params:")
for key, value in study_xgb.best_params.items():
print(f"\t{key}: {value}")
XGBoost Best params:
n_estimators: 150
max_depth: 8
learning_rate: 0.04664543050831571
subsample: 0.7942231875177621
colsample_bytree: 0.6581279765160521
gamma: 1.6864430842970046
reg_alpha: 0.016904277260539224
reg_lambda: 0.28709776773493223

Evaluate model on the Test set

Interpretation of Test Set Results

1. XGBoost (0.9991 ROC AUC, 0.9816 F1-Score)

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 12/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

Maintains excellent performance on the test set, with a very high ROC AUC and F1-Score.
This indicates strong generalization ability, meaning it's likely to perform well on new,
unseen data.

2. Gradient Boosting (0.9989 ROC AUC, 0.9857 F1-Score)

Also shows outstanding performance on the test set, comparable to XGBoost.

Achieves a slightly higher F1-Score than XGBoost, indicating a marginally better balance
of precision and recall.

3. Random Forest (0.9989 ROC AUC, 0.9837 F1-Score)

Performs very well on the test set, with a high ROC AUC and F1-Score.
While slightly behind XGBoost and Gradient Boosting, it's still a strong model.

Observations

All three models generalize well to the test set, confirming their strong performance observed
during training and validation. Gradient Boosting has a slight edge in F1-Score on the test set,
suggesting a better balance of precision and recall compared to XGBoost. The performance
differences between the models are relatively small, indicating that all three are good
candidates for deployment.

Recommendations

Model Selection:

Our primary focus in selecting a predictive model is maximizing accuracy in identifying

potential machine downtime. While computational efficiency and interpretability are valuable,
the ability to proactively prevent downtime is paramount.

In this regard, Gradient Boosting emerged as the top performer, achieving the highest F1-
score among the models evaluated. This signifies its superior balance between precision
(minimizing false alarms) and recall (capturing the majority of actual downtime events).

Therefore, we will be deploying Gradient Boosting as our predictive model to proactively

mitigate machine downtime and enhance operational efficiency.

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 13/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

In [18]:  # Evaluate on test set

def evaluate_model(model, name):
y_pred = [Link](X_test)
y_prob = model.predict_proba(X_test)[:, 1] if hasattr(model, 'predict_
return {
'Model': name,
'Precision': round(precision_score(y_test, y_pred), 4),
'Recall': round(recall_score(y_test, y_pred), 4),
'F1-Score': round(f1_score(y_test, y_pred), 4),
'ROC AUC': round(roc_auc_score(y_test, y_prob) , 4) if y_prob is n
}

results = [
evaluate_model(best_xgb, 'XGBoost'),
evaluate_model(best_gb, 'Gradient Boosting')
]

import pandas as pd
results_df = [Link](results).sort_values(by=[])
print(results df)
Model Precision Recall F1-Score ROC AUC
0 XGBoost 0.9798 0.9837 0.9817 0.9988
1 Gradient Boosting 0.9918 0.9837 0.9878 0.9991

Plot Feature Importance After evaluating on test set

1. Key Takeaways from the Plots

Top Features:

Both models strongly prioritize Hydraulic Pressure (Pa), Torque (Nm), and Cutting (N) as the
most influential factors. This suggests that variations in these parameters significantly impact
machine failures.

Coolant Pressure and Temperature:

Features related to coolant pressure and temperature also have noticeable

importance, indicating that overheating or coolant system inefficiencies might
lead to failures.

Spindle Speed and Vibration:

Spindle Speed (RPS), Tool Vibration, and Spindle Vibration appear as

moderately important features. This aligns with the mechanical behavior of
precision machining—irregular spindle movement or excessive vibration can
indicate wear and tear. Machine ID Encoding:

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 14/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

The one-hot encoded Machine_ID features have the lowest importance: This
suggests that machine-specific factors are not as crucial as operational
parameters (e.g., pressure, torque, cutting force).

2. XGBoost vs. Gradient Boosting Comparison

XGBoost:

Hydraulic Pressure (Pa) dominates with the highest importance (~0.35).

More balanced importance distribution across features.
Slightly higher weight for Torque (Nm) and Cutting (N) compared to other features.

Gradient Boosting:

Hydraulic Pressure (Pa) is even more dominant (~0.42).

Less variation in importance among the remaining features, meaning it relies more on a
few strong predictors.
Coolant Temperature and Vibration features contribute less compared to XGBoost.

3. Summary of the Analysis

Hydraulic Pressure (Pa), Torque (Nm), and Cutting (N) are the strongest
predictors of machine downtime. If these parameters exceed a threshold,
the likelihood of failure increases.
Coolant and spindle-related factors play a secondary role, suggesting that
temperature regulation and machine stability (vibration) contribute to
faults.

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 15/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

In [24]:  # Get feature importance for the optimized Gradient Boosting model
gb_feature_importance = get_feature_importance(best_gb, "Gradient Boosting

# Get feature importance for the optimized XGBoost model
xgb feature importance = get feature importance(best xgb, "XGBoost")

Plot Hyperparameter Importance

Visualize how much each hyperparameter contributes to model performance

In [19]:  # plot of Gradient boost most hyperparameter importance

plot_param_importances_(study_gb)

In [20]:  # plot XGBoost hyperparameter importance

plot param importances (study xgb)

Type Markdown and LaTeX: 𝛼2

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 16/17

3/3/25, 6:42 PM Machine_downtime_ML_model - Jupyter Notebook

localhost:8888/notebooks/Documents/Data Science Projects/Machine-Downtime-Prediction/notebook/Machine_downtime_ML_model.ipynb# 17/17

Common questions