Is Data Normalization Always Necessary Before Training ML Models
Is Data Normalization Always Necessary Before Training ML Models
SHARE
about:blank 1/6
5/25/23, 10:15 AM Yahoo Mail - Is Data Normalization Always Necessary Before Training ML Models?
about:blank 2/6
5/25/23, 10:15 AM Yahoo Mail - Is Data Normalization Always Necessary Before Training ML Models?
For instance, in the image above, the scale of Income could massively
impact the overall prediction. Normalizing the data by scaling both features to
the same range can mitigate this and improve the model’s performance.
The following visual depicts which algorithms typically need normalized data
and which don’t.
about:blank 3/6
5/25/23, 10:15 AM Yahoo Mail - Is Data Normalization Always Necessary Before Training ML Models?
Consider a decision tree, for instance. It splits the data based on thresholds
determined solely by the feature values, regardless of their scale.
about:blank 4/6
5/25/23, 10:15 AM Yahoo Mail - Is Data Normalization Always Necessary Before Training ML Models?
Decision tree
Thus, it’s important to understand the nature of your data and the algorithm
you intend to use.
You may never need data normalization if the algorithm is insensitive to the
scale of the data.
Over to you: What other algorithms typically work well without normalizing
data? Let me know :)
👉 Read what others are saying about this post on LinkedIn and Twitter.
👉 If you liked this post, don’t forget to leave a like ❤️. It helps more
people discover this newsletter on Substack and tells me that you
appreciate reading these daily insights. The button is located towards
the bottom of this email.
👉 If you love reading this newsletter, feel free to share it with friends!
about:blank 5/6
5/25/23, 10:15 AM Yahoo Mail - Is Data Normalization Always Necessary Before Training ML Models?
I like to explore, experiment and write about data science concepts and tools.
You can read my articles on Medium. Also, you can connect with me on
LinkedIn and Twitter.
about:blank 6/6