Step-by-Step Machine Learning

1. The document outlines a 12-step process for machine learning projects that includes setting objectives, obtaining data, exploring the data, selecting tools, training models, validating models on new data, testing models, building production systems, launching models, monitoring performance, and maintaining systems. 2. Key steps include setting measurable objectives, exploring data to understand patterns, evaluating models on held-out validation and test data to avoid overfitting, gradually increasing complexity of models, and continuously monitoring models in production. 3. Successful machine learning requires cross-functional teams including decision makers, domain experts, engineers, analysts, and reliability engineers working through each step of the process.

Uploaded by

koernj

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views

Step-by-Step Machine Learning

Uploaded by

koernj

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Step-by-Step Machine Learning Process

Cassie Kozyrkov ([email protected])

What is Machine Learning?
● Machine Learning (ML) is an approach to making many decisions that involves algorithmically f inding
patterns in old data and using these patterns to create models (recipes) for dealing correctly with brand new
situations.
● In a nutshell: ML is all about finding and using patterns in data to perform new tasks.

Jargon:
● Instances: examples, observations. (rows)
● Labels: targets, ground truth. (correct answers)
● Features: information about the instances, variables, attributes. (columns)

Check you need ML
● If you can’t even imagine what sort of decisions (labels) you’d like your ML system to make for you, stop. It
might be too early for you to consider ML.
● Try imagining that you’d get 1 million free but distracted human workers who would all work on similar small
tasks. What would you ask them to work on? What does good work look like? How would you know if they
were slacking off? How would you choose between these million or those million free human workers?
Imagine the work and imagine how you would assess its quality.

Step-by-Step Machine Learning

Step 1: Set Objective ( Decision Maker, Domain Expert)
● Steps:
○ Write down outputs/labels.
○ Consider mistakes.
○ How would you score one mistake vs another.
○ Create business performance metric (BPM) by stitching together individual outcome scores for all of
your individual outcomes together.
○ Look up some classic loss functions used for these types of outputs (e.g. loss functions for binary
outputs, for text outputs, for numerical outputs, etc.)
○ Compare the business performance metric with the loss function.
○ Set minimum performance criteria to productionize and to launch.
● Thinking carefully about what success means and picking a metric that captures business performance is
important! This is the decision-maker’s responsibility and without it, the ML process is doomed.
● Imagine all the labels are made by an imperfect human worker instead of an ML system. Just focus on output.
● In ML, the proof of the pudding (model) is always in the eating (performance on new data). Always evaluate
performance based on your business metric.

Step 2: Get Data (Engineer, Domain Expert)
○ Your ML system is only as good as the data that went into it.
○ Getting data involves lots of engineering effort.
○ Just focus on getting IDs and a few inputs (“features” or “variables”) per ID. These won’t be the right
ones, just a starting point for your analysts to work on in Step 4.

Step 3: Split Data (Engineer)
● Overfitting happens when you model noise instead of reality.
● If you don’t optimize for fit on totally fresh data, your models are no good to you.
● You need fresh data for checking performance!
● Split your data into:
○ Training dataset
○ Validation dataset
○ Test dataset
● Key Message: Your ML system is no good to you if it can’t deal with new data. It’s too easy to build a system
that’s really good at old data but fails miserably on new. Make sure you avoid this by evaluating performance
on fresh data.

Step 4: Explore (Analyst, Domain Expert)
● Plotting data is your secret weapon for machine learning.
● You’re only allowed to look in your training data!
● Don’t look in your validation and test datasets.
● Key Message: Your data are your most valuable resource. If you don’t explore your training dataset, you’re
missing out on taking full advantage of it.

Step 5: Get Tools (Engineer)
● The math is in service of:
1) Finding patterns (in old data).
2) Assessing models (in new data).
● Unless you’re designing brand new algorithms, you can get away with:
○ Sufficient computer skills to use ML tools others have built
○ Sufficient statistical skills to evaluate model performance
● Key Message: Start with a list of available tools and aggressively eliminate everything that obviously won’t
work, then just pick as many of the remaining tools and try them in parallel. Invest in the ability to try to run
many algorithms in parallel.
● Don’t worry about picking the “right” algorithm. Worry about giving as many of them as possible a chance.
The proof of the pudding is in the eating.

Step 6: Train ( Engineer, Analyst)
● In training, your goal is to run lots of algorithms in parallel on your data and assess the models they produce
on that same data.
● You’re making a shortlist of models that seem to work.
● Don’t worry about getting it right first time - it’ll take a few tries.
● Start simple and only build up the complexity if the simple solution doesn’t work.

Step 7: Tune and Debug (Engineer, Analyst)
● If you want the safest, most effective debugging strategy, then:
○ Run your algorithm (step 6) in some data.
○ Debug its performance by using it to label different data.
○ Since you’re not allowed to debug using validation or test data, you’re going to need to allocate a
separate dataset for tuning/debugging. You’ll have a 4th dataset in play for debugging.
● You can create the tuning/debugging dataset on the fly by allocating some of the training data for this.
● If you have hyperparameters (numerical settings you must choose before running algorithm), use this data to
tune them.

Step 8: Validate ( Analyst)
● Validation is all about checking if your model succeeds on a new dataset.
● Validation protects you from blindly overfitting. It keeps you safe, don’t skip it!
● Only view the final metric, not individual validation data points. Don’t debug in your validation data.
● Repeatedly validating erodes your protection. That’s why we have step 9 (testing).

Step 9: Test (Decision Maker, Analyst)
● Testing is the final frontier before you take your model live. This is where statistical rigor enters the picture.
● This is what the discipline of statistical inference is all about. Ask your statisticians for help.
● You only get one shot at this per test dataset.
● If testing fails, the only way to start again is to collect a pristine new test dataset.
● Never test on data that was involved in any way in training/tuning/debugging/testing.

Step 10: Build (Engineer)
● Your algorithm got you a favorite model (a model is just a recipe for turning inputs into outputs). The
engineering team’s job is to get this recipe into production.
● You can build it so it keeps itself updated automatically by building capabilities for retraining in production.
● Changing anything changes everything, so always test after a change.
● Don’t forget to think about:
○ Retraining data, speed, and frequency.
○ Ability to restrict retraining data inputs and detect fleeting changes.
○ Logging bugs and changes to logging.
○ Tracking and safety nets for outliers.
○ Plans for when retesting fails.
● Policy layers (logic that checks the model output) are a very good idea. Build them before launching in
production and make them easy to add case-by-case corrections to.

Step 11: Launch (Analyst, Decision Maker)
● You need to make sure the ML system is good enough for your business needs.
● Do an experiment to measure its impact and check that launching it at 100% is the right decision.
● Components of an experiment:
○ Hypothesis. (Performance of ML system good enough? This criterion was decided in Step 1.)
○ Different treatments. (ML system vs not ML system.)
○ Randomization to treatments. (Live traffic sent at random to ML system or old system.)

Step 12: Monitor and Maintain (Reliability Engineer, Analyst)
● Invest in:
○ Monitoring plan
○ Maintenance plan (and ensure there’s headcount for carrying it out)
○ Tracking dashboards
○ Good documentation

KL - IT Asset Management Audit Work Program
No ratings yet
KL - IT Asset Management Audit Work Program
17 pages
Amusnet Interactive Client-Area Presentation
No ratings yet
Amusnet Interactive Client-Area Presentation
15 pages
UD21665B F Baseline 1 3 Series Multilingual Quick Start Guide 20220804
No ratings yet
UD21665B F Baseline 1 3 Series Multilingual Quick Start Guide 20220804
39 pages
Apex Mastery Guide by Vishal Gangwar-1
No ratings yet
Apex Mastery Guide by Vishal Gangwar-1
8 pages
In_Depth_Interview_Question_on_Schedulable_Apex__1713343567
No ratings yet
In_Depth_Interview_Question_on_Schedulable_Apex__1713343567
24 pages
IPL Dahboard III
No ratings yet
IPL Dahboard III
21 pages
Product Sprint 101 by Munzir
No ratings yet
Product Sprint 101 by Munzir
49 pages
PwC's Workday Rising Summaries
No ratings yet
PwC's Workday Rising Summaries
135 pages
Kloudrac Assesment - Senior Salesforce Developer - Assesment - 2
No ratings yet
Kloudrac Assesment - Senior Salesforce Developer - Assesment - 2
6 pages
LWC Imp Concepts V4
No ratings yet
LWC Imp Concepts V4
40 pages
Group5
No ratings yet
Group5
4 pages
Planning Forecasting and Budgeting Updated (1)
No ratings yet
Planning Forecasting and Budgeting Updated (1)
44 pages
Industry Clouds Paths - November 7th
No ratings yet
Industry Clouds Paths - November 7th
49 pages
Appian Virtual Academy Questions
No ratings yet
Appian Virtual Academy Questions
19 pages
MTX Response _ Nebraska Licensing
No ratings yet
MTX Response _ Nebraska Licensing
141 pages
Platform Events
No ratings yet
Platform Events
584 pages
set2 (1) (1)
No ratings yet
set2 (1) (1)
57 pages
Salesforce Flows23
No ratings yet
Salesforce Flows23
6 pages
Flow Interview Questions
No ratings yet
Flow Interview Questions
10 pages
Daniel Sinfield Cover Letter
No ratings yet
Daniel Sinfield Cover Letter
1 page
Cover letter sample
No ratings yet
Cover letter sample
2 pages
A CRM application to handle the clients and their property related requirements (1)
No ratings yet
A CRM application to handle the clients and their property related requirements (1)
70 pages
Agentforce Salesforce Presentation
No ratings yet
Agentforce Salesforce Presentation
8 pages
Access Apex Class Using LWC-4
No ratings yet
Access Apex Class Using LWC-4
8 pages
Flow Interview Question
No ratings yet
Flow Interview Question
9 pages
15 Data Security
No ratings yet
15 Data Security
3 pages
MultiSoft
No ratings yet
MultiSoft
14 pages
The Basic of SQL + List of Courses
No ratings yet
The Basic of SQL + List of Courses
29 pages
Lesson03-Cloud Architecture
No ratings yet
Lesson03-Cloud Architecture
16 pages
Supercharge Your Salesforce Path- Interview Pro Edition_System Design and Architecture
No ratings yet
Supercharge Your Salesforce Path- Interview Pro Edition_System Design and Architecture
13 pages
set2 up
No ratings yet
set2 up
7 pages
Reports in Salesforce
No ratings yet
Reports in Salesforce
7 pages
LWC Interview
No ratings yet
LWC Interview
8 pages
Rekha.P-SalesForce Administrator-6Yrs
No ratings yet
Rekha.P-SalesForce Administrator-6Yrs
6 pages
Lifecycle hooks in LWC
No ratings yet
Lifecycle hooks in LWC
6 pages
Real Interview Q - A2
No ratings yet
Real Interview Q - A2
19 pages
sharing rule
No ratings yet
sharing rule
8 pages
Data Science
No ratings yet
Data Science
64 pages
Mastering Triggers for Interviews
No ratings yet
Mastering Triggers for Interviews
36 pages
RECRUITMENT End to end proccess
No ratings yet
RECRUITMENT End to end proccess
17 pages
Create Custom Notifications in LWC
No ratings yet
Create Custom Notifications in LWC
13 pages
3-Months Roadmap To Become A Data Analyst in 2024
No ratings yet
3-Months Roadmap To Become A Data Analyst in 2024
11 pages
Salesforce Objects, Fields, Tabs
No ratings yet
Salesforce Objects, Fields, Tabs
10 pages
Aura Component in Lighting-Notes
No ratings yet
Aura Component in Lighting-Notes
8 pages
Apex Best Practice Pocket Guide
No ratings yet
Apex Best Practice Pocket Guide
22 pages
salesforce-consultants-guide
No ratings yet
salesforce-consultants-guide
168 pages
LWC Apex Methods 1679577718
No ratings yet
LWC Apex Methods 1679577718
2 pages
Salesforce - Unlimited Edition 42
No ratings yet
Salesforce - Unlimited Edition 42
1 page
Javascript
No ratings yet
Javascript
95 pages
16.object Specific Actions & Global
No ratings yet
16.object Specific Actions & Global
12 pages
Salesforce Lightning Setup for EZACD-8000
No ratings yet
Salesforce Lightning Setup for EZACD-8000
15 pages
Asynchronous Apex in Salesforce 1720015152
No ratings yet
Asynchronous Apex in Salesforce 1720015152
10 pages
Automobile Sales (3)
No ratings yet
Automobile Sales (3)
8 pages
Salesforce Phase-2
No ratings yet
Salesforce Phase-2
38 pages
Salesforce Devops
No ratings yet
Salesforce Devops
5 pages
Salesforce Dreamforce 2012 - Docusign Apex REST API Integeration
No ratings yet
Salesforce Dreamforce 2012 - Docusign Apex REST API Integeration
14 pages
SalesForce LWC LightningWebComponent COURSE Material (1) Removed
No ratings yet
SalesForce LWC LightningWebComponent COURSE Material (1) Removed
54 pages
01 Flowscenarios
No ratings yet
01 Flowscenarios
6 pages
⛔Another failed interview—this time at Honeywell?
No ratings yet
⛔Another failed interview—this time at Honeywell?
7 pages
events-1 - in sfdc
No ratings yet
events-1 - in sfdc
5 pages
Supercharge Your Salesforce Path- Interview Pro Edition_Industry and Salesforce Trends
No ratings yet
Supercharge Your Salesforce Path- Interview Pro Edition_Industry and Salesforce Trends
12 pages
Grooming Questions
No ratings yet
Grooming Questions
1 page
Salesforce CRM Administration Handbook: A comprehensive guide to administering, configuring, and customizing Salesforce CRM
From Everand
Salesforce CRM Administration Handbook: A comprehensive guide to administering, configuring, and customizing Salesforce CRM
Krzysztof Nowacki
No ratings yet
TSI SW100 Cyber Security How SABSA Can Help Your Business 1
No ratings yet
TSI SW100 Cyber Security How SABSA Can Help Your Business 1
5 pages
Windows Group Policy
No ratings yet
Windows Group Policy
185 pages
Windows 2000 Group Policy
No ratings yet
Windows 2000 Group Policy
186 pages
ENISA Activities 2011-2012
No ratings yet
ENISA Activities 2011-2012
20 pages
(Ebook) Kotlin Multiplatform by Tutorials by Kevin D. Moore; Carlos Mota; Saeed Taheri ISBN 9781950325627, 1950325628 all chapter instant download
100% (6)
(Ebook) Kotlin Multiplatform by Tutorials by Kevin D. Moore; Carlos Mota; Saeed Taheri ISBN 9781950325627, 1950325628 all chapter instant download
71 pages
Chapter 1
No ratings yet
Chapter 1
6 pages
Fault Classification in Power System-1
No ratings yet
Fault Classification in Power System-1
43 pages
Data Sheet: 83C145 83C845 83C055 87C055
No ratings yet
Data Sheet: 83C145 83C845 83C055 87C055
41 pages
Intelligent Optogenetics System
No ratings yet
Intelligent Optogenetics System
31 pages
Unit 1 Algebra SL Review
No ratings yet
Unit 1 Algebra SL Review
8 pages
Rdbms Relation PDF
No ratings yet
Rdbms Relation PDF
11 pages
User Interaction With AI-enabled Systems: A Systematic Review of IS Research
No ratings yet
User Interaction With AI-enabled Systems: A Systematic Review of IS Research
17 pages
Artificial Intelligence: Lecture 1: Introduction
No ratings yet
Artificial Intelligence: Lecture 1: Introduction
32 pages
Employee'S Timekeeping & Cos Form: For Offset Day (OS)
No ratings yet
Employee'S Timekeeping & Cos Form: For Offset Day (OS)
1 page
Prompt Design and Engineering
No ratings yet
Prompt Design and Engineering
25 pages
Cute BF Wallpeoers - Google Search
No ratings yet
Cute BF Wallpeoers - Google Search
1 page
ST Series Wall Mounting Enclosure
No ratings yet
ST Series Wall Mounting Enclosure
7 pages
SA1 - Variable, Operators and Control Structures
No ratings yet
SA1 - Variable, Operators and Control Structures
9 pages
Data Analysis Portfolio
No ratings yet
Data Analysis Portfolio
20 pages
Continuity Patrol DR Automation - Tech Deck
No ratings yet
Continuity Patrol DR Automation - Tech Deck
20 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
DiscreteMaths Unit4 Notes
No ratings yet
DiscreteMaths Unit4 Notes
30 pages
Class VI - Computer Holiday Work
No ratings yet
Class VI - Computer Holiday Work
16 pages
COURSE OUTLINE-CSE 3310 Computer Graphics
No ratings yet
COURSE OUTLINE-CSE 3310 Computer Graphics
5 pages
What Is The Internet
No ratings yet
What Is The Internet
1 page
Job Application Letter Sample Administrative Assistant
100% (1)
Job Application Letter Sample Administrative Assistant
9 pages
Research of Vehicle Tracking (VTracking App)
No ratings yet
Research of Vehicle Tracking (VTracking App)
7 pages
Using Visual Basic Script in WinCC
74% (31)
Using Visual Basic Script in WinCC
248 pages
CDP-Model of Concrete: Preprint
No ratings yet
CDP-Model of Concrete: Preprint
4 pages
Lab 8 Oop
No ratings yet
Lab 8 Oop
16 pages
Coding Interview in Java
No ratings yet
Coding Interview in Java
190 pages

Step-by-Step Machine Learning

Uploaded by

Step-by-Step Machine Learning

Uploaded by

Step-by-Step Machine Learning Process

Cassie Kozyrkov (​[email protected]​)

You might also like

Cassie Kozyrkov ([email protected])