0% found this document useful (0 votes)

11 views38 pages

DDDM_Lecture3_ExperimentBasics_Dec11

The document discusses the fundamentals of experimental design in data-driven decision making, focusing on A/B testing and best practices for conducting rigorous experiments. It covers hypothesis development, randomization strategies, sample size determination, and statistical analysis methods, emphasizing the importance of proper execution to avoid misleading conclusions. Key statistical concepts such as hypothesis testing, Type I and II errors, and various test statistics are also outlined to ensure effective evaluation of experimental results.

Uploaded by

u3636157

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views38 pages

DDDM_Lecture3_ExperimentBasics_Dec11

Uploaded by

u3636157

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Data Driven Decision Making in Business

Lecture 3 Experimental Design —— Fundamentals

Shuyang Yang
Dec 11th
Agenda
Lecture 1 A Peep into DDDM
I. Best Practices for Conducting Rigorous A/B Tests

II. Statistics in Experiments:

• Hypothesis testing
• Test Statistics
• MDR

Data Driven Decision Making - Shuyang 2

0. Introduction: AB Testing / RCT/ Experiments

The concept is trivial:

1. Randomly split sample into two (or
more)
• A (Control) VS B(Treatment)
2. Collect data, calculate metrics of interest
• Run statistical tests
3. Analyze and decide which group to go w/

Data Driven Decision Making - Shuyang 3

0. Introduction: AB Testing / RCT/ Experiments
What experiments?
• Causal interpretation: randomized assignment provides the ground truth for evaluating the

impact of a treatment/intervention/strategy/feature ---- Golden Rule

• Iterative improvement: small, incremental tests compound over time to create significant

improvement ---- culture of experimentation

• Measuring real-world impact: evaluate changes directly with real users/customers in real-world

conditions
It doesn't matter how beautiful your theory is, it doesn't matter
how smart you are. If it doesn't agree with experiment[s], it's
wrong -- Richard Feynman

Data Driven Decision Making - Shuyang 4

I. Best Practices for Conducting Rigorous A/B Tests

• The advantages of A/B testing only hold if the test is conducted rigorously and

adheres to best practices.

• A poorly designed or executed A/B test can lead to misleading conclusions, wasted

resources, or even harmful changes to the business……

Insufficient sample size

Incorrect randomization Poor monitoring during test

Ignoring statistical significance

Data Driven Decision Making - Shuyang 5

I. Best Practices for Conducting Rigorous A/B Tests

Data Driven Decision Making - Shuyang 6

I. Best Practices: Develop Hypothesis

Identify the Problem Form a hypothesis

Identify specific phenomena or problems Formalize the problem into measurable/quantifiable

from historical data or experience hypothesis

low-engagement customers Subjects - Who

Too many push ads disturb non-active
who receive 10+ push messages
customers and reduce retention Treatment - What
(compared to customers who receive
less than than 10 msg)
quantifiable outcome
has 10% lower 1-month retention
- How

Data Driven Decision Making - Shuyang 7

0.Best
I. Introduction: AB Testing
Practices: Develop / RCT/ Experiments
Hypothesis
A good hypothesis should be:
• Plausible Mechanism：There should be a logical or theoretical explanation for how the cause

leads to the effect.

• Measurable: Clearly defines the variables involved and the expected outcome, includes metrics or

criteria that can be measured.

Data Driven Decision Making - Shuyang 8

0.Best
I. Introduction: AB Testing
Practices: Develop / RCT/ Experiments
Hypothesis

Data Driven Decision Making - Shuyang 9

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Experiment Unit Randomization Define Success
Strategy Metrics

Target
Population

Traffic Requirement

Data Driven Decision Making - Shuyang 10

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
1. Target Population:
The entire group of individuals, users, or entities that the test aims to represent and from which the
experimental sample is drawn. It defines the scope of the A/B test and ensures that the results are
generalizable to the group of interest.
• Inclusivity: represent all potential individuals affected by the change being tested

Full population VS Specific Subpopulation

all dimensions: demographic/states by dimension: female; age 30+; low-engaged;

w/ trigger behavior: customer who visited a

w/o trigger behavior: all registered user
particular page

no exclusion: all registered user exclusion: blocklist

Data Driven Decision Making - Shuyang 11

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Experiment Unit:
An experimental unit in an A/B test is the smallest entity or subject to which a treatment (or variation)
is applied and for which an outcome is measured.

Granularity: at what level the randomization occurs

• Individuals-level
• Individual-Session level
• Group / Clusters-level: geographic regions
• Time-based Units:

Data Driven Decision Making - Shuyang 12

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Experiment Unit:

Data Driven Decision Making - Shuyang 13

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Experiment Unit:

Data Driven Decision Making - Shuyang 14

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Experiment Unit:

Data Driven Decision Making - Shuyang 15

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Randomization Strategy:
Randomization strategies determine how participants are assigned to experimental groups (control
and variation). Proper randomization ensures that the groups are comparable, reducing bias and
increasing the reliability of causal inferences

• Simple randomization -- large sample sizes with more homogenous distribution

• Stratified randomization -- participants are grouped into strata (subgroups) based on specific
characteristics (e.g., age, location), and randomization occurs within each subgroup.
• Cluster randomization -- entire clusters are randomly selected (Han’s school example)
• Adaptive Randomization: dynamic approach that adjusts allocation over time – improve efficiency
• Multi-Armed Bandit Algorithms

Data Driven Decision Making - Shuyang 16

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Randomization Strategy:

Data Driven Decision Making - Shuyang 17

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Define Success Metrics:

Type of Metrics Definition Bing Example

Primary Metrics Directly driven by the test; main outcome Clickthrough rate
The most important metric that measures Searches per user
North Star Metric
long-term success Ad income
Metrics that ensure the test doesn’t
Guardrail Loading time
negatively impact critical areas of the business
Indirect outcomes; process metrics to uncover
Secondary Metrics Mouse hover duration
mechanism

Data Driven Decision Making - Shuyang 18

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Traffic Requirement – determine sample size
Required Input:
• Baseline value: the current value of the metrics you are measuring; e.g. 10% CTR
• (+) Minimum Detectable Effect (MDE): smallest effect size you want to detect
• Previous test results
• Based on ROI, the impact has to be XX% to cover cost
• (-) Significance level (𝛼) – Type I error (false positive)
• (+) Statistical Power (1-𝛽) – The probability of detecting a true effect when it exists

Output: sample size N

Data Driven Decision Making - Shuyang 19

0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Traffic Requirement – determine sample size

https://www.evanmiller.org/ab-testing/sample-size.html
Data Driven Decision Making - Shuyang 20
0.Best
I. Introduction: AB up
Practices: Set Testing / RCT/ Experiments
Experiment
Traffic Requirement – Establish Test Duration

• Based on sample size

• Account for seasonal and temporal
effects
• At least one full cycle (Week?)
• Avoid major anomalies
• Account for metrics calculation
• 7-day attrition rate

Data Driven Decision Making - Shuyang 21

0.Best
I. Introduction: AB Testing
Practices: Run / RCT/ Experiments
an Experiment
Monitor the process

• Traffic allocation:
• Ensure that participants are correctly allocated to control and variation groups
• No significant discrepancies in sample size
• Early Indicators of Results:
• monitoring trends can help identify potential issues
• Data Integrity Audits
• Detect and resolve data issues before analysis

Data Driven Decision Making - Shuyang 22

0.Best
I. Introduction: AB Testing
Practices: Analyze / RCT/ Experiments
and Decide
Analysis

• Analyze result for all metrics of interest

• Check statistical significance (perform statistical tests)
• Consider statistical power, ensure sufficient power to detect MDE
• Assess ”real-world” significance
• Is the impact big enough to drive North Star?
• Examine segment level results
• Identify variations across different user segments ---- more on this later

Data Driven Decision Making - Shuyang 23

0.Best
I. Introduction: AB Testing
Practices: Analyze / RCT/ Experiments
and Decide
Make a Decision

Monitor post-implemtation metric

Roll out the treatment to all units
Confimr the impact in population

Deep dive on causes

Maintain the Status Quo
Explore alternative hypothesis

Refine the test design

Conduct a Follow-up Tests
Refine the hypothesis

Data Driven Decision Making - Shuyang 24

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Hypothesis Testing – Key Definitions

Hypothesis testing is conducted around two key hypothesis:

• Null hypothesis (𝐻! )
• no difference between the control and variation
• Example: "The underlying design does not increase click-through rate (CTR)."
• Alternative hypothesis (𝐻" )
• assumes there is a difference
• Example: "The underlying design design increases click-through rate.”

Data Driven Decision Making - Shuyang 25

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Hypothesis Testing – Steps

1. Define 𝐻! and 𝐻"

2. Determine a test statistic to use and
choose a significance level (𝛼)
3. Calculate a test statistic based on sample
data, obtain p-value
• P-value: the probability of obtaining a test
statistic at least as extreme as the one
observed, under 𝐻!

4. Compare p-value to 𝛼
• reject p < 𝛼

Data Driven Decision Making - Shuyang 26

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Hypothesis Testing – Type I and Type II Errors
• Type I error:
• false positive (𝐻! is rejected when true)
• denoted 𝛼
• e.g. no change in CTR -> but conclude that there
is a change
• Type II error:
• false negative (fail to reject 𝐻! when 𝐻" is true) ,
denoted 𝛽
• e.g. CTR is increased -> but conclude that there is
no impact
• Statistical power: 1- 𝛽

Data Driven Decision Making - Shuyang 27

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Hypothesis Testing – Type I and Type II Errors

• Statistical power: 1- 𝛽
• The test’s ability to detect a true effect when it
exists (probability of avoiding Type II error)
• Common desired: 80%
• Ensure a test is sensitive enough to detect a
meaningful effect

Data Driven Decision Making - Shuyang 28

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Hypothesis Testing – Balancing Type I and Type II

• Set an appropriate significance level (𝛼)

• Increase statistical power:
• reduce variability in the data – precise estimate

• Increase sample size

• maximize effect size
• Context matter!
• In high-stake situation, Type I error is more

The lower the significance level (𝛼), the more likely Type II error occurs critical (clinical trial)
• Common standard: set 𝛼 = 0.05, 𝛽 = 0.2

Data Driven Decision Making - Shuyang 29

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Hypothesis Testing – Balancing Type I and Type II

Data Driven Decision Making - Shuyang 30

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Test Statistics – T-test (most common)
• Difference-in-mean test: compare the mean of two groups
• small sample or population std is unknown
• Assumptions:
o Normality: sample mean follows a normal distribution (CLT)
o Independent Observations: within and between group
o Equal Variance: variance of two groups are equal
o If no, Welch’s t-test
o Applications: comparing metrics of average values
o average duration on site; average order value
o Limitations:
o sensitive to outliers

Data Driven Decision Making - Shuyang 31

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Test Statistics – Proportion Test

• Difference-in-proportion: compare proportions between

groups (Bernoulli distributions)
• Assumptions - Special case of t-test
o Applications: comparing metrics of binary outcomes
o Clickthrough rate; conversion rate;
o Limitations:
o sensitive to rare events (very low probability)

Data Driven Decision Making - Shuyang 32

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Test Statistics –Chi-square Test

• Tests the association or independence between categorical

variables: deal with multiple categories
• Assumptions – flexible (large sample requirement)
o Applications: comparing metrics of binary outcomes
o Test if click-through or conversion rates differ between
groups;
o Test balance of traffic allocation (sample size of control
and treatment group)
o Limitations:
o Only tests for significance, not the size of the effect

Data Driven Decision Making - Shuyang 33

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Test Statistics – Test for Derived Metric

• Use case:
• Analysis unit is different from experiment unit
• e.g. average clicks per content
• Observations at group level are NOT independent – variance is incorrect
• Ratio metrics measure the ratio of two metrics
• e.g. page-level CTR= # clicks / # page visits
= (# clicks / #user) / (# page visits / # user)
= (user-level) / (user-level) ------ ratio metric, i.i.d. (numerator, denominator)
• Delta-Method: 𝑇𝑎𝑦𝑙𝑜𝑟 𝐸𝑥𝑝𝑎𝑛𝑠𝑖𝑜𝑛 (1𝑠𝑡 𝑂𝑟𝑑𝑒𝑟)
#$%&'
(()*+)) %$
• 𝐶𝑇𝑅 = ,- = = f(x, y)
(()*+)) &$

Data Driven Decision Making - Shuyang 34

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
Test Statistics – Test for Derived Metric

• Using delta-method, the variance becomes:

• More use cases:

• average msg sent in each group ; -- randomize at user level
• Revenue per rider– randomize at device (shared bike)
• Session CTR – randomize at user level (one user has multiple sequential sessions)

Data Driven Decision Making - Shuyang 35

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
MDE Revisit

• Minimum detectable effect: the smallest effect size that a statistical test can reliably detect given the

sample size, significance level (α), and statistical power (1−β)

• represents the smallest change in a metric (e.g., conversion rate or revenue) that you want to detect

with a specified level of confidence à determine prior to test based on business hypothesis

• Larger MDE -> smaller sample size required

Assuming 𝑛1 = 𝑘 ∗ 𝑛2
Under the H1, the power can be calculated as:

." 0 /
/ .#
1 − 𝛽 = Pr( > 𝑍"02/4 |𝑝1, 𝑝2) =0.8
%$" "& %$" %$ "& %$#
1 #
'" '#

Data Driven Decision Making - Shuyang 36

0. Statistics
II. Introduction:
in AB
ABTesting
Testing / RCT/ Experiments
MDE Revisit

Data Driven Decision Making - Shuyang 37

III. Reading:
Lecture 1 A Peep into DDDM
Kohavi, Ron, Diane Tang, and Ya Xu. Trustworthy online controlled
experiments: A practical guide to a/b testing. Cambridge University
Press, 2020.

Data Driven Decision Making - Shuyang 38

Irrational Labs - Designing Experiments
No ratings yet
Irrational Labs - Designing Experiments
75 pages
The Art Science of AB Testing For Business Decisions
No ratings yet
The Art Science of AB Testing For Business Decisions
97 pages
AB Test Document - Google Docs
No ratings yet
AB Test Document - Google Docs
50 pages
Data Science Product Questions
No ratings yet
Data Science Product Questions
92 pages
Foundations of Large-Scale Doubly-Sequential Experimentation
No ratings yet
Foundations of Large-Scale Doubly-Sequential Experimentation
339 pages
Trustworthy Online Controlled Experiments by Ron Kohavi Diane Tang Ya Xu
No ratings yet
Trustworthy Online Controlled Experiments by Ron Kohavi Diane Tang Ya Xu
424 pages
Kajaria Tiles Project
0% (2)
Kajaria Tiles Project
53 pages
The Propogation of Tropical Fruit Trees PDF
No ratings yet
The Propogation of Tropical Fruit Trees PDF
576 pages
Kohavi
No ratings yet
Kohavi
48 pages
Part 3 Tender Document
No ratings yet
Part 3 Tender Document
294 pages
Market Analysis of Daikin Air Conditioners
67% (3)
Market Analysis of Daikin Air Conditioners
51 pages
T-1000 TSA 54404-1-PM Rev 4.4
100% (1)
T-1000 TSA 54404-1-PM Rev 4.4
209 pages
ABE_11_0be526ea23779b06682fc8a9c0915dc6
No ratings yet
ABE_11_0be526ea23779b06682fc8a9c0915dc6
38 pages
Fisher Type 627
No ratings yet
Fisher Type 627
36 pages
AB Testing Cheat Sheet
No ratings yet
AB Testing Cheat Sheet
13 pages
2016-07OnlineControlledExperimentsAndABTesting
No ratings yet
2016-07OnlineControlledExperimentsAndABTesting
9 pages
Chapter-05_Tool-for-experiments
No ratings yet
Chapter-05_Tool-for-experiments
13 pages
Session 3
No ratings yet
Session 3
28 pages
SUDOKU for Students spring 2025
No ratings yet
SUDOKU for Students spring 2025
16 pages
научный файл 3
No ratings yet
научный файл 3
9 pages
A - B Testing
No ratings yet
A - B Testing
27 pages
Big Book of Experimentation
No ratings yet
Big Book of Experimentation
101 pages
A - B Test Guide
No ratings yet
A - B Test Guide
33 pages
4.ABTesting
No ratings yet
4.ABTesting
18 pages
Investment Management Module 1
100% (1)
Investment Management Module 1
21 pages
RAM 2522 DK
No ratings yet
RAM 2522 DK
40 pages
Setup Guide: Card Printers in Cardwizard® Issuance Software
No ratings yet
Setup Guide: Card Printers in Cardwizard® Issuance Software
24 pages
Ab Testing
No ratings yet
Ab Testing
16 pages
Facebook status colour change AB tetsing case study
No ratings yet
Facebook status colour change AB tetsing case study
11 pages
Test What Matters_ Level-Up Your Product Experiments with Behavioral Data
No ratings yet
Test What Matters_ Level-Up Your Product Experiments with Behavioral Data
12 pages
Study Material Module 2 - BBADMC602
No ratings yet
Study Material Module 2 - BBADMC602
15 pages
Racecar Engineering December 2017
No ratings yet
Racecar Engineering December 2017
100 pages
AB Cheatsheet
No ratings yet
AB Cheatsheet
13 pages
Old CHE1301 Practice Final
No ratings yet
Old CHE1301 Practice Final
12 pages
How to Design an A B Test as a Data Scientist Am
No ratings yet
How to Design an A B Test as a Data Scientist Am
9 pages
Data-Driven Design: Making Software & Websites Better Through Use of Statistics Patrick Mckenzie April 15Th, 2010
No ratings yet
Data-Driven Design: Making Software & Websites Better Through Use of Statistics Patrick Mckenzie April 15Th, 2010
41 pages
L4-AB Testing
No ratings yet
L4-AB Testing
17 pages
2020-04-08 A - B-Tests - Jasmin Yaya
No ratings yet
2020-04-08 A - B-Tests - Jasmin Yaya
42 pages
Bayesian AB Testing For Business Decisions
No ratings yet
Bayesian AB Testing For Business Decisions
8 pages
Parte 2
No ratings yet
Parte 2
20 pages
A-B Testing - Framework-2025061017080742
No ratings yet
A-B Testing - Framework-2025061017080742
5 pages
A Refresher On A B Testing
No ratings yet
A Refresher On A B Testing
9 pages
btec ice cream 3
No ratings yet
btec ice cream 3
8 pages
445 Texbook 0003
No ratings yet
445 Texbook 0003
110 pages
Care and Maintenance of Bearings: CAT - No
No ratings yet
Care and Maintenance of Bearings: CAT - No
26 pages
AB Test Notes
No ratings yet
AB Test Notes
7 pages
You Exec - AB Testing Complete
No ratings yet
You Exec - AB Testing Complete
15 pages
Bengali Noboborsho Special Recipe - Awadhi Gosht Korma - Cosmopolitan Currymania
No ratings yet
Bengali Noboborsho Special Recipe - Awadhi Gosht Korma - Cosmopolitan Currymania
5 pages
Building Your Data Dna
No ratings yet
Building Your Data Dna
39 pages
Whitepaper Getting Started With Product Experiemtation
No ratings yet
Whitepaper Getting Started With Product Experiemtation
17 pages
Lessons, Tactics, and Stories From World-Class Experimenters
No ratings yet
Lessons, Tactics, and Stories From World-Class Experimenters
23 pages
EP-Experimentation Best Practices-200125-135749
No ratings yet
EP-Experimentation Best Practices-200125-135749
5 pages
Learning Metrics That Maximise Power for Accelerated AB-Tests
No ratings yet
Learning Metrics That Maximise Power for Accelerated AB-Tests
11 pages
Assignment For Master of Science (Information Security) - MSCIS - July 2022 - Ist - Semester
No ratings yet
Assignment For Master of Science (Information Security) - MSCIS - July 2022 - Ist - Semester
8 pages
B experiments
No ratings yet
B experiments
3 pages
Download
No ratings yet
Download
7 pages
ANSYS Polystat Users Guide
No ratings yet
ANSYS Polystat Users Guide
180 pages
A Practical Guide To AB Testing
No ratings yet
A Practical Guide To AB Testing
27 pages
Boost Your Marketing ROI With Experimental Design: by Eric Almquist and Gordon Wyner
No ratings yet
Boost Your Marketing ROI With Experimental Design: by Eric Almquist and Gordon Wyner
12 pages
Experimentation: A Path To Personalization at Scale
No ratings yet
Experimentation: A Path To Personalization at Scale
6 pages
AB Testing Guide
No ratings yet
AB Testing Guide
2 pages
25 A - B Testing Concepts You Must Know - Interview Refresher
No ratings yet
25 A - B Testing Concepts You Must Know - Interview Refresher
7 pages
Interchange by Manufacturer (1 of 5)
No ratings yet
Interchange by Manufacturer (1 of 5)
5 pages
A/B Testing: Mazher Khan - IIT (BHU) - B.Tech (DR-2)
No ratings yet
A/B Testing: Mazher Khan - IIT (BHU) - B.Tech (DR-2)
4 pages
Buidling A Culture of Experimentation Case Study
No ratings yet
Buidling A Culture of Experimentation Case Study
10 pages
2sce5r6oetjh-TestingStudyGuide
No ratings yet
2sce5r6oetjh-TestingStudyGuide
2 pages
Post Op Instructions Root Canal
No ratings yet
Post Op Instructions Root Canal
2 pages
Class 5 Case: Notebook: Marketing II Created: 02-10-2021 18:01 Updated: 02-10-2021 18:28 Author: Asd
No ratings yet
Class 5 Case: Notebook: Marketing II Created: 02-10-2021 18:01 Updated: 02-10-2021 18:28 Author: Asd
6 pages
1 Ahad 21052023
No ratings yet
1 Ahad 21052023
2 pages
Unlock Insights With AB Testing Data-Driven Decision Making
No ratings yet
Unlock Insights With AB Testing Data-Driven Decision Making
5 pages
2201 Test 2 Quadratics 2014
No ratings yet
2201 Test 2 Quadratics 2014
5 pages
Adams Solver Training Notes
No ratings yet
Adams Solver Training Notes
4 pages
Irrational Labs Behavioral Diagnosis
No ratings yet
Irrational Labs Behavioral Diagnosis
5 pages
Detailed Lesson Plan
No ratings yet
Detailed Lesson Plan
2 pages
Interventional Radiology In-Training Test Questions For Diagnostic Radiology Residents
No ratings yet
Interventional Radiology In-Training Test Questions For Diagnostic Radiology Residents
8 pages
A Comprehensive Getting Started Guide To A/B Testing
No ratings yet
A Comprehensive Getting Started Guide To A/B Testing
8 pages
Brahma Murari Tripurantakari Brahma Murari Tripurantakari
100% (1)
Brahma Murari Tripurantakari Brahma Murari Tripurantakari
2 pages
BOM List Converter 60VDC-12VDC 12 - 06 - 2020 Thô
No ratings yet
BOM List Converter 60VDC-12VDC 12 - 06 - 2020 Thô
2 pages
Orient 2
No ratings yet
Orient 2
23 pages
2021-02-16 - Why AB Testing Is Likely Making Your UX Worse
No ratings yet
2021-02-16 - Why AB Testing Is Likely Making Your UX Worse
6 pages
North Eastern Electric Power Corporation LTD.: Notice Inviting Bid (E-Tender)
No ratings yet
North Eastern Electric Power Corporation LTD.: Notice Inviting Bid (E-Tender)
1 page
GDS - Sabre Notes and Short Commands
100% (1)
GDS - Sabre Notes and Short Commands
39 pages
Stab Binding
100% (1)
Stab Binding
5 pages
Steps To Completing An A/B Test
No ratings yet
Steps To Completing An A/B Test
1 page
Digital Experimentation
No ratings yet
Digital Experimentation
11 pages
9 Books On Using Data To Build Products
No ratings yet
9 Books On Using Data To Build Products
1 page
Crush Hypothesis Testing
From Everand
Crush Hypothesis Testing
Allison Dillard
No ratings yet
Clinical Trials Design and Methodology: Clinical Trials Mastery Series, #3
From Everand
Clinical Trials Design and Methodology: Clinical Trials Mastery Series, #3
Dr. Nilesh Panchal
No ratings yet
Practical Statistical Process Control
From Everand
Practical Statistical Process Control
Colin Hardwick
5/5 (9)
Project Management of Clinical Trials
From Everand
Project Management of Clinical Trials
Richard Chamberlain
No ratings yet

DDDM_Lecture3_ExperimentBasics_Dec11

Uploaded by

DDDM_Lecture3_ExperimentBasics_Dec11

Uploaded by

Data Driven Decision Making in Business

Lecture 3 Experimental Design —— Fundamentals

II. Statistics in Experiments:

Data Driven Decision Making - Shuyang 2

The concept is trivial:

Data Driven Decision Making - Shuyang 3

impact of a treatment/intervention/strategy/feature ---- Golden Rule

improvement ---- culture of experimentation

Data Driven Decision Making - Shuyang 4

adheres to best practices.

resources, or even harmful changes to the business……

Insufficient sample size

Incorrect randomization Poor monitoring during test

Ignoring statistical significance

Data Driven Decision Making - Shuyang 5

Data Driven Decision Making - Shuyang 6

Identify the Problem Form a hypothesis

Identify specific phenomena or problems Formalize the problem into measurable/quantifiable

low-engagement customers Subjects - Who

Data Driven Decision Making - Shuyang 7

leads to the effect.

criteria that can be measured.

Data Driven Decision Making - Shuyang 8

Data Driven Decision Making - Shuyang 9

Data Driven Decision Making - Shuyang 10

Full population VS Specific Subpopulation

all dimensions: demographic/states by dimension: female; age 30+; low-engaged;

w/ trigger behavior: customer who visited a

no exclusion: all registered user exclusion: blocklist

Data Driven Decision Making - Shuyang 11

Granularity: at what level the randomization occurs

Data Driven Decision Making - Shuyang 12

Data Driven Decision Making - Shuyang 13

Data Driven Decision Making - Shuyang 14

Data Driven Decision Making - Shuyang 15

• Simple randomization -- large sample sizes with more homogenous distribution

Data Driven Decision Making - Shuyang 16

Data Driven Decision Making - Shuyang 17

Type of Metrics Definition Bing Example

Data Driven Decision Making - Shuyang 18

Output: sample size N

Data Driven Decision Making - Shuyang 19

• Based on sample size

Data Driven Decision Making - Shuyang 21

Data Driven Decision Making - Shuyang 22

• Analyze result for all metrics of interest

Data Driven Decision Making - Shuyang 23

Monitor post-implemtation metric

Deep dive on causes

Refine the test design

Data Driven Decision Making - Shuyang 24

Hypothesis testing is conducted around two key hypothesis:

Data Driven Decision Making - Shuyang 25

1. Define 𝐻! and 𝐻"

Data Driven Decision Making - Shuyang 26

Data Driven Decision Making - Shuyang 27

Data Driven Decision Making - Shuyang 28

• Set an appropriate significance level (𝛼)

• Increase sample size

Data Driven Decision Making - Shuyang 29

Data Driven Decision Making - Shuyang 30

Data Driven Decision Making - Shuyang 31

• Difference-in-proportion: compare proportions between

Data Driven Decision Making - Shuyang 32

• Tests the association or independence between categorical

Data Driven Decision Making - Shuyang 33

Data Driven Decision Making - Shuyang 34

• Using delta-method, the variance becomes:

• More use cases:

Data Driven Decision Making - Shuyang 35

sample size, significance level (α), and statistical power (1−β)

• Larger MDE -> smaller sample size required

Data Driven Decision Making - Shuyang 36

Data Driven Decision Making - Shuyang 37

Data Driven Decision Making - Shuyang 38

You might also like