0% found this document useful (0 votes)

47 views

MDP - Thesis - Assignment-Ankit Khatri

The document describes a proposed experiment to test whether human metacognitive learning follows reinforcement learning mechanisms. The experiment would use a Mouselab MDP paradigm where participants plan routes between nodes to maximize rewards. Data on cumulative rewards would be collected across 12 trials with varying reward structures to test for effects of scarcity and misalignment. The data would be analyzed by plotting reward trends to compare performance under different conditions and test predictions from reinforcement learning. A sample size would be determined using confidence intervals and would include participants of different ages and experience levels.

Uploaded by

ANKIT KHATRI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views

MDP - Thesis - Assignment-Ankit Khatri

Uploaded by

ANKIT KHATRI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Assignment

Understanding human metacognitive learning at the Max Planck Institute for

Intelligent Systems - Rationality Enhancement Group

Submitted By : Ankit Khatri

Task 1: We want to test whether human metacognitive learning follows a reinforcement learning
mechanism by testing the three predictions (scarcity, delay, misalignment). Please select one of them and
describe how you would test it by answering the following questions:

1. Which experimental design would you use and what would be the procedures and materials
of the experiment?

Ans. I have considered two situations for this assignment. One, where many of the decisions go
unrewarded and two, misalignment, where rewards are determined by luck more than by the
quality of the plan, for testing whether Human metacognitive learning follows Reinforcement
learning mechanism. Reinforcement learning struggles to learn in these scenarios. To test whether
there is such an effect on Human metacognitive learning as well, I have designed an experiment
scenario which is discussed below. I’ll try to prove that there is less prominent growth in rewards
as the participant continues to play, in the case of scarcity and misalignment situations.

Experiment. It is highly difficult to observe people’s metacognitive behavior and learning

patterns. Hence, to externalize the mental representation that people use for planning, I have used
the MDP Mouselab paradigm which is a process-tracing paradigm which renders people’s
behavior highly diagnostic of their planning strategies . In this paradigm, each participant has to
plan their route starting from an origin point till finishing point along some path. Each node in the
path has some gain and loss associated with it, thus the goal of the experiment is to choose a path
in order to maximize the overall score. The values in these nodes are initially hidden, but can be
clicked to reveal it by paying a certain price. This is done in order to encourage participants to
reveal the information as necessary, rather than clicking all at once.

Materials. Using the MDP Mouselab paradigm, I created an experiment which has a total of 12
rounds for each participant. In the first four rounds, the nodes contain random reward values
which contain two rounds for low click cost and two for high click cost. Each node, when clicked,
displays an emoticon which is mapped to a random reward value which is randomly updated in
each subsequent round so that the user cannot memorize. In the next four rounds, each participant
can click to display the associated reward values with each node to plan their path accordingly,
which again contains two rounds each for low and high click cost respectively. In the last 4
rounds, the maze contains certain nodes with zero reward values. These nodes are selected
randomly in each round so that the participant cannot memorize the location of such nodes in
each subsequent round. These 4 rounds will be used to test the scarcity metric to know the extent
of effect of zero rewards (actions which go unrewarded) on human metacognitive learning.
Procedure. After recruiting participants (the procedure for their selection is discussed in the next
section), each one of them is given the task to complete all the 12 rounds in the experiment. The
scores are displayed on the screen and are updated after each action (click, move).

2. How would you determine the sample size?

Ans. A sample size is the number of observations taken from a population for a survey or
experiment. Its value must be optimally suitable for the concerned experiment. It depends on
various factors like the nature of the population (homogeneous or heterogeneous), availability of
resources, sampling method used, degree of accuracy required, etc. There are several methods to
determine optimum sample size, like :

● Arbitrary approach (5% or 10%)

● Conventional approach (sample size of similar studies)
● Cost benefit analysis
● Confidence interval approach (Cochran's formula & Slovin's formula)

Ideally, we will first calculate the sample size for an infinite population and then adjust it to the
required population using the confidence interval approach. The required population may consist
of participants from different age groups. I have divided the total population in 3 age groups.

● The first group will contain individuals in the age group of 14-18 having a high school
percentage over 85%.
● Next group will contain individuals in the age group of 21-35, having cleared the
AMCAT (All India level) examination score of 500 or above out of 900 or cleared other
exams like JEE/CDS/SSC.
● Next will have participants with an age of over 50 years, having at least 15 years of
experience in their respective fields.

The Sample size can also be determined by the confidence interval approach for each of the age
groups. The required population may also consist of participants who have completed 100+
Human Intelligence Tasks (HIT), had HIT approval rate of at least 90% and were located in the
United States as used in other similar studies. [1]

3. What data would you collect?

Ans. The data that would be collected is the total average cumulative reward for each trial and for
each age group.
4. How would you analyze the collected data?

Ans. As discussed above, the data collected was the average cumulative reward for each trial and
each age group to test the hypotheses whether human metacognitive learning follows a
reinforcement learning mechanism.
Trials 1-4 will be for misalignment metric, 9-12 is for scarcity metric. The data (reward values)
collected from these trials would be plotted on a line graph for each age group. The trend of these
two metrics will be compared with the results of the trials in rounds 5-8 for testing the hypothesis.
Concretely, we predict that in the case of the trials having misalignment (random rewards) and
scarcity (unrewarded decisions), the average cumulative reward should show a less prominent
growth as compared to the trials 5-8, where the participant can plan better as he is able to
clearly view the rewards of future states in order to maximize the cumulative reward value. It will
clearly show a more prominent upward trend as compared to the other trials.

Task 2

Hosted Web App : https://ankitknitj.github.io/mcl_experiment/

GitHub Repository : https://github.com/ankitknitj/Masters_Thesis_MDPAssignment

References

1. He, R., Lieder, F., & Jain, Y. (2021, July). Measuring and modeling how people learn how to
plan and how people adapt their planning strategies to the structure of the
Environment.
2. He, Ruiqi, Yash Raj Jain, and Falk Lieder. "Have I done enough planning or should I plan
more?." arXiv preprint arXiv:2201.00764 (2022).
3. jsPsych: https://www.jspsych.org
4. Mouselab MDP: https://github.com/fredcallaway/Mouselab-MDP

Behaviorist Lesson Plan
100% (6)
Behaviorist Lesson Plan
4 pages
Learning - Remembering and Forgetting
86% (7)
Learning - Remembering and Forgetting
37 pages
Arshita Matta 0011 Exp 3
No ratings yet
Arshita Matta 0011 Exp 3
6 pages
183 Experimental Psychology Practicals Inner Pages
No ratings yet
183 Experimental Psychology Practicals Inner Pages
87 pages
Learning with AI : Intelligent Optimisation
From Everand
Learning with AI : Intelligent Optimisation
Coleman Colman
No ratings yet
Glossary of Research Methodology
From Everand
Glossary of Research Methodology
Dr. Awadhesh Kishore
No ratings yet
Ideas For Internal Assessment
No ratings yet
Ideas For Internal Assessment
3 pages
Psicology Homework
No ratings yet
Psicology Homework
4 pages
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Genetic Algorithm: Fundamentals and Applications
From Everand
Genetic Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Nature of Learning Curve
No ratings yet
Nature of Learning Curve
23 pages
Differential Evolution: Fundamentals and Applications
From Everand
Differential Evolution: Fundamentals and Applications
Fouad Sabry
No ratings yet
Chapter Review Questions
No ratings yet
Chapter Review Questions
4 pages
CGS401_Lec1
No ratings yet
CGS401_Lec1
52 pages
Williams Et Al Cns 2019
No ratings yet
Williams Et Al Cns 2019
1 page
Super Intelligent Integrated Guaranteed Supplement Platform: 2026 The Birth of a Super Intelligent Integrated Civilization--3 of 12
From Everand
Super Intelligent Integrated Guaranteed Supplement Platform: 2026 The Birth of a Super Intelligent Integrated Civilization--3 of 12
zhuohong pan
No ratings yet
t5 (1)
No ratings yet
t5 (1)
4 pages
h0054502
No ratings yet
h0054502
8 pages
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Foundation Practical 1
100% (1)
Foundation Practical 1
8 pages
Hci Sheet1
No ratings yet
Hci Sheet1
3 pages
Action Election: Fundamentals and Applications
From Everand
Action Election: Fundamentals and Applications
Fouad Sabry
No ratings yet
Serial Memory
No ratings yet
Serial Memory
24 pages
Tadiparthi Ritika - Whole and Part
No ratings yet
Tadiparthi Ritika - Whole and Part
22 pages
Tadiparthi Ritika - Massed and Spaced
No ratings yet
Tadiparthi Ritika - Massed and Spaced
22 pages
Introduction - PSYC 351 - Fall 2022
No ratings yet
Introduction - PSYC 351 - Fall 2022
35 pages
cog notes 1
No ratings yet
cog notes 1
54 pages
The Impact of Risk Preferences and Learning Dynamics On Strategic Decision
No ratings yet
The Impact of Risk Preferences and Learning Dynamics On Strategic Decision
14 pages
Best First Search: Fundamentals and Applications
From Everand
Best First Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
Learning PSYCH100
No ratings yet
Learning PSYCH100
23 pages
Rouhani & Niv 2021 Elife
No ratings yet
Rouhani & Niv 2021 Elife
28 pages
Digit span test report
No ratings yet
Digit span test report
9 pages
Learning: Presented by
No ratings yet
Learning: Presented by
24 pages
Assignment On:-Learning 15/09/2014: Kumar Ravi Shankar
No ratings yet
Assignment On:-Learning 15/09/2014: Kumar Ravi Shankar
19 pages
Experiments in Learning PT 2 - Psychology
No ratings yet
Experiments in Learning PT 2 - Psychology
43 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Valuation of forest ecosystem services. A practical guide
From Everand
Valuation of forest ecosystem services. A practical guide
Pere RIERA
No ratings yet
Glossary of Research Methods
From Everand
Glossary of Research Methods
Dr. Awadhesh Kishore
No ratings yet
Applying Learning Principles and Theories to Health Care Practice
No ratings yet
Applying Learning Principles and Theories to Health Care Practice
6 pages
Affective Computing: Fundamentals and Applications
From Everand
Affective Computing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Holz Reflexive
No ratings yet
Holz Reflexive
8 pages
ch5psycgology
No ratings yet
ch5psycgology
41 pages
10 Report
No ratings yet
10 Report
65 pages
Experiments
No ratings yet
Experiments
32 pages
State Space Search: Fundamentals and Applications
From Everand
State Space Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
Learning Process, Domains-2
No ratings yet
Learning Process, Domains-2
32 pages
Experiment 2
No ratings yet
Experiment 2
4 pages
(Passed) Physical Exercise
No ratings yet
(Passed) Physical Exercise
5 pages
IE415 - Laboratory Experiment 4 - Short Term Memory Span 1
No ratings yet
IE415 - Laboratory Experiment 4 - Short Term Memory Span 1
31 pages
Look Up What You Cannot Solve in Your Mind! Children Increase Information Gathering To Counteract Imprecise Planning Abilities
No ratings yet
Look Up What You Cannot Solve in Your Mind! Children Increase Information Gathering To Counteract Imprecise Planning Abilities
29 pages
Learning
No ratings yet
Learning
22 pages
How to Use Total Quality Techniques in Your Job?
From Everand
How to Use Total Quality Techniques in Your Job?
Darlene B. Martinez
No ratings yet
Health Education Transes
No ratings yet
Health Education Transes
7 pages
Pre-Analysis-Plan Experiment 2 Stefan Meissner
No ratings yet
Pre-Analysis-Plan Experiment 2 Stefan Meissner
9 pages
Exp in Cog- Learning Memory
No ratings yet
Exp in Cog- Learning Memory
49 pages
Full AP Psych Course Vocabulary Terms by Unit - Myers' For AP 3ED
No ratings yet
Full AP Psych Course Vocabulary Terms by Unit - Myers' For AP 3ED
20 pages
Learn The Basics Of Decision Trees A Popular And Powerful Machine Learning Algorithm
From Everand
Learn The Basics Of Decision Trees A Popular And Powerful Machine Learning Algorithm
UBER AUTHOR
No ratings yet
MID TERM- NOTES PSYCH
No ratings yet
MID TERM- NOTES PSYCH
31 pages
1-s2.0-S1878929319303202-mainext
No ratings yet
1-s2.0-S1878929319303202-mainext
12 pages
Lecture1 - Intro To Cog Psych
No ratings yet
Lecture1 - Intro To Cog Psych
37 pages
Means Ends Analysis: Fundamentals and Applications
From Everand
Means Ends Analysis: Fundamentals and Applications
Fouad Sabry
No ratings yet
Glossary of Organisation Behavior
No ratings yet
Glossary of Organisation Behavior
16 pages
Chapter 9 Intro To Learning & Behavior
No ratings yet
Chapter 9 Intro To Learning & Behavior
11 pages
Fozia 2
No ratings yet
Fozia 2
38 pages
Sped 606:behavior Modification Assignment #4: Submitted By: Althea Alabanzas
No ratings yet
Sped 606:behavior Modification Assignment #4: Submitted By: Althea Alabanzas
3 pages
NCP-Down-Syndrome
No ratings yet
NCP-Down-Syndrome
19 pages
Learning Theories (With Photos)
No ratings yet
Learning Theories (With Photos)
55 pages
Question - Amity BBA Solve Assignment For Organizational Behavior
No ratings yet
Question - Amity BBA Solve Assignment For Organizational Behavior
19 pages
Adolescent Boys and The Making of Masculinities
No ratings yet
Adolescent Boys and The Making of Masculinities
21 pages
Cercado Judyann Educ 201
No ratings yet
Cercado Judyann Educ 201
21 pages
Jones Et Al-2019-Journal of Applied Behavior Analysis
No ratings yet
Jones Et Al-2019-Journal of Applied Behavior Analysis
8 pages
Gamification Workshop Day 2
No ratings yet
Gamification Workshop Day 2
13 pages
Paper 2 June 2017 Question Paper
No ratings yet
Paper 2 June 2017 Question Paper
24 pages
Behaviorism PDF
No ratings yet
Behaviorism PDF
10 pages
Curriculum Implementation Innovation and The Teacher: A Self Study Module For Under Graguate Teachers
No ratings yet
Curriculum Implementation Innovation and The Teacher: A Self Study Module For Under Graguate Teachers
63 pages
CDP Set 1 Question With Answer
0% (1)
CDP Set 1 Question With Answer
4 pages
Dog Training Handbook
No ratings yet
Dog Training Handbook
29 pages
Sensory Extinction" A Procedure For Eliminating Self-Stimulatory Behavior in Developmentally Disabled Children 1
No ratings yet
Sensory Extinction" A Procedure For Eliminating Self-Stimulatory Behavior in Developmentally Disabled Children 1
2 pages
Chapter 8 - CORT Thinking
No ratings yet
Chapter 8 - CORT Thinking
26 pages
APPLIED BEHAVIOR ANALYSIS CHEAT SHEET
No ratings yet
APPLIED BEHAVIOR ANALYSIS CHEAT SHEET
2 pages
Theories of Counselling
No ratings yet
Theories of Counselling
31 pages
Epsy Edpm Chapter 3&4
No ratings yet
Epsy Edpm Chapter 3&4
32 pages
Skinner's Essay
No ratings yet
Skinner's Essay
5 pages
Tantrums Upset Crying BIP
No ratings yet
Tantrums Upset Crying BIP
3 pages
Beginnings and Beyond Foundations in Early Childhood Education 9th Edition Gordon Test Bank
No ratings yet
Beginnings and Beyond Foundations in Early Childhood Education 9th Edition Gordon Test Bank
46 pages
Ferster, C. B. (1973) - A Functional Analysis of Depression PDF
No ratings yet
Ferster, C. B. (1973) - A Functional Analysis of Depression PDF
14 pages
Organizational Behavior NMIMS
No ratings yet
Organizational Behavior NMIMS
5 pages
Artificial Intelligence: Foundations & Applications: Prof. Partha P. Chakrabarti & Arijit Mondal
No ratings yet
Artificial Intelligence: Foundations & Applications: Prof. Partha P. Chakrabarti & Arijit Mondal
24 pages
General Psychology Chapter 3
100% (1)
General Psychology Chapter 3
6 pages
B. F. Skinner The Writer and His Definition of Verbal Behavior
No ratings yet
B. F. Skinner The Writer and His Definition of Verbal Behavior
12 pages

MDP - Thesis - Assignment-Ankit Khatri

Uploaded by

MDP - Thesis - Assignment-Ankit Khatri

Uploaded by

Assignment

Understanding human metacognitive learning at the Max Planck Institute for

Submitted By : Ankit Khatri

Experiment. It is highly difficult to observe people’s metacognitive behavior and learning

2. How would you determine the sample size?

● Arbitrary approach (5% or 10%)

3. What data would you collect?

Hosted Web App : https://ankitknitj.github.io/mcl_experiment/

You might also like