#Wang Symbolic Model
#Wang Symbolic Model
www.elsevier.com/locate/cogsys
School of Health Information Sciences, University of Texas Health Science Center at Houston,
7000 Fannin, Suite 600 Houston, TX 77030, USA
Sackler Institute for Developmental Psychobiology, Weill Medical College of Cornell University,
1300 York Ave., Box 140, New York, NY 10021, USA
Received 15 August 2003; accepted 8 December 2003
Abstract
An increasing body of evidence has shown that attention is a multi-type and multilevel cognitive faculty. The
dominant computational modeling approaches to attention have often focused on one specic type of attention at one
specic level. In particular, various connectionist modeling techniques at the subsymbolic level have been widely
adopted. In this paper, we report a symbolic computational model of the Attentional Network Test, which simultaneously involves dierent types of attention (alerting, orienting, and executive control), each subserved by distinctive
attentional networks in the brain. The model was developed in ACT-R, a rule-based cognitive architecture. The results
show that the model, by sequentially ring rules at a rate of about one every 40 ms, was able to capture the eect of each
attentional network. The model implies that while the attentional networks can be distinguished at both neuroanatomical and behavioral levels, dierent attentional networks may adopt similar computational operations at least at a
symbolic rule level.
2004 Elsevier B.V. All rights reserved.
1. Introduction
Although everybody knows what attention is
(James, 1890), the nature of attention remains
elusive after more than one hundred years of active
research and hundreds of publications in this
subject. This is hardly surprising given the com-
1389-0417/$ - see front matter 2004 Elsevier B.V. All rights reserved.
doi:10.1016/j.cogsys.2004.01.001
120
121
122
3. ACT-R
ACT-R is a production rule based cognitive
modeling architecture developed by John Anderson and colleagues over a period of nearly two
decades (Anderson, 1983, 1990, 1993; Anderson &
Bower, 1973; Anderson & Lebiere, 1998). In essence, ACT-R explains human cognition by proposing a model of the knowledge structures and
knowledge deployment that underlie cognition.
Although ACT-R consists of a nontrivial subsymbolic component for computations involving
activation and association, it relies extensively on
various symbolic structures for knowledge representation. For example, ACT-R makes a fundamental distinction between declarative and
procedural knowledge (see also Schacter & Tulving, 1994). Declarative knowledge corresponds to
things people are aware of and can usually describe to others. Procedural knowledge is knowledge that people display in behavior but are not
conscious of. Declarative knowledge in ACT-R is
represented in terms of chunks. Procedural
knowledge is represented in terms of production
rules, which are condition-action pairs. Both
chunks and production rules are fundamental
symbolic structures in ACT-R and are regarded as
the atomic components of thought in the sense that
they are as far down as one can go in the symbolic
decomposition of cognition. In ACT-R, on average every 50 ms, one production rule is chosen to
re, a few declarative chunks are processed, and
cognition advances one step. Therefore, it is
claimed that ACT-R captures the symbolic grain
size of cognition.
Canonical ACT-R is mainly a theory of higherlevel cognition, excelling in modeling phenomena
in human memory, learning, and problem solving.
One recent advance in ACT-R is the release of
ACT-R 5.0, which intends to close the gap between ACT-R and the external environment by
augmenting ACT-R with a set of perceptual-motor
modules (Byrne & Anderson, 1998). To implement
the visual system, ACT-R 5.0 implements some
rudimentary attention functions. For example,
there is a move-attention command that higher
cognition can send to the visual system. This
command allows the visual system to shift atten-
123
4. ANT on ACT-R
4.1. The model
We developed a computational model for the
ANT task in the framework of ACT-R 5.0. Our
purpose is two-fold. First, we want to explore how
alerting and executive control of attention can be
modeled symbolically in ACT-R 5.0 and how the
three types of attention work together to produce
the cognitive performance. In particular, we hope
to show how the eects of dierent attentional
networks can be explained by just ring dierent
sets of production rules but without resorting to
various subsymbolic mechanisms. Second, such a
model oers a possibility for us to cross-validate
those models based on various connectionist
modeling results and neuroimaging data, and by
doing so we hope that we can probe the possible
connections about the function of attention among
dierent levels of description.
We started by analyzing the major functional
components in the ANT task. Based on the design
presented in Fig. 1, we distinguished 6 stages in
performing a generic ANT trial (see Fig. 2):
1. Fixation and cue expectation. In this stage, the
participant xates at the xation point and prepares for something to occur. Note that at this
time the expectation of the participant was uncertain either a cue (in the cued conditions)
or a stimulus row (in the no-cue condition)
could appear. As a result, we hypothesize that
the participant has to engage in an uncertain
124
Fig. 2. A functional decomposition of the ANT task. The number in parentheses in each stage represents the number of production
rules that are used to implement these functions.
4. Stimulus expectation. In this stage the participant is expecting a stimulus to appear, with attention remaining at the cued location(s). Note
that in this stage the participant has engaged in
a preparatory state that is particularly suitable
for stimulus processing.
5. Stimulus processing. In this stage the stimulus
has to be processed and the direction of the target arrow identied. Specically, in the spatialcue condition, since attention has already been
allocated to the location where the stimulus appears, presumably a more accurate and rapid
stimulus processing can occur. In the centercue condition, an additional attentional movement needs to be initiated to shift attention to
the stimulus location. Depending upon the result from stage 3, similar operations need to
be performed in the double-cue condition.
Again, we leave the details of this condition to
the model discussion section.
6. Response. In this nal stage, the participant
responds by initiating a key pressing process
based on the result from the previous stage.
We implemented our model by mapping these
functional components into ACT-R production
rules. We used 36 rules to fully implement these
functions and cover all the conditions. The distribution of the rules in each stage is also shown
in Fig. 2. The name and a brief description of
each rule are provided in Appendix A.
With these rules our model could perform the
ANT task and interact with the same experimental environment that human participants
use. In a typical simulation trial, visual objects,
including the xation, the cue(s), and the stimulus, are presented in a computer window in a
timed sequence according to the design as shown
in Fig. 1. The model can see what is currently
in the window through its visual system. At any
time point, based on the goal and what the
model currently sees, one of the best matching
rules is selected to re, which may lead to a
chain of more rule rings. In the current model
each rule ring is set to cost 40 ms instead of
the default 50 ms (see the model discussion
section for more details). The trial ends when
the model makes a response by pressing a
response key. The time from the stimulus pre-
125
126
Table 1
Means RT and error rates under each condition
Cue type
Congruency
No-cue
Center-cue
Double-cue
Spatial-cue
(a) Mean RTs Standard deviations in ms from the experiment and (the model simulation)
Neutral
529 47 (527 3)
483 46 (487 3)
472 44 (467 5)
Congruent
530 49 (526 4)
490 48 (486 3)
479 45 (466 6)
Incongruent
605 59 (621 14)
585 57 (580 14)
574 57 (562 15)
442 39 (441 4)
446 41 (441 4)
515 58 (522 16)
(b) Error rates in % from the experiment and (the model simulation)
Neutral
1.17 (0.96)
0.93 (0.92)
Congruent
0.73 (0.75)
0.54 (1.00)
Incongruent
3.49 (3.25)
4.88 (3.79)
0.78 (0.79)
0.44 (0.83)
3.51 (2.67)
1.56 (0.71)
0.59 (0.79)
4.27 (3.50)
Alerting
Orienting
Fig. 3. Mean RTs in various conditions from (a) the experimental study and (b) our model simulation.
Orienting
Experiment
Simulation
Table 2
Eects of attentional networks
Executive control
Experiment
Simulation
0.05
)0.07
)0.12
)0.14
Mean RT
Experiment
Simulation
0.09
)0.14
0.29
)0.15
Attentional networks
Eects (ms)
(mean SD)
Alerting
Orienting
Executive
control
Experiment
Simulation
47 18
55 7.4
51 21
45 7.0
84 25
86 7.4
Executive
control
0.08
0.04
0.44b
0.80b
127
128
Note that the cost of the production rule (40 ms) does not
match the simulated alerting eect (55 ms, see Table 2). The
reason is that the simulated eect is an average number that
incorporates multiple sources although the cost of the production rules accounts for the major part of the eect. Similar
mechanisms apply to the orienting and executive control eects
discussed next.
for the anker eect. Second, there are three production rules that specically distinguish between
the incongruent and congruent condition in the
cases when a distracter arrow, but not the center
target, is focused on. When this situation arises, we
hypothesize that instead of performing an automatic but expensive (costly) re-focusing, a simpler
congruency detection is conducted. Specically, if a
congruent condition is detected, the system moves
on to respond since a re-focusing is not necessary.
This is implemented by the rule named goaheadresponding-if-congruent (rule 28 in Appendix A). If
an incongruent condition is detected, the system
can either move on to do a direct re-encoding
(harvest-target-directly-if-incongruent, rule 30 in
Appendix A, equivalent to a simple direction reversal) or perform a re-focusing again (refocusagain-if-incongruent, rule 29 in Appendix A).
Compared to the congruent condition, the rst
strategy costs an additional 85 ms (a direct moveattention) and the second strategy costs an
additional 125 ms (one additional rule ring for
refocusing rst and then a move-attention). With
the second strategy a little more likely to be selected
than the rst one (the odds are set to be about 2:1),
this leads to an additional cost of about 111 ms on
average for the incongruent conditions. It is this
cost, accrued quite often since there are four times
more ankers than the target, that is mainly responsible for the executive control eect.
One interesting empirical result from Fan et al.
(2002)s study is the small but reliable dierence
(11 ms) between the center and double-cue conditions. This can be conveniently explained by a
diused attention assumption. More specically,
the center-cue induces the participant to focus attention on the xation location while the doublecue makes the participant diuse attention at both
the top and bottom locations. As a result, compared to the center-cue condition where the stimulus location receives no priming, the stimulus
location in the double-cue condition is primed a
small amount, which may speed up later stimulus
processing. This explanation, however, provides a
challenge to a symbolic model of attention. How
can attention be diused symbolically when we
only have in hand a move-attention command,
which presumably shifts the focus of attention to a
129
5. General discussion
Though attention is a complex multilevel cognitive function, the dominant approaches to computational modeling of attention have emphasized
its close association with lower level processes such
as perception and thus often adopted some connectionist and neural modeling techniques. In addition, typically only one type of attention is
focused on. In this article, we reported a symbolic
computational model of the ANT task, which incorporates the work of multiple attentional networks including alerting, orienting, and executive
control. The model was developed in ACT-R.
Using basic symbolic knowledge modeling structures such as rules and strategies, all of which often
have clear and straightforward psychological
130
Acknowledgements
This research was supported in part by the
Grant N00014-01-1-0074 from the Oce of Naval
Research Cognitive Science Program. We would
like to thank Dr. Michael I. Posner for providing
constructive comments that have improved the
quality of the paper. Correspondence and request
for reprints should be sent to Hongbin Wang,
School of Health Information Sciences, University
of Texas Health Science Center at Houston, 7000
Fannin Suite 600, Houston, TX 77030, USA.
E-mail: [email protected].
131
132
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
P notice-stimulus-at-cued-bottom-locationbut-a-neutral-item-is-selected
Notice a neutral item at the cued top
location so refocus on the center item
P notice-stimulus-with-centercue-and-shift
The stimulus row appears while attention
is on the center location so shift attention
to it
P notice-stimulus-with-doublecue-and-shift
The stimulus row appears in a location
that is dierent from the current attended
location in the doublecue condition so shift
attention to it
P notice-stimulus-with-doublecue-and-an
-arrow-is-focused-on-so-attend
The stimulus row appears in a location
that is same as the current attended
location in the doublecue condition so encode the arrow
P notice-stimulus-with-doublecue-but-aneutral-item-is-focused-on-so-shift
The stimulus row appears in a location
that is same as the current attended
location in the doublecue condition but it
seems a neutral distracter is selected so
refocus
P attend-to-at-large-target
Encode the arrow that is currently
focused on
P shift-to-at-large-target-from-a-neutralitem
If the currently focused on item is a
neutral distract, refocus
P harvest-target
Encode the target after refocusing
P goahead-responding-if-it-is-the-target
If the encoded item is the center arrow,
goahead to respond
P hurryup-responding-no-matter-whethertarget-or-not
As long as the encoded item is an arrow,
hurry up to respond
P attended-item-is-right-to-the-target
Hold on, the currently attended arrow is to
the right of the target
P attended-item-is-left-to-the-target
Hold on, the currently attended arrow is to
the left of the target
28.
P goahead-responding-if-congruent
But move on to respond since they are
congruent
29. P refocus-again-if-incongruent
Refocus on the center location
30. P harvest-target-directly-if-incongruent
Encode the center arrow directly since its
location is available
Stage 6. Response
31. P decide-left
Decide to make a left response since the
target is a left arrow
32. P decide-right
Decide to make a right response since the
target is a right arrow
33. P random-left
Randomly decide to make a left response
34. P random-right
Randomly decide to make a right
response
35. P respond
Respond by pressing the decided key
36. P rexating-and-wait-for-next-trial
refocusing on the xation and be ready
for the next trial
133
References
Altmann, E. M., & Davidson, D. J. (2001). An integrative
approach to stroop: Combining a language model and a
unied cognitive theory. In Proceedings of the 23rd annual
meeting of the Cognitive Science Society (pp. 2126).
Hillsdale, NJ: Erlbaum.
Anderson, J. R. (1983). The architecture of cognition. Cambridge, MA: Harvard University Press.
Anderson, J. R. (1990). The adaptive character of thought.
Hillsdale, NJ: Lawrence Erlbaum Associates.
Anderson, J. R. (1993). Rules of the mind. Hillsdale, NJ:
Lawrence Erlbaum Associates.
Anderson, J. R., Bothel, D., Byrne, M., & Lebiere, C.
(submitted). An integrated theory of the mind. Psychological Review.
Anderson, J. R., & Bower, G. H. (1973). Human
associative memory. Mahwah, NJ: Lawrence Erlbaum
Associates.
134
Luck, S. J., Woodman, G. F., & Vogel, E. K. (2000). Eventrelated potential studies of attention. Trends in Cognitive
Sciences, 4(11), 432440.
Meyer, D. E., & Kieras, D. E. (1997a). A computational theory
of executive cognitive processes and mutliple-task performance: Part 1. Basic mechanism. Psychological Review,
104(1), 365.
Meyer, D. E., & Kieras, D. E. (1997b). A computational theory
of executive cognitive processes and mutliple-task performance: Part 2. Accounts of psychological refractory-period
phenomena. Psychological Review, 104(4), 749791.
Mozer, M. C. (1991). The perception of multiple objects: A
connectionist approach. Cambridge, MA: MIT Press.
Mozer, M. C., & Sitton, M. (1998). Computational modeling of
spatial attention. In H. E. Pashler (Ed.), Attention. East
Sussex, UK: Psychology Press.
Newell, A. (1990). Unied theories of cognition. Cambridge,
MA: Harvard University Press.
OReilly, R. C., & Munakata, Y. (2000). Computational
explorations in cognitive neuroscience. Cambridge, MA:
MIT Press.
Parasuraman, R., & Davies, D. R. (1984). Varieties of attention.
San Diego, CA: Academic Press.
Pashler, H. E. (1998). The psychology of attention. Cambridge,
MA: MIT Press.
Posner, M. I. (1980). Orienting of attention. Quarterly Journal
of Experimental Psychology, 32, 325.
Posner, M. I., & Dehaene, S. (2000). Attentional networks. In
M. S. Gazzaniga (Ed.), Cognitive neuroscience: A reader.
Malden, MA: Blackwell Publishers.
Posner, M. I., DiGirolamo, G. J., & Fernandez-Duque, D.
(1997). Brain mechanisms of cognitive skills. Consciousness
and Cognition, 6, 267290.
Posner, M. I., & Petersen, S. E. (1990). The attention systems of
the human brain. Annual Review of Neuroscience, 13,
2542.
Posner, M. I., & Raichle, M. E. (1994). Images of mind. New
York: Scientic American Library.
Posner, M. I., Rothbart, M. K., Vizueta, N., Levy, K. N.,
Evans, D. E., Thomas, K. M., & Clarkin, J. F. (2002).
Attentional mechanisms of borderline personality disorder.
Proceedings of the National Academy of Sciences USA,
99(25), 1636616370.
Rueda, M. R., Fan, J., McCandliss, B. D., Halparin, J. D.,
Gruber, D. B., Lercari, L. P., & Posner, M. I. (in press).
Development of attentional networks in childhood. Neuropsychologia.
Rumelhart, D. E., & McClelland, J. L. (1986). Parallel
distributed processing: Explorations in the microstructure
of cognition. Foundations (Vol. 1). Cambridge, MA: MIT
Press.
Schacter, D. L., & Tulving, E. (1994). Memory systems 1994.
Cambridge, MA: MIT Press.