[Trends in Linguistics. Studies and Monographs] Christian Chiarcos, Berry Claus, Michael Grabski (Editors) - Salience_ Multidisciplinary Perspectives on Its Function in Discourse (Trends in Linguistics. Stud
[Trends in Linguistics. Studies and Monographs] Christian Chiarcos, Berry Claus, Michael Grabski (Editors) - Salience_ Multidisciplinary Perspectives on Its Function in Discourse (Trends in Linguistics. Stud
Trends in Linguistics
Studies and Monographs 227
Editor
Volker Gast
Founding Editor
Werner Winter
Editorial Board
Walter Bisang
Hans Henrich Hock
Matthias Schlesewsky
Niina Ning Zhang
De Gruyter Mouton
Salience
Multidisciplinary Perspectives
on its Function in Discourse
Edited by
Christian Chiarcos
Berry Claus
Michael Grabski
De Gruyter Mouton
ISBN 978-3-11-024072-6
e-ISBN 978-3-11-024102-0
ISSN 1861-4302
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 279
Language index . . . . . . . . . . . . . . . . . . . . . . . . . . . . 279
Index of determinants, manifestations and aspects of salience . . . . 279
Subject index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 281
Introduction: Salience in linguistics and beyond
1. Introduction
“A point that hops and jumps like a living being”. The Latin translation of Aris-
totle’s description of the heart of an embryo in a hen’s egg – a red point that
stands out from the yellow yolk – has been the source of metaphors through-
out all languages of Europe. The salient point, le point saillant, der springende
Punkt all refer to things that are particularly important or relevant. Starting from
this metaphor, salient and salience in English became to mean things that stand
out from the ground, can be easily recognized, are in the focus of attention, or
foremost to a person’s state of mind.1
This volume addresses perspectives and functions of salience in discourse.
The volume emanates from the 6th International Workshop on Multidisciplin-
ary Approaches to Discourse that was held in October, 2005, in Chorin, Ger-
many, under the theme Salience in Discourse. The goal of the workshop was
to illustrate the differences and commonalities in research and perspectives on
salience within and between various areas of research, including computational
linguistics, discourse studies, and psycholinguistics.
Looking for a general definition of an entity as being salient, one may start
with the familiar function salience has in linguistics. In the context of the ana-
phoric binding of noun phrases, salience of possible antecedents is an important
contextual feature, that may be established by complex linguistic means. A re-
lated phenomenon is the salience of discourse segments that allows subsequent
segments to be linked to them by means of some discourse relation. Moreover,
the use of the term salience can be stretched further to extra-linguistic entities –
for example, non-linguistic objects can be defined as salient, as they restrict
linguistic reference to real entities in the discourse-external context. As gener-
1. For a detailed description of the history of the term and the metaphor see von
Heusinger (1997).
2 Christian Chiarcos, Berry Claus, and Michael Grabski
ents) and their salience ranking. Centering also formalized an important insight,
that entities in discourse have both a backward-looking and a forward-looking
aspect that correspond to the schema of input and output contexts of a given
utterance.
2.2. An example
(1) (a) For insurance agent Toni Johnson, dealing with the earthquake
has been more than just a work experience.
(b) She lives in Oakland, a community hit hard by the earthquake.
(c) The apartment she shares with her sister was rattled, but nothing
was severely damaged.
RST Discourse Treebank (Carlson, Marcu, and Okurowski, 2003), file 3
(slightly simplified)
The centers in this short text are the discourse referents Toni, the earthquake,
Oakland, the apartment, and Toni’s sister. Consider sentences (1.b) and (1.c).
Sentence (1.b) contains three forward-looking centers, Toni, Oakland, and
the earthquake. From these, both Toni and the earthquake are equally feasi-
ble candidates for the backward-looking center. Thus, the backward-looking
center Cb (U1.b ) cannot be identified on grounds of grammatical roles alone.
However, this uncertainty can be resolved by means of Rule 2, as Toni is the
backward-looking center of (1.c). Identifying Toni with the backward-looking
center Cb (U1.b ) thus results in a continue transition between (1.b) and (1.c),
while assuming the earthquake to be Cb (U1.b ), means that a shift occurred.
Thus, Rule 2 predicts that the (preferred) backward-looking center of (1.b)
is Toni. In fact, this prediction is also consistent with Rule 1, as the referent
pronominalized in (1.b) is Toni.
With respect to the forward-looking aspect of Centering, Toni is realized as
subject referent, the other referents being oblique, and thus, Toni represents the
preferred center.
8 Christian Chiarcos, Berry Claus, and Michael Grabski
Navaretta (2002), Arnold (2005), and Kaiser (2006) showed that func-
tionally different constructions, i.e., topic-marking constructions as opposed
to focus-marking constructions, have similar implications on the choice of
pronominal as opposed to nominal expressions in the forthcoming discourse.
This is taken as evidence that both topic-marking and focus-marking construc-
tions have a specific forward-looking function that exists independently from
backward-looking aspects of salience. Moreover, a psycholinguistic study by
Stevenson, Crawley, and Kleinman (1994; see also Miltsakaki, 2007) points to
a crucial role of semantic factors on focusing, such as the assignment of the-
matic roles that can be interpreted as a forward-looking aspect of discourse
coherence.
would break a cultural taboo. The use of the specific locative-marked ergative
constructions can thus be analyzed as being motivated by an act of intentionally
diminishing the salience of an entity (i.e., the speaker, in the examples brought
forward).
Ralph Rose’s contribution addresses the question whether a discourse en-
tity’s syntactic salience is indeed the major determinant of the use of referring
expressions. Starting from the observation that in English, syntactic role is often
confounded with semantic role, he conducted a corpus study to investigate the
relative contribution of syntactic and semantic factors on the use of pronominal
reference. Semantic role was operationalized in terms of FrameNet (Baker, Fill-
more, and Lowe, 1998) as well as in terms of PROTO-role entailments (Dowty,
1991). To analyze the corpus data, Rose adopted the notion of information value
from Information Theory (Shannon, 1948). The results indicate that syntactic
prominence and semantic prominence independently affect the salience of dis-
course entities
Finally, Christian Chiarcos’ contribution introduces the Mental Salience
Framework, a computational framework of salience metrics for the context-
adequate generation of referring expressions, i.e., the choice of referring expres-
sions, the assignment of grammatical roles, and word order preferences. The
Mental Salience Framework describes the realization of referring expressions
on the basis of two dimensions of salience, backward-looking (hearer-centered)
salience, and forward-looking (speaker-centered) salience. On this basis, an in-
tegrated architecture for aspects of attention control in discourse is proposed
which provides necessary preconditions for the technical operationalization of
multi-dimensional accounts of salience in discourse.
3.2. An example
Consider the following text, taken from Lascarides and Asher (2007):
(2) a. John had a great evening last night.
b. He had a great meal.
c. He ate salmon.
d. He devoured lots of cheese.
e. He won a dancing competition.
f. ??It was a beautiful pink.
To describe the coherence structure in (2) we use the terminology from Seg-
mented Discourse Representation Theory (SDRT; cf. Asher, 1993; Asher and
Lascarides, 2003; Lascarides and Asher, 2007). SDRT combines classical Dis-
course Representation Theory (DRT; Kamp, 1981; Kamp and Reyle, 1993) with
discourse relations that determine, in a stepwise processing of a text, the type
of coherence of each of its utterances with a preceding text segment.
In (2) the first relation to be established is between text utterances (2.a) and
(2.b). In this case, the fact can be exploited that (2.a) presents an event of which
the event mentioned in (2.b) is a part. This normally permits that the relation
Elaboration is derived (cf. Mann and Thompson, 1988). Utterances (2.c) and
(2.d) both can attach to (2.b), using the latter as an input context for linking
them, again by Elaboration. There is also a temporal sequence of the events
Introduction: Salience in linguistics and beyond 13
(3)
Now, in (2.e) the text mentions an event that cannot be interpreted as a sub-event
of the event in (2.b). But the input context made up by (2.a–2.d) and represented
in (3), offers the possibility to interpret it as a sub-event of the event mentioned
in (2.a). The node ke introduced by (2.e) is therefore attached, by Elaboration,
to the higher node ka , i.e., ‘discourse popping’ occurs. The event is also related
to (2.b) by Narration, cf. (4).
(4)
The discourse structure obtained so far poses a problem for attachment of (2.f).
By its content, it is best interpreted as elaborating the ‘Salmon constituent’,
(2.c). But that constituent is beyond the Right Frontier, which by now consists
14 Christian Chiarcos, Berry Claus, and Michael Grabski
of the branch ka –ke . The oddness of (2.f) is explained by the specific input
context that has been built up so far.
We may now sketch the effect of the choice of specific coherence relations
on output contexts. In several approaches, relations are classified as for the
‘weight’ of their arguments. In Rhetorical Structure Theory (RST; cf. Carlson
and Marcu, 2001; Mann and Thompson, 1988), most relations make a differ-
ence between their nucleus and satellite arguments; in SDRT relations are either
coordinating, i.e., relate sister nodes (Narration, Result etc.), or are subordinat-
ing, i.e., relate a daughter node to its mother node (Elaboration, Explanation
etc.). This difference shapes different salience contours at the output context
in the following way: By a coordinating relation, the newly attached node be-
comes the lowest element of the Right Frontier and the most salient node for the
next discourse constituent to be attached. In the example above, this happened
when (2.d) was attached by Narration, that is by a coordinating relation. In (3)
the node kd has become the most salient node, and kc , being no longer on the
Right Frontier, has lost its salience for subsequent text utterances. In contrast,
a subordinating relation preserves the position of an attachment site k on the
Right Frontier, as instantiated by the link of (2.b) to (2.a) by Elaboration, cf.
the position of nodes ka and kb in (3), (4).
they are novel or one-of-a-kind. Being more salient, these stimuli will capture
attention.
hypertexts act on the assumption that the target text of a hypertext link pro-
vides additional information on the linked-marked entity. To put it in Bexten’s
words, a hypertext link allows for predictions of the content of the target text.
In this respect, a hypertext link resembles the Centering Theory’s notion of a
forward-looking center. It is by virtue of this parallel that Bexten proposes that
hypertext links should be regarded as preferred centers as well. What makes
a hypertext an interesting theoretical case is that hypertext links do not neces-
sarily coincide with linguistic criteria that define forward-looking centers. As a
consequence, a single utterance of a hypertext can contain more than one pre-
ferred center. In her contribution, Bexten develops a descriptive approach of
coherence in hypertexts, taking into account the commonalities and differences
between hypertextual and linguistic salience marking.
The contribution by Berry Claus takes a look at the issue of salience from
the perspective of the simulation view of language comprehension. Proponents
of this view (e.g., Barsalou, 1999; Glenberg, 2007, Zwaan, 2004) assume that
language comprehension is tantamount to mentally simulating the actual ex-
perience of the described situations. In her contribution, Claus proposes that
(non-linguistic) salience of discourse entities derives from the mental simula-
tions constructed during language comprehension. Her considerations are con-
fined to narrative text comprehension and she argues that what makes an en-
tity salient – over and above linguistic factors – may depend on facets of the
narrated situation. The main implication of the simulation view with regard to
non-linguistic salience is that the most salient entities are those that are present
in the described situation. Claus reviews empirical support for this notion stem-
ming from studies that investigated whether the mental accessibility of entities
mentioned in narrative text is affected by properties of the described situation.
References
Almor, Amit
1999 Noun-phrase anaphora and focus: the informational load hypothesis.
Psychological Review 106: 748–765.
Ariel, Mira
1990 Accessing noun-phrase antecedents. London: Routledge.
Arnold, Jennifer
2005 Marking salience: The similarity of topic and focus. [On-line] Avail-
able: http://www.unc.edu/∼jarnold/papers/top.foc.html
Asher, Nicolas
1993 Reference to abstract objects in discourse. Dordrecht: Kluwer.
20 Christian Chiarcos, Berry Claus, and Michael Grabski
Chafe, Wallace
1994 Discourse, consiousness, and time. The flow and displacement of con-
scious experience in speaking and writing. Chicago: University of
Chicago Press.
Chambers, Craig G., Michael K. Tanenhaus, Kathleen M. Eberhard, Hana Filip, and
Greg N. Carlson
2002 Circumscribing referential domains during real-time language com-
prehension. Journal of Memory and Language 47: 30–49.
Chambers, Craig G., Michael K. Tanenhaus, and James S. Magnuson
2004 Actions and affordances in syntactic ambiguity resolution. Journal
of Experimental Psychology: Learning, Memory, and Cognition 30:
687–696.
Chiarcos, Christian and Olga Krasavina
2008 Rhetorical distance revisited: A parameterized approach. In Con-
straints in Discourse, Anton Benz and Peter Kühnlein (eds.), 97–115,
Amsterdam: John Benjamins.
Clamons, C. Robin, Ann E. Mulkern, and Gerald Sanders
1993 Salience signaling in Oromo. Journal of Pragmatics 19: 519–536.
Clark, Herbert H. and Meredyth A. Krych
2004 Speaking while monitoring addressees for understanding. Journal of
Memory and Language 50: 62–81.
Cristea, Dan, Nancy Ide, and Laurent Romary
1998 Veins Theory: A model of global discourse cohesion and coherence.
In Proceedings of the 36th Meeting of the Association for Computa-
tional Linguistics and 17th Conference on Computational Linguistics,
281–285, San Francisco.
Dixon, Robert M. W.
1994 Ergativity. Cambridge: Cambridge University Press.
Dowty, David
1991 Thematic proto-roles and argument selection. Language 67: 547–619.
Einhäuser, Wolfgang, Ueli Rutishauser, and Christof Koch
2008 Task-demands can immediately reverse the effects of sensory-driven
saliency in complex visual stimuli. Journal of Vision 8: 1–19.
Ellsiepen, Emilia, Pia Knoeferle, and Matthew W. Crocker
2008 Incremental syntactic disambiguation using depicted events: Plausi-
bility, co-presence and dynamic presentation. In Proceedings of the
30th Annual Conference of the Cognitive Science Society, Brad C.
Love, Ken McRae, and Vladimir M. Sloutsky (Eds.), 2398–2403.
Austin, TX: Cognitive Science Society.
22 Christian Chiarcos, Berry Claus, and Michael Grabski
Fillmore, Charles J.
1977 Topics in lexical semantics. In Current issues in linguistic theory,
Roger W. Cole (ed.), 76–138. Bloomington: Indiana University Press.
Fox, Barbara A.
1987 Discourse structure and anaphora: Written and conversational En-
glish. Cambridge: Cambridge University Press.
Givón, Talmy
1983 Introduction. In Topic continuity in discourse: a quantitative cross-
language study, Talmy Givón (ed.), 5–41. Amsterdam: John Ben-
jamins.
Givón, Talmy
1995 Functionalism and grammar. Amsterdam: John Benjamins.
Givón, Talmy
2001 Syntax (2nd edition). Amsterdam: John Benjamins.
Glenberg, Arthur M.
2007 Language and action: Creating sensible combinations of ideas. In Ox-
ford Handbook of Psycholinguistics, Gareth Gaskell (ed.), 361–370.
Oxford, UK: Oxford University Press.
Glenberg, Arthur M., Marion Meyer, and Karen Lindem
1987 Mental models contribute to foregrounding during text comprehen-
sion. Journal of Memory and Language 26: 69–83.
Gordon, Peter C., Barbara J. Grosz, and Laura A. Gilliom
1993 Pronouns, names, and the centering of attention in discourse. Cogni-
tive Science 3: 311–347.
Gordon, Peter C. and Randall Hendrick
1998 The representation and processing of coreference in discourse. Cog-
nitive Science 22: 389–424.
Groenendijk, Jeroen and Martin Stokhof
1991 Dynamic predicate logic. Linguistics and Philosophy 14: 39–100.
Grosz, Barbara
1981 Focusing and description in natural language dialogues. In Elements
of discourse understanding, Aravind K. Joshi, Bonnie L. Webber, and
Ivan A. Sag (eds.), 85–105. Cambridge: Cambridge University Press.
Grosz, Barbara J., Aravind K. Joshi, and Scott Weinstein
1983 Providing a unified account of definite noun phrases in discourse. In
Proceedings of the 21st Annual Meeting of the Association of Com-
putational Linguistics, 44–50.
Grosz, Barbara J., Aravind K. Joshi, and Scott Weinstein
1995 Centering: A framework for modelling the local coherence of dis-
course. Computational Linguistics 21: 203–225.
Introduction: Salience in linguistics and beyond 23
Kaiser, Elsi
2006 Effects of topic and focus on salience. In Proceedings of Sinn und
Bedeutung 10, Christian Ebert and Cornelia Endriss (eds), 139–154.
Berlin: ZAS Working Papers in Linguistics, Vol. 44.
Kaiser, Elsi and John Trueswell
2004 The referential properties of Dutch pronouns and demonstratives: Is
salience enough? In Proceedings of the Conference „sub8 – Sinn und
Bedeutung“, Cecile Meier and Matthias Weisgerber (eds.), 137–149.
Universität Konstanz: Arbeitspapier Nr. 177, FB Sprachwissenschaft.
Konstanz, Germany.
Kaiser, Elsi and John Trueswell
to appear 2011 Investigating the interpretation of pronouns and demonstratives in
Finnish: Going beyond salience. In The processing and acquisition of
reference, Edward Gibson and Neal J. Pearlmutter (eds). Cambridge,
Mass.: MIT Press.
Kamide, Yuki, Gerry T. M. Altmann, and Sarah L. Haywood
2003 The time-course of prediction in incremental sentence processing:
Evidence from anticipatory eye movements. Journal of Memory and
Language 49: 133–156.
Kamp, Hans
1981 A theory of truth and semantic representation. In Formal methods in
the study of language, Jeroen A.G. Groenendijk, Theo M. V. Janssen,
and Martin B. J. Stokhof (eds.), 277–322. Amsterdam: Foris.
Kamp, Hans and Uwe Reyle
1993 From discourse to logic. Dordrecht: Kluver.
Kelter, Stephanie, Barbara Kaup, and Berry Claus
2004 Representing a described sequence of events: A dynamic view of nar-
rative comprehension. Journal of Experimental Psychology: Learn-
ing, Memory, and Cognition 30: 451–464.
Knott, Alistair, Jon Oberlander, Mick O’Donnell, and Chris Mellish
2000 Beyond elaboration: The interactions of relations and focus in coher-
ent text. In Text representation: linguistic and psycholinguistic as-
pects, Ted Sanders, Joost Schilperoord, and Wilbert Spooren (eds.),
181–196. Amsterdam: John Benjamins.
Krifka, Manfred
2007 Basic notions of information structure. In Interdisciplinary studies
of information structure 6, Caroline Féry and Manfred Krifka (eds.),
Potsdam.
Introduction: Salience in linguistics and beyond 25
Sidner, Candace L.
1978 The use of focus as a tool for disambiguation of definite noun phrases.
In Theoretical issues in natural language processing, TINLAP 2,
David L. Waltz (ed.), 86–95. Association for Computing Machinery,
University of Illinois at Urbana- Champaign.
Sidner, Candace L.
1981 Focusing for the interpretation of pronouns. American Journal of
Computational Linguistics 7: 217–231.
Sidner, Candace L.
1983 Focusing in the comprehension of definite anaphora. In Computa-
tional models of discourse, Michael Brady and Robert Berwick (eds.),
267–330. Cambridge: MIT Press.
Stevenson, Rosemary
2002 The role of salience in the production of referring expressions. In In-
formation sharing: Reference and presupposition in language gener-
ation and interpretation, Kees van Deemter and Rodger Kibble (eds.),
167–192. Stanford: CSLI Publications.
Stevenson, Rosemary J., Rosalind A. Crawley, and David Kleinman
1994 Thematic roles, focus and the representation of events. Language and
Cognitive Processes 9: 519–548.
Tanenhaus, Michael K., Michael J. Spivey-Knowlton, Kathleen M. Eberhard, and
Julie C. Sedivy
1995 Integration of visual and linguistic information in spoken language
comprehension. Science 268: 1632–1634.
Taylor, Shelley E. and Susan T. Fiske
1978 Salience, attention, and attributions: Top of the head phenomena. In
Advances in experimental psychology – Vol. 11, Leonard Berkowitz
(ed.), 249–287. New York: Academic Press.
Tomlin, Russel S.
1995 Focal attention, voice, and word order. an experimental, cross-linguis-
tic study. In Word order in discourse, Mickey Noonan and Pamela
Downing (eds.), 517–554. Amsterdam: John Benjamins.
Tversky, Amos
1977 Features of similarity. Psychological Review 84: 327–352.
Vallduví, Enric
1992 The informational component. New York: Garland.
Vallduví, Enric and Elisabeth Engdahl
1996 The linguistic realization of information packaging, Linguistics 34:
459–519.
28 Christian Chiarcos, Berry Claus, and Michael Grabski
Olga Krasavina
Abstract. The current article focuses on the use of demonstratives in Russian within
the broader discourse phenomenon of referential choice. It has been repeatedly claimed
that referential choice and the activation level (salience) of a referent in the memory of
the speaker/listener are interconnected (e.g. Chafe 1994; Tomlin and Pu 1991; Kibrik
1996; 2000). In this article, the cognitive-psychological model of Gundel et. al. (1993,
2001) that specifies this connection is assessed and formulated more precisely. Following
this analysis, I determine several cases of demonstrative NP use and summarize them
as a model of referential choice. This study combines corpus analysis with experimental
methods and presents the results obtained in three experiments. The experiments employ
a questionnaire method, and involve a forced choice task and a written text-continuation
exercise. The results of Experiment 1 demonstrate the effect of a time shift factor on
demonstrative noun phrase (NP) use; the results of Experiments 2 and 3 cast doubt on
the prevalent hypothesis that it is the prospective relevance of a discourse new referent
that stimulates demonstrative NP use when mentioned for the second time.
1. Introduction
In the last few decades, studies in referential choice1 have enjoyed great pop-
ularity. This article specifically examines the discourse use of demonstratives
in Russian. Despite considerable progress in explanation of full noun phrase
(NP) and pronoun use achieved today, the use of demonstratives has not re-
ceived proper attention. The question of what specific factors stimulate the use
of demonstrative noun phrases still remains open.
The subject of this study was restricted to proximal demonstratives, since
the factors leading to the use of proximal and distant demonstratives may be of
different nature. Proximal demonstratives have an important place in the model
of referential choice – in Russian they are the third most frequent anaphoric
device2 after non-demonstrative full NPs and personal pronouns. This article
is based on Russian data, and I cite only Russian examples below. It is quite
likely, however, that some results obtained here may be relevant for a more
general understanding of the use of demonstratives in discourse.
The current article presents a combination of corpus and experimental ap-
proaches. Unlike the previous studies of demonstratives in Russian, which used
a limited number of casual examples (Padučeva 1985, Boguslavskaja and Mu-
rav’jeva 1987), the present study involves several hundred examples from nat-
ural discourse and aims at describing all these examples. To verify the role of
some potential factors, several psycholinguistic experiments were conducted.
The term demonstrative NP, or briefly ètot X in this paper, is used to refer
to a non-pronominal NP in anaphoric use3 and consists of the following lexical
items: 1) proximal demonstrative ètot ‘this’ in adnominal use followed by a
head noun with or without attributes, e.g. ètot mal’čik ‘this boy’ or 2) proximal
demonstrative ètot ‘this one’ in nominal use with or without attributes.4 The
term “pronoun” will refer to the third-person pronouns (on).5 With respect to
non-demonstrative full NPs, I will henceforth use the term “plain NP” for the
sake of brevity. The paradigm of ètot is presented below in Table 1.
Some influential explanatory models of referential choice emerged within
the cognitive approach to discourse (Chafe 1976, Givón 1983, Gundel et. al.
1993, 2001, Ariel 1994, Grosz et. al. 1995). Within this approach it is claimed
that the choice of a referential expression is constrained by the speaker’s eval-
uation of the referent’s representation in the memory, or activation (salience),
in the mind of the listener. Gundel et. al. (1993: 275) argue that each “mem-
ory and/or attention state” in the Givenness Hierarchy (see below) “is a neces-
sary and sufficient condition for appropriate use of a different form or forms”,
and that demonstrative NPs correspond to the medium memory and/or attention
state.
6. The form èto in nominal use is a specialized device which has been excluded from
consideration in this work for the sake of simplicity. The nominal èto often denotes
events and situations rather than referents, e.g. Ja byl gotov k otvetu, i èto pridavalo
mne uverennosti. – I was prepared to answer, and this made me feel confident. Nom-
inal èto should not be confused with the neuter form of ètot. Nominal èto has been
studied by Padučeva (1985).
7. The accusative forms ètot/ètogo and èti/ètix correspond to inanimate and animate
objects, respectively.
34 Olga Krasavina
2) If not, what are the factors determining the use of a demonstrative NP?
The structure of this article is as follows. In Section 2 a connection between acti-
vation level and demonstrative NPs is investigated, and the results of the corpus
study are summarized. In Section 3 the cases that remained unexplained in the
corpus study are considered and the experiments that served to verify several
hypotheses are discussed. The final conclusions are reported in Section 4.
The corpus study used the texts of Russian authors such as F. Iskander, K Si-
monov, etc. (see the list of authors at the end of this article). All texts were
written fiction. The study sample consisted of 254 examples, out of which 217
were demonstrative NP uses and 37 were plain NP uses that are interchange-
able with a demonstrative NP. A demonstrative NP proved to be a relatively
infrequently used device compared to the basic referential devices – plain NPs
and pronouns (see Table 2 for the numbers). Examples that were functionally
marked and demanded a separate consideration were excluded. The excluded
cases mostly constituted lexicalized expressions like na ètot raz ‘this time’, na
ètot sčet ‘in this respect’, and occurrences of demonstrative NP in contexts of
internal and direct speech.8
Table 2. The frequency of demonstrative NP, plain NP, and pronoun uses per 1000
words (in a sample text).
9. Gundel et. al. (1993, 2001) often make judgements as to what memory and/or at-
tention state a referent may have at a certain case, on the basis of the context or
the referent’s properties, as quoted above. A general model of identifying cognitive
statuses for certain cases has not been proposed, though.
10. I do not use word-by-word glosses here since the details of the Russian examples are
irrelevant for the topic of this article.
36 Olga Krasavina
‘They didn’t believe him but they were very glad when he, having dis-
covered a tin in the pocket of his overcoat, suggested that they eat be-
fore the journey. In the tin there were sprats, and the three of them ate
these sprats without bread and water.’
Gundel et. al. (1993) claim that the referent of ètot must be at least familiar, so
this condition would also be met by the higher cognitive statuses (“activated”
or “in focus”). This claim is supported by my material (see (2) where ètot X is
used for a referent “in focus” in Gundel et. al. (1993) terms). It is not difficult
to find contexts, however, in which under a low activation level – where only
a plain NP is expected (“uniquely identifiable” or “referential” or “type identi-
fiable”) – a demonstrative NP is used, as in examples where the distance from
a demonstrative NP to its antecedent is high (more than three clauses)11 or as
in (3):
(2) Mne stalo stydno za sebja, potomu čto ni razu v žizni ja ne projavil
nastojaščego interesa k tomu, što on delal. Kak i vse my, pogloščennyj
svoimi zabotami, ja ne pridaval dolžnogo značenija žiznennoj celi ètogo
ognennogo mečtatelja [1].
‘I felt ashamed for myself because I didn’t express real interest in what
he was doing, not even once. Like all of us swallowed by everyday prob-
lems, I never attached due importance to the life goal of this fired-up
dreamer.’
(3) – Na sever sejčas nevozmožno – čerez šosse ne proskočim. Noči nado
ždat’ … I xoronit’ aby kak – grešno. My što – boimsja ix? Ili udiraem?
Ladno, davaj na xutor k Šandoru Borce.
(Beginning of a new chapter) Ètogo vengra xorošo znali vse okrest. On
razvodil xmel’, deržal neskol’ko korov … [7].
‘– [To go] to the north is impossible now – we’ll not make it through the
highway. We need to wait for the night … And to bury [him] improperly
is no good. We are not afraid of them, are we? Or are we running away?
Okay, let’s go to the farm of Shandor Borec.
[Beginning of a new chapter] This Hungarian was famous all around
(lit. This Hungarian [Obj] knew everyone [Subj] around). He cultivated
hops, had a few cows …’
In (3), the referent Hungarian was mentioned previously only once by means
of a proper name in the end of the preceding chapter in a non-prominent syntac-
11. I avoid citing such examples here, in order to save space: these examples are ex-
tremely lengthy.
Demonstratives and salience: Towards a functional taxonomy 37
Low activation is necessary and sufficient for appropriate use of a plain NP,
but it can also expand to referents that have middle or high activation. High
activation is necessary and sufficient for appropriate use of pronouns. Middle
activation is necessary and sufficient for appropriate use of a demonstrative NP
ètot X, yet ètot X can expand the scope of its possible uses in both directions,
that is, it can also be appropriately used for coding referents enjoying either
high or low activation. In other words, a pronoun cannot expand the scope of
its possible appropriate uses, while a demonstrative NP and a plain NP can – to
all other statuses.
12. Long distances (e.g. in clauses, sentences, over paragraph borders) are important
factors lowering the referent activation (cf. Fox 1987; Givón 1983, 1990; Kibrik
1996, 2000). Chapter borders may be an even stronger factor than paragraph borders.
38 Olga Krasavina
Now, it seems clear that activation level is irrelevant for the use of demon-
strative NPs. Rather demonstrative NPs appear to occur under very different
circumstances in terms of activation level. In this respect, they differ from plain
NPs and pronouns, assuming that the distribution of the latter is determined by
an activation level of a referent (Chafe 1994; Tomlin and Pu 1991; Kibrik 1996,
2000). Therefore, the question remains: what specific factors govern the use of
a demonstrative NP?
‘So far, he was retired and was listed as a military teacher in Tyrnyauz,
at school. From this school came Grisha as well. ’
(7) I tut že migom odna ruka lezet v ledjanoj sumrak za butylkoj, v to vremja
kak drugaja vytiraet trjapkoj prilavok, pološčet gromadnuju lituju
kružku s žul’ničeskim tolstym dnom, perevoračivaet ètu kružku i so
stukom stavit ee pered pokupatelem ([6]).
‘And immediately one hand reaches into the icy darkness for the bot-
tle, while the other wipes the counter with a cloth, rinses a huge cast
mug with a thick false bottom, turns this mug over and puts it loudly in
front of the client.’
In (4), nominal ètot points out that specifically this corpse is meant. Thus the
corpse in question is contrasted to those that could have been thrown out beside
the entrance. In (5), the negative evaluation of the Binomial theorem and the
speaker’s ironical attitude to its exaggerated importance to a math teacher is
expressed by means of ètot X. Russian does not use articles to denote whether
the NP refers to a unique object or any object of the sort. Consider (6), where at
its first mention, the school can be interpreted as referring either to some school
or to a specific school. Ètot X can only refer to a specific school. Moreover, ètot
can be substituted by ètot že ‘the same’, without a change of meaning. Thus the
use of ètot marks that the referents in the first and the second predications in (6)
are identical. In (7), all three devices are interchangeable – a demonstrative
NP, a pronoun, and a plain NP. The use of a demonstrative NP in this example
shows the speaker’s intention to identify the mug in question with the one that
is “huge”, “cast”, and “with a thick false bottom”, rather than some other mug.
Another class of demonstrative NP uses is that of substitutionally deter-
mined uses (41% of all cases). In these cases a demonstrative NP is used when
neither a pronoun nor a plain NP can be used for a certain reason. In other
words, some restricting factors prevent the use of both basic referential devices
(the use of the latter on the basis of the activation level is explained in detail in
Kibrik 1996; 2000). As a result, a demonstrative NP substitutes a correspond-
ing device, as for example, in (8). A plain NP cannot be used in (8) because of a
stylistic constraint in Russian on full repetition of a plain NP at short distances.
The distance at which this constraint functions requires further investigation.
If the activation level (see Kibrik 1996, 2000) typical for the examples of this
type is included (8), the value of this measurement would be high enough for
a pronoun to be used. The introductory character of the antecedent, however,
lowers the activation level. For this reason it is unlikely that a pronoun can be
used.
40 Olga Krasavina
(8) Ja raskryl knigu, kotoruju prines s soboj. Eto byl kakoj-to xrestom-
atijnyj učebnik dlja vtorogo ili tret’ego klassov s nebolšimi otryvkami
iz klassičeskix rasskazov i povestej. Ja stal gromko čitat’ èti otryvki
(*nebol’šie otryvki, ? otryvki, *ix) isključitel’no dlja togo, čtoby obratit’
vnimanie učitelej na beglost’ moego čtenija ([1]).
‘I opened the book that I brought with me. It was some text-
book for the second or the third form containing short extracts from
some classic short stories and novels. I began reading these extracts
(*the short extracts, ? the extracts, *them) loudly, in order to turn the
teachers’ attention to the fluency of my reading.’
Figure 2 below summarizes the model of referential choice which accounts for
functionally and substitutionally determined uses:
Step 1: This stage involves checking if there are any conditions for a demon-
strative NP to be used in one of its functions. If such conditions are present
and no blocking filters are present, a demonstrative NP is used (see examples
(4) – (7)). If there are no such conditions or if such conditions are present, but
blocking filters are also present, then one proceeds to Step 2.
Step 2: According to the activation level, either a plain NP or a pronoun is
chosen (see Kibrik 1996, 2000).
Demonstratives and salience: Towards a functional taxonomy 41
Step 3: If there are no filters that can block the use of a pronoun or a plain NP,
then a corresponding referential device is used. If such filters are present, then
one proceeds to Step 4.
Step 4: A substitutionally determined demonstrative NP is used (see (8)).
At this point a question may occur: what happens if not only the basic referential
devices are blocked, but a demonstrative NP as well? Other ways of referring
to the target object may exist. At this time no definitive answer to this question
is available.
Many claims have been made concerning the use of demonstrative NPs that
signal different kinds of shifts (cf. Kleiber 1988, Gundel 1993, Himmelmann
1996, Cornish 1999), with the only difference from plain NPs being that plain
NPs have a presupposition of uniqueness, whereas “the notion of contrast is
built into the conditions of use regulating demonstratives” (Cornish 1999:59).
In my study sample, 3% of demonstrative NPs occur under the condition of a
time shift.
It has also been repeatedly argued, on the other hand, that various kinds of
shifts are the typical conditions for plain NP use as contrasted to pronoun use
(cf. Walker 1998; Cornish 1999). Most likely such contradictions in conclu-
sions are due to the difference in unaccounted variables that could have had
different values in the texts tested. One needs to carefully consider all condi-
42 Olga Krasavina
tions that could have had any weight in referential choice and make the material
as uniform as possible.
In Section 3, I consider the cases where 1) a demonstrative NP is used to
indicate a topic shift, and 2) a demonstrative NP is used after a time shift. In the
first case, the animate and inanimate referents were considered separately, in
two different experiments: animate and inanimate referents can have different
activation levels.13 In the second case, referents were locations.
Since this study has been conducted in the production-oriented perspective,
the experimental tasks were modeled in such a way that utterance production
process was imitated. The questionnaire method was used. Experiments 1 and
2 involved a forced choice task, and Experiment 3 a written text-continuation
exercise. The following section is devoted to these experiments.
3. Experimental evidence
2) the predications preceding the one containing the demonstrative NP are con-
nected by the rhetorical relation of “sequence”14 (in terms of RST Theory;
see Mann, Matthiessen, and Thompson 1992).
The hypothesis tested in this experiment can be formulated as follows. Let us
assume that the context meets criteria 1) and 2). Then a demonstrative NP is
used after a time shift (Condition 1). If there is no time shift, mostly plain NPs
are used (Condition 2).
3.1.2. Participants
The participants were 18 to 32 years old. They were divided randomly into
two parts. Each part consisted of 16 people. Only individuals with no linguistic
background were admitted. This was meant to prevent problems connected with
the so-called “effect of study”, that is, the possibility that a participant would fig-
ure out exactly what the experimenter expected to see and would unconsciously
either help the experimenter to obtain the expected result, or prevent him from
doing so.
14. The sequence relation specifies that a succession relationship must exist between the
related spans.
44 Olga Krasavina
same text, so they had no chance to compare these two variants while making
the choice of a referential device.
Each experimental sheet consisted of four texts, one of which was the target
text with presence or absence of a time shift; the three others were filler-texts.
The filler-texts imitated the target texts. Each experimental text consisted of two
sentences. In the first sentence the target referent was mentioned for the first
time, and in the second sentence for the second time. At the second mention of
the referent the variants of choice (a plain NP and a demonstrative NP) were
suggested in parentheses. Thus the task involved a forced choice between these
variants.
Each participant received four test sheets, including two with a time shift
and two with no time shift. This yielded 4 (text types) × 32 (participants) = 128
test sheets. The test texts and filler texts were typed out in a random order on
the test sheets. The time for completing the task was not limited. Some texts
that were used for this experiment were original extracts from the corpus and
some were constructed for this experiment.
Below is an example of an experimental sheet. The target text is the second
one from the top. This text includes a time shift. The other texts are filler-texts.
Task: Below you see some extracts from Russian fiction. Please underline that variant
given in parentheses that you think sounds best in this context. Accomplish the tasks
in order. Proceed to the following task; do not return to any of the previous ones.
Nakonec vygljanulo solnce. (Ono, èto solnce) osvetilo komnatu, brosaja vyzov plox-
omu nastroeniju.
‘At last the sun came out of the clouds. (It, this sun) lightened the room, challenging
the bad frame of mind’.
Miša vošel v komnatu i podošel k stolu. Kogda-to za (ètim stolom, stolom) sidel ego
deduška.
‘Misha entered the room and came to the table. Some time ago his grandfather used
to sit at (this table, the table).
Vernuvšis’, ja zastal ix za jarostnym sporom. Ponabl’udav nemnogo za (ètoj scenoj,
nej), ja rešil vmešat’sja.
‘As I returned, I found them in a furious dispute. After watching (this scene, it) for a
while, I decided to interrupt’.
Odin student rešil podšutit’ nad svoim odnokursnikom. (Ètot student, on) napisal čto-
to na kločke bumagi i položil v sumku svoemu drugu.
‘Once a student wanted to make fun of his classmate. (This student, he) wrote some-
thing on a sheet of paper and dropped it into his friend’s bag’.
Figure 3. Example of an experimental sheet (translation from Russian)
Demonstratives and salience: Towards a functional taxonomy 45
ètot X plain NP
Condition 1 (time shift) 22 (69%) 10 (31%)
Condition 2 (no time shift) 11 (34%) 21 (66%)
3.2.2. Participants
The participants involved in this experiment were the same individuals who
were involved in Experiment 1. The subjects were subdivided in a random way
into two parts, 16 subjects each, so that each part received different versions
of the same texts. These two versions corresponded to the two conditions men-
tioned above.
46 Olga Krasavina
Each participant received a test sheet with 4 test texts on it, each 45–60 words
long: two texts according to Condition 1, and two according to Condition 2.
Filler material was not used. In the target place there were two variants of
choice. Participants were instructed to choose one, either a pronoun or a demon-
strative NP. This yielded 4 (texts) × 32 (participants) = 128 test texts.
Demonstratives and salience: Towards a functional taxonomy 47
Demonstrative NPs that were chosen noticeably more frequently under Con-
dition 1, and demonstrative NPs and pronouns that were chosen with an equal
frequency or with prevalence of pronouns under Condition 2, would be inter-
preted as support for the hypothesis.
The participants were given the test sheets for both experiments at the same
time. The test tasks were carried out consecutively, without any time break.
ètot X on
Condition 1 (thematic prominent) 8 (33%) 16 (66%)
Condition 2 (not thematic prominent) 6 (25%) 18 (75%)
Table 7. Summary frequency of ètot X and pronoun use at the second mention of the
referent under Condition 1 and 2.
ètot X on Total
Condition 1
or 14 (29%) 34 (70%) 48 (100%)
Condition 2
of the chosen devices). There must be either some additional, more powerful
factors that regulate the choice between a pronoun and a demonstrative NP in
this situation, probably depending on the speaker’s intention, the genre, etc., or
a different experimental design needs to be chosen, in order to ensure that the
participants understand the relevance of the target referent for the subsequent
discourse.
she lived, Zana dug a hole and put some ferns around it, and thus she
made herself quite a cozy bedroom.’
3.3.2. Participants
12 subjects took part in the experiment. The subjects had never taken part in this
kind of experiment. The participants met the same criteria as the participants in
Experiments 1 and 2.
Number %
*Plain NP at linear distance = 1 7 10
*Plain NP at linear distance > 1 9 13
Pronoun 10 14
Demonstrative NP 6 9
*Adverbial demonstrative 7 10
*Relative pronoun 8 11
*Target referent is not mentioned 23 33
Total 70 100
Table 10. Correlation of pronoun use and referent mention in the following context
(16 Total).
referent, demonstratives were still used (Table 10, lines 2 and 4), although the
number of pronouns used under this condition was higher (7 cases, or 44%) than
the number of demonstratives (4 cases, or 25%). At the same time, the number
of demonstratives on the whole (6 cases, or 38%) was lower than the number
of pronouns (10 cases, or 63%).
The statistical analysis shows that there is no significant difference between
pronoun or demonstrative NP use under two conditions (χ 2 (1) = 0.02 n.s.). This
indicates that the factor of “thematic prominence” in the subsequent discourse
of a new discourse referent does not affect the choice of demonstratives.
The number of demonstrative NPs turned out to be quite low. As can be seen
in Table 9, jama ‘hole’ was often referred to by means of adverbial demonstra-
tives and plain NPs at the linear distance of one clause despite the constraint on
full NP repetition at such distance. This constraint can be abolished in cases
where the referent is a location, as in the case considered in Experiment 1.
The high occurrence of adverbial demonstratives and plain NPs here can be
addressed in terms of referent semantics.
52 Olga Krasavina
4. Conclusions
The present study has focused on the use of an important, but little-studied,
referential device in Russian – the demonstrative NP ètot X. One of the most
essential points made in this study is the clarification of the connection between
demonstrative NPs and activation level. This connection turned out to be quite
weak: demonstrative NPs can be used under any activation level of a referent. It
was demonstrated that ètot X can encode the referents which have both higher
than “familiar” status (in line with the Gundel et. al. (1993, 2001) prediction)
and lower than “familiar” status. The latter can be interpreted as an important
addition to the theory of Gundel et. al. (1993, 2001). Using the corpus mate-
rial, two classes covering most of the demonstrative NP uses were singled out.
Demonstratives are used when neither of the basic referential devices can be
used. There are certain factors that block the use of a pronoun under a high acti-
vation level and of a plain NP under a low activation level of a referent, so that
the only possible referential form left is a demonstrative NP. These cases make
up the first class. Another class is represented by cases that can be accounted
for by one of the basic functions underlying the use of demonstratives: selection
from a set, identification, or pejorative function.
After the examination of several hypotheses concerning the character of the
factors that lead to the use of demonstrative NPs in experimental studies, it has
been proven that the demonstrative NP is a preferable referential device after
a temporal shift takes place. An important “negative” result was also obtained
during the study: the experiment indicates that the factor of further referent’s
relevance in the following context does not affect the choice between a demon-
strative NP and a pronoun.
This study calls for further empirical studies in the use of Russian demon-
stratives. The questions that remain to be answered are:
– What governs the referential choice in cases like those described in Experi-
ments 2 and 3?
– Within the class of substitutional uses, what happens if not only a pronoun
and a plain NP are blocked, but a demonstrative NP as well?
Additionally, the nature of constraints on the use of referential expressions re-
quires further investigation. For example, the distance from the antecedent at
which a plain NP is blocked needs to be clarified. Future work will include a
study of a larger data set, as well as reaction time experiments.
Demonstratives and salience: Towards a functional taxonomy 53
Acknowledgements
The research presented in this paper was supported by grant 03-06-80241a of
the Russian Fund of Fundamental Research.
Abbreviations
The literature from which the examples were taken (URL: http://www.lib.ru):
[1] Iskander F. “Školnyj val’s, ili Energija styda”;15
[2] Iskander F. “Stojanka čeloveka”;
[3] Simonov K. “Živye i mertvye”;
[4] Bykov V. “Obelisk”;
[5] Vizbor J. “Legenda sedogo El’brusa”;
[6] Kataev V. “Beleet parus odinokij”;
[7] Akimov I. “Legenda o malom garnizone”.
References
Ariel, Mira
1994 Interpreting Anaphoric Expressions: A cognitive versus a pragmatic
approach. In: Journal of Linguistics (30), 3–42.
Boguslavskaja, Olga and Muravjeva, Irina
1987 Mexanizm anaforičeskoj nominacii. In: Modelirovanie jazykovoj de-
jatel’nosti v intellektual’nyx sistemax (red. A.E. Kibrik, A.S. Narin-
jani). Moskva: Nauka, 78–127.
Chafe, Wallace
1976 Givenness, contrastiveness, definiteness, subjects, topics and point of
view. In: C.N. Li (ed.), Subject and topic. New York: Academic Press,
25–55.
Chafe, Wallace
1994 Discourse, Consciousness, and Time. The Flow and Displacement of
Conscious Experience in Speaking and Writing. Chicago: University
of Chicago Press.
Cornish, Fransis
1999 Anaphora, Discourse and Understanding. Evidence from English and
French. Oxford: Clarendon.
Fox, Barbara
1987 Discourse Structure and Anaphora. Cambridge: Cambridge Univer-
sity Press.
15. Examples from this and the other sources used in this study, were abridged as far as
possible.
54 Olga Krasavina
Andrey Y. Filchenko
1. Introduction
1. The work leading to this publication was supported by the NEH-NSF Documenting
Endangered Languages Fellowship, 2005–2006. Any views, findings, conclusions,
or recommendations expressed in this publication do not necessarily reflect those of
the National Endowment for the Humanities and the National Science Foundation.
58 Andrey Y. Filchenko
The empirical base for the discussion is a corpus of the Eastern Khanty nar-
ratives, collected and transcribed between 2000-2003, supplemented by some
Eastern Khanty texts published between 1900-1995.
In the discussion of the structural properties and information structuring of
the Eastern Khanty clauses, I will differentiate the grammatical relations from
the semantic roles of the arguments of propositions, and pragmatic functions of
the referents and pragmatic operations they are involved in.
Grammatical relations are henceforth indicated following Dixon (1994) as:
S – intransitive subject; A – transitive subject; O – transitive non subject.
The main semantic roles of the arguments of propositions (relevant for the
purpose of discussion) are generally defined after Van Valin and Lapolla (1997)
as either Agent or Target, each representing a host of semantic features, or en-
tailments (cf. also Rose, this vol., for a more in-depth discussion).
The notions of discourse-pragmatic status (salience), pragmatic operations,
topicality are central to the discussion of the features of the constructions in the
narrative. A referent is defined as pragmatically central if it shows the follow-
ing properties: it belongs to the presuppositional part of the proposition, it is
contextually accessible and active, in dislocation tests (“as for” and “about”)
it produces the target clause, it normally does not carry the clause accent, and
the rest of the proposition appears to carry a relation of “aboutness” towards it
(Strawson, 1964; Kuno, 1972; Gundel, 1976; Lambrecht, 1994). In the discus-
sion of the pragmatic structure of the data, I will also use the terminology and
premises of the Centering framework as in Grosz et al. (1995), cf. the Intro-
duction of this volume for a more detailed discussion of Centering. I will also
use less conventional terms foregrounded center and backgrounded center to
mark special pragmatic states resulting from the voice operations in the Eastern
Khanty parenthetical constructions (cf. Hinterhölzl and Petrova, this vol. for
similar notions). In this, by the backgrounded center in case of Eastern Khanty
parenthetical Agent-demoting constructions I will understand a primary topical
referent (Cb ), whose pragmatic status is temporary demoted (backgrounded)
(C[backgrnd] = C[n−1] (= C[n+1] ) =/ C[n+2] ), while another referent with a compet-
ing discourse status but with non-typical semantic role appears temporary pro-
moted (foregrounded) as a pragmatic center for an interval of 1–2 clauses, as in
foregrounded center (C[foregrnd] =/ C[n−1] (= C[n+1] ) =/ C[n+2] ).
Section (2) will provide a brief outline of the canonical Eastern Khanty
active-direct clause and consider the main features of organization of gram-
matical relations important for the discussion to follow. Section 3 will intro-
duce the Eastern Khanty agent demotion constructions, listing first the so called
“ergative constructions” (3.1.) immediately compared to the “agented passive
constructions” (3.2.). The description of the key morphosyntactic, semantic and
60 Andrey Y. Filchenko
Typical Eastern Khanty simple clause shows general tendency towards the SOV
pattern. Khanty unmarked neutral simple clause is verb-final where, generally,
subject precedes object. The position of the O constituent may vary contingent
upon its pragmatic properties, that is, brand new, inactive, unidentifiable O ref-
erents are always rigidly fixed in the SOV order. In cases of pragmatically active
and identifiable O referents other orders may result, OSV and SVO.
The semantic role of Agent is typically mapped to A grammatical relation,
while the semantic role mapped to O relation is typically the Target, an entity
saliently affected in the event. The semantic role mapped to the S grammatical
relation is understood as that of a single core NP of an intransitive verb.
↓
(1) a. mä m@n-l-im
1sg walk-prst-1sg
‘I walk’
↓ ↓
b. mä ajr1t-äm t1Gl-a qar1-mta-s-1m
1sg canoe-1sg/sg det-ill pull-intn-pst2-1sg/sg
‘I pulled my canoe here’
The referent with the semantic role of Agent appears clause-initially, expressed
by the argument in S/A relation marked by Nom. case that controls S/A-V agree-
ment inflection on the predicate. S/A-V agreement and O-V agreement refers to
Khanty single and double conjugation, i.e. co-referential agreement inflection
on the transitive predicate controlled by the S/A relation and O relation respec-
tively. Agreement controlled by S/A grammatical relation is obligatory (1a–b).
Agreement between the grammatical relation of O and transitive predicate is
contingent upon the pragmatic properties of the O (1b), i.e. pragmatic identifi-
ability and activation of this referent in the interlocutors’ discourse universe.
Parenthetical agent-demoting in Eastern Khanty 61
The argument in the S/A grammatical relation normally has all the tradi-
tional subjecthood properties, such as control over referential relations clause-
internally and -externally: control over embedded non-finite clauses; control
over zero anaphora across conjoined clauses; control over reflexivization; quan-
tifier movement control.
Eastern Khanty nouns in O relation are zero-marked for case, i.e. morpho-
logically indistinguishable from the Nominative. In the pronouns, the Accusa-
tive case has a marker /-t/. Thus: mä ämp tuG@m ‘I brought a dog’ vs. ämp mänt
por ‘a dog bit me’.
Pragmatically, a new referent is introduced or re-activated by a full NP or
a free pronoun in the S/A grammatical relation. Once the referent is identifi-
able as topic, its discourse salience is then expressed by elision and by verbal
agreement – a preferred topic expression (Filchenko 2006).
There is a strong correlation: Topic = S/A = ClauseInitial = Agent, that is
within a general typological information structural pattern (Lambrecht, 1994).
Topicality associates strongly with minimal morphological complexity, implic-
itness, while new information and focus correlate with morphological explicit-
ness. In Centering framework terms (Grosz et al. 1995), Eastern Khanty canon-
ical clause adheres well to the cross-linguistic prototypical correlation of for-
mal referring expressions to attention state and inference load, i.e. in center
continuation, a persisting preferred center Cp(n) is typically realized by a zero
pronoun and is strongly associated with grammatical relation of S/A and sub-
jecthood, clause-initiality and pronominalization. On the other hand, in center
shifts, preferred center Cp(n) that associates with C(n+1) is typically realized by
a morphologically explicit full NP.
2. The small caps are used to denote the clause constituent bearing the clausal accent.
62 Andrey Y. Filchenko
In (2a), the canonical active-direct clause with the elided 1sg. topic is followed
by the ergative (2b) with a new 3 pl referent. Counter to the preferred topic
expression pattern discussed in the section above, the topical Agent referent
‘they’ in (2b) appears coded by a free Loc-marked pronoun in S/A relation and
by S-V agreement inflection. The Message “Bear!” now has high discourse acti-
vation status, marked in (2b) by 3sg O-V agreement on the predicate. However,
the narrative resumes canonically in the immediately subsequent active-direct
clause (2c), where the topical status of the 1sg referent has the preferred topic
expression. That is, the demoted 1sg. topic referent of (2a) reappears expressed
in (2c) by elision and co-reference agreement inflection on the predicate, thus
retaining its overall discourse salience.
Superficially, nothing in the structure/grammar of these clauses, and their
immediate discourse environment, precludes the use of the canonical active-
direct construction type to express the same semantic content. These features
correlate to general pragmatics of Topic-comment (2a) vs. marked predicate-
focus (2b) with the Loc-marked S/A agent pronoun.
In respect to discourse information structuring, the ‘ergative’ events ap-
pear parenthetical, consequential (reactive) in their nature, representing a cause-
effect or action-consequence dependence upon the event in the preceding active-
direct clause: (2a) implies the projected effect of (2b).
Although the affect is often implicit, however, the affectedness of the Target
referent (expected in the typical transitive event) is never specified.
Thus, the Loc-marked ergative S/A referents, although mainly inherently agen-
tive (definite human/animate), are deprived, at least in part, of some of the sub-
jecthood properties: control/volition, which correlates with the fact of their in-
creased morphological complexity: oblique case marking of the Agent. This is
Parenthetical agent-demoting in Eastern Khanty 63
In the adjoined passive (5b), the Target referent ‘small dog’ is expressed by the
full NP in the S relation controlling 3sg verbal agreement, whereas the Agent
referent is expressed by the free Loc-marked pronoun in the non-S relation. The
topic in (5b) is the S argument ‘small dog’, whereas the 1sg agent is apparently
temporarily demoted to an oblique-like role. The relevance of the agent in the
proposition (5b) is still manifested through its overt presence, to minimize po-
tential ambiguity.
However, in the active-direct (6), the demoted 1sg agent reappears in the S/A
relation, remaining topical, coded appropriately by elision and predicate agree-
ment inflection, and controlling the S/A relation over the non-finite modifiers
(inflected for 1sg), thus unaffected discoursively by the passive demotion.
– passive manifests the Eastern Khanty tendency for Topic initiality, i.e. the
strongest alignment appears <pragmatic function = grammatical relation>,
overriding that of <pragmatic function = semantic role> or <semantic role =
grammatical relation>;
– type frequency in the narratives ∼ 13%.
The Eastern Khanty passives with overt Agent appear to resonate with the gen-
eral fundamental function of passives “having to do with defocusing of agents”
(Shibatani, 1985).
The structural properties of the Eastern Khanty voice constructions listed
above are not typologically unique, nor are they new to the language descrip-
tion. However, the exact identification of the motivation of their usage, i.e.
the functional explanation of these grammaticalized form-function correspon-
dences, is not accounted for. In the meanwhile, the explanation of the choice
of speakers’ strategies of mapping of the propositional-semantic content to the
structural features and discourse functions in the passive constructions may be
aided by the insights into the specifics of the cultural context typically correlat-
ing to frequent use of these constructions, passive voice talk.
3. In the Eastern Khanty cultural frame implies that the status of ‘wife’ is complete only
upon arrival and observing certain rituals at the new family location (new house), the
‘husband’s’ clan residence.
4. Eastern Khanty have patrilineal, patrilocal setting, where the oldest son normally
resides with his Father and inherits from him.
68 Andrey Y. Filchenko
Bear is an extremely significant cultural agent for many of the Siberian native
cultures, and Eastern Khanty in particular. From the available ethnographic ac-
counts (Tschernetsov 1974, Kulemzin 1984, Lukina 1990) and surviving oral
folk tradition (Filchenko 2009) it is known that behaviour towards the bear is
highly ritualized, with omnipresent taboos and restrictions. Having basically
equal status with a human, a bear is the biggest and most dangerous local preda-
tor, referred to as wont-iki ‘forest master’, or qaq1 wajaG ‘brother animal’, and
just wajaG ‘animal’, and almost never by a proper taboo term jiG ‘bear’. Bear
is the only animal who has a complete set of taboo terms for body parts un-
related to proper somatic nomenclature of humans and other animals: cf. kil
‘bear’s stomach’ vs. qon ‘human/animal stomach’); laGl’ip ‘bear’s tooth’ vs.
pöNk ‘human/animal tooth’, etc). Bear’s bones are never broken and are spe-
cially disposed of out of the reach of dogs and other animals, while the skulls
are kept by the hunters on house roofs (cf. photo below). The events of hunting
the bear and feasting over it are of extreme significance and ritual value, char-
acterized importantly by concealment of the identities of the hunters (mask (cf.
photo below) and nicknames) interpreted as a need to avoid possible retaliation
from the bear’s spirit (Kulemzin 1984, Filchenko 2007). On the other hand, the
ability to hunt bear successfully and thus provide for the community translates
into a prominent social status for the hunter (cf. photo, Figure 3).
Thus, within the Eastern Khanty cultural convention, on the one hand, there
is an apparent tendency towards considerable caution and avoidance of personal
association with the bear, and particularly affecting the bear. On the other hand,
there is an apparent social significance of the status of a successful bear hunter.
This special cultural status and behaviour conventions towards the bear corre-
late to the special treatment of it linguistically, in that a special pragmatically
motivated morphosyntactic operations are implemented aimed at demoting the
70 Andrey Y. Filchenko
status of the agent in the otherwise typical transitive event. In other words, the
culturally conditioned tendency towards avoidance of association with affect-
ing the bear is manifested in the linguistic avoidance of discourse salience of
the agent in the propositions describing this type of events (cf. also Claus, this
vol., outlining experiential effects on language processing).
The above, however, should not be seen as a statement that this construction
is restricted exclusively to the contexts involving the bear. In (15), the context
involves hunting for the biggest local fish, pike.
(15) a. mä sart wel-s-@m, ¨@ll¨@
1sg pike kill-pst2-1sg big
‘I caught a pike-fish, a big one’
b. Öll¨@ sart män-n@ löGöli-s-im
big pike 1sg-loc cut-pst2-1sg/sg
‘I prepared the big pike’
c. terkä-s-im iwes-n@
fry-pst2-1sg/sg stick-loc
‘I fried it on sticks’
Parenthetical agent-demoting in Eastern Khanty 71
Notice, that in the Agent-demoted (16a), the ‘incidental’ interpretation and re-
spective adverbial use is preferred to the ‘intentional’. However, in the reflexive
event with the active-direct transitive clause and canonical Agent coding (16b)
both, the ‘incidental’ and ‘intentional’ interpretations and respective adverbial
use are acceptable. Unlike (16b), which allows for volitional, purposeful event
of acting on oneself, the “ergative” voice in (16a) codes less intentional, defo-
cused Agent and typically has a ‘reading’ of the less volitional, unintentional
Event/Action
The “ergative” agent-demotion construction is also attested with modal ver-
bal predicates, particularly of cognition (17), which are in many features similar
to the already exemplified perception predicates (‘look / aim at’).
72 Andrey Y. Filchenko
In their features, these examples are consistent with the established pattern of
low Target affectedness and reduced/unapparent agentivity and control.
In referential terms, the preference for the contextual use of the construction
holds for the non SAP.
In (18), the 3sg non-SAP agentive referent is demoted in the context of bear
hunting, implying agent’s affecting the bear, but the agent’s control and volition
in the proposition is backgrounded by Loc case-marking.
In less culturally significant contexts, the “ergative” agent-demoting con-
structions are also attested, however, the general pragmatic tendency of agen-
tivity and salience avoidance holds consistently (19):
Notice, in the reply utterance (19b), the Agent is coded as the clause-initial argu-
ment marked by the Loc case, with the main pragmatic function being the agent
demotion, making it less volitional, controlling, affecting and, importantly less
topical for the given stretch of the discourse. The non-topicality, and rather
the focus pragmatic relation of the Agent argument towards the whole of the
proposition (19b), is an important feature, the one by which the presupposition
of (19a) is falsified, or made absent. Thus, it is not incidental or random that
the morphosyntactic properties of the Agent argument in (19b) are consistent
with those of the demoted Agent voice construction. On the other hand, the non-
topicality of the role of Agent here, its coding by the Loc-marked NP aligns it
with another marked Agent-demotion construction, the agented passive.
Parenthetical agent-demoting in Eastern Khanty 73
In some of the examples (20), the Target referent may be elided from ex-
plicit coding, being expressed only by predicate agreement inflection, whereas
Agent is overt and marked formally by the Loc case. These examples testifying
additionally to the Agent’s demotion from the core of the proposition towards
the periphery, while the elided Target, being elided, appears more topical, prag-
matically foregrounded.
canonical agent-demotion
Demotion of the Agent referent in both of the reviewed Eastern Khanty con-
structions appears an operation having to do with defocusing of the core par-
ticipant in the event, moving it towards the periphery of the semantic frame of
the event, typically manifested by an increase in morphological complexity of
the Agent referent, i.e. cross-linguistic examples of the passive constructions
(Shibatani, 1985).
Among the essential factors that underlie attested ergative, nominative or
split systems, it is often suspected that larger discourse pragmatic and/or se-
mantic considerations might be the key conditioning factors. Particularly the
degree of topicality/referentiality, as well as volition/control of referents may
affect the inclination in a language’s grammatical relations either towards erga-
tivity or nominativity. Morphological complexity of the Agent argument of the
Eastern Khanty “ergative” construction quite consistently correlates with pro-
totypical ergative continuity between S and O at the deeper level, i.e. A of the
“ergative” approximates the O of the active in its pragmatic and semantic fea-
tures: decreased topicality, low agentivity/control, approaching the semantics
of experiencer/undergoer.
In “ergative” clauses, the overt Loc-marked human/animate Agent, the low
transitivity of the morphologically active verbal predicate, predicate agreement
control, and its parenthetical character (one clause length followed by canonical
active-direct clauses with the continuing topic expressed by elision) signal tem-
porary pragmatic demotion of the low control/volition Agent in the consequen-
tial event, where the agentive nature of the Agent is de-emphasised, consistent
with specific cultural conventions and practices.
In passive clauses, such features as: demoted overt Loc-marked Agent, the
high semantic transitivity of the morphologically passive verb, promoted case-
unmarked Target controlling predicate agreement, and parenthetical character
(1–2 clause length followed by a canonical active-direct clause with the contin-
uing topic expressed by elision) communicate temporary pragmatic prominence
Parenthetical agent-demoting in Eastern Khanty 75
Conclusion
that a wide cognitive faculty, facilitating discourse coherence in the given cul-
tural context is at play in structuring the information and ultimately affecting the
linguistic form, governing the choice of referring expressions of the arguments
of propositions. The system’s specific grammatical resources consistently cor-
relate with identifiable and predictable pragmatic and semantic properties, man-
ifesting speaker’s choices in representing the salient entities across a stretch of
discourse.
The Eastern Khanty voice constructions typically code ultimately de-transi-
tive events with multiple referents potentially competing for discourse salience.
The “agented passive” and “ergative” constructions manifest parenthetical
shifts in salience of the agentive discourse referents, allowing for gradience in
discourse prominence, primary and secondary topicality. This is expressed by
promotion/demotion voice operations, oblique case marking and grammatical
relation shifts of agents.
The implications of the above for the issue of discourse salience are vari-
ous. What is salient in the discourse appears to be a matter of gradience and
underlyingly culturally motivated. The salient entity in a stretch of discourse is
a persistent center in a sequence of utterances. Centering is a multifactorial dis-
course phenomenon controlled by an interaction of cultural convention, prag-
matic, lexical semantic and syntactic features. “Ergative” and passive construc-
tions show that: i) center continuation sequencing is indeed preferred over cen-
ter retention sequencing; ii) Cp(n) does strongly associate with clause-initiality
and pronominalization, which confirms correlation of low morphological com-
plexity to high pragmatic salience; iii) Cp(n) ’s association with subjecthood or
grammatical relation of S is though strong, however, not impenetrable for such
factors as cultural preferences, speaker intentions and pragmatic pressures; iv)
in the non-canonical constructions S relation is indeed firmly tied with a rela-
tively high degree of discourse salience, albeit occasionally temporarily fore-
grounded compared to the primary persisting discourse center; v) in a stretch of
utterances more than one simultaneous discourse centers are possible, in which
case; vi) the choice of referring expression is determined by the speaker’s in-
tentions interacting with prevailing patterns in the linguistic system, such as:
word-order, clause-initiality of the center, association of semantic role(Agent) –
grammatical relation (S/A)–and grammatical role/function (Subject).
References
Filchenko, A.Y.
2009 Landscape Perception and Sacred Places amongst the Vasyugan
Khanty. In: ed. P.Jordan: Landscape and Culture in the Siberian
North. UCL Press.
Filtchenko, A.Y.
Field notes from ethno-linguistic research of Eastern Khanty. The
Field Archive of the Laboratory of Siberian Indigenous Languages
at TSPU. Tomsk.
Givon, T.
2001 Syntax. An Introduction. Amsterdam/Philadelphia: John Benjamins.
Grosz, B., A. Joshi and Sc. Weinstein
1995 Centering: A framework for modeling the local coherence of dis-
course. Computational Linguistics 21. pp. 203–225.
Gundel, J.K.
1976 Topic-comment structure and the use of tože and takže. Slavic. and
East European Journal 19. pp. 174–176.
Jordan, P. and Filtchenko A.
2005 Continuity and Change in Eastern Khanty Language and World-
view. In: “Rebuilding Identities: Pathways to Reform in Post-Soviet
Siberia” edit. Erich Kasten. Dietrich Reimer Verlag.
Karjalainen, K.F.
1927 Die Religion der Jugra-Völker. Parvoo.
Klimov, G.G.
1984 Nominativnoe i ergativnoe predlozhenia. Moscow.
Kulemzin, V.M.
1984 Celovek i priroda v verovanijakh Khantov. Tomsk.
Kulemzin, V.M.
1995 Mirovozzrencheskie aspekty ohoty i rybolovstva. In. V.I. Molodin,
N.V. Lukina, V.M. Kulemzin, E.P. Martinova, E. Schmidt, N.N. Fe-
dorova (eds.) Istoria i kultura Khantov. Tomsk. pp. 45–64.
Kulonen, U.-M.
1989 The Passive in Ob-Ugrian. Helsinki.
Lambrecht, K.
1994 Information Structure and Sentence From. Cambridge: CUP.
Li, C. (Ed.)
1977 Subject and Topic. London: Academic Press.
Lukina, N.V.
1990 Obshee i osobennoe v kulte medvedja u obskix ugrov. Obrjady naro-
dov severo-zapadnoj Sibiri. Tomsk.
Parenthetical agent-demoting in Eastern Khanty 79
Sarkany M.
1989 Female and Male in Myth and Reality. Uralic Mythology and Folk-
lore, Bp., Helsinki.
Shibatani, M.
1985 Passives and Related Constructions. Language. V-61, #4. pp. 821–
848.
Trask, R.L.
1979 On the origins of ergativity. In: Plank F. (ed), Ergativity. Towards a
theory of grammatical relations. London – New-York.
Tschernetsov, W.N.
1974 Bärenfest bei den Ob-Ugriern. Acta Ethnographica Academiae Scein-
tiarum Hungaricae, t.23 (3–4). Budapest.
Van Valin, R. and Lapolla, R.J.
1997 Syntax. Structure, meaning, and function. CUP.
Joint information value of syntactic and semantic
prominence for subsequent pronominal reference
Ralph L. Rose
1. Introduction
Many studies of discourse production and perception have observed that entities
evoked in subject position are treated somewhat differently than those evoked
in other positions when those entities are referred to subsequently. For instance,
consider the short discourse in (1).
(1) a. Luke i hit Max j
b. Then he i/#j ran home.
b’. Then #Luke/Max ran home.
While the pronoun in (1b) is ambiguous and could be interpreted as referring
to either luke or max, the preferred interpretation is luke, the subject of the
preceding sentence (cf., Hudson-D’Zmura and Tanenhaus 1997; Mathews and
Chodorow 1988). Similarly, repeated reference to luke by name as in (1b’)
is more marked than repeated reference to max by name (Gordon, Grosz, and
Gilliom 1993; Almor 1999; Almor and Eimas 2008). These observations are
from the hearer’s perspective, but even from the speaker’s perspective, sim-
ilar preferences have been observed. Brown (1983) observed that entities in-
troduced as subjects persisted longer than those introduced in other syntactic
positions: That is, there were more contiguous utterances in which the entity
was referred to again.
Many models of discourse production and processing capture these obser-
vations through two assumptions. First, the salience of entities evoked in a dis-
course determines how subsequent reference to those entities should be per-
formed or interpreted (Ariel 1988; Gundel, Hedberg, and Zacharski 1993; Gor-
don and Hendrick 1997, 1998). Second, syntactic information1 is a primary
or even the sole factor which determines salience (Grosz, Joshi, and Wein-
stein 1995; Lappin and Leass, 1994). Thus, according to this kind of model,
the first sentence in (1) introduces two entities into the discourse representa-
tion, luke and max. With respect to the syntactic prominence hierarchy shown
in (2), in this representation, luke is more salient because it was realized in
subject position while max is less salient having been realized in object po-
sition.
(2) subject > object > oblique > none
One problem with this account is that in such languages as English, syntactic
information is often conflated with semantic information. That is, syntactic sub-
jects are often semantic agents and carry more Proto-Agent entailments (e.g.,
sentience, volition; Dowty 1991), while syntactic objects are often semantic pa-
tients and carry more Proto-Patient entailments (e.g., undergo change-of-state,
causally affected). Thus, assuming a semantic prominence hierarchy as in (3)
(cf., thematic hierarchies in Dorr, Habash, and Traum 1998; Jackendoff 1972,
1990; Speas 1990), it could be the case that luke is more salient than max in
(1a) not because it is realized in subject position, but rather because it is realized
as an agent (of the hitting event).
(3) agent > patient > others
Along these lines, Stevenson et al. (2000) argues that semantic focusing is a
crucial factor helping to explain such inter-utterance coherence effects. How-
ever, Miltsakaki (2007) shows that semantic focusing alone is insufficient to
explain related phenomena in Greek and suggests that things may be more com-
plicated.
The purpose of this paper is to investigate syntactic and semantic promi-
nence. Specifically, I am seeking to answer the question, “What is the relative
contribution of syntactic prominence and semantic prominence to the salience
of entities evoked in a discourse?” I investigate this question with a corpus in-
vestigation which looks at coreference across adjacent utterances and the form
of referring expression (pronoun or description) used in subsequent reference.
The results are presented in terms of Information Theory (Shannon 1948) and
suggest that while syntactic and semantic prominence are comparably infor-
mative about the form of subsequent reference, taken together, syntactic and
semantic prominence are more informative than either is alone.
in Old High German, word order involving verb placement indicates the discourse
status of referents.
Joint value of syntactic and semantic prominence 83
In the next section, I describe the basic discourse model I assume in this
paper and then in Section 3, I describe the corpus used in this study. Section 4
contains an overview of Information Theory and particularly the concept of the
value of information. I report the results of the study in Section 5 along with
interleaved discussion.
2. Discourse model
In this paper, I assume a model of discourse processing in which the current ut-
terance is processed with respect to the context; that is, the representation of the
discourse so far (Kamp and Reyle 1993; Kehler 2002). I assume that the con-
text contains representations of the entities evoked in the discourse. Following
Karttunen (1976) and Heim (1982, 1983), I call them discourse referents (or
just referents for short). The set of referents is a partially-ordered list, the or-
der of which is determined by salience – “the degree of relative prominence of
a unit of information, at a specific point in time, compared to the other units
of information” (cf. the Introduction, this volume). A number of factors con-
tribute to salience including syntactic role and recency of linguistic expressions
in a discourse (see Hirst 1981 and Mitkov 2002 for an overview of these and
many other factors). Non-linguistic factors may also contribute to the salience
of referents including visual salience (e.g., Kelleher, this volume) and possibly
prominence within a mental simulation (e.g., Claus, this volume). However, in
this paper I will focus only on linguistic factors.
I take the highest ranking referent to be the most salient referent in the cur-
rent context. As such, if this referent is evoked in the current utterance, then
it should be done so pronominally (cf., Rule 2 of the Centering Framework of
Grosz, Joshi, and Weinstein 1995). This then is a useful metric for determining
which referents in the context are more salient than others (see Krasavina in this
volume for more extensive discussion of referential choice and how this works
in Russian).
This is the approach I use in the corpus analysis in order to examine which
referents are most salient and subsequently which syntactic and semantic fea-
tures are most informative for determining their salience. However one simpli-
fication I make is to assume that recency determines that all referents evoked
in the most recent utterance are more salient than those evoked in earlier ut-
terances. Thus, while inter-utterance coreference could conceivably span mul-
tiple utterances, the present study only considers coreference in adjacent utter-
ances.
84 Ralph L. Rose
The theoretical approach which I take in this study embodies the speaker’s
point of view in discourse processing. In other words, I am investigating what
the speaker takes as salient in the discourse and the encoding decisions made
as a result of that. However, I take salience to be a feature of discourse rep-
resentation which is ultimately used by both hearer and speaker in their re-
spective tasks. The precise way in which each uses salience may be different,
but I assume that they rely on the same core notion of salience in the process
of discourse production or perception (cf., Prince 1986; Blutner 1998, 2000;
though see Chiarcos in this volume for a detailed view of how speaker and
hearer salience may be distinguished).
3. Corpus design
3.1. Utterances
The texts were first parsed into sentence nodes, <s>, based on their appearance
in the text: word strings terminated by a period (except of course for periods
marking an abbreviation). The <s> nodes were further marked with a relatively
shallow parse based on clause relations. Each clause, <c>, contained at most
one <verb> child. The noun-phrase, <np>, and clausal arguments of a verb were
marked as siblings of the <verb>. The text shown in (4) was thus tagged as in
(5) (leaving out currently irrelevant details).
(4) John hit Matt. He told his teacher that John did so.
(5) <s>
<c>John hit Matt</c>
<punc>.</punc>
</s>
<s>
Joint value of syntactic and semantic prominence 85
3.3.1. FrameNet
Based on the Frame Semantics of Fillmore (1968, 1976), the FrameNet system
defines a large number of conceptual frames (e.g., intentionally affect, transitive
action), each of which incorporates a set of frame elements (i.e., thematic roles:
agent, patient, etc.) which participate in that frame. Each frame encompasses
a number of lexical items which invoke that frame and therefore define the
particular roles that the arguments of each item play. For instance, the verb
throw invokes the cause_motion frame and therefore takes several participants
including an agent, a theme, and a goal.
In the present study, for each <verb>, the semantic role of each <np> argu-
ment of that verb was determined by consulting the FrameNet database for the
frame which encompassed that verb and then assigning the respective frame
element labels to the <np> nodes. If a verb was not in the FrameNet database,
then the database was searched for a suitable alternative (e.g., via synonym or
hypernym relations). Thus, the sentence in (6) was tagged as in (8).
(8) <s>
<c>
<np semrole="agent">Ken</np>
<verb>threw</verb>
<np semrole="theme">the frisbee</np>
to
<np semrole="goal">Jaime</np>
</c>
<punc>.</punc>
</s>
3.3.2. Proto-roles
Dowty (1991) proposes an alternative view of the linking between lexical con-
ceptual structure and syntax through semantic entailments placed on arguments
by a verb. He posits two sets of Proto-role entailments as in (9).
(9) Proto-Agent entailments
– sentience
Joint value of syntactic and semantic prominence 87
– volition
– cause event or change-of-state
– undergo movement
Proto-Patient entailments
– undergo change-of-state
– causally affected
– incremental theme
– stationary
Under Dowty’s theory, arguments of a verb may carry any number of these en-
tailments. A selection principle then determines that the argument which carries
the most Proto-Agent entailments becomes the surface subject. The remain-
ing argument with the most Proto-Patient entailments becomes the object. Any
other arguments become obliques. It is important to notice then that under this
system, arguments may take on the Proto-Agent or Proto-Patient roles in vary-
ing degrees. With one verb, the argument realized as subject may carry all four
Proto-Agent entailments while with another verb, the argument realized as sub-
ject may carry only one or two. Furthermore, some crossover between the roles
is possible: An argument realized as a subject may carry some Proto-Patient en-
tailments while an argument realized as an object may carry some Proto-Agent
entailments.
In the corpus, Proto-role entailments for every <np> argument were marked.
The entailments associated with any particular verb were determined using a
series of linguistic tests described in Rose (2005). Thus, (6) was marked as
in (10).
(10) <s>
<c>
<np sentience="yes" volition="yes" stationary="yes">
Ken</np>
<verb>threw</verb>
<np movement="yes">the frisbee</np>
to
<np stationary="yes">Jaime</np>
</c>
<punc>.</punc>
</s>
Coreference was determined as cases where (in the coder’s opinion) the author
intended for two noun phrases to have the same extensional meaning (and in-
tended the reader to make the same interpretation). Anaphoric dependence was
not a sufficient cause to determine coreference. Hence, in Although John saw
the students raise their hands, his remained down, although his is anaphorically
dependent on their hands, these phrases were not taken as coreferent because
Joint value of syntactic and semantic prominence 89
4. Information Theory
The corpus analysis which follows makes use of one fundamental concept in
Information Theory (Shannon 1948): the value of information (hereafter, EIV).
EIV is based on the entropy, H – an estimate of the uncertainty of the outcome –
of a given probability space. H for a probability space with N possible outcomes
can be calculated as shown in (13) where P(n) is the probability of the n-th
outcome.
N
(13) − ∑ P(n) · log2 P(n)
n=1
For a given question in which all possible outcomes are equally likely (e.g.,
the flip of a fair coin), the entropy is very high. However, if we learn some
information, x, that causes one outcome to be far more likely to occur, then our
uncertainty will decrease: H will be reduced. The amount of entropy reduction
as a result of learning x, Hr (x), is thus calculated as the difference between the
initial entropy, H, and the conditional entropy H (x) (i.e., H given x).2
To illustrate, consider the following problem: If I open a novel to a random
page and point to a random letter on the page, what is the probability, P, that the
letter is “u”? Without any other information, P is simply the prior probability of
2. The value amounting to the reduction in entropy has also been referred as entropy
value (van Rooy 2004).
90 Ralph L. Rose
the occurrence of “u” in the language as a whole. Using this prior probability we
could calculate the entropy, H , of the problem. However, imagine we learn that
the preceding letter is “q”. Then we can be much more certain that the letter in
question is “u”. Thus, the conditional entropy, H (“q”), will be less – a reduction
in entropy.
Entropy reduction may be either positive or negative: learning that x is true
may make us more certain while learning that x is false may make us less certain
about some outcome. It is therefore useful to calculate the value of learning
whether or not x is true. In other words, it is useful to know what the overall
value of asking the question of whether x is true or false is. In Information
Theory, this value is estimated as the weighted sum of the entropy reductions
for all possible outcomes of x (here, true or false). This value is known as the
estimated information value, EIV. Formally, the EIV of learning whether or
not x is calculated using the formula shown in (14), where P(x) is the prior
probability of the occurrence of x.
A good illustration of information value comes from the game “Who am I?”
in which one person pretends to be some famous person and others must ask
yes/no questions to find the identity of the person. In this scenario, what is an
informative (i.e., having a large information value) first question assuming that
there is no bias in the choice of famous person? One candidate would be Are you
a male/female? In this case, both terms in the sum of (14) will be at a maximum
and thus EIV will be large. However, a question like Are you Albert Einstein?
will be much less informative: While the entropy reduction if the answer is yes,
Hr (x), is large, the probability the answer is yes, P(x), is very small. If the
answer is no then the converse is true. Thus both terms in the sum of (14) will
be small and EIV will be small. Of course, if after several questions we have
learned that the mystery person is male, is a scientist, lived in the 20th century,
and won a Nobel Prize, then the EIV would be much larger.
In the present study, I am investigating the information value of syntactic
and semantic prominence toward determining the salience of discourse refer-
ents. This is done by asking, for example, the following question: What is the
information value of learning whether or not a particular discourse referent was
a subject to the probability of its being pronominalized in subsequent refer-
ence? This information value, EIV(subject), can be calculated using the formu-
las above. Likewise, the information values for the other syntactic and semantic
features can be calculated. Finally, I will calculate the net information value,
EIVtot , for syntactic prominence as the total of the EIVs for the various syntac-
Joint value of syntactic and semantic prominence 91
tic features (i.e., EIV(subject), EIV(object), etc.). Similarly, I will calculate the
EIVtot for semantic prominence as the total of the EIVs for the various seman-
tic features. Therefore, the central question becomes whether either information
about syntactic prominence or semantic prominence is more informative (i.e.,
larger EIVtot ) than the other or if they are equally informative. A second ques-
tion is whether syntactic and semantic information together is more informative
than either is alone. These two questions are formally summarized in (15)–(16).
(15) Is the syntactic prominence EIVtot greater than, equal to, or less than
the semantic prominence EIVtot ?
(16) Is the joint syntactic and semantic prominence EIVtot greater than ei-
ther the syntactic or semantic prominence EIVtot ?
With respect to (15), if results show that syntactic and semantic prominence
are equally informative, then another question may be posed: Are syntactic and
semantic prominence redundant with each other or are they at least somewhat
independent but equally informative? An answer to this question may be found
by looking at the answer to (16). If the joint information value is higher than ei-
ther is alone, then they cannot be redundant and must therefore be independent.
In the corpus there are 291 cases of inter-utterance coreference. In 224 (77%)
of these coreference cases, the coreferent noun phrase in the latter utterance is
pronominalized. Thus, the entropy of pronominalization is calculated as shown
in (17) where P(pro) is the probability of pronominalization.
(17) H = −[P(pro) · log2 P(pro) + P(¬ pro) · log2 P(¬ pro)]
H = −[224/291 · log2 (224/291) + 67/29] · log2 (67/291)]
H = 0.778
This value serves as the baseline for entropy reduction: How much is entropy
reduced from H = 0.778 by learning some information about syntactic or se-
mantic prominence? In this section, I will present these results along with some
interleaved discussion. However, before presenting the results, it is necessary
to deal with one complication. The referents in the current context may have
been realized in multiple syntactic positions and semantic roles. For instance,
in (18), as a verbal argument, john has been realized as a subject and an object,
an experiencer and a recipient, and carries the entailments sentience, volition,
and stationary.
92 Ralph L. Rose
(18) <s>
<c>
<np id="JOHN" synrole="subject" semrole="experiencer"
sentience="yes" volition="yes">
John</np>
<verb>wants</verb>
<c>
<np>his father</np>
to
<verb>give</verb>
<np id="JOHN" synrole="object" semrole="recipient"
stationary="yes">
him</np>
<np>a bicycle</np>
</c>
</c>
<punc>.</punc>
</s>
x EIV(x)
subject 0.059
object 0.021
oblique 0.010
none 0.011
EIVtot 0.101
concept of the value of information may provide a useful method for quantify-
ing these theories.
It should be noted here that there is some evidence (Miltsaki 2003) that ref-
erents introduced in main clauses are more salient than those introduced in sub-
ordinate clauses regardless of grammatical role. In the present study, the syntac-
tic information was analyzed with respect to a hierarchical approach that distin-
guishes the prominence of referents according to level of embedding. However,
the information value of syntactic prominence under hierarchical marking was
only EIVtot = 0.060. Thus, for expository reasons, the details of this approach
were excluded from the present paper, but can be seen in Rose (2005).
Three results are notable. First, it is interesting that the perception roles in
group 2 are more informative than the agentive roles in group 1, in spite of
the fact that agentive roles are usually posited to be highest on many thematic
hierarchies. This suggests that sentience is more important to the salience of en-
tities evoked in a discourse than agentivity. This would seem to parallel other
results showing the importance of animacy to the salience of discourse entities
(Prat-Sala and Branigan 1999).
The second interesting result is that the total information value of semantic
prominence under FrameNet is equal to that of syntactic prominence. I will
discuss the implications of this below. Before that, I present the results for the
other semantic prominence approach used in this study.
The third result that requires some discussion is the moderately high EIV for
the group 7 roles. As noted above, these are roles that fall outside the scope of
the FrameNet system. A cursory examination of these cases in the corpus shows
that many are instances where a particular referent is being retained across a
sequence of utterances (in a manner not unlike the retain transition in Center-
Joint value of syntactic and semantic prominence 95
ing Theory of Grosz, Joshi, and Weinstein 1995) by being placed in a genitive
construction (e.g., John likes apples. His mother does too. So, he bought her a
whole bushel.). While interesting, these cases are not particularly relevant here
to the question of how semantic information influences salience in terms of the
FrameNet system and are therefore not discussed further.
5.2.2. Proto-roles
A particular discourse referent may carry more than one Proto-role entailment.
In order to avoid the overlap problems that this generates in the present anal-
ysis, I use a simple transformation. For each referent, I calculate a parameter
I call Proto-Agency as the total number of (unique) Proto-Agent entailments on
that referent minus the total number of Proto-Patient entailments. Thus, Proto-
Agency ranges in integer values from +4 to −4 (although in this corpus, there
were no instances of −4). For example, a referent which carries, say, three
Proto-Agent entailments and one Proto-Patient entailment within a single ut-
terance would be regarded as having a Proto-Agency value of 3 − 1 = 2. In
short, then, Proto-Agency might be regarded as a measure of how agent-like
(in Dowtian terms) a particular referent is entailed to be: The higher the value,
the more agent-like the referent is. Under this transformation, the results are as
shown in Table 3.
Table 3. Information Value of Semantic Prominence via Proto-role Entailments
Proto-Agency EIV(Proto-Agency)
+4 0.000
+3 0.001
+2 0.045
+1 0.005
0 0.045
−1 0.000
−2 0.001
−3 0.002
−4 ***
EIVtot 0.098
Results here show that semantic prominence with respect to Proto-roles is com-
parably informative to semantic prominence with respect to FrameNet as well
as to syntactic prominence.
96 Ralph L. Rose
EIVtot
syntactic role × FrameNet group 0.165
syntactic role × Proto-Agency 0.141
The joint information value of syntactic and semantic prominence is higher than
that of either factor alone. Thus, the results suggest that syntactic and semantic
prominence are not redundant with each other and that each provides at least
some unique information with respect to the pronominalization of subsequent
reference. This conclusion should be regarded as tentative, however, because
the differences noted above are not statistically confirmed. One attempt to eval-
uate the strength of these findings used a boot-strapping procedure in which the
original sample of inter-utterance coreference instances was resampled with re-
placement 10,000 times. Under this procedure, the differences between the joint
EIVs and the individual EIVs were not shown to be significant. However, this
procedure is suspect because measures of skewness and kurtosis show that the
bootstrap distribution is non-normal. It is clear that a larger data set will be
required to confirm this conclusion.
Joint value of syntactic and semantic prominence 97
One interesting result here, though, is the fact that the Proto-role informa-
tion is not quite as informative as the FrameNet group information when taken
together with syntactic role. This, however, could be a by-product of the trans-
formation on the Proto-role information described in Section 5.2.2. This trans-
formation is a mathematical convenience and glosses over semantic distinctions
between the various entailments. Perhaps a more sophisticated transformation
would result in a greater information value.
6. General Discussion
Under the discourse model presented above, the results presented in this cor-
pus analysis suggest that the salience of referents in a discourse is influenced
by both syntactic and semantic information: Taking both into account results
in greater predictive ability for the form of subsequent reference. These results
are thus in line with a view of discourse processing in which salience represents
information about discourse structure: the more salient a referent is in the cur-
rent context, the greater the information value about the structure of subsequent
discourse, particularly the form of referring expressions. Information Theory
thus potentially offers another view of the relative value of the different factors
known to affect discourse salience and may provide another means by which to
narrow down on which factors are most crucial.
The idea that syntactic and semantic information seem to be at least partly
independent in their influence on salience suggest that models of discourse
salience may benefit by including some account of semantic information as
distinct from syntactic information. This is especially relevant to modular ap-
proaches in which one module is responsible for structure while an indepen-
dent module is responsible for interpretation. The results here may be rele-
vant for determining how these modules interact for the purpose of determining
salience.
The improvement in the joint information value suggests that computational
implementations of discourse salience models (e.g., parameterized systems,
such as the Mental Salience Framework; Chiarcos, this volume) might see some
performance improvement by the inclusion of semantic prominence informa-
tion. If the assumption that salience is a core notion common to both speaker
and hearer is correct, then the present results would indicate that pronoun reso-
lution algorithms might also benefit from the inclusion of semantic prominence
as a contributing factor.
In this study, two different semantic systems were employed to evaluate
semantic prominence. The joint information values suggest that the FrameNet
98 Ralph L. Rose
system may be more informative than the Proto-role system. However, as noted
above, this difference may not be real. If it is real, then an interesting line of
future investigation would be to look more closely at the relationship between
salience and the notion of primitive semantic roles as assumed in Frame Seman-
tics. On the other hand, if the difference between the two systems turns out not to
be real, then there is a practical conclusion to make: Technologically speaking,
the Proto-role system is less cumbersome than the vast network of frames and
roles in FrameNet and therefore may be more efficient in the implementation
of mechanisms for discourse processing and salience.
Throughout this study, I have compared the semantic prominence hierar-
chy to thematic hierarchies used in syntactic-semantic linking theories. While
it would be very interesting if it were to turn out that these hierarchies are paral-
lel, there is no theoretical reason to presume that this is so. These two different
research areas use these hierarchies for completely different reasons so it would
not be problematic for the present study if the semantic prominence hierarchy
is different. In fact, the evidence from corpus analysis suggests that this might
be the case: In particular, it seems that roles entailing sentience (e.g., cognizer,
experiencer) are higher on the hierarchy than roles entailing agency. Hence,
validation of the semantic prominence hierarchy is required.3
The results presented in this corpus analysis are perhaps somewhat pre-
mature to conclude firmly that syntactic and semantic information contribute
independently to salience (though see Rose 2005, 2006 for converging evi-
dence from a series of psycholinguistic experiments using different experimen-
tal paradigms). One possibility that needs to be investigated is whether or not
semantic prominence has a broad effect across the whole range of linguistic
items or whether only certain verbs (or certain classes of verbs) influence the
salience of their arguments via semantic information.
Another angle for further investigation of the role of syntactic and seman-
tic prominence along the lines presented here might include looking at different
languages. In English – the language used in this study – syntactic and semantic
role is often conflated as noted early in this paper. However, in languages where
word order is more free such as Spanish or Japanese, the distinction between
syntactic and semantic prominence may be easier to observe.4 Such work may
provide a clearer view of the degree to which syntactic and semantic promi-
nence each determine the salience of discourse referents.
7. Conclusion
In this paper, I have sought to compare the relative information value of syn-
tactic and semantic prominence to the salience of discourse referents. Results
show that both contribute to salience. On its face, the fact that semantic promi-
nence contributes to salience is perhaps unsurprising. However, this research
has also shown ways that semantic prominence can be computed via traditional
thematic role labels or via Dowtian Proto-role entailment configurations. The
results further suggest how semantic prominence information may be combined
with syntactic prominence information to yield more informative measures of
salience.
Acknowledgments
The work presented here is based on my Ph.D. dissertation. I am indebted to
Stefan Kaufmann, my adviser, and Michael Dickey for discussions about the
theoretical background and data analysis. I am also grateful to two anonymous
reviewers for helpful comments on earlier drafts of this paper.
References
Almor, Amit
1999 Noun-phrase anaphora and focus: The informational load hypothesis.
Psychological Review 106: 748–765.
Almor, Amit and Eimas, Peter
2008 Focus and noun phrase anaphors in spoken language comprehension.
Language and Cognitive Processes, 23: 201—225.
Ariel, Mira
1988 Referring and accessibility. Journal of Linguistics 24: 65–87.
Baker, Collin, Fillmore, Charles, and Lowe, John
1998 The Berkeley FrameNet project. Proceedings of the 17th International
Conference on Computational Linguistics, 86–90.
Blutner, Reinhard
1998 Lexical pragmatics. Journal of Semantics 15: 115–162.
Blutner, Reinhard
2000 Some aspects of optimality in natural language interpretation. Journal
of Semantics 17: 189–216.
Brown, Cheryl
1983 Topic continuity in written English narrative. In: Givón, Talmy
(ed.), Topic Continuity in Discourse: A Quantitative Cross-Language
Study, 313–342. Amsterdam: John Benjamins.
Dorr, Bonnie, Habash, Nizar, and Traum, David
1998 A thematic hierarchy for efficient generation from lexical-conceptual
structure. Proceedings of the Third Conference of the Association for
Machine Translation in the Americas, 333–343.
Dowty, David
1991 Thematic proto-roles and argument selection. Language, 67: 547–
619.
Fillmore, Charles
1968 The case for case. In: Bach, Emmon and Harms, Robert (eds.), Uni-
versals in Linguistic Theory, 1–90. New York: Holt, Rhinehart and
Winston.
Fillmore, Charles
1976 Frame semantics and the nature of language. Annals of the New York
Academy of Sciences: Conference on the Origin and Development of
Language and Speech, 280: 20–32.
Gernsbacher, Morton Ann and Hargreaves, David
1988 Accessing sentence participants: The advantage of first mention. Jour-
nal of Memory and Language, 27: 699–717.
Gordon, Peter, Grosz, Barbara, and Gilliom, Laura
1993 Pronouns, names, and the centering of attention in discourse. Cogni-
tive Science, 17: 311–347.
Gordon, Peter and Hendrick, R.
1997 Intuitive knowledge of linguistic co-reference. Cognition, 62: 325–
370.
Gordon, Peter and Hendrick, R.
1998 The representation and processing of coreference in discourse. Cog-
nitive Science, 22: 389–424.
Grosz, Barbara, Joshi, Aravind, and Weinstein, Scott
1995 Centering: A framework for modeling the local coherence of dis-
course. Computational Linguistics, 21: 203–225.
Joint value of syntactic and semantic prominence 101
Gundel, Jeanette
1974 The role of topic and comment in linguistic theory. Ph.D. dissertation,
University of Texas, Austin.
Gundel, Jeanette, Hedberg, Nancy, and Zacharski, Ron
1993 Cognitive status and the form of referring expressions. Language, 69:
274–307.
Heim, Irene
1982 The semantics of definite and indefinite noun phrases. Ph.D. disserta-
tion, University of Massachusetts, Amherst.
Heim, Irene
1983 File change semantics and the familiarity theory of definiteness. In:
Bäuerle, Rainer, Schwarze, Christoph and von Stechow, Arnim (eds.),
Meaning, Use, and Interpretation of Language, 164–189. Berlin: W.
DeGruyter.
Hirst, Graeme
1981 Anaphora in Natural Language Understanding: A Survey. Berlin:
Springer-Verlag.
Hudson-D’Zmura, Susan and Tanenhaus, Michael
1997 Assigning antecedents to ambiguous pronouns: The role of the center
of attention as the default assignment. In: Walker, Marilyn, Joshi, Ar-
avind, and Prince, Ellen (eds.), Centering Theory in Discourse, 199–
226. Oxford, UK: Clarendon Press.
Jackendoff, Ray
1972 Semantic Interpretation in Generative Grammar. Cambridge, Mas-
sachusets: MIT Press.
Jackendoff, Ray
1990 Semantic Structures. Cambridge, Massachusetts: MIT Press.
Kamp, Hans and Reyle, Uwe
1993 From Discourse to Logic. Dordrecht: Kluwer Academic.
Karttunen, Lauri
1976 Discourse referents. In: McCawley, James, (ed.), Syntax and Seman-
tics, Vol. 7: Notes from the Linguistic Underground, 363–385. New
York: Academic Press.
Kehler, Andrew
2002 Coherence, Reference, and the Theory of Grammar. Stanford Univer-
sity, CA: CSLI Publications.
Lappin, Shalom and Leass, Herbert
1994 An algorithm for pronominal anaphora resolution. Computational
Linguistics, 20: 535–561.
102 Ralph L. Rose
Christian Chiarcos
It had been noticed very early that the choice among syntactically well-formed
expressions is by no means determined by semantic constraints alone. Consider
the following classical text (Grosz et al. 1995, ex. (5), shortened) with possible
truth-semantically equivalent variations as illustrated in (4’), (1’) and (5’):
(1) Terryte really goofs sometimes.
(2) Hete wanted Tonyto to join him on a sailing expedition.
(3) Hete called himto at 6AM.
(4) Tonyto was sick and furious at being woken up so early.
(5) Heto told Terryte to get lost and hung up.
(6) Of course, Terryte hadn’t intended to upset Tonyto .
106 Christian Chiarcos
2. Salience in discourse
1. The original definition as used by Koch and Itti is based on visual fields of neurons,
i.e. on areas within a scene, not entities. Though it seems that a similar dynamic
notion of salience is appropriate on a higher level of abstraction as well, researchers
on the interface of visual and linguistic salience modeled it in terms of size and ab-
solute position (Kelleher, this volume), denying the possibility of shifts of attention
at all. However, it seems to be generally accepted that this static notion of salience
is a heuristic approximation only, a generalization over a longer period of time de-
scribing the likelihood of an area to be salient.
108 Christian Chiarcos
2.2. Phenomenology
Here, I focus on three phenomena: Choice of referring expressions (REF), as-
signment of grammatical roles (GR) and word order effects (WO).
Salience is defined in terms of givenness or accessibility, it has been fre-
quently remarked that the more salient a discourse referent is, the less com-
plex, the less semantically rich and the less emphasized referring expressions
110 Christian Chiarcos
are expected. Especially, pronouns are expected to denote more salient referents
than full descriptions. Similarly, salience rankings of grammatical roles (Grosz
et al., 1995; Givón, 2001) follow a conventional hierarchy of markedness with
subject (nominative case) being more frequent and less phonologically com-
plex than direct object, etc. Following Givón (1995, p.25-69), markedness is
defined in terms of complexity and non-conventionality (inverse frequency).
So, for REF and GR, an iconic mapping can be assumed correlating surface
or empirical measures of markedness with the underlying degree of relative
salience:
markedness − +
REF pronoun full description
WO left-peripheral right-peripheral
GR subject non-subject
salience + −
Figure 1. Simplified markedness hierarchies and salience.
The notion of salience as defined here involves two different temporal or partici-
pant perspectives, and indeed, similar claims on multi-dimensionality have been
made for the levels of referring expressions (cf. deviations from iconic map-
ping; Ariel, 1990, p. 191ff.) and grammatical roles (cf. the aspects of “discourse
prominence”: indicating a horizon and fore-/backgrounding; Pustet, 1997). With
respect to word-order preferences, multiple factors have been considered related
to both dimensions of salience. On the one hand, it is claimed that given ele-
The Mental Salience Framework 111
ments tend to precede new elements (Sgall et al., 1986), on the other one, it has
been shown that – at least for German – this tendency is not absolute (Weber
and Müller, 2004). Instead, Strube and Hahn (1999) suggested that relative or-
dering has an effect on the accessibility in forthcoming discourse, is it thus a
device of foregrounding similar to grammatical roles.
3. Adequacy
1
ssal LRCT (r) = (3)
1 + RDana (r) + (1 − REF ana (r))
As demanded in the preceding section, the outcome is normalized so that 1 de-
notes the highest possible salience score. If no antecedent (anaphor) exists,
RDante (RDana ) is infinite, thus hsal LRCT (ssal LRCT ) converges against 0. For
Canonical Centering, the locality constraint can be implemented by replacing
RDante with 1/1/RDante .
As an alternative to the canonical salience ranking, in Functional Center-
ing (FCT, Strube and Hahn, 1999), the ordering of potential backward-looking
centers is replaced by a ranking based on information status, embedding depth
and relative word-order. Following Rambow’s (1993) account on Centering and
word order in German, I concentrate on word order as a determinant of the rank-
ing of forward-looking centers (WO ante ), further abbreviated as WOCT.3
1
hsal WO (r) = (4)
1 + RDante (r) + WOante (r)
Consider an utterance U n and a referring expression r with antecedent q in a preceding utterance U k and
anaphor s in a subsequent utterance U l (k < n < l).
properties of antecedent properties of anaphor(s)
∞ iff. r has no antecedent
RDante (r) =
n − k − 1 else
⎧ ∞ iff. no anaphor to r exists
⎪ 0 iff. r has no antecedent RDana (r) =
⎪
⎪ l − n − 1 else
⎪
⎪
⎪
⎨1 iff. q is subject ⎧
⎪
⎨0 iff. r has no anaphor
GRante (r) = 0.9 iff. q is direct object
⎪
⎪ REF ana (r) = 1.0 iff. s is pronominal
⎪
⎪0.8 iff. q is indirect object ⎪
⎪
⎪ ⎩
⎩ 0.5 else (i.e. s is a full description)
0.7 else
⎧ #mentions of r within the next 20 utterances
⎨0 iff. no antecedent TP(r) =
20
WOante (r) = #words in U k before q
⎩ else
#words in U k − #words in q
Considering Terry (te), we find that RDante (te) = 1 (his last mention was
in sentence (3)) and GRante (te) = 1.0 (subject in (3)). Inserting these values in
equation (1) using the parameter weights for hearer salience (hsal) in LRCT
reconstruction as summarized in Fig. 4, we achieve a formula identical to equa-
tion (2). Thus, hsal LRCT (te) can be calculated as 0.5. As the proposed pronomi-
nalization threshold is not met, we predict a nominal description. Accordingly,
hsal CCT (te) converges against 0, thus the coding preferences in Canonical Cen-
tering would be identical. The antecedent of Terry is sentence-initial in (3), so
hsal WO (te) is 0.5, too. In the topicality reconstruction, where referential dis-
tance is the only parameter of hearer salience, the same prediction is calculated,
too.
The corresponding parameter values for Tony (to) in (5) are: RDante (to) = 0
(last mention in (4)), GRante (to) = 1.0 (subject) and WOante (to) = 0 (sentence-
initial). So, hearer salience of Tony is calculated as 1 for LRCT, CCT, WOCT
and TOP equally. This exceeds the respective pronominalization threshold. As
the only possible interfering referent Terry has a sufficiently lower degree of
hearer salience, no restrictions arise from ambiguity avoidance strategies. So,
we can safely refer to Tony with a pronoun, just as in Grosz et al.’s original
example.
For speaker salience (ssal), we find anaphoric references to Terry and Tony
are in the directly following utterance (RDana = 0), both with full descriptions
(REF ana = 0.5), but only once in the forthcoming discourse (TP = 1/20). Thus,
speaker salience is identical for both Terry and Tony, in 1 in Centering recon-
structions 1 and 2039 in topicality reconstruction.
As grammatical roles resp. word order preferences are determined in Cen-
tering reconstruction by relative differences between speaker salience scores,
no preferences for GR or WO can be derived here. The same is true for WO
preferences in the topicality reconstruction. However, GR preferences in the
topicality reconstruction are calculated from the interaction (e.g. addition) of
hearer and speaker salience scores, but not from speaker salience alone. There-
fore, Tony’s score (hsal TOP (to) = 1) exceeds Terry’s (hsal TOP (te) = 0.5), and
we predict Tony to be preferred subject and Terry to be non-subject. In fact, the
opposite decision was taken in Grosz et al.’s constructed example. However,
this is very likely to be due to constraints from verbal semantics, as a more
agentive realization of Tony in a sentence semantically roughly equivalent to
ex. (6) would be rather odd (cf. ex. (6’)).
(6’) #Of course, Tonyto has not been intended to get upset by Terryte .
118 Christian Chiarcos
Figure 5 summarizes preferences for the whole text is summarized. Besides the
effects of a heuristic ambiguity resolution rule4 and partial indistinguishability,
few crucial deviations from the original coding decision have been found. Here,
(3) seems to be the most critical instance, where actual word order and gram-
matical role assignment deviate from both Centering and topicality preferences.
However, the interpretation of the pronouns in (3) depends on parallelism with
the previous utterance. A sentence like Heto was called (by himte ) at 6 AM. is
nearly incomprehensible. It would be necessary to use a nominal such as the
name for either Tony or both referents (as suggested by the theories).
Figure 5. Example: Parameter values, hearer salience (hsal), speaker salience (ssal) and
coding preferences.
4. In Fig. 5, full* means to use a non-pronominal form to avoid ambiguity if the abso-
lute salience score is sufficient for pronominalization, subject* and unspec indicate
that Tony and Terry are ranked equally, deviations from original are marked bold.
The Mental Salience Framework 119
indicate the ranking of forward-looking centers, and thus, it explains only the effect
of these grammatical devices, but not their assignment in discourse. Nevertheless, the
preference to keep the backward-looking center over a sequence of utterances (cf.
the notion of “preferred center”; Strube and Hahn, 1999) can be exploited to predict
the assignment of grammatical roles (Kibble and Power, 2004). It should be noted,
however, that these preference are deducted from preferences of transitions between
utterances only within the same discourse segment (Grosz and Sidner, 1986), and
that it is not clear to what degree these preferences extend towards the discourse as
a whole.
Similarly, Givón’s approach involves a differentiation between anaphoric and
cataphoric aspects of topicality, but he does not describe how these both dimensions
are interacting in the deduction of concrete coding decisions.
Therefore, like any practical application of a theoretical construct, a reconstruc-
tion within a formal framework relies on an interpretation which is maximally pre-
dictive in order to achieve concrete predictions, and thus, researchers are usually not
interested in equivalent reconstructions, but in reconstructions which involve a gain
in predictive power. However, such a reconstruction cannot be equivalent as it sys-
tematically violates the soundness criterion.
As an example, Beaver’s (2004) equivalence proof between Centering (as for-
mulated by Brennan et al. (1987)) and his reconstruction of Centering in Optimality
Theory represents in fact a proof of adequacy, as he claims that the reconstruction
Centering entails additional predictions that were not entailed from the original for-
mulation: “This declarativity means that COT is equally suited for generation or in-
terpretation. In contrast, the BFP algorithm is suited for interpretation only. It could
not be used to generate texts directly …”. As an alternative to partial equivalence
proofs as provided by Bearer, I suggest to distangle equivalence and adequacy and
focus on the adequacy between original formulation and the reconstruction rather
than on equivalence.
The Mental Salience Framework 121
constructions within the Mental Salience Framework are adequate, and thus, the
framework is capable to allow the reconstruction of two classical approaches.
5. Related research
separated from the assumptions that the speaker has about attentional states of
the hearer.
In this theoretical minimalism, the Mental Salience Framework shares a cer-
tain resemblance with Optimality Theory, which can also be viewed as a formal
apparatus within which existing theories such as Centering (Beaver, 2004) can
be reconstructed.9
Optimality Theory relies on the observation that grammars contain con-
straints on the well-formedness of linguistic structures, and often, these con-
straints are in conflict. The rapid and systematic resolution of such conflicts,
however, entails that constraints are not equal in their violability, and thus, the
existence of a ranking. According to OT, constraints are components of the uni-
versal grammar, and language-specific grammars are instantiations of the UG
in that they represent different possible rankings of universal constraints.
Formally, constraints in OT are conditions on the relationship between an
underlying form, or input, and a set of possible surface candidates, i.e. possible
output. For the generation of referring expressions, the input is an underspec-
ified logical form of an utterance, the output is a candidate utterance. The op-
timal candidate output is selected based on the ranking of violated constraints.
Given two candidate forms A and B, A is more optimal than B if the highest-
ranking constraint which is violated by B is not violated by A, and no violations
of higher-ranked constraints occur for A.
In his Centering in OT (COT), Beaver proposes a set of constraints which
capture the main ideas of Centering following Brennan et al. (1987, BFP):
– pro-top The topic is pronominalized. (Rule 1)
– cohere The topic of the current sentence is the topic of the previous one.
(dis-preference of shifts, Rule 2)
– align The topic is in subject position. (dis-preference of shifts, Rule 2)
Further, Beaver provides a constraint-based definition of the backward-looking
center (“topic”):
– one-sentence-window Only discourse entities mentioned in the previous
sentence are salient. (salience definition)
– arg-salience One discourse entity is more salient than another if the first
was referred to in a less oblique argument position than the second in the
same sentence. (salience definition)
9. The concrete claims by Optimality Theory are more rigid, but only concern the na-
ture of constraints as a component of Universal Grammar.
128 Christian Chiarcos
the formulation “if there are pronouns” involves a great abstraction, in that
is assumes that pronominalization is triggered only by salience (and agree-
ment filters). As noted in the sketch of the adequacy proof above, this assump-
tion predicts the same results as the original Centering rule only if the defini-
tion of agreement filters may extend beyond strict morpho-syntactic congru-
ency.
Further, aside from the critical cases identified above, we can construct an
example in which pro-top and Centering make different predictions about pro-
nominalization:
Marym watched how Sues crossed the street over to Harry’sh house.
(Mary, Sue > Harry)
(7) Shem/s wondered about the low traffic today.
(8) He/Harryh did not realize herm/s .
(9) Heh did not realize Marym /Sues .
(10) Harryh did not realize Marym /Sues .
The examples (7) to (10) are possible continuations of the first sentence. The
well-formedness of example (9) for both interpretations illustrates that Mary
and Sue are equally possible antecedents of a pronoun in the following sen-
tence, thus, a feminine pronoun would be ambiguous between Mary and Sue.
Therefore, (7) is fully ambiguous between both readings, and thus from cooper-
ativity considerations, we may conclude that it is not a feasible candidate output.
As a consequence, only (9) and (10) are to be considered by COT respectively
Centering. However, Harry is clearly more oblique than Mary and Sue in the
first utterance, and thus, it cannot be the backward-looking center. Therefore,
(9) violates both Rule 1 and pro-top. However, (10) violates pro-top, but does
not violate Rule 1. Therefore, if (8) is not available for reasons of ambiguity,
the Centering-optimal output is (10), whereas COT is indistinctive between (9)
and (10).
At this point, I would like to emphasize that because of the unconditional for-
mulation of pro-top in COT and the existence of the fam-def constraint, COT
makes predictions beyond the original Centering, and thus, must be deemed ad-
equate with respect to Centering, rather than equivalent. The equivalence proof
provided by Beaver is concerned with a subset of critical cases and only with
the differentiation between pronouns and the use of proper names. It is possi-
130 Christian Chiarcos
ble to construct a critical example in which Centering and COT make different
pronominalization predictions.10
Thus, Centering in OT and the reconstruction of Centering within the Mental
Salience Framework are comparable with respect to their adequacy (“equiva-
lence”).
However, the theoretical implications of an OT modeling cannot be underes-
timated. Essentially, all possible constraints must be part of the universal gram-
mar. Postulating a constraint like fam-def entails the assumption that definite
NPs form a universal syntactic category, which is clearly contradicted by the
existence of languages which have no explicit definiteness markers. Further, the
OT reconstruction of Centering, like the original formulation of Centering, are
inherently symbolic, categorial accounts, which are capable to predict a finite
and fixed set of possible categories of referring expressions. One of the central
criticisms of categorial accounts of givenness brought forward by Mira Ariel
(1990; 2001) states that the number of grammatical devices distinguished in
a specific language, is theoretically unlimited, and if all relevant distinctions
among referring expressions are to be captured in an extension of COT similar
to the familiarity criterion of definite NPs by the postulation of the correspond-
ing constraints, the formulation of these categories and their salience characteri-
zation in OT also entails that these categories are also present in universal gram-
mar, which is probably misleading. As opposed to this, in the Mental Salience
Framework, the number of possible referring expressions is not a priori limited,
but can be justified in terms of their salience characterization.
In the OT and in the anaphor resolution communities, further instantiations
of Centering in OT have been developed (Buchwald et al., 2002; Bouma, 2003;
Byron and Gegg-Harrison, 2004; Hardt, 2004). From these, the conceptual
motivations underlying the Recoverability Optimality Theory (ROT) model
(Buchwald et al., 2002) are very closely related to underlying insights of the
Mental Salience Framework. Both share a production perspective which leads
to the assumption of two discourse models, the model of the speaker’s private
intentions and the discourse model, the salience list or “common ground”. Both
share the assumption that cues from the following discourse must be considered
in order to generate referring expressions in a proper way. And, as well as the
Mental Salience Framework, ROT is a parameterized framework in the sense
that the set of constraints considered is subject to possible extensions. Indeed,
the Mental Salience Framework could be applied for the ranking of the current
and the following salience list, and thus serve as a complement to ROT with
respect to the concrete model of salience which is left unspecified so far.
Nevertheless, the Mental Salience Framework is less constrained in its the-
oretical implications and in its adaptive character. Especially, it supports lan-
guage-specific categories of referring expressions whose treatment in Optimal-
ity Theory is uncertain. In the best case, the integration of additional categories
of referring expressions only requires to associate them with certain salience
scores. Accordingly, the Mental Salience Framework is more oriented towards
a broad-scale practical application.
cludes an explicit model of attentional states of the speaker, and thus, it is more
specialized for the needs of Natural Language Generation.
Some details of Kibble’s approach remain abstract, and the adequacy of this ap-
proach has not been proven so far. However, with the exception of the reference
resolver which has no direct parallel in the Mental Salience Framework, its con-
cepts can be interpreted in terms of Kibble’s Game-theoretic framework. Hearer
salience is clearly a part of the discourse modeler, though a fully specified dis-
course model involves additional aspects beyond the modeling of attentional
states of the hearer. The strategies enumerated in the planner/content determi-
nation module are partly concerned with the assignment of grammatical roles.
134 Christian Chiarcos
Strategy (a) is concerned with perceptual salience only, but is roughly parallel to
the word order and grammatical role-strategies specified for speaker salience in
TOP. Strategies (b) and (c) involve an “alignment of salience rankings” with
utterances from the following discourse, and seemingly, this corresponds to
the extrapolation of speaker salience from coding decisions in the following
discourse according to the Centering reconstructions. Finally, the realizer cov-
ers the determination of coding preferences (from the linear combination of
salience scores) and their application.
Hence, the conceptions of the Mental Salience Framework seem to be closely
related to Kibble’s Game-theoretic reconstruction (“elimination”) of Centering,
and it might be regarded a more concrete framework for formulation of the
strategies suggested by Kibble.
References
Mira Ariel
2001 Accessibility theory: An overview. In T. Sanders, J. Schilperoord, and
W. Spooren, editors, Text Representation. Linguistic and psycholin-
guistic aspects, volume 8 of Human Cognitive Processing, pages 29–
87. John Benjamins, Amsterdam, Philadelphia.
Mira Ariel
1994 Interpreting anaphoric expressions: A cognitive versus a pragmatic
approach. Journal of Linguistics, 30:3–42.
David I. Beaver
2004 The optimization of discourse. Linguistics and Philosophy, 27(1).
Gerlof Bouma
2003 Doing Dutch pronouns automatically in Optimality Theory. In Pro-
ceedings of the EACL 2003 Workshop on The Computational Treat-
ment of Anaphora, Budapest.
Susan E. Brennan, Marilyn W. Friedman, and Carl J. Pollard
1987 A Centering approach to pronouns. In Proc. of the 25th Annual Meet-
ing of the Association for Computational Linguistics, pages 155–163,
Stanford, Cal., July 1987.
Adam Buchwald, Oren Schwartz, Amanda Seidl, and Paul Smolensky
2002 Recoverability Optimality Theory: Discourse anaphora in a bidirec-
tional framework. In Proceedings of the 6th Ws. on the Semantics and
Pragmatics of Dialogue (EDILOG), Edinburgh, Sep. 2002.
Donna K. Byron and Whitney Gegg-Harrison
2004 Evaluating Optimality Theory for pronoun resolution algorithm spec-
ification. In Proceedings of the Discourse Anaphora and Refer-
ence Resolution Conference (DAARC2004), pages 27–32, September
2004.
Karl Bühler
1934 Sprachtheorie. Die Darstellungsfunktion der Sprache. Gustav Fi-
scher, Stuttgart.
Wallace Chafe
1976 Giveness, contrastiveness, definiteness, subjects, topics, and point of
view. In Charles W. Li (ed.), Subject and Topic. Academic Press, New
York.
Wallace Chafe
1994 Discourse, Consiousness, and Time. The Flow and Displacement of
Conscious Experience in Speaking and Writing. University of Chi-
caogo Press, Chicago and London.
136 Christian Chiarcos
Christian Chiarcos
2009 Mental Salience and Grammatical Form. Toward a Framework for
Salience Metrics in Natural Language Generation. PhD thesis. Uni-
versität Potsdam, Germany.
Christian Chiarcos
2011 On the dimensions of discourse salience. Paper presented at the
DGFS-2011 Workshop Beyond Semantics. Corpus-based Investiga-
tions of Pragmatic and Discourse Phenomena. Göttingen, Feb. 2011.
C. Robin Clamons, Ann E. Mulkern, and Gerald Sanders
1993 Salience signaling in Oromo. Journal of Pragmatics, 19:519–536.
James Raymond Davis and Julia Hirschberg
1988 Assigning intonational features in synthesized spoken directions. In
Proc. ACL-1988, pages 187–193, Buffalo/NY.
Rachel Giora
1999 On the priority of salient meanings: Studies of literal and figurative
language. Journal of Pragmatics, 31:919–929.
Talmy Givón
2001 Syntax. John Benjamins, Amsterdam, 2nd edition (2 vols).
Talmy Givón
1983 Introduction. In Talmy Givón, editor, Topic Continuity in Discourse:
A Quantitative Cross-Language Study. John Benjamins, Amsterdam,
Philadelphia, 1983, pages 5–41.
Talmy Givón
1995 Functionalism and Grammar. John Benjamins, Amsterdam, Philadel-
phia.
Joseph H. Greenberg
1963 Some universals of grammar with particular reference to the order of
meaningful elements. In Joseph H. Greenberg, editor, Universals of
language, pages 73–113. MIT Press, Cambridge, Mass.
Barbara J. Grosz and Candace L. Sidner
1986 Attention, intentions, and the structure of discourse. Computational
Linguistics, 12: 175–204.
Barbara J. Grosz, Aravind K. Joshi, and Scott Weinstein
1995 Centering: A framework for modelling the local coherence of dis-
course. Computational Linguistics, 21(2):203–225.
Eva Hajičová, Ivana Kruijff-Korbayová, and Geert-Jan M. Kruijff
1998 Salience in dialogues. In Svetla Cmejrková, Jana Hoffmannová, Olga
Müllerová, and Jindra Svetlá, editors, Dialogue Analysis VI: Proc. of
the 5th Int. Congress of the Int. Assoc. of Dialogue Analysis, April
17–20 1996, Prague, Czech Republic, pages 381–393, Prague.
The Mental Salience Framework 137
Regina Pustet
1997 Diskursprominenz und Rollensemantik – Eine funktionale Typologie
von Partizipantensystemen. Lincom Europa.
Owen Rambow
1993 Pragmatic aspects of scrambling and topicalization in German. In
Workshop on Centering Theory in Naturally-Occurring Discourse.
Institute for Research in Cognitive Science, University of Pennsylva-
nia, Philadelphia, PA.
Petr Sgall, Eva Hajićová, and Jarmila Panevova
1986 The Meaning of the Sentence in its Semantic and Pragmatic Aspects.
Reidel, Dordrecht.
Michael Strube and Udo Hahn
1996 Functional Centering. In Proc. of 34th Ann. Meeting of the Associa-
tion for Computational Linguistics (ACL’96), pages 270–277, Santa
Cruz/CA, June 1996.
Michael Strube and Udo Hahn
1999 Functional Centering – Grounding referential coherence in informa-
tion structure. Computational Linguistics, 25(3):309–344.
Joel R. Tetreault
1999 Analysis of syntax based pronoun resolution methods. In Proceedings
of the 37th Annual Meeting of the Association for Computational Lin-
guistics (ACL’99), Maryland/MD.
Russel S. Tomlin
1995 Focal attention, voice, and word order. an experimental, cross-
linguistic study. In Mickey Noonan and Pamela Downing, editors,
Word Order in Discourse. John Benjamins, Amsterdam, Philadel-
phia, pages 517–554.
Amos Tversky
1977 Features of similarity. Psychological Review, 84(4):327–352.
Ielka van der Sluis and Emiel Krahmer
2001 Generating referring expressions in a multimodal context: An empi-
rical approach. In Proceedings 11th CLIN-meeting, Rodopi, Amster-
dam/Atlanta.
Andrea Weber and Karin Müller
2004 Word order variation in German main clauses: A corpus analysis. In
Proceedings of the 20th International Conference on Computational
Linguistics, Geneva.
Part II.
Beyond entities in discourse
Discourse-structural salience from a cross-linguistic
perspective: Coordination and its contribution to
discourse (structure)
Wiebke Ramm
1. Introduction
This paper approaches the topic of this volume from a cross-linguistic perspec-
tive, with Norwegian, German and English as example languages. Our aim is
to contribute to a clarification of discourse-structural concepts like the distinc-
tion between subordinating vs. coordinating discourse relations as described in
Segmented Discourse Representation Theory (SDRT, Asher and Vieu 2005)
or nucleus-satellite vs. multinuclear discourse relations in Rhetorical Structure
Theory (RST, Mann and Thompson 1988) and their relation to information-
structural (focus vs. background) and syntactic distinctions (coordination vs.
subordination) in a cross-linguistic perspective. Taking non-correspondences
regarding (clause or verb phrase) coordination in translation as an observa-
tional point of departure, we discuss the interpretation of coordinated structures
as compared to non-coordinated alternatives (sentence sequences and syntactic
subordination) with a view to the relative salience of the conjuncts in discourse.1
We are concerned with two types of translation discrepancy involving Norwe-
1. To some degree, the questions addressed in this contribution are similar to those ad-
dressed by Hinterhölzl and Petrova (this volume): both papers investigate the relation
between structural linguistic features – syntactic coordination vs. non-coordinated
structures in the present contribution, V1 vs. V2 word order in Hinterhölzl and
Petrova’s article – and how they show up in hierarchical discourse structure, as mod-
elled in discourse theories such as SDRT.
There is also a certain relationship to the contribution by Krasavina (this vol-
ume) in that questions of choice between different linguistic options and their im-
plications for the marking of salience are addressed: Krasavina deals with choices
between different types of referential expressions in Russian, the present paper ad-
dresses language-specific choices (showing up as translation mismatches) concern-
ing the linking of discourse units.
144 Wiebke Ramm
gian and German or English, namely coordinated clauses in the source language
(SL) translated as a sequence of sentences in the target language (TL) (Norwe-
gian > German, Section 3.1), and syntactic subordination (adjunction) rendered
as (verb phrase or clausal) coordination in the TL (German > Norwegian, Sec-
tion 3.2, and English > Norwegian/German, Section 3.3). Our data are taken
from three different parallel corpora, the Oslo Multilingual Corpus (OMC),2 as
well as from two smaller corpora of non-fictional texts.
It is a well-known fact that coordination, despite its apparent syntactic sym-
metry (the conjuncts belonging to the same syntactic category), may encode
or “explicate” an asymmetrical relation at the semantic-pragmatic level.3 What
we want to show is that coordination tends to be exploited somewhat differ-
ently in Norwegian than in English or German: Norwegian apparently uses co-
ordination more productively, as a kind of compensation for other grammatical
resources (e.g. adjunction or non-coordinated paratactic structures) used in En-
glish or German; and Norwegian also seems to be less constrained with respect
to what kind of discourse units the coordination marker can link as well as re-
garding the order of foregrounded and backgrounded information in a coordi-
nated structure.
From a theoretical viewpoint our observations raise interesting questions
about the correlation between syntactic coordination/subordination and coordi-
nating/subordinating discourse relations (cf. e.g. Asher and Vieu 2005) as well
as the status of the latter across languages. Our data suggest that either the use of
coordinating/subordinating discourse relations in Norwegian differs from their
use in German and English, or that syntactic coordination signalled by a co-
ordination marker (og/und/and) does not necessarily imply a coordinating dis-
course relation between the conjuncts, contrary to what Asher and Lascarides
(2003) and Asher and Vieu (2005), following Txurruka (2000), seem to assume.
A further – both theoretically and empirically interesting – implication of our
contrastive analyses is that they shed light on the backgrounding role/function
of (certain types of) adjuncts.
In Section 2 we give a brief overview of theoretical concepts to bear on our
topic. Section 3 presents and discusses our translational data. Our conclusions
are summarized in Section 4.
2. See http://www.hf.uio.no/iln/tjenester/sprak/korpus/flersprakligekorpus/omc/index.
html (visited 16 Sep 2010)
3. Asymmetry in coordination is taken up in several of the contributions in Fabricius-
Hansen and Ramm (eds., 2008), see the editor’s introduction (Fabricius-Hansen and
Ramm 2008: 7–11) for an overview.
Discourse-structural salience from a cross-linguistic perspective 145
2. General concepts
5. Asher (1999) discusses some aspects of the relation between sentential focus and
discourse focus. The issue of optional adjuncts, however, is not taken up here.
6. Since most of this paper was written before July 2007, recent work on the discourse
relation Background in SDRT, in particular Asher, Prévot and Vieu (2008), could
only partially be taken into account. In the following, we will comment (in notes)
where Asher, Prévot and Vieu (2008) deviate from what is said about the SDRT
relation Background here.
Discourse-structural salience from a cross-linguistic perspective 147
“matters” in that events from subsequent utterances will be related to it” (Asher
and Lascarides 2003: 166).8 Thus understood, the foreground-background dis-
tinction seems related to the distinction between Hauptstruktur (‘main struc-
ture’) and Nebenstruktur (‘side structure’) made by Klein and von Stutterheim
(1987) within the so-called quaestio model.
As a cover term for forward-looking and backward-looking Background,
i.e. as a discourse relation that can attach in both directions, the SDRT rela-
tion Background would also be similar to the discourse relation Background as
defined in RST (Mann and Thompson 1988). In RST Background is an (asym-
metric) nucleus-satellite relation – roughly corresponding to what is called a
subordinating discourse relation in SDRT but not formally defined – where the
function of what is presented in the satellite is to increase the reader’s ability
to comprehend what is presented in the nucleus (see definitions on the RST
webpage).9 Although the RST definition is not restricted to a particular order
of nucleus and satellite, in typical RST examples of the Background relation
the satellite precedes the nucleus (see examples on the RST webpage.)10 This
means that, after all, Background as a discourse relation is understood quite dif-
ferently in SDRT and RST.11 In any case, the SDRT notion of Background is
a narrower concept, being defined solely by way of temporal overlap between
an event(uality) and a state – which makes it problematic for the analysis of
non-narrative texts, i.e. texts that are not primarily structured by temporal rela-
tions.12
is encoded as an independent constituent; cf. I flew to Hawaii last month vs. I went
by plane to Hawaii last month (Talmy 2000 II: 128).
13. URL: http://www.sfu.ca/rst/02analyses/index.html (visited 7 Oct 2009)
14. Conjunction is not defined as a discourse relation on the “official” RST web site,
but is contained in the “extMT – extended Mann/Thompson” set of RST relations
in the RST tool developed by O’Donnell (URL: http://www.wagsoft.com/RSTTool/
index.html, visited 7 Oct 2009) which is widely used for text analyses across the RST
community. Although not precisely defined there either, according to the developer
of the tool (answer to a mail request, June 04) this relation is meant to cover con-
structions with and connectives. See also the related discussion on the definitions
of Conjunction and Disjunction on the RST mailing list, September 2006: URL:
http://lloyd.emich.edu/cgi-bin/wa?A1=ind0609&L=rstlist (visited 7 Oct 2009).
150 Wiebke Ramm
15. Lehmann (1988: 182) conceives subordination as a form of clause linkage, while
coordination is seen as a “relation of sociation combining two syntagms of the same
type and forming a syntagm which is again of the same type” and is thus not restricted
to hold on clause-level only.
16. Hypotaxis is understood by Lehmann (1988: 182) “as the subordination of a clause
in the narrow sense (which problably includes its finiteness)”, while parataxis refers
to the coordination of clauses, with no further restrictions “on the kind or structural
means of coordination. In particular, parataxis may be syndetic or asyndetic”.
17. Lehmann (1988: 210) points out that explicitness of linking has nothing to do with
parataxis vs. hypotaxis. As examples for linking devices with decreasing explicitness
152 Wiebke Ramm
inal texts are not translated by a corresponding coordination in German. Two general
translation strategies can be identified for these examples of translation mismatch: In
22,4 % of the Norwegian coordination examples the coordination marker is dropped,
and the discourse relation holding between the clauses is left implicit in the German
version (as in ex. (3), (4) and (8) in this paper); in the remaining 26,3 % of the mis-
match examples some alternative element (e.g., a pronominal adverb or a demon-
strative nominal phrase) is used to signal the discourse relation holding between the
clauses; this strategy is illustrated by ex. (5), (6), (7) and (9) in this paper. For some of
the non-correspondence examples in the corpus translation by coordination in Ger-
man certainly could have been an option. The frequency of this type of translation
mismatch in the corpus, however, should be seen as a strong indication for some
language-specific difference regarding the use of coordination.
19. In this and the following examples English glosses of the Norwegian and German text
examples keeping the word order particularities of the original languages are given
in single quotes. Due to the length of the examples, we refrained from presenting
interlinear glosses and idiomatic translations.
154 Wiebke Ramm
In (3a) the lack of a common topic between the two conjuncts seems to block
the use of coordination in the German translation (3b). A further problem is the
fact that the second conjunct alone is elaborated by the sentence following the
colon. In the translation the coordinated clauses are split into two separate sen-
tences which leads to a change of the discourse structure assigned to the text:
In the RST model, the German translation can be analysed as a Background
or Circumstance relation – with (3b[ii] ) as satellite, its nucleus covering (3b[iii] )
and (3b[iv] ) –, and the span (3b[ii] )–(3b[iv] ) functioning as Elaboration to (3b[i] ).
The analysis of the Norwegian original, however, would possibly have to as-
sign a (multinuclear) Conjunction (or Joint) relation to (3a[ii] ) and (3a[iii] ). But
where does this span attach to its discourse context? To the left (as Elabora-
tion or Background of (3a[i] ) – which does not fit very well), or to the right (as
Background)? But then – at least as a non-native speaker of Norwegian – one
runs into problems with how to coherently interpret the sentence following the
colon, since (3a[iv] ) clearly elaborates the second conjunct (3a[iii] ), but not the
first (3a[ii] ). Thus, the grouping of (3a[ii] ) and (3a[iii] ) as a joint, non-hierarchical
span leads to attachment problems with the following discourse segment.
Using the SDRT approach one runs into similar problems: In the Norwe-
gian version the reader probably first tries to interpret (3a[ii] ) as providing back-
ground information (Background1 in the sense of Asher and Lascarides 2003,
backward-looking Background in the sense of Asher, Prévot and Vieu 2008)20
or as adding some kind of explanation (Explanation being one of the subordi-
nating discourse relations in SDRT) to the preceding sentence (3a[i] ). But which
relation holds between (3a[ii] ) and (3a[iii] )? In English or German the use of the
coordination marker would presuppose the existence of some kind of common
topic between the linked elements, but obviously Norwegian is not that strict in
this respect. For the German version, an SDRT-style analysis is less problem-
atic: a relation of Background1 /backward-looking Background or Explanation
may be assigned between the independent sentence corresponding to the first
conjunct (3b[ii] ) and the sentence preceding it (3b[i] ), whereas the counterpart of
the second conjunct (3b[iii] ) can be interpreted as elaborating sentence (3b[i] ).
Jasinskaja’s (2009) suggestion to treat the coordination marker as a signal
that the current utterance (sentence) is not yet completed (i.e. does not conform
to the exhaustiveness condition) would not work properly for the interpreta-
tion of the Norwegian og-coordination either. The German version, however
does: the independent sentence corresponding to the first SL conjunct (3b[ii] )
20. A problem with the relation Background in this example could be the restriction that,
according to the definition in SDRT, it is restricted to hold between an event and a
state.
Discourse-structural salience from a cross-linguistic perspective 155
Similar problems occur in (4), where the second conjunct (4a[iii] ) should be sub-
ordinated (as an Explanation in SDRT, and as an Evidence satellite in RST) in
relation to the first conjunct (4a[ii] ), since the following sentence (4a[iv] ) obvi-
ously is related only to (4a[ii] ) and not to (4a[iii] ). This discourse representa-
tion is precisely what we get in the German translation (4b) – where the co-
ordination marker og (and) is replaced by a semicolon. But which discourse
structure should be assigned to the Norwegian version, where both SDRT and
RST would be urged to assign a coordinating/multinuclear discourse relation
to the sentential coordination, blocking the right frontier (in the SDRT frame-
work) or not providing an appropriate nucleus (in the RST framework) to attach
(4a[iv] )?
The two examples above illustrate that Norwegian seems to be less restricted
as to the types of elements that can be coordinated. They are evidence to the ef-
fect that the universality of the definition of discourse relations in theories like
SDRT or RST is questioned (cf. 3.4). Our examples show that at least the func-
tion of the coordination marker (og/und/and) is not precisely the same cross-
156 Wiebke Ramm
21. We are aware of the fact that the data presented in this paper are based on parallel
corpora only, i.e. we are comparing linguistic features of original texts with features
of their respective translations only, and that the properties of translations might
deviate from the properties of original texts of a language – e.g., in translations the
original language may “shine through” (cf. Teich 2003: 61) in some way. We have
not analysed the use of sentential coordination in comparable corpora of Norwegian
and German, but if we assume that there is at least some “shining through” from
the Norwegian SL texts regarding the use of coordination, it can be expected that
the differences of use are even clearer in comparable corpora. (see again note 18.
on the frequency of non-correspondence regarding coordination in Norwegian and
German).
Discourse-structural salience from a cross-linguistic perspective 157
Also in examples (5) and (6) a sentential coordination in the Norwegian original
text is translated by a sequence of independent sentences, but in these examples
a connective (pronominal adverb) explicitly signalling the temporal-causal re-
lation between the sentences corresponding to the two conjuncts in the Norwe-
gian version is added. The same relations (of “consequentiality”, cf. Section 2.3)
hold between the conjuncts in the Norwegian text, but here they have to be con-
textually inferred, i.e. they are less explicitly marked. A structurally equivalent
translation by a coordination would have been possible for both examples, i.e.
would not have been in contradiction with the discourse relations licensed by
und-coordination in German. However, the option chosen in the actual trans-
lation seems to be more natural with respect to the standard patterns of text
organisation in German.
(7) a. I Bergen hadde det mektige tyske Kontoret hindret tyskere i å ta
norsk borgerskap av frykt for at de skulle bli konkurrenter[i] . I 1560
måtte Kontoret oppgi denne politikken[ii] , og en stadig strøm av
tidligere hanseater tok i den følgende tida frivillig norsk borger-
skap[iii] . I 1766 ble den siste vintersitteren borger i Bergen, det var
Jochen Krämer fra Bremen[iv] .
‘In Bergen had the powerful Comptoir hindered the Germans to
acquire Norwegian citizenship because of fear that they might be-
come competitors[i] . In 1560 the Comptoir had to give up this poli-
tics[ii] , and a continuous stream of previous Hanseats acquired in
the following time deliberately Norwegian citizenship[iii] . In 1766
became the last winter-sitter a citizen of Bergen, it was Jochen
Kämer from Bremen[iv] .’
b. In Bergen hatte das mächtige Comptoir aus Angst vor deren
Konkurrenz Deutsche daran gehindert, norwegische Bürger zu
werden[i] . 1560 musste das Comptoir diese Politik aufgeben[ii] .
Während der darauf folgenden Zeit ließ ein ständiger Strom
ehemaliger Hanseaten sich freiwillig einbürgern[iii] . 1766 wurde
der letzte Wintersitzer, Jochen Krämer aus Bremen, Bürger von
Bergen[iv] .
‘In Bergen had the powerful Comptoir because of fear of com-
petition hindered Germans to acquire Norwegian citizenship[i] . In
1560 the Comptoir had to give up this politics[ii] . In the follow-
ing time acquired a continuous stream of previous hanseats delib-
erately Norwegian citizenship[iii] . In 1766 became the last winter-
sitter, Jochen Kämer from Bremen, a citizen of Bergen[iv] .’
Discourse-structural salience from a cross-linguistic perspective 159
her[iii] . Men det kan være interessant å peke på noen forhold som
kan ha betydning[iv] .
‘I will not try to answer the question[i] . Many have been engaged
to stress the uniqueness of the German university model[ii] , and
there exists big literature which it would take too long to discuss
here[iii] . But it might be interesting to point to some circumstances
that might have importance[iv] .’
b. Ich werde hier nicht versuchen, diese Frage zu beantworten[i] . Vie-
le Forscher haben sich damit beschäftigt, worin das Einmalige des
deutschen Universitätsmodells bestand[ii] . Zu diesem Thema liegt
eine umfangreiche Literatur vor, deren eingehendere Erörterung
hier zu weit führen würde[iii] . Doch dürfte es von Interesse sein,
auf einige Verhältnisse, die für die Beantwortung der Frage von
Bedeutung sein könnten, etwas genauer einzugehen[iv] .
‘I will not try to answer the question[i] . Many researchers have been
engaged to define what characterised the uniqueness of the German
university model[ii] . On this topic exists a vast amount of literature
the discussion of which would take too long here[iii] . But it might
be of interest, for some circumstances which are important for an-
swering the question, to go into more detail[iv] .’
Sentence splitting in the translations of (8) and (9) above seems so be motivated
by different preferences regarding whether the conjoined clauses are expected
to contribute to the incrementally constructed discourse representation as one
joint unit or whether it is also possible that only one of the conjuncts consti-
tutes a discourse relation with the preceding or following discourse units. In
(8a) the first conjunct (8a[ii] ) is in a (concessive) discourse relation to the pre-
vious sentence (8a[i] ), indicated by the sequence of the connectives riktignok
(‘although, indeed’) and men (‘but’), but it is not clear whether this concessive
relation also holds for the second conjunct (8a[iii] ). This indeterminacy does
not exist in the German version, where only the sentence corresponding to the
first conjunct (8b[ii] ) is in a concessive relation to the previous sentence (8b[i] )
(zwar ‘although, indeed’ – doch ‘but, however’). The sentence corresponding
to the second conjunct (8b[iii] ) rather implies an Evidence (RST) or Explanation
(SDRT) relation to the sentence corresponding to the first conjunct, signalled
by es ist nicht sicher (‘it is not sure’) and the (topicalised) adverbial in jedem
Fall (‘in any case’). Thus, again, the discourse relations holding between some
of the discourse units are more clearly inferable in the German version of the
text. In (9) the coordination in the Norwegian version leads to some indetermi-
nacy with respect to the interpretation (attachment in discourse structure) of the
Discourse-structural salience from a cross-linguistic perspective 161
following sentence: The following sentence (9a[iv] ), starting with men (‘but’),
is in a contrastive relation to the non-restrictive relative clause som det vil føre
for langt å gjøre rede for her (‘which it would take too long to discuss here’)
which is a part of the second conjunct (9a[iii] ), but this is somewhat blurred in the
Norwegian version due to the existence of the coordination which might imply
some joint contribution of the two conjuncts to the further development of the
discourse. By dropping the coordination in the German translation the sentence
corresponding to the second conjunct (9[iii] ) is interpreted as an elaboration of
the sentence corresponding to the first conjunct (9b[ii] ) – further emphasised by
adding zu diesem Thema (‘on this topic’) and placing it in sentence-initial po-
sition. The inference of the contrastive connection between the relative clause
deren eingehendere Erörterung hier zu weit führen würde (‘the discussion of
which would take too long here’) and the following sentence (9b[iv] ) is not dis-
turbed by the attempt to assign some joint relevance to the conjuncts of a sen-
tential coordination. So, as in (8), sentence splitting in the German translation
guarantees a clearer correlation between sentence boundaries and the attach-
ment of discourse units to the incrementally growing discourse representation.
These examples of Norwegian sentential coordination translated by non-
coordinated sequences of sentences in German indicate that sentential coordi-
nation serves somewhat different functions in discourse in the two languages:
1. The discourse relations compatible with (licensed by) coordination seem
to be more constraint in German than in Norwegian, as illustrated by (3)
and (4). In German the use of the coordination marker und appears to be
restricted to the types of relations (e.g. additive and temporal-causal) also
compatible with and in English (cf. Blakemore and Carston 2005, see Sec-
tion 2.3), whereas in Norwegian these constraints obviously are not taken
that seriously.
2. In cases where sentential coordination would be compatible with a discourse
relation this is often not the preferred realisation in German. Rather, an op-
tion is chosen which more explicitly signals the discourse relation holding
between adjacent discourse units, e.g. by using a connective as in (5) and (6).
3. German also seems to take und as a means to signal that two conjuncts
should be processed as a joint unit and jointly contribute to discourse struc-
ture more seriously than Norwegian does, as illustrated by (8) and (9).
Paratactic clause linkage with og seems to function as a kind of default se-
quentialisation strategy in Norwegian, which is applied without imposing too
much meaning to the coordination marker. In this way the use of og in writ-
ten Norwegian texts seems to be similar to the functions and (and its equiva-
lents in other languages) can take in oral narratives, i.e. signalling that the story
162 Wiebke Ramm
goes on without being too specific about the relation holding between the dis-
course units, or explicitly marking the transitions between discourse units (cf.
e.g. Schiffrin 1986 on the functions of and in (English) conversations). This
observation would also fit into the picture of written Norwegian as still being
under the pressure of the norms holding for oral language (see e.g. Torp and
Vikør 2000: Chapt. 14; Solfjeld 2000: 46–48). In German written genres, how-
ever, sentential coordination and sentence boundaries in general appear to be
taken more seriously as signals indicating the structuring of the discourse into
hierarchically and non-hierarchically related units.
tions were to be avoided […], and sterility was moved into the
foreground.’
(11) a. Als es feststand, daß die Alliierten nicht hier, sondern an der Ka-
nalküste landen würden, disponierte man um und schickte alle
Boote dorthin. Der Gegner, uns überhörend, faßte seine Beob-
achtungen präzise zusammen.
‘When it was clear that the Allies not here, but on the Channel
coast would land, we reorganized and sent all the boats there. The
opponent, us bugging, summarised his observations precisely.’
b. Da det nå ble klart at de allierte ikke ville lande her, men i Nor-
mandie, ble vi omdirigert dit. Motstanderne våre avlyttet våre
radiomeldinger og samlet omhyggelig sammen opplysninger.
‘As it now got clear that the Allies would not land here, but in
Normandy, we were redirected to-there. Our opponents bugged
our radio messages and gathered information carefully.’
In both examples a structurally equivalent Norwegian translation would not
have been possible or would at least have been stylistically marked. Although
both languages are V2, Norwegian is less open to place informationally “heavy”
constituents in Vorfeld position than German is, making it difficult to render the
prepositional phrase in (10a), which furthermore is based on nominalisations
(Interesse ‘interest’, Infektionsverhütung ‘infection prevention’), by a corre-
sponding prepositional phrase in Norwegian. Neither is it possible to translate
the present participle construction in (11a), uns überhörend (‘us bugging’), by
a corresponding participle construction in Norwegian. Choosing a coordinated
structure in (10) and (11) in the Norwegian translation can be see as a strat-
egy which tries to compensate for the lack of equivalent structural options or
preferences in Norwegian. The Norwegian versions are more sentential than the
original texts and exploit the inference mechanisms triggered by the coordina-
tive structure (cf. 2.3, Blakemore 2002) in order to gain a similar interpretation
as the German version. The syntactically downgraded function of the German
adjunct is “simulated” by the first conjunct which gets the discourse function
of “leading up to” the second, i.e. entering into a consequentiality relation with
the second conjunct. In this way coordination works as a backgrounding device,
establishing the second conjunct as part of the “main story” – equivalent to the
source text. The frequent use of coordination also illustrates the tendency that
Norwegian prefers to organize discourse paratactically where German tends to
use hypotactic/hierarchical structures (Fabricius-Hansen 1996).
164 Wiebke Ramm
ticularly good. But nevertheless these examples seem to give further evidence
for the hypothesis that coordination functions somewhat differently in Norwe-
gian than in German and English. The dispensability of a marker of the tempo-
ral overlap in (13c) indicates that Norwegian may be less biased to interpreting
clause/verb phrase coordination as a temporal sequence (in narration) than Ger-
man is. And (14c) and (15c) show that Norwegian possibly is also more open to
placing background(ed) information in the second conjunct, the position where
focused/foregrounded information is strongly preferred in German.
(13) a. He smiled slyly, nodding.
b. Er lächelte verstohlen und nickte dabei.
‘He smiled furtively and nodded thereby.’
c. Han smilte litt lurt og nikket.
‘He smiled somewhat slyly and nodded.’
(14) a. Tony went home, taking his tool box with him.
b. Tony griff nach seinem Werkzeugkasten und ging nach Hause.
‘Tony reached for his tool box and went home.’
c. Tony gikk hjem og tok med seg verktøykassen sin.
‘Tony went home and took his tool box with him.’
(15) a. Things suddenly got very tense in the bar and Dad drank heavily,
sweating..
b. Auf einmal wurde die Atmosphäre in der Bar äußerst angespannt,
und Papa schwitzte und trank immer mehr.
‘Suddenly the atmosphere got very tense in the bar, and Dad
sweated and drank more and more.’
c. Stemningen i baren ble plutselig meget spent, og pappa drakk tett
og svettet.
‘The atmosphere in the bar got suddenly very tense, and Dad
drank heavily and sweated.’
differ as regards the clause linkage type they realise (cf. Section 2.4) – the ad-
juncts in 3.3 being more sentential and less integrated than the adjuncts in 3.2.23
This means that more hierarchical upgrading (towards parataxis) and more sen-
tentialisation is required in (10) and (11) than in (12) to (15). This implies also
that – at least in (10) and (11) – the relations expressed in the SL vs. TL text
change their status from semantic relations holding between units/constituents
within a clause to discourse relations holding between propositions/clauses.
This can be viewed as a change in salience associated with a piece of informa-
tion in the SL vs. TL text – from non-propositional (or less propositional), con-
tributing to sentence semantics (in the first place) to propositional, contributing
to discourse semantics/structure.
In sum, discourse salience emerges as a concept that can have many facets
in a contrastive perspective. Such a multi-dimensional nature of salience (more
specifically, of “nuclearity”) – yet not in a contrastive perspective – has also
been argued for by Stede (2008), who demonstrates that discourse units may
be assigned salience on different levels of description and that various factors –
such as referential structure, thematic development, intentional structure or ex-
plicit linguistic markers – come into play here.
Another “factored” approach to discourse is pursued by Webber and her col-
leagues (e.g. Webber, Knott, and Joshi 1999, 2003). Working in the framework
of Tree-Adjoining Grammar, they distinguish between (discourse) relations that
are induced structurally by punctuation or (coordinating or subordinating) con-
junctions like and, although on the one hand, and relations that are established
by presupposition-bearing anaphoric adverbials like then, instead, otherwise
on the other hand. Whereas relations of the former type hold between the inter-
pretation of adjacent or conjoined discourse units, thus creating a (discourse)
structure in the strict sense, anaphoric adverbials signal “a relation between the
interpretation of their matrix clause and an entity in or derived from the dis-
course context” (Webber, Knott, and Joshi 2003: 547) which may cross such
structural dependencies. Webber, Knott, and Joshi suggest that this “factored”
approach may have “a better chance of providing a cross-linguistic account of
discourse than one that relies on a single premise” (Webber et al., 1999: Sect. 5).
Their approach does not, as we see it, offer an immediate solution to the specific
problems discussed in connection with examples (3) and (4) (Sect. 3.1). But the
proposed distinction between structurally induced discourse relations (triggered
by punctuation and conjunctions) creating discourse structure in the strict sense
23. The semantics and discourse properties of adjuncts of various kinds are taken up in
detail in Fabricius-Hansen and Haug (eds., in prep.). The problem of “competing
structures” across languages is particularly discussed in Chapter 5.
168 Wiebke Ramm
4. Conclusions
We have shown that special conditions seem to hold as regards the use of sen-
tential and verb phrase coordination with (counterparts of) and in Norwegian
as compared to German and English. In translations from German or English
into Norwegian, coordination is often used as a compensation for language-
specific – structural and stylistic – restrictions on hypotactic complexity at sen-
tence level (Sections 3.2 and 3.3). Apparently, Norwegian is also less con-
strained as to which kinds of (discourse) elements can be linked by the co-
ordination marker (Section 3.1) and in which order the conjuncts appear (Sec-
tion 3.3).
To put it the other way round, it appears that the function of the coordination
marker (og/und/and) is not precisely the same cross-linguistically, so that e.g.
syntactic coordination may be compatible with discourse relations like Back-
ground, Explanation or Elaboration in Norwegian, while blocked in German
or English. These observations cast some doubt on the cross-linguistic valid-
ity of the definition of discourse relations in theories like SDRT or RST. In
particular, they seem to challenge the assumption (see 2.3) that syntactic coor-
dination with (equivalents of) the connective and necessarily implies a coordi-
nating/multinuclear discourse relation.
The translation examples furthermore illustrate that salience can be express-
ed by various linguistic means and that these means may differ cross-linguisti-
cally (3.4). Salience may be assigned by the hierarchical vs. non-hierarchical
organisation of discourse in form of subordinating vs. coordinating discourse
relations holding between clauses/propositions. But salience can also manifest
itself by the choice of the size/granularity of the linguistic unit to communicate a
piece of information, in particular by the choice between propositions (clauses)
and linguistic units that do not have proposition status, e.g. as phrases. Finally,
also discourse segmentation into (complex) sentences separated by full stop (or
other “major” punctuation marks such as question mark and exclamation mark)
relates to the assignment of salience in discourse. Segmentation into sentences
provides the “temporal” dimension of discourse interpretation, by determining
which “portions” of information should be integrated into the incrementally
Discourse-structural salience from a cross-linguistic perspective 169
Acknowledgements
This article is a modified and extended version of Ramm and Fabricius-Hansen
(2005), and many of the ideas presented here have been developed in cooper-
ation with Cathrine Fabricius-Hansen. Moreover, the work has profited from
cooperation with Bergljot Behrens (Univ. of Oslo) and Kåre Solfjeld (Østfold
Univ. College, Halden) who have contributed with examples and helpful discus-
sions. I am also grateful to the Faculty of Humanities at the University of Oslo,
for supporting me by a PhD scholarship (2003–2006). The research has been
carried out in connection with the project SPRIK (Språk i kontrast / Languages
in Contrast)24 at the University of Oslo, Faculty of Humanities funded by the
Norwegian Research Council under project number 158447/530 (2003–2008).
References
Asher, Nicholas
1999 Discourse and the focus/background distinction. In: Peter Bosch and
Rob A. van der Sandt (eds.), Focus: Linguistic, Cognitive, and Com-
putational Perspectives, 247–267. Cambridge/New York: Cambridge
University Press.
Asher, Nicholas and Alex Lascarides
2003 Logics of Conversation. (Studies in Natural Language Processing.)
Cambridge/New York: Cambridge University Press.
Asher, Nicholas, Laurent Prévot and Laure Vieu
2007 Setting the background in discourse. Discours 1: 1–29.
URL: http://discours.revues.orig/index301.html
Asher, Nicholas and Laure Vieu, Laure
2005 Subordinating and coordinating discourse relations. Lingua 115: 591–
610.
Behrens, Bergljot
1998 Contrastive discourse: An interlingual approach to the interpretation
and translation of free ING-participial adjuncts. Ph.D. dissertation,
Department of Linguistics, University of Oslo.
Jasinskaja, Ekaterina
2009 Pragmatics and prosody of implicit discourse relations: The case of
restatement. Ph.D. dissertation, University of Tübingen.
Klein, Wolfgang and Christiane von Stutterheim
1987 Quaestio und referentielle Bewegung in Erzählungen. Linguistische
Berichte 109: 163–183.
Lambrecht, Knud
1994 Information Structure and Sentence Form: Topic, Focus, and the
Mental Representations of Discourse Referents. (Cambridge studies
in linguistics 71.) Cambridge: Cambridge University Press.
Lehmann, Christian
1988 Towards a typology of clause linkage. In: John Haiman and Sandra
A. Thompson (eds.), Clause Combining in Grammar and Discourse,
181–225. Amsterdam/Philadelphia: John Benjamins.
Mann, William C. and Sandra A. Thompson
1988 Rhetorical Structure Theory: Toward a functional theory of text orga-
nization. Text 8: 243–281.
Ramm, Wiebke
2010 Satzgrenzenveränderungen in der Übersetzung: Satzverbindung und
lokale Diskursorganisation im Norwegischen und Deutschen. Ph.D.
(submitted), University of Oslo.
Ramm, Wiebke and Cathrine Fabricius-Hansen
2005 Coordination and discourse-structural salience from a cross-linguistic
perspective. In: Manfred Stede, Christian Chiarcos, Michael Grabski
and Luuk Lagerwerf (eds.), Salience in Discourse: Multidisciplinary
Approaches to Discourse 2005, 119–128. Münster: Stichting/Nodus.
Rooth, Mats
1992 A theory of focus interpretation. Natural Language Semantics 1: 75–
116.
Sandström, Görel
1993 When-clauses and the temporal interpretation of narrative discourse.
Ph.D. dissertation, Department of General Linguistics, University of
Umeå.
Schiffrin, Deborah
1986 Functions of ‘and’ in discourse. Journal of Pragmatics 10: 41–66.
Solfjeld, Kåre
2000 Sententialität, Nominalität und Übersetzung. Eine empirische Unter-
suchung deutscher Sachprosatexte und ihrer norwegischen Überset-
zungen. Frankfurt M.: Peter Lang.
172 Wiebke Ramm
Solfjeld, Kåre
2004 Informationsspaltung nach links in Sachprosaübersetzungen Deutsch-
Norwegisch. In: Eva Lambertsson Björk and Sverre Vesterhus (eds.),
Kommunikasjon, 111–130. Halden: Høgskolen i Østfold.
Stede, Manfred
2008 RST revisited: Disentangling nuclearity. In: Cathrine Fabricius-Han-
sen and Wiebke Ramm (eds.), ‘Subordination’ versus ‘Coordination’
in Sentence and Text. A Cross-linguistic Perspective, (Studies in Lan-
guage Companion Series 98.) Amsterdam/New York: John Benja-
mins.
Talmy, Leonard
2000 Toward a Cognitive Semantics. Volume I: Concept Structuring Sys-
tems. Volume II: Typology and Process in Concept Structuring. Cam-
bridge, MA: The MIT Press.
Teich, Elke
2003 Cross-Linguistic Variation in System and Text. A Methodology for the
Investigation of Translations and Comparable Texts. (Text, Trans-
lation, Computational Processing 5.) Berlin/New York: Mouton de
Gruyter.
Torp, Arne and Lars S. Vikør
2000 Hovuddrag i norsk språkhistorie. Oslo: Gyldendal.
Txurruka, Isabel G.
2000 The semantics of ‘and’ in discourse. Technical Report ILCLI-00-LIC-
9, University of the Basque Country.
Vallduvi, Enric and Elisabeth Engdahl
1996 The cross-linguistic realization of information packaging. Linguistics
34: 459–519.
Vieu, Laure and L. Laurent Prévot
2004 Background in Segmented Discourse Representation Theory. In:
Workshop Segmented Discourse Representation Theory, 11th Con-
ference on Natural Language Processing (TALN), 485–494.
Webber, Bonnie, Alistair Knott, Matthew Stone and Aravind Joshi
1999 Discourse relations: A structural and presuppositional account using
lexicalised TAG. Paper presented at 1999 Meeting of the Association
for Computational Linguistics, College Park MD.
Webber, Bonnie, Alistair Knott, Matthew Stone and Aravind Joshi
2003 Anaphora and discourse Structure. Computational Linguistics 29:
545–587.
Rhetorical relations and verb placement
in Old High German
1. Introduction
The present paper approaches the issue of salience in discourse from the per-
spective of historical linguistics and the theory of language change. In particu-
lar, we are interested in discerning and describing linguistic phenomena which
are formal correlates of salience and related notions in the system of Old High
German (henceforth OHG). In particular, we are interested in finding out how
the expression of features related to salience influences the development of
novel forms and patterns in the history of German.
According to the common definition employed in this volume, salience re-
flects “the degree of relative prominence of a unit of information, at a specific
point in time, compared to other units of information” (Introduction, p. 2ff.).
A variety of linguistic factors which determine the referent’s current degree of
salience have been discussed in the literature, foremost cognitive status (given
vs. new), grammatical role (subject vs. non-subject) and animacy (animate vs.
non-animate). It is also claimed that there is a special matching relation between
the referent’s current degree of salience and the form of the linguistic expression
used to refer to it (Gundel et al. 1993) also called ‘referential choice’ (Krasav-
ina, this volume). At the same time, languages employ special strategies to mark
shifts in the degree of salience with respect to the preceding context, e.g. when
a referent with a lower degree of salience is promoted to a higher degree of
prominence at a particular stage of the discourse, also called ‘salience promo-
tion’ (see also Filchenko, Chiarcos, all this volume ). Addressing the issue of
referential choice and the form of anaphoric expressions in OHG, Petrova and
Solf (2010) have argued that salience promotion as a main principle governing
the use of demonstratives vs. personal pronouns in modern German (see Bosch
et al. 2003, Bosch and Umbach 2007), has applied already at the earliest stages
of the language.
Yet the use of anaphoric expressions is only one domain in which salience-
related features find a formal expression in the system of OHG. In the following
174 Roland Hinterhölzl and Svetlana Petrova
contribution, we will argue that pragmatic factors related to salience and dis-
course coherence take formal realization in syntax as well, more precisely in
the structure of the left periphery of main clauses in OHG. In particular, we
will focus on the principles determining the position of the finite verb in the
sentence. In this respect, the notion of salience and its realization are crucial
for the explanation of structural variation in the left periphery of main clauses
in OHG.
On the basis of evidence from the OHG Tatian, a major representative of the
OHG corpus (see section 2 below), we distinguish verb-initial (V1) and verb-
second (V2) as the two basic word order patterns at this particular stage of the
development of German. In approaching the principles governing the distribu-
tion and functional properties of these patterns, we first draw the attention to
the correlation between salience and syntactic position in the clause. Following
initial observations outlined in Hinterhölzl et al. (2005), we show that the po-
sitional realization of referring expressions in OHG is sensitive to the degree
of salience of the particular referent in the sense of givenness and accessibil-
ity in the discourse. So expressions referring to salient, i.e. pre-mentioned or
situationally inferable, referents are realized in clause-initial position followed
immediately by the finite verb, which results in V2 structures on the surface. In
contrast, non-salient, i.e. discourse-new referents are placed postverbally yield-
ing V1 on the surface. Following this, we conclude that V2 is used as a means
of marking prominence on the constituent placed in clause initial position and
separated from the rest of the utterance by the finite verb.
However, this correlation can be overwritten by discourse-structural factors,
as is evidenced by the occurrence of V1 orders with given discourse referents.
In some of the cases, the factors leading to V1 clearly pertain to discourse or-
ganization proper, i.e. they mark the beginning of a new chapter or episode
in the structure of the text. With Grüning and Kibrik (2005), we can assume
that referential distance across episode/paragraph boundaries lowers the status
of salience of the antecedent which results in postverbal realization of the re-
ferring expression. In this case, the process of ‘salience demotion’ takes place
(see also Filchenko, this volume). But in other cases, V1 with given referents
occurs within one and the same episode. In these cases, however, the sentence
conveys an especially important event or state which is crucial to the further
development of the discourse. In attempting to provide a unified account for
all cases of V1, we invoke the distinction between coordination vs. subordina-
tion in discourse as outlined in the Segmented Discourse Representation Theory
(SDRT, Asher and Lascarides 2003; see also Ramm, this volume). We analyze
the instances of V1 and V2 from the perspective of the features viewed as con-
stitutive for the definition of two basic types of rhetorical relations in discourse.
Rhetorical relations and verb placement in Old High German 175
The OHG corpus comprises texts of different length, genre, and quality of trans-
mission composed in the time between around 750 and 1050. Of course not all of
them are equally appropriate for syntactic research (cf. Fleischer 2006). One of
the largest prose texts from the beginning of the OHG period is the Tatian text,
a gospel harmony translated from Latin and written down in the scriptorium of
Fulda by at least 6 scribes. This text has been deliberately chosen for the purpose
of the present investigation. Although having been considered for a long time a
slavish word-for-word translation of the Latin original and therefore unsuitable
for any investigation on word order, this text has been rediscovered as a good
basis for research due to novel insights into the main principle of translation
applied in it. In the manuscript, as Figure 1 of the Appendix shows, the Latin
source and the OHG translation are attested as two juxtaposed columns. Only
recently, it has been observed that each line in the OHG text translates exactly
the same material found in the corresponding Latin line; departures from this
basic principle are extremely rare within the whole text. A new diplomatic edi-
tion made available by Masser (1994) reflects these major characteristics and
makes it possible to compare the source and target text, cf. Figure 2 of the Ap-
pendix. The translating technique applied in the Tatian text certainly imposes
restrictions on the possibility of rendering genuine word order patterns in the
translation (cf. Masser 1997 a and b), while the deviations from the Latin source
can be viewed as evidence for genuine OHG structures (cf. Dittmer and Dittmer
1998; Fleischer, Hinterhölzl and Solf 2008).
Therefore, we base our study on deviating examples from the Tatian text ex-
clusively. The corpus of the study comprises the complete sample of deviations
in constituent order found in the text portions of three scribes, a total of 1.658
176 Roland Hinterhölzl and Svetlana Petrova
clausal structures. These examples were fed into a corpus and annotated for
various morpho-syntactic and information-structural features by project B4 of
Collaborative Research Center (SFB) 632 “Information Structure” at Humboldt
University Berlin. The corpus is searchable via the ANNIS database (Chiarcos
et al., 2008; Zeldes et al., 2009) developed by project D1 of SFB 632 (Uni-
versity of Potsdam, Humboldt University Berlin). For more details concerning
the design of this corpus and the use of the ANNIS database see Petrova at al.
(2009).
1. The examples from the Isidor [I] text are cited by line number according to the edition
of Eggers (1964). The examples from the Tatian [T] text are cited by manuscript
page and line number according to Masser (1994). A slash in the Tatian examples
represents the end of line according to the manuscript. The inflected verb in both
OHG and Latin is underlined for clarity throughout the paper.
Rhetorical relations and verb placement in Old High German 177
Exploring the frequency of these word order types in the Tatian database de-
scribed in section 2 above, we discover a rather different picture. Here, mainly
V1 and V2 occur in considerable numbers against the structure of the Latin
original.3 Patterns in which the verb occurs in a position later then the second
one like in (4) are formed against the original only rarely, and cases with the
verb at the absolute end of the sentence as in (5) are mere exceptions:4
2. Here, we only briefly refer to some previous accounts on some of these patterns in
Isidor. First, we do not subscribe to the view expressed by Robinson (1994) who
claims that V1 represents a foreign pattern used exclusively in the translation of the
biblical citations rather than of the commentary parts of the treatise in order to signal
foreign speech. Rather, we regard V1 as a common Germanic pattern which abounds
both in the remaining texts of the OHG tradition as well as in all other early Germanic
languages, i.e. in Old English, Old Saxon and Old Norse. Second, with respect to
Vlate/Vend, we deny the view of Tomaselli (1995) reducing such examples to cases
involving pronominal or other prosodically light constituents which she explains as
clitics attached to the left of the verb after a full constituent in initial position. As our
example in (3) shows, Vlate/Vend in main clauses in Isidor also appears in sentences
with full constituents before the finite verb.
3. Note that the cases of V1 included in this statistics do not comprise elliptic non-initial
conjuncts sharing the subject of the preceding clause and therefore showing surface
V1-order.
4. In this example, the synthetic passive of the Latin original is represented by an ana-
lytic construction involving the finite form of the auxiliary sîn ‘be’ + Past Participle.
As the semantics of the Latin main verb is reflected in the OHG participle, the finite
178 Roland Hinterhölzl and Svetlana Petrova
Table 2. Frequency of word order patterns in main declarative clauses in Tatian formed
against the Latin original
From this we can conclude that a process towards stricter verb fronting in main
declarative clauses and a considerable reduction of the Vlate/end pattern has
taken place already within the OHG period. One question arises from this ob-
servation, namely whether the distribution of the main competing patterns, V1
and V2, obeys certain rules in the system that emerges in the Tatian, and if so,
what kind of principle may be made responsible for the choice of one pattern
over the other. This question will be addressed in the following section.
Turning to OHG, Axel shows that both operator movement as well as stylis-
tic fronting occur, while the third option, the placement of a base-generated ex-
pletive in Spec,CP has not emerged yet. As a consequence, sentences in which
neither operator movement nor stylistic fronting can apply remain as V1 (ana-
lyzed as the verb moving to C◦ with Spec,CP remaining empty). This implies
that the rule of V2 was not fully grammaticalized yet in OHG.
But what is then constitutive of the word order in OHG? To explain why
Spec,CP remains empty in OHG, Axel refers to the fact that in most of the cases
of V1, the sentence contains the adverbial tho ‘then’ in postverbal position tak-
ing the function of a narrative-expressive particle indicating sentence type just
like other particles, e.g. the interrogative particle inu/eno, the affirmative parti-
cle ia or the imperative particle nu etc. Once sentence type has been indicated by
the particle, the application of stylistic fronting is unnecessary leaving Spec,CP
empty in the corresponding cases.
Expressivity as a factor leading to V1 in early Germanic is known from a
number of previous works on the matter. In his very influential study, Fourquet
(1974) has put forward the idea that verb fronting in early Germanic is used to
highlight the entire contents of a sentence. Much earlier, Ries (1880, 19) had
observed for Old Saxon that V1 occurs in sentences reporting an outstandingly
important event or property. As for Old English, van Kemenade (1987, 44) re-
ports that in the Anglo-Saxon Chronicle, V1 is especially characteristic of one
particular section which is “famous for its lively narrative style”.
But expressivity, or stylistic vividness are rather vague terms when it comes
to differentiating the domains in which the two main patterns in declaratives
in OHG apply. All accounts mentioned before shift the attention to the broad
field of pragmatics as the source of additional factors influencing word order
in early Germanic. In this respect, they are representative of a long tradition of
research whose attempts in explaining this issue should be reconsidered from
the perspective of modern linguistic theory. In this respect, we want to ana-
lyze more thoroughly the functional domains in which the two main patterns of
OHG main-clause syntax occur in order to be able to isolate operational features
associated with each of them in OHG.
(8) [Jesus tells a parable about an unjust judge who was asked by a widow
to avenge her against her adversary]
uuas thar ouh sum uuitua/ In thero burgi
was there also certain widow in this town
‘There was a widow too in that city’ (T 201, 2)
Latin Vidua autem quaedam erat/ In ciuitate illa
The text also provides numerous examples of ‘minimal pairs’ where the initial
placement of the verb in the first sentence introducing new discourse referents
is immediately suspended for a V2 clause in the following utterance making a
statement on the referents just established. Consider the following small dis-
course:
(11) [the beginning of the story about the Nativity of John the Baptist]
a. uuas In tagun herodes […]/ sumer biscof […]/
was in days Herod.GEN some bishop
Inti quena Imo
and wife him.DAT
b. siu uuarun rehtiu beida fora gote
they were righteous both before God.DAT
182 Roland Hinterhölzl and Svetlana Petrova
‘In the days of Herod […], there was a certain priest […] and his
wife […]. They were both righteous before God’ (T 26, 3)
Latin a. Fuit in diebus herodis regis/ […] quidam sacerdos/ […]/
& uxor illi […]/ b. erant autem iusti ambo ante deum
This evidence provides significant points in favor of the interdependence be-
tween verb placement and information structure in OHG. It shows that new
referents follow the verb, while referents already salient in the context precede
it. What kind of generalization can we draw from these observations?
Looking at the data from the perspective of the model developed by Sasse
(1995), we discover that the sentences we are dealing with are typical represen-
tatives of the thetic vs. categorical type of judgments. By definition, categorical
sentences have a bipartite structure divided into a predication base, or topic of
the sentence and a comment on this topic. This is the case in (9), (10), and (11b).
Here, the finite verb separates from the rest of the utterance exactly that con-
stituent which provides the sentence topic (both in line with the familiarity as
well as the aboutness concept, for a discussion see Frey 2000, 137–138). By
contrast, the presentational sentences in (6) through (8) and in (11a) are typical
instances of the thetic type of judgments. The most significant feature of such
instances is that they represent “monominal predications” (Sasse 1995, 4) in
which no particular constituent is taken as the predication base of the utterance;
rather, the entire sentence, including all participants, is asserted as a unitary
whole.5
Therefore, we can conclude that the position of the finite verb in OHG is
firmly related to the realization of the topic-comment structure in a sentence.
As a rule, the finite verb separates the topic expression from the comment of the
utterance. In the most cases, this position is occupied by an expression referring
to the most salient referent in the context, which is either previously mentioned
or situationally accessible at that particular point of the discourse. Remarkably,
novel referents serving as the predication base of a categoric utterance also share
the positional properties of canonical (i.e. salient) topics in OHG. Consider the
bare plural fohún ‘foxes’ in (12) which is not previously established in the con-
text but is nevertheless placed in preverbal position. The sentence receives an
interpretation according to which it makes a statement about a set of individuals
of the denoted kind. Thus, the kind-refering bare plural in fohún is the aboutness
topic of the utterance:6
5. Drubig (1992) and Lambrecht (1994, 137–146) also argue that in thetic utterances
no topic-comment division applies.
6. In this respect, we follow Endriss and Hinterwimmer (2007) who argue that given-
ness is not necessary for topicality. They argue that novel constituents may provide
Rhetorical relations and verb placement in Old High German 183
(12) [a chain of coordinate conjuncts claims that every creature has a home
to stay over night except the Son of the Lord]
fohún habent loh
foxes have holes
‘The foxes have holes’ (T 85, 25)
Latin vulpes foueas habent
In case no topic-comment distinction applies, the verb moves to the position in
front of all arguments to indicate that none of them functions as the sentence
topic and that the entire proposition has to be interpreted as wide (sentence)
focus.
These observations are summarized in (13):
(13) a. [Vfin….DRnew …]FOCUS (V1)
b. [DR]TOP [Vfin……]COMMENT (V2)
Lenerz (1984, 151–153) and Ramers (2005, 81) also observe that V1 is typical
for presentational sentences in OHG. They conclude that V1 in OHG is used
when the sentence conveys discourse-new, or rhematic material only. Looking
at the examples above, we nevertheless discover that new information is estab-
lished only in the subject expressions, while the remaining part of the sentence
is given; see e.g. the adverbials In thero lantskeffi ‘in this country’ in (7), or
In thero burgi ‘in this town’ in (8). From this perspective, the notion that V1
occurs in all-new sentences cannot be maintained. Rather, verb fronting signals
that none of the constituents provided in the sentence takes over the function of
the sentences topic because no topic-comment division applies in these utter-
ances.
the aboutness topic of an utterance if the utterance allows for a topic-comment divi-
sion in which the respective constituent takes the role of the subject of the predica-
tion.
184 Roland Hinterhölzl and Svetlana Petrova
(14) [A Pharisee invites Jesus to dine in his house. Jesus enters the house
and sits down to eat. The Pharisee realizes that Jesus has not washed
his hand before dinner and criticizes him on that occasion]
bigonda ther phariseus […] quedan
began this Pharisee speak.INF
‘The Pharisee began to speak’ (T 126, 5–6)
Latin Phariseus autem coepit […] dicere
(15) [Jesus starts telling a parable on whether it is allowed to heal on Sab-
baths]
Quad her tho zi then giladoten/ ratissa
spoke he then to the invited parable
‘Then he told to the guests a parable’ (T 180, 9–10)
Latin Dicebat autem & ad Inuitatos/parabolam
The full definite expression ther phariseus ‘the Pharisee’ in (14) as well as the
personal pronoun her ‘he’ in (15) refer to entities already introduced in the
previous discourse. But although they display pragmatic properties of sentence
topics like givenness/accessibility, definiteness and referentiality, they fail to
occupy the topic position established in (13b) above.
To explain these data, we need to find a common basis to account for the
postverbal placement of both given and new referents in OHG. In our opinion,
this may be achieved if one broadens the account on information packaging
beyond the scope of the informational status of individual discourse referents
in the sentence and takes into consideration the discourse-functional role of the
utterance in the narrative structure of the text.
In (16b), the finite verb is shifted from the sentence final position in the Latin
source to the position between the topic and the comment in the OHG trans-
lation. Clearly, the sentence in (16b) provides additional information on the
discourse referent muoter ‘mother’ introduced by the preceding sentence. With
respect to the temporal relation of the two sentences, we can observe that the
event in (16b) overlaps with the event in (16a). Taken together, all these features
favor the identification of elaboration among (16b) and (16a) as the prototype
of the subordinating kind of linking in discourse.
In other parts of the text, we discover chains of utterances equally depending
on a higher unit in discourse structure. Consider (17b–e) which assign different
properties to the referent scrîbera ‘the scribes’ introduced in the opening sen-
tence (17a). V2 is established by the regular insertion of the pronominal subject
referring to the topic referent of the entire text portion (topic continuity):
(18) [an angel prophesies to Zacharias the near birth of his son, John the
Baptist, and explains that he will be a person of special qualities]
a. Inti manage in sineru giburti mendent
and many in his birth have joy
b. her ist uuârlihho mihhil fora druhtine
he is truly great before God.DAT
‘And many people will rejoice at his birth. For he will be great in
the eyes of the Lord’ (T 26, 29–30)
Latin & multi in natiuitate eius gaudebunt/ erit enim magnus coram
domino
The fact that V1 is used to indicate the beginning of a new episode is rather
suggestive for the role of this pattern in the structuring of the discourse. In
particular, it is clear that no elaboration on the discourse referents involved
in the sentences is at issue here. Rather, the information in the sentences un-
der scrutiny is part of the core scheme of the narrative, providing the basis for
further elaboration in the discourse.
(24) [Jesus has healed lots of people and performed many miracles]
Inti argieng thó úz thiu liumunt
and spread then out this fame
‘And this fame spread around’ (T 97, 5)
Latin & exiuit fama haec
Next to verbs of motion, verbs of saying form another group of stable V1 occur-
rences in sentences involving context-given referents. The instances indicate a
change of interlocutors in a dialogue sequence and therefore a shift in perspec-
tive. Consider (25):
instances involving V1, the adverbial thô is put independently of the original in
the position after the finite verb thus supporting V1 on the surface, see (6), (7) as
well as (24) through (27) above. However, in 122 of the 382 V2-cases included
in the database, the structure in (32b) occurs. We assume that this situation
shows the beginning of a process whereby the initial position in a sentence,
which was originally preserved for the most salient constituent of sentences
with a topic-comment division, was reanalyzed and extended by analogy to
adverbials used to link the sentence to the discourse situation established in the
previous discourse. Note that adverbials in anaphoric relation to a previously
mentioned location or goal share the positional properties of nominal referential
expressions as topics described so far. See thar ‘there’ referring to the pre-
established place of the wedding ceremony in (33):
(33) [at the Cannae wedding]
thar uuas thes heilantes muoter
there was the.GEN Saviour.GEN mother
‘The mother of the Saviour was also there’ (T 81, 15)
Latin erat mater ihesu & ibi
As a result of this unification process, the preverbal position cannot be identified
with any specific information-structural category anymore and is neutralized
leading to V2 in modern German declaratives.
Note that there was a different preference for one or the other structure in
(32) among the different scribes of the Tatian text. Although it has to be clarified
if the scribes are the actual translators of the text, we can detect some interesting
patterns. First, within the text portion supplied by the scribe ε , there is a 100 per
cent of consistency as to using the structure in (32b) in sentences indicating a
change of speaker in dialogue. The investigation of the same amount of text in
the portions of three different scribes reveal quite different preferences for V1
against thô+V2 in sentences with verbs of saying, namely 16 to 3 for scribe α ,
3 to 9 for scribe β and 1 to 12 for scribe ζ , respectively.
The fact that we encounter variation within one and the same functional
domain indicates a language change in progress. In the framework of Lightfoot
(1999), language change is viewed as a new type of parameter setting in the
internal grammar of young generations of speakers resulting from a shift in the
frequency relation of competing structures in the input data during language
acquisition. In this sense, the existence of competing structures in the domain
of sentences attributed to the coordinative type of discourse relations can be
viewed as a pre-condition and indication of language change.
Rhetorical relations and verb placement in Old High German 195
7. Conclusion
In the Old High German (OHG) Tatian text we find systematic variation be-
tween V1 and V2 clauses that is pragmatically driven. In particular, the distri-
bution of V1 and V2 clauses correlates with coordination and subordination as
the two basic types of discourse relations in the framework of SDRT by Asher
and Lascarides (2003). First, instances of V2 are regularly found in structures
providing additional descriptive or explanatory information on a discourse ref-
erent representing the topic of the sentences. These clauses provide additional
information about elements located higher in discourse structure. From this we
conclude that V2 correlates with elaboration and continuation, more precisely
with the realization of subordination in discourse structure.
In contrast, V1 comes in two main functions signaling main line sequential-
ity and progress in narration: i) it provides information which constitutes the
basis for a subsequent elaboration on a lower level of discourse hierarchy; or
ii) it signals that a previous chain of subordinative units is suspended and that
the discourse returns to the main line of the story. In both cases, we assign to
V1 properties of coordination in discourse.
Thus, we arrive at the conclusion that verb placement in the earliest stages
of German was governed by pragmatic, more precisely, by discourse-related
properties. Our main claim is that at a certain stage in the history of the Ger-
manic languages, the position of the verb was a means for distinguishing the
type of rhetorical relation which the sentence holds with respect to the previ-
ous context. In this way, word order and verb placement were involved in the
creation of dynamic text structure and discourse coherence.
Acknowledgements
This is a modified and extended version of Hinterhölzl and Petrova (2005).
The investigation was conducted during our work in Research Project B4 of
the Collaborative Research Center 632 “Information Structure” at Humboldt-
University Berlin8 and was presented at the International Workshop on Salience
in Discourse in Chorin, Germany, on 5th–8th Oct 2005. We are grateful to all
participants in the workshop for a fruitful exchange of ideas. For useful com-
ments and discussions we also thank Karin Donhauser, Milena Kühnast, Sonja
Linde, Michael Solf, Eva Schlachter as well as two anonymous reviewers.
Appendix
Figure 1. The beginning of Luke 2, 8 in the manuscript of the St. Gallen, Stiftsbibl. Cod.
56. Facsimile, pag. 35. In Sonderegger (2003, 130).
Figure 2. The same part of the text in the edition of Masser (1994, 85).
References
Filchenko, Andrey Y.
this vol. Parenthetical Agent-demoting Constructions in Eastern Khanty: Dis-
course Salience vis-à-vis Referring Expressions. this volume, 57–79.
Fleischer, Jürg
2006 Zur Methodologie althochdeutscher Syntaxforschung. Beiträge zur
Geschichte der deutschen Sprache und Literatur 128:25–69.
Fleischer, Jürg, Roland Hinterhölzl and Michael Solf
2008 Zum Quellenwert des AHD-Tatian für die Syntaxforschung: Über-
legungen auf der Basis von Wortstellungsphänomenen. Zeitschrift für
germanistische Linguistik 36:210–239.
Fourquet, Jean
1974 Genetische Betrachtungen über den deutschen Satzbau. In Studien
zur deutschen Literatur und Sprache des Mittelalters. Festschrift für
Hugo Moser zum 65. Geburtstag, eds. Werner Besch, Günther Jung-
bluth, Gerhard Meissburger and Eberhard Nellmann, 314–323. Berlin:
Erich Schmidt.
Frey, Werner
2000 Über die syntaktische Position der Satztopiks im Deutschen. ZAS Pa-
pers in Linguistics 20:137–172.
Grüning, André and Andrej A.Kibrik
2005 Modelling Referential Choice in Discourse: A Cognitive Calculative
Approach and a Neutral Network Approach. In Anaphora Process-
ing. Linguistic, Cognitive and Computations Modelling, eds. António
Branco, Tony McEnery and Ruslan Mitkov, 163–197. Amsterdam /
Philadelphia: John Benjamins Publishing Company.
Gundel, Jeanette K., Nancy Hedberg and Ron Zacharsky
1993 Cognitive status and the form of referring expressions in discourse.
Language 69 (2):274–307
Hinterhölzl, Roland and Svetlana Petrova
2005 Rhetorical Relations and Verb Placement in Early Germanic Lan-
guages. Evidence from the Old High German Tatian translation (9th
century). In Salience in Discourse. Multidisciplinary Approaches to
Discourse, eds. Manfred Stede, Christian Chiarcos, Michael Grabski
and Luuk Lagerwerf, 71–79. Münster: Stichting / Nodus.
Hinterhölzl, Roland, Svetlana Petrova and Michael Solf
2005 Diskurspragmatische Faktoren für Topikalität und Verbstellung in
der althochdeutschen Tatianübersetzung (9. Jh.). In Interdisciplinary
Studies on Information Structure (ISIS) 3, eds. Shinichiro Ishihara,
Michaela Schmitz and Anne Schwarz, 143–182. Potsdam: Univer-
sitätsverlag.
Rhetorical relations and verb placement in Old High German 199
Hopper, Paul J.
1979a Some Observations on the Typology of Focus and Aspect in Narrative
Language. Studies in Language 3.1:37–64.
Hopper, Paul J.
1979b Aspect and Foregrounding in Discourse. In Syntax and Semantics, ed.
Talmy Givón, 213–241. San Diego / New York / Berkeley / Boston /
London / Sydney / Tokyo / Toronto: Academic Press, INC.
Karttunen, Lauri
1976 Discourse Referents. In Syntax and Semantics 7: Notes from the Lin-
guistic Underground, ed. James McCawley, 363–385. New York /
San Francisco / London: Academic Press.
Kemenade, Ans van
1987 Syntactic Case and Morphological Case in the History of English.
Dordrecht: Foris Publications.
Krasavina, Olga
this vol. Demonstratives and salience: Towards a functionally taxonomy. this
volume, 31–55.
Krifka, Manfred
2007 Basic notions of information structure. In The Notions of Information
Structure, eds. Caroline Féry, Gisbert Fanselow and Manfred Krifka,
13–56. Potsdam: Universitätsverlag.
Lambrecht, Knud
1994 Information structure and sentence form. Topic, focus and the mental
representations of discourse referents. Cambridge: Cambridge Uni-
versity Press.
Lenerz, Jürgen
1984 Syntaktischer Wandel und Grammatiktheorie. Eine Untersuchung an
Beispielen aus der Sprachgeschichte des Deutschen. Tübingen: Max
Niemeyer.
Lightfoot, David
1999 The Development of Language. Acquistion, Change, and Evolution.
Malden / Oxford: Blackwell.
Mann, William C., and Sandra A.Thompson
1988 Rhetorical Structure Theory: Toward a functional theory of text orga-
nization. Text. An interdisciplinary journal for the study of discourse
8:243–281.
Masser, Achim ed.
1994 Die lateinisch-althochdeutsche Tatianbilingue Stiftsbibliothek St. Gal-
len Cod. 56. Göttingen: Vandenhoeck & Ruprecht.
200 Roland Hinterhölzl and Svetlana Petrova
Masser, Achim
1997a Syntaxprobleme im althochdeutschen Tatian. In Semantik der syntak-
tischen Beziehungen. Akten des Pariser Kolloquiums zur Erforschung
des Althochdeutschen 1994, ed. Yvon Desportes, 123–140. Heidel-
berg: Carl Winter.
Masser, Achim
1997b Wege zu gesprochenem Althochdeutsch. In Grammatica Ianua Ar-
tium. Festschrift für Rolf Bergmann zum 60. Geburtstag, eds. Elvira
Glaser and Michael Schlaefer, 49–70. Heidelberg: Carl Winter.
Molnár, Valéria
1993 Zur Pragmatik und Grammatik des TOPIK-Begriffs. In Wortstellung
und Informationsstruktur, ed. Marga Reis, 155–202. Tübingen: Max
Niemeyer.
Önnerfors, Olaf
1997 Verb-erst-Deklarativsätze. Grammatik und Pragmatik. Stockholm:
Almquist & Wiskell International.
Petrova, Svetlana
2006 A discourse-based approach to verb placement in early West-Germa-
nic. In Interdisciplinary Studies on Information Structure (ISIS) 5,
eds. Shinichiro Ishihara, Michaela Schmitz and Anne Schwarz, 153–
185. Potsdam: Universitätsverlag.
Petrova, Svetlana and Michael Solf
2008 Rhetorical relations and verb placement in early Germanic. A cross
linguistic study. In ‘Subordination’ vs. ‘coordination’ in sentence and
text – from a cross-linguistic perspective, eds. Cathrine Fabricius-
Hansen and Wiebke Ramm, 329–351. Amsterdam / Philadelphia:
John Benjamins Publishing Company.
Petrova, Svetlana and Michael Solf
2009 Die Entwicklung von Verbzweit im Fragesatz. Die Evidenz im Al-
thochdeutschen. Beiträge zur Geschichte der deutschen Sprache und
Literatur 131 (1):6–49.
Petrova, Svetlana and Michael Solf
2010 Pronominale Wiederaufnahme im ältesten Deutsch. Personal-
vs. Demonstrativpronomen im Althochdeutschen. In Historische
Textgrammatik und historische Syntax des Deutschen: Traditionen,
Innovationen, Perspektiven, ed. Arne Ziegler and Christian Braun,
339–365, Berlin: Walter de Gruyter.
Petrova, Svetlana, Michael Solf, Julia Ritz, Christian Chiarcos and Amir Zeldes
2009 Building and using a richly annotated interlinear diachronic corpus:
the case of Old High German Tatian. In Natural Language Process-
ing for Ancient Languages, eds. Joseph Denooz and Serge Rosmor-
Rhetorical relations and verb placement in Old High German 201
John D. Kelleher
1. Introduction
entered the model via the non-linguistic modalities, in this instance the visual
context of the dialog. By contrast, the references the engine and the boxcar in
13.3 are examples of anaphoric references. The reference the engine can be re-
solved relative to the linguistic context by binding it to the representation of an
engine introduced to the linguistic context by the resolution of 11.7. Similarly,
the reference the boxcar can be resolved relative to the linguistic context by
binding it to the representation of a boxcar introduced by the resolution of 11.7.
Most forms of referring expression have a preferred mode of interpretation,
for example, anaphoric, exophoric. For example, pronouns are typically in-
terpreted anaphorically. However, there is no one-to-one relationship between
form and mode of interpretation. For example, definite descriptions can be used
either anaphorically or exophorically. Indeed, the two most common cases of
definite descriptions in the TRAINS corpus of situated dialogue were anaphoric
and exophoric definites (Poesio 1993). One consequence of the one-to-many
relationship between referential form and mode of interpretation is that a multi-
modal reference resolution process should define a strategy to deal with cases
where different mode of interpretations are suggested for the same reference.
One solution, to this issue, is to define a preference ordering over the dif-
ferent interpretation rules. In Sect. 2 several reference resolution frameworks
that adopt such an approach are reviewed: Poesio (1993); Kievit et al. (2001);
Salmon-Alt and Romary (2001); Landragin and Romary (2003); Gorniak and
Roy (2004); Kelleher et al. (2005).
In contrast with these rule based approaches, in this paper, we develop a
probabilistic framework that addresses the issue of selecting between different
modes of interpretation through a saliency based modelling of the attentional
spread across the set of entities in the discourse domain. This attention based ap-
proach is inspired by psycholinguistic findings, see Sect. 2, that point to a strong
interaction between attention and linguistic reference. We will use the concept
of salience to describe the factors and associated processes that direct attention.
The framework consists of a set a salience models, one for each modality, and
an reference resolution process that for each entity in the discourse context com-
putes an overall attention score by integrating the scoring of that entity by each
of the salience models. The attention score assigned to an entity represents the
probability of that entity being the intended referent of the referring expression.
Thus, the entity with the highest overall attention score after the processing of
a referring expression is selected as the referent for that reference.
The paper is structured as follows: Section 2 reviews related work; Sect. 3
describes the reference resolution framework; Sect. 4 describes an implemen-
tation of the framework; Sect. 5 contains a worked example illustrating how
208 John D. Kelleher
the framework functions; the paper finishes, in Sect. 6, with conclusions and
future work.
2. Related work
1. The dialog participants mutually developed public view of what they are talking
about.
2. The antecedent of an anaphoric reference is the representation of the reference’s
referent that was introduced to the discourse model by a prior referring expression.
3. For alternate models see Hobbs (1985); Mann (1987); Asher and Lascarides (2003).
Visual salience and the other one 209
ordering being defined over the different modes of interpretation for each form
of reference. One issue with this approach is that the set of candidates consid-
ered during any one resolution step is constrained to the set of entities within
the sub-context the resolution step uses to construct the initial set of candidates.
As a result, the system cannot recognise situations where a reference may be
ambiguous between two entities in different sub-contexts, and, consequently, it
may resolve a reference incorrectly rather than initiate a clarification process.
Gorniak and Roy (2004) focus on the resolution of references containing
spatial descriptions. They propose a feed-forward filtering process to reference
resolution. In their framework, each lexical item in the system’s lexicon is asso-
ciated with one or more composer functions. A composer function takes one or
more candidate referents as input and filters this set of candidates by computing
how well each of the candidates fulfils the semantic model defined for the lexi-
cal term. Reference resolution is carried out by chaining the composer functions
associated with the lexical terms in the reference together, i.e. the filtered set of
candidates output by one composer function is used as the input set by the next
composer function in the chain. Gorniak and Roy note that this strategy can
fail if one of the composer functions excludes the target object from the set of
candidates. For example, when interpreting “the leftmost one in the front” the
composer for “leftmost” selects the leftmost objects in the scene, not including
the obvious example of “front” that is not a good example of “leftmost”.
The reference resolution frameworks presented in Salmon-Alt and Romary
(2001), Landragin and Romary (2003) and Kelleher et al. (2005) use the notion
of a reference domain. A reference domain is a structured contextual subset of
the multi-modal dialog context. Reference domains are created in the context
model due to perceptual or linguistic events or conceptual knowledge and are
intended to reflect the mental representation of the event they model. In these
frameworks the resolution process involves: (1) the construction of an under-
specified reference domain, using templates associated with the form of the
reference given; (2) the unification of this underspecified domain with a suit-
able reference domain within the context model; (3) the selection of one of the
elements within the unified reference domain to function as the referent. How-
ever, similar to the frameworks proposed in Kievit et al. (2001) and Gorniak
and Roy (2004), there is the potential for these frameworks to overcommit to a
particular subset of the context during the resolution process. As the resolution
process occurs within a sub-context, whose selection is at least partially driven
by the form of the reference being interpreted, if the wrong reference domain
is selected the intended target object and/or plausible distractor referents, that
may indicate the need for reference clarification, may be excluded from consid-
eration.
Visual salience and the other one 211
3. Approach
linguistic and visual salience models are integrated. The weighting factors α
and β accommodate the preferential relationship between the form of referring
expression and the mode of interpretation. These weights range in value be-
tween 0 and 1. As an example of the possible values these weights might take,
when resolving a pronominal reference setting α = 0.1 and β = 0.9 would pref-
erence the resolution process towards entities with a high linguistic salience.
Ideally, however, these weights should be derived from an empirical analysis
of the preferred mode of interpretation for each form of reference. The resulting
integrated salience scores are then used to rank the candidate referents during
the resolution process, with the candidate scoring the highest selected as the
referent.
IS = ((α ∗ VS + β ∗ LS))/2 (1)
The framework distinguishes three stages of salience integration.
Stage 1 This stage computes the salience of the entities in the context prior to
the processing of a given referring expression. It includes the basic visual
salience (i.e. the prominence of an entity due to bottom up visual cues)
and linguistic salience (i.e. the prominence of an entity due to previous
discourse) of an object.
Stage 2 This stage computes the salience of each entity within the context pro-
vided by the referring expression being resolved. For each entity two
salience scores result from this stage of processing: a reference relative
visual salience and a reference relative linguistic salience. These salience
scores are computed by integrating each entities basic visual salience and
basic linguistic salience with a rating of how well the entity fulfils the
description provided in the referring expression.
Stage 3 For each entity in the context this stage computes an overall attention
score by integrating the entity’s salience scores that resulted from Stage
2. This overall salience is computed using a weighted integration, where
the weights used reflect the biasing associated with different forms of ref-
erence toward a particular information source.
The flow of information during reference resolution is from Stage 1 to Stage 3.
Algorithm 1 lists the basic steps in reference resolution.
One advantage of this approach is that resolution process occurs within the
full multi-modal context of the dialog, in so far as the the referent is selected
from a full list of the objects in the multi-modal context ordered by a model
of integrated salience. Consequently, none of the objects in the context are ex-
cluded from consideration. As a result situations where the intended target of the
reference is erroneously excluded, due to an individual assumption within the
resolution process, are avoided. Also, the framework can recognise cross-modal
ambiguity by comparing the integrated salience of the primary candidate with
the integrated salience of all the other objects in the context. In these ambigu-
ous cases the initiation of a clarification dialog may be a better system response
rather than the selection of the primary candidate referent. By contrast, many of
the previous multi-modal resolution frameworks exclude entities in the multi-
modal context model from consideration before the selection of the referent. In
some cases, for example Kievit et al. (2001); Salmon-Alt and Romary (2001);
Landragin and Romary (2003); Kelleher et al. (2005), the initial set of candi-
dates referents is restricted to a sub-set of the context based on preferences with
respect to the mode of interpretation relative to the form of reference. In other
frameworks, for example Gorniak and Roy (2004), candidate referents are in-
crementally excluded from consideration as the resolution process progresses
due to the sequential manner that the semantics of the terms within the reference
are processed.
Moreover, from a functional perspective this approach has the advantage of
modularity and the potential to accommodate learning within the system. The
modularity of the framework stems from the fact that the only information re-
quired by the resolution process from each of the information sources (language
and vision) within the context are the salience scores for each entity. As a result,
the resolution process is, to a large extent, decoupled from the representations
and processes used within the linguistic and visual context models. The learn-
ing aspect of the system arises from the ease (relative to rule based approaches)
with which the integration weightings associated with a particular form of ref-
erence could be updated, for example, using machine learning techniques such
as reinforcement learning. Finally, from a cognitive perspective, an attention
based model fits the theoretical and psychological data that points to the role of
attention within human reference resolution.
4. Implementation
The resolution framework described in Sect. 3 has been used to update the LIVE
system’s (Kelleher et al. 2005) approach to reference resolution. In the follow-
214 John D. Kelleher
ing sections the data structures, salience algorithms and reference resolution
algorithms used by the updated system are described.
id = String value
visual salience
a) basic = [0 . . .1]
b) reference-relative = [0 . . .1]
linguistic salience
a) basic = [0 . . .1]
b) reference-relative = [0 . . .1]
integrated salience = [0 . . . 1]
Figure 3. A Coreference Class
New coreference classes are added to the context model as a result of visual pro-
cessing. Each time an object is detected in the visual scene the context model is
queried for the coreference class representing the object. If there is no corefer-
ence class for the object model a new coreference class is created and is assigned
the id used by the vision processing. The basic visual salience component is ini-
tialised to the value created by vision processing when the object was detected.
This is updated after each scene is rendered. All the other salience scores are
initialised to 0. These components are updated after each utterance has been
processed. Coreference classes are removed from the context model when both
their basic visual and linguistic saliences fall below a threshold (.0001). In the
following sections the algorithms that provide and use the information stored
in these structures are described.
computed in parallel across the visual field and these are then combined into
a single saliency map. Then a selection process deploys attention to locations
in decreasing order of salience. In Kelleher and van Genabith (2004) a simple
model of visual salience (based on object size and centrality relative to a focus
of visual attention) was presented. In this paper we adopt use this model to
capture the information entering the discourse through vision.
The visual salience algorithm uses a false-colouring technique. Each object
in the simulation is assigned a unique colour or vision-id. This colour differs
from the normal colours used to render the object in the world; hence the term
false colouring. Each frame is rendered twice: firstly, using the objects’ normal
colours, textures and shading, and secondly, using the vision-ids. The first ren-
dering is on screen (i.e. the user sees it), the second rendering may be off screen.
After each frame is rendered, a bitmap image of the false colour rendering is
created. The bitmap is then scanned and a list of the colours in the image is
created. Using this list the system can recognise which objects are visible and
which are not. Moreover, the system can identify, at the pixel level, the area
covered by each object in the scene. This pixel information is used to compute
the basic visual saliency of each object.
Mimicking the spread of visual acuity across the retina, the algorithm
weights each pixel in the image based on its distance from the point of visual fo-
cus. The weighting is computed using Equation 2. In this equation, D equals the
distance between the pixel being weighted and the point of focus, M equals the
maximum distance between the point of focus and any point on the border of the
image. The point of focus can be determined using eye tracking technology to
compute the user’s gaze at each scene rendering. However, if eye tracking is not
being used the point of focus defaults to the center of the image or to the center
of silhouette of the last object referred to. Algorithm 2 lists the procedure used
to compute basic visual saliency and to update the coreference classes. For each
scene processed the algorithm returns a list of objects in the scene each with a
relative salience between 0 and 1, with 1 representing maximum salience.
D
Weighting = 1 − (2)
M+1
to in that utterance. This set is partially ordered to reflect the relative promi-
nence of the referring expressions within the utterance. Grammatical roles are
a major factor here, so that subject > object > other. The central compo-
nent of the algorithm is a function sf that maps the objects in a domain D to the
set {0, . . . , 1}, with the intuition that 0 represents non-salience and 1 maximal
salience. Figure 4 defines the salience function sf used by the framework. The
algorithm assumes that in the initial situation s0 all the objects in the domain
are equally (not) salient: sf (s0 , d) = 0 for all d ∈ D.
It should be noted that, although this algorithm is inspired by the Centering
framework of Grosz et al. (1995) it does deviate from Centering is so far as
it is recursive and assigns salience scores to entities not mentioned in the im-
Algorithm 3 The basic linguistic salience algorithm. BLS = basic linguistic salience.
Let TotalDS = 0
for each coreference class CRi do
CRi .BLS = sf (sj , CRi )
TotalDS = TotalDS + CCi .BLS
endfor
for each coreference class CCi do
CRi .BLS = CRi .BLS/TotalDS
endfor
visual context and multiplication is used for integration in the linguistic con-
text. Consequently, an object’s reference relative visual salience will be > 0 if
it fulfils any of the selectional preferences in the description, and its linguistic
reference relative salience will be = 0 if it does not fulfill all of the selectional
preferences in the description. Algorithm 4 lists the algorithm for computing
the reference relative saliences.
nite descriptions versus pronominal references) being resolved and reflect the
preferential interpretation associated with each type of reference. For exam-
ple, in general, a pronoun is used to refer to a referent that is prominent within
the linguistic context. By contrast, a definite description can be used to refer
to an object from the visual scene and to previously mentioned objects. Ide-
ally, these weights would be set based on an empirical examination of a multi-
modal corpora. However, the system currently uses predefined weights for this
integration. When resolving a definite description visual and linguistic salience
are integrated evenly. When resolving a pronominal reference the integration
weightings used biases towards linguistic salience. Algorithm 5 defines the pro-
cedure used to construct the integrated context and select the reference. It also
defines the mechanism used to check for ambiguous references. This ambigu-
ity check uses a predefined confidence interval and simply checks that within
the context provided by the referring expression the integrated salience of the
object selected as the referent is sufficiently larger than the other objects in the
context to ensure that the reference is not ambiguous. In situations where the
ambiguity check fails the algorithm returns 0.
Algorithm 5 Constructing the integrated context and selecting the references. RVS =
reference relative visual salience, RLS = reference relative linguistic salience.
5. Worked Example
of the set of elements that other designates its referent as being excluded from
often requires information from prior discourse and the set of elements that the
referent may be selected from may contain elements from the visual context
that have not been mentioned previously.
It has long been observed that the nominal anaphor one can be resolved only
in relation to the discourse context. However, it gains new importance where
references of the form the green one or the other one get evaluated. For one
thing it is perfectly possible that while the resolution of the pronoun one is to
a noun used in the antecedent discourse, the referent of the noun phrase (NP)
belongs only to the visual context. Thus the red one can refer to a house that
is visible in the scene but has not yet been mentioned, although this is only
because the word house occurs in an earlier utterance that the red one can be
interpreted as the red house.
Neither of these phenomena effect the weightings used to integrate the vi-
sual and linguistic salience. However, one anaphora does effect the selectional
preference used in the computation of the overall salience of its NP and other
anaphora effects the selection of its NP’s referent once the overall saliencies
have been computed.
When interpreting a one-anaphoric NP, the pronoun one is resolved to the
most recent NP in the the discourse that referred to the referent with the highest
linguistic salience in the context model. It inherits all the selectional prefer-
ences specified in its antecedent NP that do not contradict selectional prefer-
ences specified in the one-anaphoric NP. For example, assuming the red house
on the right is the most recent reference in the discourse the reference the green
one is interpreted as the green house on the right.
Other also functions as an anaphor of sorts – it presupposes that the con-
text contains a set X of objects of the relevant kind and a proper subset Y of
X that its referent is excluded from. The referent of the NP containing other is
then selected from the difference between these two sets, X − Y . Consequently,
in order to resolve other anaphora the subset of the context that other specifies
its referent is excluded from (i.e. Y ) must be defined. In this framework this
subset is defined as the containing the element with the highest overall refer-
ential salience computed relative to the selectional preferences encoded in the
NP modified by other. Assuming a reasonable distribution of visual salience,
the updating of linguistic salience and the constraint that an object’s reference
based linguistic salience can only be > 0 if it fulfils all the selectional prefer-
ences encoded in the current NP imposes a partial ordering on the reference
resolution context:
222 John D. Kelleher
1. objects that have previously been mentioned and that fulfil the selectional
preferences encoded in the NP being interpreted (internally ordered by lin-
guistic and visual salience)
2. objects that fulfil the selectional preferences encoded in the NP being inter-
preted (internally ordered by visual salience)
3. other objects in the context (internally ordered by visual salience)
Consequently, by excluding the most salient element in the reference resolution
context the most recently mentioned object that fulfils the selectional prefer-
ences encoded in the NP is excluded. For example, during the interpretation of
a reference the other house the most recently mentioned house would be ex-
cluded from the set of candidate referents and the reference would be resolved
to the next most salient house in the context.
Table 2 lists the data computed by the framework during the different stages
of this interaction. Rows 1, 2 and 3 present the visual and linguistic saliencies
of the objects in Figure 5 before any commands are input; H2’s visual salience
is higher than H1 and H3’s because it is closer to the centre of the image, which
is the default point of visual focus in the visual saliency algorithm.
Rows 4, 5 and 6 present the data computed during the interpretation of the
blue house. This is a definite description so visual and linguistic salience are
weighted equally in the integration process. However, at this point, there has
been no previous discourse so each object’s overall salience is dependent on
the interaction between its visual salience and its visual f-score. H2 fulfils two
of the selectional preferences encoded in the reference blue and house. Con-
sequently, its visual f-score is 2. The other houses in the scene only fulfil the
type selectional preference and their visual f-scores are 1. Primarily due to the
difference in f-scores H2 achieves the highest overall salience and is selected
as the referent. The asterix in the overall salience column (os column) of row
5, H2’s row, highlights this.
Rows 7, 8, and 9 list the data computed when the reference it is processed.
Note, that H2 has a linguistic salience of 1 because it was selected as the referent
for the subject in the preceding utterance. The pronoun it does not encode any
selectional preferences. Consequently, all the objects visual and linguistic f-
scores are set to the default values. The biasing towards linguistic salience in
the computation of the overall saliencies is apparent in the dominance of H2’s
integrated salience and again it is selected as the referent.
Rows 10 to 12 list data computed when the red house on the right is pro-
cessed. There are three selectional preference in this reference red, house and
on the right and the visual and linguistic f-scores are computed accordingly. As
H2 does not fulfil all the selectional preferences its linguistic f-score is 0. This
Visual salience and the other one 223
results in its linguistic salience being discounted in the resolution process. Con-
sequently, H3 achieves the highest overall salience due to its dominant visual
f-score.
Rows 13 to 15 list the data computed during the processing of the other one.
As noted in Sect. 4.5, the pronoun one inherits its selectional preferences from
the most recent NP in the preceding utterance that referred to the referent with
the highest linguistic salience in the context model. This results in the reference
being interpreted as the other red house on the right. In Sect. 4.5, the effect of
the modifier other on its NP was also presented. It does not introduce a selec-
tional preference into the process of computing the overall saliencies, rather it
affects the selection of the referent once these saliencies have been computed.
Consequently, the overall saliencies are computed using the same selectional
preferences as the preceding reference: red, house and on the right. As a result,
H1, H2 and H3 have the same visual and linguistic f-scores. However, as H3
was selected as the referent of the preceding subject NP it now has the highest
linguistic salience in the context 0.8889. Moreover, it has the maximum visual
f-score. These two factors result in H3 achieving the highest overall salience.
However, due to the modifier other in the reference it is excluded as a candidate
referent. The next most salient object is H1 and it is selected as the referent. H1’s
higher overall ranking relative to H2 is due to two factors: (1) H2 did not ful-
fil all the selectional preferences so its linguistic salience was discounted from
the resolution process; (2) H1 fulfilled more of the selectional preferences than
H2 which resulted in its visual reference based salience being higher. Figure 5
illustrates the visual context at the end of the interaction.
Table 2. Salience scores computed during the example interaction. Acronyms: bvs =
basic visual salience, bls = basic linguistic salience, vfs = visual f-score, lfs = linguistic
f-score, rvs = reference based visual salience, rls = reference based linguistic salience,
os = overall salience.
bvs bls vfs lfs rvs rls os
Initial Context
1 H1 0.2812 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000
2 H2 0.4376 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000
3 H3 0.2812 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000
make the blue house bigger
4 H1 0.2812 0.0000 1.0000 0.0000 0.2556 0.0000 0.2556
5 H2 0.4376 0.0000 2.0000 1.0000 0.4887 0.0000 0.4887*
6 H3 0.2812 0.0000 1.0000 0.0000 0.2556 0.0000 0.2556
make it smaller
7 H1 0.2879 0.0000 0.0000 1.0000 0.2879 0.0000 0.1438
8 H2 0.4348 1.0000 0.0000 1.0000 0.4348 1.0000 0.7124*
9 H3 0.2879 0.0000 0.0000 1.0000 0.2879 0.0000 0.1438
make the red house on the right bigger
10 H1 0.2812 0.0000 2.0000 0.0000 0.3259 0.0000 0.3259
11 H2 0.4376 1.0000 1.0000 0.0000 0.2054 0.0000 0.2054
12 H3 0.2812 0.0000 3.0000 1.0000 0.4687 0.0000 0.4687*
make the other one bigger
13 H1 0.1990 0.0000 2.0000 0.0000 0.3141 0.0000 0.1571*
14 H2 0.3477 0.1111 1.0000 0.0000 0.1925 0.0000 0.0963
15 H3 0.4534 0.8889 3.0000 1.0000 0.4833 1.0000 0.7476
This paper presented an attention based reference resolution framework for vi-
sually situated discourse. The framework uses a weighted integration of visual
and linguistic attention to order the candidate referents within the context. The
candidate with the highest integrated attention score is taken to be the referent.
One advantage of this approach is that the resolution process occurs within the
full multimodal context. As a result situations where the intended target of the
reference is erroneously excluded, due to an individual assumption within the
resolution process, are avoided. Moreover, the system can recognise situations
where attentional cues from different modalities make a reference potentially
ambiguous. From a cognitive perspective the framework meshes well with psy-
Visual salience and the other one 225
cholinguistic results that point to the role of attention within human reference
resolution processes.
Finally, it should be noted that the framework as it currently stands is in-
tended to represent an abstract and preliminary attempt. Several issues need to
be addressed if it is to be used as a component within a dialog systems for less
constrained contexts. In particular, the use of predefined weights for salience in-
tegration is overly simplistic. This issue could be addressed by using a machine
learning algorithm, such as reinforcement learning, to automatically compute
these weights. The visual and linguistic salience algorithms should also be im-
proved. For dialog systems interfacing with virtual environments, the visual
salience algorithm should be extended to at least handle attentional cues such
as colour, motion and location of gaze. If the framework was to be used within a
real-world system, such as a robot dialog system, a computer vision saliency al-
gorithm, such as Itti and Koch (2000), could be adopted. The linguistic saliency
algorithm should also be revised and extended. In particular, the relationship be-
tween the framework’s model of local level attention and a more global model
of discourse structure, such as Grosz and Sidner’s focus stack model or Asher
and Lascarides’ SDRT framework, should be clarified. Fortunately, the mod-
ular nature of the framework makes such modifications possible without major
changes to the overall approach.
References
J.F. Allen and L.K. Schubert
1991 The TRAINS project. Technical report, University of Rochester, De-
partment of Computer Science.
H. Alshawi
1987 Memory and Context for Language Interpretation. Cambridge Uni-
versity Press, Cambridge, UK.
M. Ariel
1990 Accessing noun-phrase antecedents. London: Routeledge.
N. Asher and A. Lascarides
2003 Logics of Conversation. Cmabridge University Press.
I. Duwe and H. Strohner
1997 Towards a cognitive model of linguistic reference. Report: 97/1 – Si-
tuierte Künstliche Kommunikatoren 97/1, Univeristät Bielefeld.
P. Gorniak and D. Roy
2004 Grounded semantic composition for visual scenes. Journal of Artifi-
cial Intelligence Research, 21:429–470.
226 John D. Kelleher
Birgitta Bexten
1. Introduction
Hypertexts are special. Not only do they increasingly combine semiotic func-
tions and create new ways of associative writing and reading. In addition, their
mere tree- or networklike structure provides intellectual challenges for hyper-
text authors, readers and, of course, for discourse linguists. None of them can
completely rely on already existing, polished conventions. Accordingly, all of
them wrestle with new attempts to get a grip on text in hypertext. What are the
criteria for an appropriate text unit? How can I combine those units to form a
coherent whole? How can I mark – and recognise – various text connections,
various link types? How can I decide where to read on when I come across a
hyperlink? From a discourse linguistic point of view, one of the central ques-
tions is what a hypertext’s network structure does to the text. What remains
the same if one compares it to traditional linear texts? What changes? Using the
framework of Centering, in this paper, I demonstrate how hypertext differs from
linear text in its capability and its need to provide multiple so-called preferred
centers in a single utterance.
230 Birgitta Bexten
Hypertexts, narrative just as well as scientific ones, go beyond the scope of tra-
ditional, linear texts. While traditional texts usually only come up with a single
reading sequence,1 hypertexts are bound to provide a selection of different se-
quences. Without this possibility, neither treelike nor networklike hypertexts
would be possible. This quality affects the text itself just as well as the hyper-
text’s author and reader. As far as the text is concerned, it has to adapt to the
exceptional discourse structure. Due to the hyperlinks, the text splits up2 and
becomes plurilinear.3 Consequently, to form a coherent whole, not only the text
within every single hypertext node has to be coherent but also the connections
between the linked nodes. For the hypertext author, this means that he has to
arrange text and links very carefully. Especially so as he cannot tell for sure
which units the reader has already perceived. This is only possible in extremely
well-planned hypertexts. The easiest way to master a networked text surely is
to cut off direct pronominal connections between the information units and to
use referring nouns instead (Kuhlen 1991), but this way of granulating text is
surely not desirable for every kind of hypertext. Especially authors of fictional
hypertexts can fall back on direct utterance-connecting devices.
Surely, the author can add hypertext-specific coherence cues like overviews,
lists of currently available target units and so forth (Storrer 2002), but the read-
ability of the hypertext mainly depends on coherent connections between the
single units. The sequence of the hypertext units depends on each individual
reader’s decision to follow hyperlinks and this might vary every time the hy-
pertext is being read. From the reader’s point of view, this means that the pluri-
linear structure requires the reader to decide whether he wants to read on in the
current unit or in the link’s target unit.
The readers’s decision is influenced by his expectations about how the two
offered text strings might proceed. He can deduce clues about this from the ut-
terance he is reading at the very moment: usually the most salient entities of a
1. Even though some texts do offer various reading sequences via footnotes, but, in
contrast to hypertext units, footnotes are not constitutive for the text. (Nielsen 1995)
2. This description only applies to treelike hypertexts with a simple structure. In a more
complex, networklike hypertext, the text not only bifurcates, but also merges at var-
ious points. Even if the reader relinearises (parts of) the text while reading, the text
would structurally remain a network. For the purpose of this paper, however, the idea
of recombining text strings can be left aside as it depends more on the backward-
looking aspect text connections than on the here relevant forward-looking one.
3. For the concept of plurilinear texts see Harweg (1974) and for an application of his
model on hypertexts Bexten (2006)
Hypertext and multiple salience centers in the framework of Centering 231
given utterance are most likely to be the topic of the next one. What does this
mean for the two text strings in hypertext? As for the string announced by the
hyperlink, the reader’s curiosity can be satisfied quite easily even without the
need of following the link: he can expect the target unit to provide additional
information about the link-marked entity. Hyperlinks are of course quite salient
just because they differ from the surrounding text by their highlighted design:
conventionally, links are visually marked, e.g. by a different colour.4 In addition
to the link marks, there are also hints that concern the continuation of the cur-
rent unit’s text. This is because not only non-linguistic, paratextual 5 features can
increase an entity’s degree of salience but also linguistic ones. Which entities
might be central in the ongoing text, the reader can tell from their grammatical
role or from their information value, as has been widely described in Centering
Theory. Obviously, this kind of discourse marking does not only apply to pluri-
linear hypertext structures but also to traditional linear discourses. With regard
to hypertexts, however, I consider a combination of the two ways of salience
marking a decisive factor for the establishment of coherence. The first type of
salience facilitates coherent discourse processing in the current hypertext node,
and the second for cross-node discourse continuation.
What is of interest here, is the question in how far the available results from
research on linguistic salience can be used to describe paratextual salience.
To answer this question, it is necessary to find out, whether it is justifiable
to equate linguistic and paratextual salience. For that purpose, I consider the
framework of Centering a promising candidate, for it takes into account the
forward-looking, promise-making character of text. In other words: it detects
how the text enables the reader to conjecture about the text’s continuation. What
is more, the various approaches to Centering indicate that salience can depend
on varying factors. Therefore, to cover more than traditional linear texts, an inte-
gration of a paratextual way to mark salience is, as the following considerations
will point out, not only possible but desirable.
The basic Centering model, as originally developed by Grosz et al. (1995)
and expended for example by Strube & Hahn (1999) only covers questions
of linguistic salience. Therefore, though Centering is a thought-out method to
4. The question of labeling various link types will remain out of consideration here. For
an overview over research on the impact of labeled hyperlinks on the cognitive load
placed upon the reader see DeStefano & LeFevre (2007). For a model of bottom-up
visual salience in a situated-dialog context, see Kelleher (this volume).
5. Genette (1987) defines as paratext those linguistic elements (index, title, etc.) and
non-linguistic elements (fonts, illustrations, etc.) that accompany and present text in
a medium-specific way.
232 Birgitta Bexten
identify the most salient discourse entity, for the purpose of my argumentation,
its focus needs to be extended to non-linguistic ways of salience marking. Af-
ter a short exemplification of the hypertextual double salience, I will illustrate
the fundamental comparability of linguistic and non-linguistic salience in the
framework of Centering.6
6. For different adoptions of Centering Theory see for example Chiarcos, this volume,
who incorporates a variant of Centering Theory into his Mental Salience Frame-
work, and Kelleher, this volume, who computes basic linguistic salience by using an
algorithm that is inspired by the Centering framework.
7. Swigart and Strange (2002, “Discoveries” in “Mouth’s Journey 40.000 Years Ago.
Part 1.”).
8. Swigart and Strange (2002, “Continents” in “Mouth’s Journey 40.000 Years Ago.
Part 1.”)
Hypertext and multiple salience centers in the framework of Centering 233
too; the link marks on the other hand call the reader’s attention to the fact that
in the link’s target node he will find a subsequent text string about continents
as well.9 Naturally, the two types do not always coincide in one and the same
entity. If different entities are salient in the same utterance, one linguistic and
one paratextual, the reader can expect that not only the discourse but also the
topic bifurcates.
The linguistically and the paratextually marked entities are equal in as far
as both enable the reader to predict the ‘aboutness’ of subsequent utterances.
Hence, both can be regarded as preferred centers. In hypertext, their promis-
ing10 character is far more prominent than in linear texts: Linear texts allow the
reader simply to read on while they draw his attention more or less unnoticed
on the preferred centers. Hypertext forces the reader at least partly to give up
this comparatively passive role. With every paratextually marked hyperlink, the
hypertext obliges the reader’s attention to undergo a rapid oscillation between
between looking through and looking at the text (Bolter 1991), for hypertext
offers two different promises about two separated text strings by two different
kinds of salience.
Text bifurcations in hypertext are funded on the possibility to provide a sin-
gle utterance with various preferred centers. According to Grosz et al. (1995), in
principle, every utterance can have more than one preferred center. Centers are
constructs of the discourse: different discourse situations in which one and the
same utterance may be made, can lead to different centers. This multiplication
of centers is nevertheless not on the same level as multiple preferred centers
in hypertext. While in Grosz et al.’s argumentation it is a question of either
/ or, of various discourse situations, discourse in hypertext construes several
parallel discourse situations. In a networklike hypertext environment, i.e., in a
discourse with a multitude of bifurcations and mergers, some utterances on all
accounts structurally belong to different discourse situations at the same time.
Therefore, multiple preferred centers are an essential presupposition for hyper-
textuality.
9. Admittedly, not all connections between information units are as smooth as this one.
Especially in scientific hypertexts, they are rather rare. For the purpose of my argu-
mentation, however, a quantitative inventory is less important than highlighting the
possibilities. In addition, the discussion below will show, that the observations from
this neat example are transferable to less direct connections.
10. Genette (1987) ascribes all paratexts to some degree an illocutional character: they
inform, demand, promise, etc. Hammwöhner (1997), who describes the changes that
paratext undergoes in hypertext, points at the promising character of hypertext links.
234 Birgitta Bexten
11. Backward-looking centers are not the same as anaphora, e.g. pronouns. Grosz et al.
illustrate that every utterance can contain several pronouns while only a single entity
can function as backward-looking center.
236 Birgitta Bexten
12. The interpretation of the centers can differ between languages. In his research on
Eastern Kanthy, Filchenko (this volume) resorts to the use of the terms foregrounded
center and backgrounded center to mark special pragmatic states.
13. For a first attempt of Centering without backward-looking centers see Strube (1998).
14. The authors make reference to the terms of information status proposed by Prince
(1981) and (1992). The sets of discourse-old and discourse-new entities can be fur-
ther categorised in terms of Prince’s (1981) familiarity scale: hearer-old entities can
be split into evoked > unused while the hearer-new entities can be split into in-
ferable > containing inferable > anchored brand-new > brand-new. An
investigation that compares the relative contribution of syntactic and semantic promi-
nence to the salience of entities is presented by Rose in this volume. Rose’s corpus
analysis indicates that the prediction of subsequent reference is enhanced when join-
ing the information of two salience factors (syntactic and semantic prominence).
Hypertext and multiple salience centers in the framework of Centering 237
15. Nevertheless, I assume that Strube and Hahn’s extended version of the Cf ranking is
an applicable Centering model for hypertexts. It does at least allow for the fact that
pronominal anaphor resolution is hard to combine with the use of hypertext links.
16. This set consists of inferable, containing inferable, and anchored brand-new entities.
17. The difference between linguistic and non-linguistic salience is also described by
Kelleher, this volume, who provides an approach that integrates linguistic and visual
salience to account for situated reference resolution.
238 Birgitta Bexten
ness’ of subsequent utterances. The only difference is, that these utterances do
not occur in the same hypertext node as the link-marked entity itself but in the
link’s target node.
Thus, in hypertext, a single utterance can contain multiple preferred centers.
Speaking in terms of Grosz et al.’s model, each of the salient centers of an
utterance Un , i.e. the one that is ranked highest by grammatical – or functional
– criteria as well as the one that is purely salient by means of link marks, can
be connected to its own backward-looking center. This does not mean that, at
the same time, a single utterance Un+1 could have multiple backward-looking
centers. The Cb ’s belong to different utterances which, again, usually belong to
different hypertext nodes: one to the node in which the link occurs and one to
the target node.18
Link marks can be conceived of as a promise to the reader that he can find
more information about the link-marked entity in the target node. This promise
can only be kept if the target node contains a coreferent entity, or at least one
that shows some kind of connection with the link-marked entity. In other words:
whether the promise is kept, depends on a coherent connection between the hy-
pertext nodes. But even – and this is crucial here – if the readers’s prediction
does not come true, at the time he comes across the hypertext link, the link-
marked entity has to be regarded as a preferred center nevertheless. Even more:
such a case would support my argumention because it illustrates the impact
which the hints about the discourse continuation exert on the question of con-
nectivity. The link mark attracts the reader’s attention and offers him an addi-
tional way to read on. The discourse bifurcates. The reader has to decide which
sequence he wants to follow. When choosing the link, he relies on the link’s
promise that there is a coherent connection with the target node. When stick-
ing to in the current node, he can presume the discourse’s process by the usual,
syntactically or functionally marked preferred center of the current utterance.
In the remainder, I discuss some examples illustrating this phenomenon.
As already mentioned, it is not necessarily the preferred center which is
marked as hyperlink. I can think of two basic positions a hypertext link can have
in respect of the Cp of an utterance. Either the link corresponds to the preferred
forward looking center, as was the the case in example (1) – here repeated as (3)
–, or it does not, as in example (4).19 Naturally, both cases can occur in one and
the same utterance for there can be various link-marked entities at once. To il-
18. For a discussion of the question whether an utterance can have more than one
backward-looking center see Kruiff-Korbayová and Hajičová (1997)
19. Swigart and Strange (2002, ”Who’s on First” in ”The Granville Files. Present Day.
Part 2.”)
Hypertext and multiple salience centers in the framework of Centering 239
20. In Strube and Hahn’s model, forward-looking centers of the same type are ranked
according to their position in text. Thus, in (4)e the ones who is ranked higher than it.
240 Birgitta Bexten
Apart from these two basic cases, two more arrangements can be found. It
also occurs that, as in example (5),21 not a whole constituent is link-marked
but only, e.g, an adjective. In such a case, too, the link could or could not co-
incide with – or rather be part of – the preferred center. The other possibility is
that the link marks go beyond the borders of one constituent as is the case in
example (6).22
(5) a. Bei Simulationen handelt es sich um spezielle
By simulations be it itself about special
interaktive Programme, die dynamische Modelle von
interactive programs, which dynamic models of
Apparaten, Prozessen und Systemen abbilden.
devices, processes and systems represent.
b. Simulations are special interactive programs which represent dy-
namic models of devices, processes and systems.
(6) a. Er nimmt die Sonnenbrille ab, lässt seinen Blick ins
He takes the sunglasses off, lets his gaze in_the
Wolkenkratzergetümmel tauchen, sieht Details und nimmt
skyscraper_turmoil dive, sees details and takes
sein Schreiben wieder auf. Die Buchstaben rennen den
his writing again on. The letters run the
Ereignissen hinterher.
events after.
b. He takes off his sunglasses, lets his gaze dive into the turmoil of
skyscrapers, sees details and resumes his writing. The letters run
after the events.
In example (5), the link only promises more information about interaktive and
not about spezielle interaktive Programme.
In example (6) – apart from the linguistically marked forward-looking center
the letters –, the utterance as a whole functions as a preferred center.
In all examples, due to the links, the discourse forks into two distinct text
strings. In example (3), for instance, the reader can predict the Cp Continents
as being connected to two different subsequences of utterances. Whether this
prediction is right, the reader can only find out by reading on either in the cur-
rent hypertext node or in the target node. For the time being, he can only con-
sider the linguistic and paratextual marks of Continents as promises for such
connections.
Strictly speaking, Continents should therefore be analysed as two Cp ’s, a
linguistic center Cp 1 and paratextual center Cp 2. Each belongs to its own text
string. At first, both strings are combined in the same hypertext node. Then they
split up, and one half continues in the current node and the other in the link’s
target node. This phenomenon becomes more obvious in example (4). Here, the
two different Cp ’s are divided. The linguistically marked Cp 1, the ones who, is
connected to the discourse segment of the current node while the second Cp 2,
thinking, opens a discourse connection to the segment of another node.
At this point, one could argue that in hypertext not utterances are connected
but longer sub-texts and that the question of cross-node forward-looking cen-
ters therefore goes beyond traditional23 Centering. Nevertheless, for two rea-
sons, I consider the Centering model a reasonable approach to reveal coherent
discourse connections in hypertext.
On the one hand, one can doubt that Centering in hypertext necessarily is
about connected sub-texts. Especially – but not exclusively – in fictional24 hy-
pertext like the one in example (3), hypertext links are quite often constructed
as direct connections between utterances. Due to the network structure, the
linked utterances cannot occur as a visible linear sequence but have to be written
down in different hypertext nodes. Nevertheless, the structural linguistic con-
nection between the link-marked entity and the coreferent entity in the – usually
first – utterance in the target node supports an interpretation as a sequence of
utterances.
On the other hand, even if this is not the case, Centering can be used to
describe the role link marks play for the coherence of a discourse. Coherence,
in the Centering framework, can be seen as the discourse’s devices to support a
prediction about how it continues and of finding the prediction fulfilled. Both,
prediction and fulfillment, are founded in the discourse itself. The predicting
part of this progress, which is central here, is fundamentally affected by link
marks.
23. Except from traditional Centering theories that focus on local coherence, there are
attempts to concentrate on global discourse structure as well (e.g. Hahn and Strube
(1997) and Walker (2000)).
24. In non-fictional hypertexts, information retrieval is often more important than a co-
herent text. Links, then, target at text units that do not continue the text, but rather
define the link-marked words. Todesco (1997) even advises to write hypertexts in
such a lexicon-like manner. In this case, the hypertext nodes are rather independent
of each other. An interpretation of linked sub-texts, therefore, seems indeed more
appropriate.
242 Birgitta Bexten
The most logical properties to compare preferred centers and hyperlinks are
their form and their function.
Concerning their function, the preceding elaborations have already demon-
strated that both phenomena match considerably. Structurally, both function as
a forward-looking connecting element between two discourse elements. From
a pragmatic viewpoint, they function as ways to assess the text’s continuation.
But hyperlinks go beyond these functions. Apart from the fact that they rep-
resent an executable connection between virtual information units, they also
announce the bifurcation of the recent text string. The same is true for footnote
marks and links in lexicons, but only the latter always announce the same kind
of connected unit, namely relatively context independent information about the
marked word. Hyperlinks as well as footnote marks can be connected to a much
broader range of targets. The difference between them is, that footnotes can only
announce side strings of the text while hyperlinks often connect equally impor-
tant discourse units.26
25. I limit the argumentation here to those typographic attributes that map emphasis to
written language. On the one hand, taking into consideration typographic marks in
general, would go beyond the scope of this paper – just think of typographic indi-
cators of propositional macro-structures or the marking of technical terms. As far as
I know, there are no studies which concentrate on the effects those markings have
on connections between utterances. On the other hand, results from research on ver-
bal salience, which are even at hand in the framework of Centering, can be adopted.
Which marks are to be regarded as typographic reproduction of prosodic qualities
depends on the given discourse context (Wehde 2000). Therefore, it is impossible to
in- or exclude specific ways of marking.
26. For a further discussion of this matter see Bexten (2006)
Hypertext and multiple salience centers in the framework of Centering 243
For the matter of the form, the marked entity and the way it is marked can
be distinguished.27 First, I will concentrate on the marked entity itself.
The preferred centers and hyperlinks in the examples above both mark lin-
guistic entities. How large this entities are, is mutable for both kinds. For the
matter of linguistic centers, Grosz et al. (1995) allude that centers are seman-
tic objects and therefore must not be confounded with words, phrases, etc.
Also other discourse theories suggest that anaphora connections can be hetero-
syntactical, anaphora can resolve for example between phrases, subphrases or
even conjunctional subordinated clauses (Harweg 1979).28 Here the question is
whether centers as for their size are as flexible as hyperlinks.
In hypertext, every entity can theoretically be used as a link anchor, no mat-
ter how small or large it is. The examples have already shown that whole sen-
tences or mere adjectives can serve as a such.29 Whether a whole constituent or
just a noun or an adjective is link marked, depends on several factors. One thing
that plays a role is the link’s target: if it provides a definition or explication of
the marked word, the surrounding words most likely are not link-marked. This
kind of connection usually comes along with a change of dimension (Harweg
1979), which means that a concrete phenomenon, like the interactive character
of computer programs in example (5), is linked to a general explanation of the
concept of interactivity. Thus, especially non-fictional hypertexts show obvious
parallels to printed lexicons.30 In fictional texts, or generally: when a concrete
27. For pragmatical reasons I exclude links, which are placed outside the actual text,
and non-linguistic elements, like pictures, that function as link anchor. The fact that
non-linguistic elements can be used for thematic connections, points at the smooth
transition between the various medial elements, though.
28. In addition to the focus on traditional, linguistic centers, some theories extend the
concept to other conceptions of salience, esp. discourse-structural salience (as used
by Ramm and Hinterhölzl/Petrova (this volume). They describe salience for both
discourse referents and events/discourse segments.
29. And even smaller entities, like letters are link-marked at some websites. In alphabet-
ical lists, this even makes sense, in cases like the one of the proudly presented su-
perlink (http://www.stangl-taller.at/4711/SIEB.10/NETERATUR/LITTERATUR/
Litteratur.html), however, where nearly every letter works as a hyperlink but lacks
every thematic connection to its target, the function of hyperlinks is completely un-
dermined. The reader can by no means tell which website he will be sent to when
clicking on one of the links. These kinds of hyperlink only function on very exper-
imental websites where text-constitutive phenomena as coherence play a secondary
role.
30. The fact that the link marks in print lexicons conventionally follow the word in ques-
tion comes second here, especially as this kind of marks can be found in hyper-
244 Birgitta Bexten
referent should be referred to, it makes more sense to link-mark the whole con-
stituent.
How about linguistic preferred centers then? First, I will focus on their pos-
sibility to cross the borders of a single constituent. In the following dialogic
sequence of utterances31
(7) a. A: But you said, that you’ld agree!
b. B: I never said that!
the whole indirect quotation functions as a forward-looking center. To be a re-
alistic candidate32 for anaphora resolution in the next utterance, however, in
contrast to larger hyperlinks, such a linguistic center makes certain demands: it
has to be preceded by an appropriate preamble, like But you said. This preamble,
on the other hand, causes the whole quotation to have the value of a constituent.
As example (6) has shown, this is not necessary for hyperlinks.
The next question is, whether forward-looking centers can be smaller than
constituents? As already implied in the discussion of the verb thinking in ex-
ample (4), this seems possible, but here, too, certain demands have to be met.
Consider the following example:
(8) a. In his opinion, the blue sweater did not suit her.
b. Blue just wasn’t her colour.
In a., blue can only be judged as a preferred center if the thematic context allows
it and if, in addition, it is emphasised contrastively. What concerns the context,
it requires that several sweaters in various colours were mentioned before. As
for the emphasis, it could – leaving the context put of consideration – fall on
her or even on sweater just as well, which would cause the preferred center
to shift to these entities. Thus, emphasising as a paralinguistic instrument can
influence the salience hierarchy. The matter of an entities size and the ways
of marking it apparently overlap at this point. As this example indicates and
as Grosz et al. (1995) already have conceded, preferred centers can be marked
in various ways. Certainly, it does not necessarily have to be a small entity,
like an adjective, that is made more salient by means of emphasis. The same
texts, too. Grether (n.d.), for example, uses them to differ between intra- and extra-
hypertext links.
31. For an application of the Centering concept to dialogues see for example Byron &
Stent (1998).
32. Theoretically, every clause can be referred to by a meta statement, and therefore can
count as a forward looking center, but it has to be regarded as being ranked very low
in the salience hierarchy.
Hypertext and multiple salience centers in the framework of Centering 245
33. Her analysis is supported by findings outside Centering Theory. Caldwell (2002),
for example, underlines that in English syntax alone is not enough to foreground an
entity and names among other devices verbal emphasis.
246 Birgitta Bexten
their surroundings, while this does not necessarily apply for hyperlinks.34 On
the other hand, emphasis or typographic marks force both, the forward-looking
center and the reader’s attention, to shift. In contrast, in hypertext, the center of
attention only shifts in the reader’s mind but not in the discourse itself. In the
discourse segment, the first Cp remains where it is while, in addition, a second
Cp arises from the link marks.
The comparison of form and function of linguistic centers and hyperlinks
reveals many parallels. What is more, this comparison demonstrates that it is a
small step from mere linguistic to paralinguistic devices of salience marking.
As mentioned, the latter already are partly integrated in Centering approaches.
Against the background of the presented elaboration, including hyperlink marks
is only consequent.
The analysis of utterances in hypertexts shows that the limit of one preferred
forward-looking center per utterance can be exceeded. Apart from the normal
linguistically marked Cp , the writer can explicitly highlight additional Cp ’s. The
first Cp corresponds to the most salient forward-looking center usually analysed
in Centering Theory. The additional Cp ’s, the hypertext links, can be entities
with a low degree of linguistic salience. They usually are marked paratextually
by being, for example, underlined and coloured. Both kinds of Cp ’s are equal in
as far as both permit the reader to predict the ‘aboutness’ of subsequent utter-
ances. Therefore, both contribute to the coherence of discourse in a plurilinear
hypertext environment. In this paper, I proposed to describe the two types in-
tegratively in the Centering framework. While linguistically salient Cp ’s can
sufficiently be analysed with traditional Centering theories, I used Navarretta’s
description of explicit salience marking as theoretical background to show that
an integration of paralinguistic phenomena is perfectly reasonable. Hypertext
links are not necessarily salient in a linguistic way. Hence, in traditional Cen-
tering, they would not be regarded as preferred centers. Referring to less salient
discourse entities, in traditional Centering, would mean to risk the discourse to
become incoherent. Here, I pointed out that with regard to hypertext links this
interpretation is not accurate. Discourse in hypertext allows for rougher shifts
than traditional linear discourse because by highlighting discourse entities as
links, the writer warns the reader to prepare for the thematic shift. Without link
34. At least, this is true from a pure descriptive viewpoint. Keeping the question of co-
herent connections in mind, it is arguable whether it really is advisable to tap the full
potential of hyperlink placement.
Hypertext and multiple salience centers in the framework of Centering 247
marks, the reader could focus his attention only on the linguistically marked pre-
ferred center in the linear sequence of utterances in one hypertext node. Only
the paratextual marks tell him that there is a connection with another text string
and that he should shift his attention if he would want to follow it. Without link
marks there would be no such information and, what is more, there would be no
hypertext. The hypertext network depends on both kinds of preferred centers.
Hence, in contrast to linear texts, coherence in hypertext can only be described
sufficiently in a model that includes linguistic as well as non-linguistic, i.e. para-
textual, salience. In future research such a model could adapt results from Cen-
tering research to the analysis of hypertexts. It is, for example desirable to take
a closer look on the impact that the placement of hyperlinks in an utterance has
on cross-node coherence. Arguing on the assumption that every center shifting
between utterances entails a decrease of coherence, one could assume, that this
holds for cross-node coherence, too. Therefore, it would be plausible to let co-
incide the linguistic and the paratextual center as often as possible. On the other
hand, it is imaginable that this would confuse the reader, because the added in-
formation value would be less comprehensible. An integrative application of
Centering Theory, as suggested above, provides a framework to deal with these
questions.
References
Bexten, Birgitta
2006 Hypertext and Plurilinearity. Challenging an old-fashioned discourse
model. International Symposium. Discourse and Document. Schedae.
Prépublications de l’Université de Caen Basse Normandie. Press uni-
versitaires de Caen, 117–121.
Blumstengel, Astrid
1998 Entwicklung hypermedialer Lernsysteme. Ph.D. thesis, University of
Paderborn, Paderborn. http://dsor.uni-paderborn.de/forschung/publi
kationen/blumstengel-diss/.
Bolter, David J.
1991 Writing space: The computer, hypertext, and the history of writing.
Hilldale etc.: Lawrence Erlbaum Associates, Inc.
Byron, Donna K. and Stent, Amanda J.
1998 A preliminary model of Centering in dialog. Proceedings of the 36th
annual meeting on Association for Computational Linguistics, Mor-
riston : Association for Computational Linguists, 1475–1477.
Caldwell, Thomas P.
2002 Topic-Comment Effects in English. Meisei Review, 17: 49–69.
248 Birgitta Bexten
Chiarcos, Christian
this volume The Mental Salience Framework: Context-adequate generation of re-
ferring expressions. this volume, 105–139.
DeStefano, Diana and Jo-Anne LeFevre
2007 Cognitive load in hypertext reading. A review. Computing Human
Behaviour, 23: 1616–1641.
Genette, Gérard
1987 Seuils. Paris: Éditions du Seuil.
Grether, Reinhold
n.d. Die Weltrevolution nach Flusser. http://www.flusser.de/.
Grosz, Barbara J., Aravind K. Joshi, and Scott Weinstein
1995 Centering: A Framework for Modeling the Local Coherence of Dis-
course. Computational Linguistics, 21(2): 203–225.
Hahn, Udo and Michael Strube
1997 Centering in-the-Large: Computing Referential Discourse Segments.
Proceedings of the 35th Annual Meeting of the Association for Com-
putational Linguistics and the 8th Conference of the European Chap-
ter of the Association for Computational Linguistics, Madrid, 104–
111.
Hammwöhner, Rainer
1997 Offene Hypertextsysteme. Das Konstanzer Hypertextsystem (KHS)
im wissenschaftlichen und technischen Kontext. Universitäts-Verlag,
Konstanz (= Schriften zur Informationswissenschaft; 32).
Harweg, Roland
1974 Bifurcations de textes. Semiotika 12: 41–59.
Harweg, Roland
1979 [1968] Pronomina und Textkonstitution. München: Wilhelm Fink Verlag.
Hinterhölzl, Roland and Svetlana Petrova
this volume Rhetorical relations and verb placement in Old High German. this
volume, 173–201.
Kelleher, John D.
this volume Visual Salience and the Other One. this volume, 205–228.
Kruijff-Korbayová, Irina and Eva Hajicová
1997 Topics and Centers. A Comparision of the Salience-Based Approach
and the Centering Theory. The Prague Bulletin of Mathematical Lin-
guistics, 67: 25–50.
Kuhlen, Rainer
1991 Hypertext. Ein nicht-lineares Medium zwischen Buch und Wissens-
bank. Springer, Berlin, Heidelberg, New York etc. (Edition SEL-Stif-
tung).
Hypertext and multiple salience centers in the framework of Centering 249
Navarretta, Constanza
2002 Combining Information Structure and Centering-based Models
of Salience for Resolving Intersentential Pronominal Anaphora.
In: António Branco, Tony McEnery and Ruslan Mitkov, edi-
tors,Proceedings of DAARC 2002 – 4h Discourse Anaphora and
Anaphora Resolution Colloquium, Lisbon, September, 135–140.
Nielsen, Jakob
1995 Multimedia and hypertext: the internet and beyond. Boston u.a.: Aca-
demic Press Professional, Inc.
Ohler, Normen
1995 Die Quotenmaschine. http://home.ph-freiburg.de/kepser/qm/.
Prince, Ellen F.
1981 Toward a Taxonomy of Given-New Information. In: Peter Cole, edi-
tor, Radical Pragmatics. Academic Press, New York, 223–255.
Prince, Ellen F.
1992 The ZPG Letter: Subjects, Definiteness, and Information-Status. In:
William C. Mann and Sandra A. Thompson, editors. Discourse De-
scription: Diverse Analyses of a Fund Raising Text, John Benjamins
B.V., Philadelphia, Amsterdam, 295–325.
Ramm, Wiebke
this volume Discourse-structural salience from a cross-linguistic perspective: Co-
ordination and its contribution to discourse (structure). this volume,
143–172.
Rose, Ralph L.
this volume Joint Information Value of Syntactic and Semantic Prominence for
Subsequent Pronominal Reference. this volume, 81–103.
Scharl, Arno
2000 Evolutionary Web Development. Automated Analysis, Adaptive De-
sign, and Interactive Visualization of Commercial Web Information
Systems London: Springer.
Storrer, Angelika
2002 Coherence in Text and Hypertext. Document Design, 3(2):156–168.
Strube, Michael
1998 Never Look Back: An Alternative to Centering. Proceedings of the
17th International Conference on Computational Linguistics, Asso-
ciation for Computational Linguistics, Morristown, NJ, USA, 1251–
1257.
Strube, Michael and Udo Hahn
1999 Functional Centering: Grounding Referential Coherence in Informa-
tion Structure. Computational Linguistics, 25(3):309–344.
250 Birgitta Bexten
Berry Claus
1. Introduction
(1) John was preparing for a marathon in August. After doing a few warm-
up exercises, he put on (associated) / took off (dissociated) his sweat-
shirt and went jogging. He jogged halfway around the lake without too
much difficulty. Probe: sweatshirt
This finding received much attention and has been considered an elegant sup-
port for the notion that comprehenders construct situational representations
which guide comprehension. However, McKoon and Ratcliff claimed that there
is an alternative interpretation of the finding. They argued that the difference in
accessibility between the associated and dissociated condition can be explained
without assuming that comprehenders construct situational representations. Ac-
cording to their interpretation, the result does not reflect the spatial distance in
252 Berry Claus
1. It should be noted that this is also one of the arguments in Glenberg’s reply (see
Glenberg and Matthew, 1992) to McKoon and Ratcliff (1992).
A simulation view account of salience 253
other extra-linguistic factors, see Bexten, this volume, who addresses the is-
sue of hypertextual salience marking; Kelleher, this volume, who developed a
model of reference resolution in situated communication that takes into account
visual salience; see also Rose, this volume, for an account of pronominal ref-
erence that joins the information of syntactic and semantic prominence). The
theoretical point of departure is the simulation view of language comprehen-
sion. According to this view, language comprehension is tantamount to men-
tally simulating the experience of the described situation. It should be noted in
advance that the paper is not intended as providing a full account of salience.
Its purpose is to outline a simulation view account of salience which claims that
salience may derive from mental simulations constructed during narrative text
comprehension.
The next section will provide a brief overview of the simulation view of
language comprehension. Section 3 will give an account of how mental simula-
tions during language comprehension may affect salience. Section 4 will report
empirical support for the simulation view account of salience. Section 5 will
conclude with some final remarks.
Finally, there is also empirical evidence for the simulation view of language
from behavioural studies concerned with the representation of abstract infor-
mation, such as descriptions of non-physical transfer (Glenberg and Kaschak,
2002), desiderative sentence mood (Claus, 2008), or negation (Kaup, Lüdtke,
and Zwaan, 2006; Kaup, Yaxley, Madden, Zwaan, and Lüdtke, 2007).
Taken together, the findings from neuroscientific and behavioural studies
provide strong empirical support for the view that language comprehension in-
volves embodied mental simulations. The findings are difficult to align with
amodal theories of language comprehension. To be sure, amodal theories could
account for the findings by adding additional assumptions. However, such an
account would be a completely post hoc explanation.
Moreover, amodal theories suffer from two inherent problems. The trans-
duction problem refers to the lack of an account as to how amodal abstract
symbols emerge in the mind, that is, how perceptual experiences are transduced
into arbitrary symbols (Barsalou, 1999; see also Brooks, 1987). The reverse of
this problem is the symbol grounding problem. It pertains to the question as
to how amodal abstract symbols are mapped back onto the world, that is, how
the meaning of arbitrary symbols is grounded (Harnad, 1990). Both problems
do not exist in embodied theories of cognition which assume that meaning is
grounded in perception and action.
At present, the embodied account of language comprehension is not yet a
full-fledged theory. Most of the studies that investigated predictions of the sim-
ulation view were concerned with the processing and representation of narrated
concrete situations. The presently available evidence for the simulation view is
limited to narrative text comprehension. There are currently no studies within
256 Berry Claus
this framework which address the issue of expository texts. What is also still
lacking are substantial theoretical approaches and empirical evidence regard-
ing the question as to how embodied theories of language comprehension can
account for issues such as abstract concepts and function words. However, the
results of the above mentioned studies concerning abstract transfer, sentence
mood, and negation are promising with regard to future research within the
simulation view framework.
The next section considers what the simulation view of language compre-
hension in its present state can contribute to the issue of salience. The scope of
the considerations is limited to nonlinguistic aspects of the described situations
during the comprehension of narrative texts.
2. Indeed, there seems to be wide agreement across different theories on text compre-
hension that the most salient entities are part of the available working-memory repre-
sentation. However, there is disagreement on the question of which factors constrain
the available set of entities.
A simulation view account of salience 257
assumed to be experiential and perspectival. Let’s first consider these two char-
acteristics and then turn back to the question.
(4) John was on his way to school. He was terribly worried about the maths
lesson. He thought he might not be able to control the class again today.
It was not a normal part of a janitor’s duties.
When reading the first sentence, most people will simulate a pupil on his way
to school. From an experiential simulation view this can be attributed to the
fact that for most comprehenders, the majority of memory traces of way-to-
school experiences originate from their own school days. Hence, most com-
prehenders are led up the garden path by an experientially biased simulation,
resulting in difficulties when processing the third sentence. According to the
simulation view, a teacher, who would read the narrative in (4) might construct
a differentially biased simulation, resulting in processing difficulties with the
last sentence when John turns out to be a janitor.
Mental simulations of described situations are not only shaped by unique ex-
periences. In particular, they are also constrained by basic principles underlying
human experience of the world such as temporal and spatial organization.
Time plays a central role in how we experience the world. The temporal
dimension can be considered to be the most important one in structuring our
experiences (cf. Navon, 1978). In experiencing, we conceive time as continu-
ously extending from past to present to future. The present is mentally set off
against the past and the future. More precisely, the situation that exists at the
258 Berry Claus
(5) a. Bill was sitting in the living room reading the paper when John
came into the living room.
b. Bill was sitting in the living room reading the paper when John
went into the living room.
Reading times for the perspective-shift sentences were found to be prolonged
compared with reading times for the perspective-consistent sentences, indicat-
ing that the participants adopted the spatial point of view of the first character.
This conclusion is further bolstered by the additional finding that participants
made systematic errors in recalling the perspective-shift sentences by replacing
went by came.
However, spatial point of view is merely one type of perspective. There
is empirical evidence that comprehenders track the protagonist’s mental per-
spective as well. Studies concerning the representation of emotions suggest
that comprehenders infer unmentioned emotional states of protagonists (Gerns-
bacher, Goldsmith and Robertson, 1992; Gernsbacher and Robertson, 1992).
Other findings indicate that comprehenders also infer non-explicitly stated goals
of protagonists (Long and Golding, 1993; Poynor and Morris, 2003). In ad-
dition, there is evidence that comprehenders keep track of the protagonist’s
knowledge/ignorance (Barquero, 1999; de Vega, Díaz and León, 1997). A study
by Sanford, Clegg, and Majid (1998) suggests that states of affairs being men-
tioned in a narrative are generally mentally coded in terms of their significance
to the protagonist. The results indicate that a background information sentence,
such as (6), is interpreted as being experienced by the protagonist.
(6) The air was hot and sticky.
As mental simulations that are constructed during language comprehension are
assumed to be experiential in nature, they are biased by the comprehender’s
current and past personal experiences. Accordingly, mental simulations con-
structed during language comprehension should be biased also by the compre-
hender’s perspective. Indeed, empirical studies indicate that language compre-
hension is affected by the comprehender’s personality. Findings of Zwaan and
Truitt (1998) indicate that smokers and non-smokers differ with regard to pro-
cessing smoking-related sentences. A study by Holt and Beilock (2006) sug-
gests that novice and expert ice hockey players and novice and expert football
players construct different mental simulations of described hockey-specific sit-
uations and football-specific situations, respectively.
Let’s now turn back to the question as to what the simulation view implies
with regard to the issue as to which entities of a narrative constitute the salient
elements at a given moment during text comprehension.
260 Berry Claus
4. Empirical Evidence
This section will report findings which provide empirical support for the claim
that salience may derive from mental simulations during language comprehen-
sion. Before turning to these findings, some remarks on the measurement of
salience have to be made.
A simulation view account of salience 261
3. The majority of these studies tested mental accessibility by measuring reaction times
on a probe-recognition task. In a typical probe-recognition task experiment, partici-
pants read texts sentence by sentence at a self-paced rate. At a given moment (either
during or at the end of the text presentation), they are presented with a probe word.
Their task is to indicate as quickly and accurately as possible whether or not the word
was mentioned in the text.
262 Berry Claus
are less accessible (e.g., Anderson, Garrod, and Sanford, 1983; Bestgen and
Vonk, 1995; Carreiras, Carriedo, Alonso, and Fernández, 1997; Magliano and
Schleich, 2000; Speer and Zacks, 2005; Zwaan, 1996; Zwaan, Madden, and
Whitten, 2000).
In one of the experiments of the study by Carreiras and colleagues (1997,
Experiment 1), participants read short narratives and were tested for the mental
accessibility of a job description (e.g., economist) that was mentioned in the
narrative either in a sentence describing the protagonist’s current situation, as
in (7a), or in a comparable sentence referring to the protagonist’s past, as in (7b).
(7) a. Now she works as an economist for an international company.
b. Sometime in the past she worked as an economist for an interna-
tional company.
The job description was found to be more accessible when it was presented
as currently applying to the protagonist, as in (7a), compared to when it was
presented as not currently applying to the protagonist, as in (7b). Remarkably,
this effect was obtained even when the accessibility was tested immediately
after the manipulated sentence, that is, immediately after reading the sentence
in which the job description was introduced.
Findings by Zwaan, Madden, and Whitten (2000) also indicate that the pres-
ence or absence of states of affairs at the protagonist’s Now immediately affects
accessibility. Participants read sentence pairs such as (8a) or (8b).
(8) a. Thomas was programming his computer. When his drink spilled,
he continued.
b. Thomas was programming his computer. When his drink spilled,
he stopped.
After reading the second sentence, participants were tested for the accessibility
of the activity mentioned in the first sentence (e.g., programming). The activ-
ity proved to be more accessible when the second sentence stated that it was
still going on at the protagonist’s Now at the time of testing, as in (8a), com-
pared to when the second sentence stated a discontinuation of the activity, as
in (8b).
Additional evidence for the effect of temporal presence on mental accessibil-
ity stems from studies examining the role of verb aspect (Carreiras et al., 1997,
Experiment 3; Magliano and Schleich, 2000, Experiment 3). For example, in
the experiment by Magliano and Schleich (2000), participants were presented
with narratives containing a sentence which described an activity of the protag-
onist either in imperfective aspect, as in (9a), or in perfective aspect, as in (9b).
A simulation view account of salience 263
Hence, the sentence either implied that the activity was ongoing or that it was
completed.
The manipulation of verb aspect had an impact on the accessibility of the activ-
ity. The activity was more accessible when it was conveyed with an imperfec-
tive aspect compared to when it was conveyed with a perfective aspect.
The results of the studies reported so far indicate that states of affairs per-
taining to the protagonist’s Now are more easy to access than states of affairs
pertaining to the (far or distant) past. This finding provides strong empirical
evidence for the assumption that the available set of elements at a given time
during text processing is composed of those entities that make up the protago-
nist’s current situation.
Additional support for the assumption comes from studies which compared
the mental accessibility of entities that are present at the current situation to the
mental accessibility of spatially distant entities. Most of the studies which inves-
tigated the impact of spatial distance on mental accessibility employed an ex-
perimental paradigm introduced by Morrow, Greenspan, and Bower (1987) or
Rinck and Bower’s (1995) variant of this paradigm (e.g., Dutke, 2003; Haenggi,
Kintsch, and Gernsbacher, 1995; Morrow, Bower, and Greenspan, 1989; Rinck,
Hähnel, Bower, and Glowalla, 1997; Rinck, Williams, Bower, and Becker,
1996). Participants first memorize the layout of rooms in a building and the
various objects in the rooms. They then read a narrative containing several mo-
tion sentences, such as (10), describing the protagonist’s movement from one
room (source room) through an unmentioned path room to a goal room. After
the presentation of a motion sentence, reading is interrupted by a test of the
mental accessibility of the objects in the rooms.
The typical result of studies using this “Morrow paradigm” is that objects of
the goal room, that is, objects at the protagonist’s current location at the time
of testing, are more accessible than objects of the path room, which in turn are
more accessible than objects of the source room. This finding fits well with the
assumption that entities pertaining to the protagonist’s current Here are highly
salient. Yet, it is questionable whether the results from studies involving memo-
rizing a spatial layout before reading a text about it can be generalized to natural-
istic reading conditions. The layout learning may have directed the participants’
264 Berry Claus
attention to the spatial properties of the described world (cf. Zwaan, Radvansky,
Hilliard, and Curiel, 1998).4
This objection does not hold for the study by Glenberg and colleagues (1987)
which has been mentioned already in the introduction of this paper. Glenberg
and colleagues presented their participants with short narratives such as (1),
repeated here as (11), without especially encouraging them to attend to spatial
relations.
(11) John was preparing for a marathon in August. After doing a few warm-
up exercises, he put on (associated) / took off (dissociated) his sweat-
shirt and went jogging. He jogged halfway around the lake without too
much difficulty.
Participants were tested for the accessibility of an entity mentioned in the ma-
nipulated sentence (e.g., sweatshirt). It was found that the entity was more ac-
cessible after reading the associated version than after reading the dissociated
version. That is, the probed entity was more accessible when it pertained to the
protagonist’s current Here at the time of testing compared to when it was spa-
tial distant from the protagonist. According to the simulation view account of
salience this result can be attributed to the salience of the protagonist’s Here.
However, one might object that the difference in mental accessibility does
not reflect an effect of spatial presence. Indeed, the associated and dissociated
condition did not only differ with regard to spatial distance but also with re-
gard to the functional importance of the probed entities for the protagonist’s
current situation. Yet, an explanation of the results in terms of functionality (cf.
Radvanyky and Copeland, 2000) would by no means be incompatible with the
simulation view account of salience. As Glenberg and Mathew (1992) pointed
4. Findings from research on spatial text information suggest the conclusion that com-
prehenders do not normally construct spatial representations (at least not detailed
ones), unless such representations are necessary with regard to the specific task de-
mands or personal reading goals or are easy to construct. However, this conclusion
may be premature considering that in virtually all the studies the material was pre-
sented visually. Visual text presentation as opposed to auditory text presentation can
be expected to be disadvantageous with regard to representing spatial information
about a described situation (cf., Brooks, 1967; Eddy and Glass, 1981). As reading
already involves a spatial task, namely the control of eye movements, it should inter-
fere with the processing of spatial text information. Indeed, empirical results indicate
that comprehenders construct detailed spatial representations of described situations
with auditory text presentation but not with visual text presentation under conditions
where neither the instruction, nor the materials, nor the experimental task highlighted
spatial information (Claus and Kelter, 2009).
A simulation view account of salience 265
out in their reply to McKoon and Ratcliff (1992), “what make an object or event
salient are its relations to the other objects and events and our knowledge about
what those relations imply for further action”.
Mental simulations of described situations are assumed to be perspectival
and highly selective, as are representations of directly experienced situations.
Hence, the available set of entities at a given time during processing is assumed
to be determined by the protagonist’s spatial and mental perspective. Thus, it
should include, in particular, those entities which are functionally related to the
protagonist.
(13) Mr. Ranzini was sitting outside on his front stoop. He had lived on
this block for over 30 years. Next door was a local playground for the
children. Directly across the street was the mailbox that he used. As
usual, Mrs. Rosaldo was taking her poodle for a walk.
unblocked: Suddenly, a man on a bike rode up in front of Mr. Ranzini.
blocked: Suddenly, a large truck pulled up in front of Mr. Ranzini.
At the end of the narrative, participants had to respond to a probe question about
the critical entity (e.g., Was there a mailbox across the street?). Response la-
tencies to the probe question were found to be shorter after reading the un-
blocked version of the last sentence compared to after reading the blocked ver-
sion. This result suggests that entities which are visible to the protagonist are
mentally more accessible than entities which are occluded from the protago-
nist’s view.
Hence, there is some, albeit limited, support for the assumption, that the
available set of entities at a given time during narrative comprehension, is de-
termined by the protagonist’s perceptual perspective. Let’s now turn to studies
which demonstrate effects of the protagonist’s mental perspective on accessi-
bility.
Results of a study using the “Morrow paradigm” (see section 4.2) indicate
that the mental accessibility of entities does not only depend on the protago-
nist’s current spatial location but mainly on his or her mental location, that is, a
location pertaining to the protagonist’s thoughts (Morrow et al., 1989). Partici-
pants were presented with sentences, such as (14), describing the protagonist’s
thoughts which involved a particular room.
A simulation view account of salience 267
(14) He thought the library should be rearranged to make room for a dis-
play of current research.
It was found that after reading such a sentence, objects located in the room that
the protagonist was thinking about were more accessible than objects in any
other room.
Additional support for the impact of the protagonist’s mental perspective on
accessibility comes from studies concerning the significance of protagonists’
goals. Empirical findings of several studies indicate that protagonists’ goals are
highly accessible (e.g., Dopkins, Klin and Myers, 1993; Huitema, Dopkins, Klin
and Myers, 1993; Suh and Trabasso, 1993). Other findings suggest that this only
holds as long as the protagonist’s goal is not satisfied. Goal-related information
was found to be more accessible when it pertains to a failed goal compared to
when it pertains to a completed goal (Lutz and Radvansky, 1997; Radvansky,
and Curiel, 1998; Suh and Trabasso, 1993).5
In the study by Radvansky and Curiel, participants read narratives such as
(15). There were two versions of the narrative which differed with respect to
whether an initially mentioned goal of the protagonist (buying a retirement gift)
was failed or completed.
(15) Once there was a bank teller named Roy. Roy realized his boss was
retiring in four days. He wanted to give her a retirement gift. He went
to the department store.
failed goal: He couldn’t find anything nice enough. He felt dis-
couraged.
completed goal: He bought a nice big-screen TV for his boss. He felt
pretty good.
Accessibility of the protagonist’s (ongoing or completed) goal was tested by
measuring response latencies to a probe question (e.g., Had Roy wanted to buy
his boss a gift?). Response were found to be shorter after reading that the goal
was failed compared to after reading that the goal was satisfied.
The findings of the studies reported in this section fit well with the assump-
tion that the available set of entities is shaped by the protagonist’s point of view.
They indicate that the mental accessibility of an entity is affected by its relation
to the protagonist’s spatial and mental perspective.
5. This resembles an effect found for non-linguistic cognition. People remember un-
completed tasks better than completed tasks (Zeigarnik, 1927).
268 Berry Claus
5. Conclusion
The present paper took a look at the issue of salience from the perspective of the
simulation view of language comprehension. According to the simulation view,
extra-linguistic salience derives from the mental simulations constructed during
language comprehension. The available set of entities at a given moment during
processing is assumed to be determined by the mental simulation of the situa-
tion at the current narrative Now. The studies reported in the previous sections
provide empirical support for this assumption.
It should be noted, that the simulation view account of salience by no means
implies a denial of the impact of linguistic devices on salience. Mental simu-
lations of described situations are constructed through language. They are in-
structed by linguistic means. Hence, the simulation view account of salience is,
in principle, not incompatible with accounts of salience in terms of linguistic
factors. It would be interesting to see whether and how both types of accounts
might benefit from each other.
In future research, it needs to be clarified whether and how simulation-based
salience affects the resolution of referential expressions and how this interacts
with effects of linguistic salience. There are already some promising empirical
findings in this regard. Results of a sentence-completion study by Stevenson,
Crawley and Kleinman (1994) indicate a preference to resolve an ambiguous
pronoun with the thematic role that is associated with the consequences of the
precedingly described event (i.e., goal, patient) rather than with the thematic
role that is associated with the starting point (i.e., source, agent) – even when
order of mention/syntactic function is controlled for. Results of Morrow (1985)
also suggest that reference resolution depends more on the event structure than
on the linguistic surface structure. He investigated the resolution of ambiguous
definite noun phrases (e.g., He noticed the room was dark) after reading a sen-
tence that described that a character moved from one room to another room.
Antecedent choices were determined more by grammatical aspect and preposi-
tion that implied the mover’s current location (e.g., John walked past the living
room into the kitchen vs. John was walking past the living room to the kitchen)
than by the order of mention. Remarkably, an additional experiment indicated
that the referential interpretation of proximal and distal demonstratives (e.g.,
this room vs. that room) was affected by spatial and/or temporal distance in
the described world rather than by surface linguistic distance. However, Mor-
row’s findings as well as the findings by Stevenson and colleagues are based on
offline measures. In future studies on the effects of linguistic versus simulation-
based salience, it would be expedient to investigate which factors influence the
time course of reference resolution during online comprehension. For exam-
A simulation view account of salience 269
6. Overall, the proportion of pronoun uses was higher for subject referents than for
object referents.
7. Exceptions are, for example, focussed entities in situated language processing and
probably also nuclear implicit referents in dialogue utterances (for empirical evi-
dence, see Cornish, Garnham, Cowles, Fossard, and André, 2005).
270 Berry Claus
References
Buccino, Giovanni, Lucia Riggio, Gabriele Melli, Ferdinand Binkofski, Vittorio Gallese,
and Giacomo Rizzolatti
2005 Listening to action-related sentences modulates the activity of the mo-
tor system: A combined TMS and behavioral study. Cognitive Brain
Research 24: 355–363.
Carreiras, Manuel, Núria Carriedo, María A. Alonso, and Angel Fernández
1997 The role of verb tense and verb aspect in the foregrounding of infor-
mation during reading. Memory & Cognition 25: 438–446.
Claus, Berry
2008 Comprehending descriptions of non-factual desired situations: Dis-
course referents and motor actions. In Proceedings of the Workshop
Constraints in Discourse 3, Anton Benz, Peter Kühnlein, and Man-
fred Stede (eds.). Potsdam, Germany.
Claus, Berry and Stephanie Kelter
2006 Comprehending narratives containing flashbacks: Evidence for tem-
porally organized representations. Journal of Experimental Psychol-
ogy: Learning, Memory, and Cognition 32: 1031–1044.
Claus, Berry and Stephanie Kelter
2009 Embodied language comprehension: The processing of spatial infor-
mation during reading and listening. In Advances in Psychology Re-
search, Vol. 59, Alexandra M. Columbus (ed.), 1–44. Hauppauge,
NY: Nova Science.
Cornish, Francis, Alan Garnham, H. Wind Cowles, Marion Fossard, and Virginie André
2005 Indirect anaphora in English and French: A cross-linguistic study of
pronoun resolution. Journal of Memory and Language 52: 363–376.
de Vega, Manuel, José M. Díaz, and Immaculada León
1997 To know or not to know: Comprehending protagonists’ beliefs and
their emotional consequences. Discourse Processes 23: 169–192.
Dopkins, Stephen, Celia Klin, and Jerome L. Myers
1993 Accessibility of information about goals during the processing of nar-
rative texts. Journal of Experimental Psychology: Learning, Memory,
& Cognition 19: 70–80.
Dutke, Stephan
2003 Anaphor resolution as a function of spatial distance and priming: ex-
ploring the spatial distance effect in situation models. Experimental
Psychology 50: 270–284.
Eddy, John K., and Arnold L. Glass
1981 Reading and listening to high and low imagery sentences. Journal of
Verbal Learning and Verbal Behavior 20: 333–345.
272 Berry Claus
Filchenko, Andrey Y.
this vol. Parenthetical agent-demoting constructions in Eastern Khanty: Dis-
course salience vis-à-vis referring expressions. this volume, 57–79.
Franklin, Nancy and Barbara Tversky
1990 Searching imagined environments. Journal of Experimental Psychol-
ogy: General 119: 63–76.
Gernsbacher, Morton Ann, H. Hill Goldsmith, and Rachel R.W. Robertson
1992 Do readers mentally represent fictional characters’ emotional states?
Cognition & Emotion 6: 89–111.
Gernsbacher, Morton Ann and Rachel R.W. Robertson
1992 Knowledge activation versus sentence mapping when representing
fictional characters’ emotional states. Language and Cognitive Pro-
cesses 7: 353–371.
Glenberg, Arthur M.
1997 What memory is for. Behavioral and Brain Sciences 20: 1–55.
Glenberg, Arthur M., David Havas, Raymond Becker, and Mike Rinck
2005 Grounding language in bodily states: The case for emotion. In
Grounding cognition: The role of perception and action in memory,
language, and thinking, Diane Pecher and Rolf A. Zwaan (eds.), 115–
128. Cambridge: Cambridge University Press.
Glenberg, Arthur M. and Michael P. Kaschak
2002 Grounding language in action. Psychonomic Bulletin & Review 9:
558–565.
Glenberg, Arthur M., and Shashi Mathew
1992 When minimalism is not enough: Mental models in reading compre-
hension. Psycoloquy 3(64), reading-inference-2.1
Glenberg, Arthur M., Marion Meyer, and Karen Lindem
1987 Mental models contribute to foregrounding during text comprehen-
sion. Journal of Memory and Language 26: 69–83.
Glenberg, Arthur M., Marc Sato, and Luigi Cattaneo
2008 Use-induced motor plasticity affects the processing of abstract and
concrete language. Current Biology 18: R290–R291.
González, Julio, Alfonso Barros-Loscertales, Friedemann Pulvermüller, Vanessa Mese-
guer, Ana Sanjuán, Vicente Belloch, and César Ávila
2006 Reading cinnamon activates olfactory brain regions. NeuroImage 32:
906–912.
Graesser, Arthur C., Murray Singer, and Tom Trabasso
1994 Constructing inferences during narrative text comprehension. Psy-
chological Review 101: 371–395.
A simulation view account of salience 273
Ramm, Wiebke
this vol. Discourse-structural salience from a cross-linguistic perspective: Co-
ordination and its contribution to discourse (structure). this volume,
143–172.
Rinck, Mike and Gordon H. Bower
1995 Anaphora resolution and the focus of attention in situation models.
Journal of Memory and Language 34: 110–131.
Rinck, Mike, Andrea Hähnel, Gordon H. Bower, and Ulrich Glowalla
1997 The metrics of spatial situation models. Journal of Experimental Psy-
chology: Learning, Memory, and Cognition 23: 622–637.
Rinck, Mike and Ulrike Weber
2003 Who when where: An experimental test of the event-indexing model.
Memory & Cognition 31: 1284–1292.
Rinck, Mike, Pepper Williams, Gordon H. Bower, and Eni S. Becker
1996 Spatial situation models and narrative understanding: Some general-
izations and extensions. Discourse Processes 21: 23–55.
Rose, Ralph L.
this vol. Joint information value of syntactic and semantic prominence for sub-
sequent pronominal reference. this volume, 81–103.
Sanford, Anthony J., Michael Clegg, and Asifa Majid
1998 The influence of types of character on processing background infor-
mation in narrative discourse. Memory & Cognition 26: 1323–1329.
Sanford, Anthony J. and Simon C. Garrod
1981 Understanding written language: explorations of comprehension be-
yond the sentence. New York: John Wiley.
Speer, Nicole K. and Jeffrey M. Zacks
2005 Temporal changes as event boundaries: Processing and memory con-
sequences of narrative time shifts. Journal of Memory and Language
53: 125–140.
Stevenson, Rosemary J., Rosalind A. Crawley, and David Kleinman
1994 Thematic roles, focus and the representation of events. Language and
Cognitive Processes 9: 519–548.
Suh, Soyoung and Tom Trabasso
1993 Inferences during reading: Converging evidence from discourse anal-
ysis, talk-aloud protocols, and recognition priming. Journal of Mem-
ory and Language 32: 279–300.
Taylor, Larry J. and Rolf A. Zwaan
2008 Motor resonance and linguistic focus. Quarterly Journal of Experi-
mental Psychology 61: 896–904.
A simulation view account of salience 277
Language index
aboutness 35, 37, 59, 119, 128, 146– contrast (contrastivity, contrastiveness)
147, 182–183, 195, 229–242, 245– 16–17, 39, 41, 123, 161, 239, 244
246, 265–267 coordination 4, 13–15, 143–169, 174–
accessibility 4–5, 14, 19, 59, 71, 107– 175, 183, 185–186, 188, 192–195
111, 134, 174, 180–184, 251–252,
258, 261–267 definiteness 5, 45, 62, 128–130, 180,
184, 186, 205, 207–209, 219–220,
activation 5, 9–10, 31–42, 52, 59–60,
222, 268
62, 107, 109, 208, 254, 257
discourse prominence 9, 57, 76, 107,
agenthood (agentivity, see semantic
110–111, 115
roles) 10, 57, 60–65, 68, 71–76, 82,
distance (recency of mention) 16, 18,
86–87, 93–95, 117
32–33, 35–37, 39, 51–52, 83, 113–
animacy (animate, inanimate) 33, 42, 117, 174, 215, 221–223, 242, 251–
45–50, 62, 74, 94, 173, 191 252, 261, 263–264, 268
discourse-new (see new) 15, 31, 174, persistence (thematic prominence) 46–
183, 236 48, 51, 61, 76, 81, 115, 122
discourse-old (see given) 15, 229, pronominalization 6–8, 61, 76, 90–92,
232, 236, 239 96, 113, 115–118, 120–121, 127–
hearer-new (see new) 119, 236–237 130, 132
hearer-old (see given) 236–237 property salience 125–126
feature salience 125–126 prosody 17, 63, 107, 229, 150, 229, 242,
focusing (semantic focusing) 10, 17, 82, 245
145, 165
referential choice (choice of referring
given (givenness) 3, 5, 9, 15, 32, 34, 35, expressions) 9, 11, 31–52, 76, 83,
105, 107–109, 111, 115, 123, 130, 105–134, 269, 173
134, 146, 173–174, 180, 182–184, referentiality 5, 35–37, 74, 88, 184,
190–191, 193, 258 253–254, 268
grammatical roles (syntactic relations) relevance 1, 9, 31, 35, 38, 47–49, 52,
5–8, 11, 33, 57, 59–66, 73–74, 76, 59, 64, 106–107, 134, 146, 149–150,
81–97, 105–134, 144, 173, 216, 231, 161, 180, 230, 252
235, 269
selective attention 16
hearer salience 11, 84, 107–134 semantic roles (thematic roles) 10–11,
17, 57, 59–60, 63–67, 76, 81–99,
identifiability 35–36, 60, 71
150, 167, 268–269
importance 1, 4, 45–46, 94, 107–108,
socio-cultural factors 66–73
115, 122–123, 134, 174, 179, 192,
speaker salience 11, 84, 107–134
242, 260, 264
subordination 4, 13–15, 93, 143–169,
imposed salience 9, 109
174–175, 185–188, 195, 243, 258
inherent salience 9, 109
markedness 81, 109–112, 163, 229, 232, topicality 5, 9, 59, 61–64, 71–77, 105,
233, 235, 238, 240–241, 244–247 107, 111, 115–118, 120, 126, 134,
182, 252
new (newness, newsworthiness) 3, 15, typographic marks 4, 17, 229, 242, 245–
31, 45–46, 51, 60–62, 107, 111, 119, 246
128, 134, 146, 150, 173–174, 180–
184, 186, 190–193, 236–237, 256 visual salience 16, 18, 83, 107, 123,
nuclearity (nucleus-satelite relationships 126, 205–225, 231, 237, 253
in RST) 14, 143, 146, 148–149,
154–155, 166–168, 185 word order (order of mention) 8, 11, 57,
60, 76, 81–82, 98, 105–134, 143–
perceptual salience 15–17, 109, 133– 144, 153, 159, 164, 168, 174–192,
134, 205, 208, 258 195, 236, 245, 268
Subject index 281
Subject index
active-direct voice 59–65, 71, 74–75, Centering transitions 6–8, 94, 113, 120,
106 128, 131, 235–236, 243
attention 1, 5–7, 9, 11, 16–17, 32, 34– coherence 3, 6–8, 10, 12, 19, 76, 82,
35, 61, 105–109, 111, 115, 119, 111, 113, 229, 132, 134, 174–175,
122–124, 127, 131, 133–134, 148, 180, 185, 195, 229–231, 237, 241,
179, 207–209, 211–215, 224–225, 243, 246–247, 256
233–234, 236–239, 245–247, 264 common ground 5, 108–109, 123, 131,
208–209
Background (discourse structure) 145– continuity
150, 154, 156, 168 continuation (discourse relation)
background (information structure) 3, 186–187, 195
110, 143, 145–146, 149, 180 continue (Centering transition) 6–8,
backgrounding (salience demotion, 61, 65, 74, 76, 133, 234, 236
downgrading, defocusing) 10, 57, topic continuity 150–151, 187
59–62, 64–66, 68–69, 71–76, 99,
110, 144–146, 151, 162–166, 174, deixis 17, 126, 258
184–185 demonstrative 9–10, 31–52, 123–124,
backward-looking (anaphoric aspects of 153, 173, 205, 268
discourse processing) 3, 5–11, 108, discourse relations (coherence relations,
111, 113–114, 120–121, 127–131, rhetorical relations) 1, 4, 11–15, 43,
147–148, 154, 234–236, 238 144–155, 159–164, 166, 168, 173–
bridging 131, 181 195
Discourse Representation Theory (DRT)
center of attention (see focus of atten- 12, 14, 253
tion) 6, 106, 131, 239, 245 discourse structure 2–4, 11–15, 97, 143–
center (Centering) 5–8, 19, 59, 61, 63, 169, 174–175, 183–190, 195, 208,
65, 76, 113–114, 120–121, 127–131, 225, 230, 241, 243
216, 232–241, 245–247
backgrounded center 59, 65, 236 ergative voice construction 10–11, 58,
backward-looking center 5–8, 113– 59, 61–63, 68–76
114, 120–121, 127–131, 234–
236, 238 focus (information structure) 3–4, 10,
foregrounded center 59, 63, 65, 236 17, 61–62, 72, 92, 143, 146, 162,
forward-looking centers 5–7, 15, 18– 178, 180, 183, 193
19, 113–114, 116, 120, 132, 234– focus (of attention) 1, 5, 9, 18, 35–36,
237, 239–241, 244–246 106, 145, 208, 211, 214–215, 222,
preferred center 6–7, 61, 63, 113, 234, 239, 247
120–121, 229–247 focus stack model 208, 225
Centering Theory 2–3, 5–10, 18–19, 59, foreground 132, 145, 147–148
61, 76, 83, 105, 111, 113–122, 126– foregrounding (salience promotion) 57,
134, 216, 229–247 59, 63–65, 73–76, 107, 109, 110–
282 Index
reference resolution (anaphor resolution) topic 3, 5, 8, 10, 35, 42, 59, 61–66, 74–
4, 8, 14, 17–18, 97, 113, 128, 130, 75, 92, 127–128, 145, 147, 178, 180,
133, 205–225, 236–237, 243–244, 182–184, 187, 192, 194–195, 231,
253, 268–269 233–236, 252
referent (discourse referent) 2–3, 6–9,
18, 31–39, 42, 44–52, 57, 59–65, 68, visual-world paradigm 16, 269