Word Sense Disambiguation of Modals (Ext. Abstract)

The document discusses word sense disambiguation of the modal verb "must" in English. It finds that "must" has two main senses - a deontic (obligation) sense and an epistemic (certainty) sense. Two factors are found to help disambiguate between the senses: 1) the verbal construction used with "must", and 2) the person of the subject. An initial experiment using only the verbal construction factor achieved a 81% accuracy rate at automatic disambiguation, which improved slightly to 83% when also adding the subject person factor.

Uploaded by

zerertrty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

Word Sense Disambiguation of Modals (Ext. Abstract)

Uploaded by

zerertrty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

WORD SENSE DISAMBIGUATION OF MODALS 1.

introduction

(ext. abstract)

The work reported on in this abstract concerns the question of the ambiguity of modal verbs in English. Most modals are ambiguous in that they have more than one distinct sense. The questions addressed in this paper are: What are the senses of modals? What are the disambiguation factors? Are they the same as for lexical disambiguation (as reported in among others Ide and Vronis 1999 and Stevenson and Wilks 2001)? What is the context needed for a successful disambiguation?

In this abstract, I will limit myself to the modal must. Its senses can be clearly defined and the disambiguation factors are relatively clear. In the final version of the paper, other modals such as may, can, and should will be discussed as well. 2. modals and meaning In this abstract I will assume that modals have two distinct senses, exemplified in () below. (1) a. b. John must go to school. John must be in school.

When queried, most speakers of English agree that the modal must has a different meaning in (1a) than it has in (1b). The meaning of must in (1a) is commonly referred to as an instance of deontic modality (the term root modality is also used, see e.g., Coates 1983), while (1b) uses the epistemic sense of must. While finer distinctions are possible (for instance, Palmer 1990 distinguishes between deontic and dynamic modality), the present study will assume that English modals are two-way ambiguous (have two distinct senses that need to be disambiguated). For the moment, this disregards those modals that have more meanings (such as can which also has an ability reading). The question is: if a modal such as must is ambiguous, why do people assign without hesitation a deontic reading to (1a) and an epistemic reading to (1b)? It is quite possible to construct scenarios where sentence (1a) receives an epistemic interpretation and (1b) a deontic one. These readings are not salient, however, and the question is: why not? What about the context makes must in (1a) a deontic modal, but an epistemic one in (1b)? 3. analysis of the corpus Before proceeding to the WSD stage, the corpus was analyzed by hand according to a number of features. The results are reported in De Haan 2003. Two such features will be exemplified here, verbal construction and person of subject. Other factors which may possibly be relevant include negation, subjective vs. objective reading of the modal, and semantic status of the main verb. It is not obvious that knowing the precise meaning of the main verb would help, but it is conceivable that a Vendlerian division of accomplishment, achievement, stative, and activity verbs might be helpful. Such a task has not been added here, but it could easily be done. There were 520 sentences containing the modal must in the Switchboard corpus, of which 11 were ungrammatical or irrelevant to the present study. These were discarded. Of the remaining 509 sentences, 66 were deontic, 412 epistemic, and 31 were indeterminate, which means that even within context it is not possible to assign a definitive interpretation to the modal. The 509 sentences were analyzed on their verbal collocations. An example is shown in (2): (2) Thats the only place I was sore, and I thought, well, I must not be doing them right [S272]

This sentence has the verbal form must be V-ing (where V signifies the main verb) and is coded accordingly. The full list of distributions is shown in Table 1. Table 1 Occurrences of must with verbal complements in the Switchboard corpus Deontic Epistemic Be +V- part. 4 V 54 55 Have V 3 34 Have been V 70 Be 2 195 Be + V-ing 9 Have got + V-ed 2 No V 3 5 Have + V-ed 41 Have been +V-ing 1

Indeterminate 1 7 4 5 1 13

As can be seen from the data in Table 1, this parameter serves very well to disambiguate between the two sense of must. There are only two constructions that are heavily ambiguous, namely must V (with an almost even split between the two senses) and must without accompanying main verb. Perhaps unsurprisingly, the latter construction had the highest number of indeterminate cases. The second parameter looked at is the person of the subject. The results are shown in Table 2 below. Note that no distinction was made between second person singular and plural because the form you is ambiguous in itself. As is to be expected most persons occur more with epistemic modals than with deontic ones due to the overwhelming number of epistemic sentences in the corpus. The one exception to this is the first person singular, which has an overwhelming preference for deontic must. Impersonal subjects include constructions like it must be or there must be. Table 2 Correlation of person and modality in the Switchboard corpus Deontic Epistemic Indeterminate 1 SG 39 14 1 2 SG/PL 10 67 5 3 SG 6 166 13 1 PL 2 13 1 3 PL 7 59 10 No overt subject 1 29 0 Impersonal subject 1 64 1 66 412 31 Total These numbers show that these two parameters are possible disambiguation criteria and a system was designed based on these features. 4. the WSD results The corpus used for this study is the Switchboard corpus, a corpus of spoken American English, which was tagged with POS tags according to the CLAWS C7 scheme. Under this scheme, all modals, excluding catenative modals such as ought to, receive the POS tag VM. This is irrespective of its modal sense, and is just meant as a syntactic tag. The first 100 occurrences of must in the Switchboard that had an unambiguous meaning were hand tagged with their respective sense (this excludes the indeterminate cases). Thus an epistemic instance was coded with the tag VME and a deontic one was coded with VMD. This was used as a training corpus and the system then proceeded to analyze the verbal structure of each sentence based on its POS analysis. The system was therefore not provided with the numbers of Table 1 and 2, but was expected to deduce the information from the linguistic material itself. This turned out to be impossible with the subject, since the

system did not have access to the syntactic structure of the sentence at this point. The subject was hand coded at the second stage. After all errors were smoothed out (this includes programming errors and POS errors) the system, based on just the first criterion, verbal collocation, already had a successful disambiguation rate of about 81%. With the second feature, subject person, added in, the success rate climbed marginally to about 83% (this parameter was only used on the must V and must without main verb constructions, since these were the constructions with the highest level of ambiguity). The main problem lies as expected in the must V construction, which proves very difficult to disambiguate automatically. Also, the system assigns an interpretation to any sentence which is indeterminate. Since these interpretations cannot be checked for accuracy, they bring down the success rate, albeit marginally. The system is currently rerunning the examples, but now Treebank information is added to give more syntactic information. It is still too soon to report on the results of these experiments (but they will be included in the final paper). Current work is also focused on finding the combination of criteria that will yield the highest degree of accuracy. 5. preliminary conclusions The results from this experiment show that a disambiguating system based on a single parameter, verbal collocation, already drastically improves on randomly assigning a meaning. For an automated system that has to make a determination very quickly this might suffice, but it will still make an error 20% of the time on average. For a system which aims to mimic human behavior, more is obviously needed. An additional problem is that must behaves differently in different forms of the language. According to the data in Biber et al. 1999, must is predominately epistemic in the spoken language, but predominately deontic in the written language. This is confirmed in my data: the verb must is epistemic in 79% of all cases in the Switchboard corpus, but only 16% of the time in the written Brown corpus. Must is the only modal for which this is true. It is not quite clear how to deal with that. It is of course possible to construct a training corpus based on a mixture of sentences from the spoken and written language and use these numbers as a basis. This disregards the fact that the choice of speech style itself is a disambiguating factor and can be used as such. In this case we need two sets of data, one for each style and the situation determines which set will be used. References Biber, Douglas, et al. (eds.). 1999. Longman Grammar of Spoken and Written English. London: Longman. Coates, Jennifer. 1983. The semantics of the modal auxiliaries. London: Croom Helm. De Haan, Ferdinand. 2003. must-constructions. Manuscript, University of Arizona. Fellbaum, Christiane (ed.). 1999. WordNet: an electronic lexical database. Cambridge, MA: MIT Press. Ide, Nancy; Jean Vronis. 1998. Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics 24(1), 1-40. Palmer, Frank R. 1990. Modality and the English modals, second edition. London: Longman. Stevenson, Mark; Yorick Wilks. 2001. The Interaction of Knowledge Sources in Word Sense Disambiguation. Computational Linguistics 27(3), 321-49. SWITCHBOARD CORPUS. Linguistic Data Consortium.

Pinout dd13 dd15
89% (19)
Pinout dd13 dd15
10 pages
A Study of The Semantic Function of Modality
No ratings yet
A Study of The Semantic Function of Modality
22 pages
Mood, Modality and Modal Verbs
100% (1)
Mood, Modality and Modal Verbs
34 pages
Cambridge Delta: Focus On Teaching & Learning Grammar
100% (1)
Cambridge Delta: Focus On Teaching & Learning Grammar
12 pages
Introduction to Proof in Abstract Mathematics
From Everand
Introduction to Proof in Abstract Mathematics
Andrew Wohlgemuth
5/5 (1)
Book Review of Modality and Its Interact
No ratings yet
Book Review of Modality and Its Interact
6 pages
Diploma Thesis
No ratings yet
Diploma Thesis
62 pages
3modal Verbs and Modal Adverbs in Chinese
No ratings yet
3modal Verbs and Modal Adverbs in Chinese
28 pages
Corpus-Based Study of The Modal Verbs in The Spoken and Academic Genres of The Corpus of Contemporary American English
No ratings yet
Corpus-Based Study of The Modal Verbs in The Spoken and Academic Genres of The Corpus of Contemporary American English
28 pages
PAPER - Modal Verbs of Obligation in MELC
100% (1)
PAPER - Modal Verbs of Obligation in MELC
15 pages
DF 2 A 33661 A 97 Fdda 1073
No ratings yet
DF 2 A 33661 A 97 Fdda 1073
20 pages
2012 Theperilsoftranslating Englishmodals
No ratings yet
2012 Theperilsoftranslating Englishmodals
21 pages
50 Most Challenging Algebra Problems!
From Everand
50 Most Challenging Algebra Problems!
Andrei Besedin
No ratings yet
Types of Modal Verbs and Their Uses
No ratings yet
Types of Modal Verbs and Their Uses
3 pages
WAMM Avila Mello
No ratings yet
WAMM Avila Mello
6 pages
Modals and Modality in English
No ratings yet
Modals and Modality in English
15 pages
Modal and Modality by Besga PDF
No ratings yet
Modal and Modality by Besga PDF
15 pages
Modality
No ratings yet
Modality
34 pages
24-Article Text-17-1-10-20151206
No ratings yet
24-Article Text-17-1-10-20151206
10 pages
Jorge Arus Hita
No ratings yet
Jorge Arus Hita
17 pages
EN Modality Meanings in Students Argumentat
No ratings yet
EN Modality Meanings in Students Argumentat
6 pages
9783110895339.1
No ratings yet
9783110895339.1
18 pages
Typological Approaches To Modality
No ratings yet
Typological Approaches To Modality
44 pages
Mood, Modality and Modal Verbs
100% (2)
Mood, Modality and Modal Verbs
31 pages
Semantic Modeling In Formal English
From Everand
Semantic Modeling In Formal English
Dr. Ir. Andries Van Renssen
No ratings yet
Coreference: Fundamentals and Applications
From Everand
Coreference: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mood, Modality and Modal Verbs: The Functional Categories of The Verb-Spring Term 2011 - Ileana Baciu
No ratings yet
Mood, Modality and Modal Verbs: The Functional Categories of The Verb-Spring Term 2011 - Ileana Baciu
37 pages
Mood, Modality and Modal Verbs
100% (1)
Mood, Modality and Modal Verbs
42 pages
Immediate download (Ebook) The Semantic Field of Modal Certainty: A Corpus-Based Study of English Adverbs. by A. M. Simon,Vandenbergen, Karin Aijmer ISBN 9783110196177, 3110196174 ebooks 2024
100% (3)
Immediate download (Ebook) The Semantic Field of Modal Certainty: A Corpus-Based Study of English Adverbs. by A. M. Simon,Vandenbergen, Karin Aijmer ISBN 9783110196177, 3110196174 ebooks 2024
81 pages
DLP-8-Q1-M3 Sept. 18-22, 2023 MODALS
No ratings yet
DLP-8-Q1-M3 Sept. 18-22, 2023 MODALS
11 pages
Difficulties in Teaching Modal Verbs PDF
No ratings yet
Difficulties in Teaching Modal Verbs PDF
14 pages
Mood and Modality (2nd Edition) : SIL Electronic Book Reviews 2004-010
No ratings yet
Mood and Modality (2nd Edition) : SIL Electronic Book Reviews 2004-010
3 pages
DLP-8-Q1-Module 3
No ratings yet
DLP-8-Q1-Module 3
8 pages
MODALITY
No ratings yet
MODALITY
21 pages
Modals Verbs
No ratings yet
Modals Verbs
8 pages
Guía de Inglés Iv - Sem-3-2016 PDF
No ratings yet
Guía de Inglés Iv - Sem-3-2016 PDF
41 pages
Upplementary Eading Ore About Modality
No ratings yet
Upplementary Eading Ore About Modality
8 pages
Download Full The Semantic Field of Modal Certainty A Corpus Based Study of English Adverbs A. M. Simon PDF All Chapters
100% (10)
Download Full The Semantic Field of Modal Certainty A Corpus Based Study of English Adverbs A. M. Simon PDF All Chapters
81 pages
Lexical Encoding of Verbs in English and Bulgarian Rositsa Dekova
No ratings yet
Lexical Encoding of Verbs in English and Bulgarian Rositsa Dekova
8 pages
The_Use_of_Modal_Verb_in_English_Writing_Corpus_Vi
No ratings yet
The_Use_of_Modal_Verb_in_English_Writing_Corpus_Vi
7 pages
Urdu Model Verbs
No ratings yet
Urdu Model Verbs
8 pages
Prototext-metatext translation shifts: A model with examples based on Bible translation
From Everand
Prototext-metatext translation shifts: A model with examples based on Bible translation
Bruno Osimo
No ratings yet
Valentina Ilca Modals
100% (2)
Valentina Ilca Modals
40 pages
FON University - First Private University - Skopje Faculty of Applied Foreign Languages - Skopje
No ratings yet
FON University - First Private University - Skopje Faculty of Applied Foreign Languages - Skopje
10 pages
Seminar Paper
No ratings yet
Seminar Paper
10 pages
MA Thesis Teaching Modals
100% (5)
MA Thesis Teaching Modals
89 pages
Modul 6 Questions Direct and Inderect, Question Tags-Rev3
No ratings yet
Modul 6 Questions Direct and Inderect, Question Tags-Rev3
28 pages
Modulating Grammar Through Modality: A Discourse Approach: ELIA I, 2000
No ratings yet
Modulating Grammar Through Modality: A Discourse Approach: ELIA I, 2000
18 pages
Grammar 2 Modals Lesson & Exercises Version B
No ratings yet
Grammar 2 Modals Lesson & Exercises Version B
9 pages
Lewis Proposal
No ratings yet
Lewis Proposal
13 pages
Function Words
No ratings yet
Function Words
8 pages
Modal Verbs
No ratings yet
Modal Verbs
8 pages
A Study On Modality in English-Medium Research Articles: Ton Nu My Nhat, Nguyen Thi Dieu Minh
No ratings yet
A Study On Modality in English-Medium Research Articles: Ton Nu My Nhat, Nguyen Thi Dieu Minh
19 pages
(SFL) Modal Construction
No ratings yet
(SFL) Modal Construction
10 pages
Epistemic Modality and Deontic Modality 2011
No ratings yet
Epistemic Modality and Deontic Modality 2011
24 pages
First Order Logic: Fundamentals and Applications
From Everand
First Order Logic: Fundamentals and Applications
Fouad Sabry
No ratings yet
Question Tags
No ratings yet
Question Tags
1 page
A Contrastive Study of Turkish and English Modality With Reference To Speech Act Theory
No ratings yet
A Contrastive Study of Turkish and English Modality With Reference To Speech Act Theory
22 pages
Sample Proposal
No ratings yet
Sample Proposal
9 pages
Tojdac v080SSE305
No ratings yet
Tojdac v080SSE305
10 pages
Mfa2 Morphology of Verbs 1 2020
No ratings yet
Mfa2 Morphology of Verbs 1 2020
4 pages
Jet Engine Report - Merged
No ratings yet
Jet Engine Report - Merged
16 pages
Teacher Value Added Report
No ratings yet
Teacher Value Added Report
2 pages
Lkitn235b02, C06 - Phase Ii Main
No ratings yet
Lkitn235b02, C06 - Phase Ii Main
14 pages
LabReport2
No ratings yet
LabReport2
2 pages
Sewing Tools
100% (1)
Sewing Tools
2 pages
K 78210250
No ratings yet
K 78210250
2 pages
Study On Behavior of RC Framed Corner Joints Using Fem Technique
No ratings yet
Study On Behavior of RC Framed Corner Joints Using Fem Technique
4 pages
Igneous Rocks
No ratings yet
Igneous Rocks
34 pages
Fms With Arena
No ratings yet
Fms With Arena
6 pages
Subject Verb Agreement (Answer With Clues)
No ratings yet
Subject Verb Agreement (Answer With Clues)
3 pages
Case Cx210b Cx290b Diagnostic
100% (64)
Case Cx210b Cx290b Diagnostic
5 pages
Oral Pathology, Lecture 1
No ratings yet
Oral Pathology, Lecture 1
64 pages
Core_v6.0_Vol0
No ratings yet
Core_v6.0_Vol0
212 pages
ABB Ability™ System 800xa
No ratings yet
ABB Ability™ System 800xa
8 pages
Dustbin Assembly Coloured
No ratings yet
Dustbin Assembly Coloured
2 pages
Vacuum Casting
No ratings yet
Vacuum Casting
2 pages
Reports: What Are Schur Complements, Anyway? David Carlson
No ratings yet
Reports: What Are Schur Complements, Anyway? David Carlson
19 pages
Systematics of NDE Reliability - A Practical Point of View
No ratings yet
Systematics of NDE Reliability - A Practical Point of View
6 pages
12 Class
No ratings yet
12 Class
5 pages
Mathematical Modeling of Electrical Systems: Nader Sadegh
No ratings yet
Mathematical Modeling of Electrical Systems: Nader Sadegh
10 pages
Cruise Control (1GD-FTV, 2GD-FTV), ECT and A/T Indicator (1GD-FTV, 2GD-FTV), Engine Control (1GD-FTV, 2GD-FTV)
100% (1)
Cruise Control (1GD-FTV, 2GD-FTV), ECT and A/T Indicator (1GD-FTV, 2GD-FTV), Engine Control (1GD-FTV, 2GD-FTV)
7 pages
The Purloined Letter - Edgar Allan Poe
No ratings yet
The Purloined Letter - Edgar Allan Poe
10 pages
Funai Led32 h9000m lc9 PDF
No ratings yet
Funai Led32 h9000m lc9 PDF
69 pages
Wa0082
No ratings yet
Wa0082
4 pages
Assign-1 130601 SUR
No ratings yet
Assign-1 130601 SUR
4 pages
Jhulelal Institute of Technology
No ratings yet
Jhulelal Institute of Technology
9 pages
Report Simulation Parking System - Group8 PDF
No ratings yet
Report Simulation Parking System - Group8 PDF
53 pages
Bha - TPN DZ.06 (TPN 223)
No ratings yet
Bha - TPN DZ.06 (TPN 223)
22 pages

Word Sense Disambiguation of Modals (Ext. Abstract)

Uploaded by

Word Sense Disambiguation of Modals (Ext. Abstract)

Uploaded by

WORD SENSE DISAMBIGUATION OF MODALS 1.

You might also like