Propositions of Conventions For RDF: (Postcript Version)
Propositions of Conventions For RDF: (Postcript Version)
RDF is a general model and representing a piece of information in RDF/XML can be done in many
lexical/syntactic/structural/ontological ways. Unfortunately, these different representations often cannot
be automatically compared with each other and therefore retrieved, merged or reused. We cannot expect
the metadata providers to follow the same schemas and this would not be prevent incomparable
syntactic/structural variations. Metadata providers (including schema creators) need to follow
conventions. Here are some propositions.
Similarly, writing statements using only nouns, compound nouns or verb nominal forms makes these
statements more explicit. Furthermore, with this convention, the number of lexical and structural
possibilities to express these statements is significantly reduced (i.e. the choices of classes, properties
and ways they can be combined are reduced). Therefore, there is a stronger possibility that statements
can be automatically matched, and thus retrieved, merged or reused.
Why Nouns?
Metadata providers often use names of attributes/characteristics (e.g. of physical characteristics such as
mass and color) as relation names. This practice would not be a problem if all attributes could be
represented as properties and organized via subPropertyOf relations. Unfortunately, after exploring this
option with WordNet [WN], we realized that relatively few attributes can be used as relations.
Therefore, we introduced the class AttributeOrMeasure and classified the top-level WordNet attribute
categories and measure categories under it (it is sometimes difficult to distinguish these two notions, e.g.
Color is an attribute but Red and its corresponding wave length may be seen as a measure).
Though we also provided a few relations such as mass and length, these relations that could be
decomposed/defined using the combination of an instance of AttributeOrMeasure and the relation
attribute (plus possibly the relation ) should be considered as exceptions.
A few role nouns, such as child, creator and driver are also used as relations. However, these
relations are not basic (they refer to processes) and, except for those that are very commonly used,
should be avoided.
As a simple example, instead of writing that a resource X has for type DirectFlight OR IndirectFlight, it
seems better to declare X as an instance of a type Flight that has DirectFlight and IndirectFlight as
exclusive subtypes (i.e. types that cannot have common subtypes or instances). Exclusion links between
types (or between entire statements) are the kinds of negations that can be handled efficiently, and are
included in many expressive but efficient logic models, e.g. Courteous logic on which the BRML is
based.
Additional properties (e.g. "atLeast", "atMost" and "part") would be interesting to specify some
restrictions on the quantification. Here is an example.
<!-- At least 2% of persons like most of cats -->
<forall id="baz" atLeast="2%" var="p" rdf:about="#p">
<if><rdf:type resource="#Person"/>
<then><forall part="most" var="c" about="#c">
<if><rdf:type rdf:resource="#Cat"/>
<then><objectOf><Liking><agent>#p</agent></Liking>
</objectOf></then>
</if>
</forall>
</then>
</if></forall>
Such a construct permits the definition of rules on the instances of a class, or in other words, to associate
definitions to that class. Without restricting properties (e.g. "atLeast", "atMost" and "part"), the
definition specifies relations "necessarily" connected to all instances of that class (that is, necessary
conditions of membership to the class). Using part="most", typical relations can be defined, but more
precision is achieved with percentages (e.g. part="75%" or atLeast="75%").
[RDFSchema] also permits one to define some restrictions on the use of a class by directly connecting
classes via relations. Though this method is convenient for a few well-known special cases
(generalization relations, exclusion relations and relation signatures), the semantics of such connections
is unknown for other cases. Assume for example that two classes Airplane and Wing are connected by a
relation "part". Does this mean that "any airplane has for part a wing" or "any wing is part of a plane" or
"a wing is part of any plane" or "any airplane has for part all the wings"? We propose the first
interpretation be adopted (i.e. the source of the relation is universally quantified and the destination
existentially quantified).
The properties "atLeast" and "atMost" permit the delimitation of intervals. Here is an example.
<!-- Tom is the creator of 10 to 20 documents, including http://www.foo.org/bar -->
<Set rdf:ID="s" atLeast="10" atMost="20"/>
<rdf:li rdf:resource="http://www.foo.org/bar"/>
</Set>
<rdf:Description rdf:aboutEach="#s">
<rdf:type rdf:resource="#Document"/>
<creator><Person rdf:ID="Tom"/></creator>
</rdf:Description>
This last example could also be represented using the relations minimalSize and maximalSize which
are part of the 120 basic relations of our top-level ontology. However, like conventions, if such common
and basic relations are not adopted as standards, the comparison of RDF metadata (and therefore their
retrieval, merge and reuse) will remain problematic.
Acknowledgments
Many thanks to Dr OLivier Corby and Pr Peter Eklund for their readings and corrections of this article.
References
[BernersLee99]
The Semantic Toolbox: Building Semantics on top of XML-RDF
http://www.w3.org/DesignIssues/Toolbox.html
[BRML]
Business Rules Markup Language
http://www.oasis-open.org/cover/brml.html
[CGs]
Conceptual Graphs
http://concept.cs.uah.edu/CG/Standard.html
[CYC]
The Upper Cyc® Ontology
http://www.cyc.com/cyc-2-1/cover.html
[DC]
Dublin Core Metadata Initiative
http://purl.org/dc/
[KIF]
Knowledge Interchange Format (KIF)
http://logic.stanford.edu/kif/kif.html
[MCF/XML]
Meta Content Framework Using XML
http://www.w3.org/TR/NOTE-MCF-XML/#secA.
[RDFMS]
Resource Description Framework (RDF) Model and Syntax, W3C Recommendation, 22 February
1999
http://www.w3.org/TR/1999/REC-rdf-syntax-19990222
[RDFSchema]
Resource Description Framework (RDF) Schema Specification 1.0, W3C Candidate
Recommendation 27 March 2000
http://www.w3.org/TR/2000/CR-rdf-schema-20000327
[WN]
WordNet - a Lexical Database for English
http://cogsci.princeton.edu/~wn/