Chap1 - Introduction To DSS and XML
Chap1 - Introduction To DSS and XML
Semistructured Data
3rd year LMD
Department of computer science
University of Laghouat
Academic year 2023/2024
Different types of Data
o Digital data can be classified into three forms:
Unstructured
Semi-structured
Structured
Unstructured Data
o This is the data which does not conform to a data model or
is not in a form which can be used easily by a computer
program.
o Videos, audio, and binary data files might not have a specific
structure. They’re assigned to unstructured data.
Structured Data
o Structured data refers to the data which has a pre-defined
data model/schema/structure.
o This is the data which does not conform to a data model but
has some structure.
o the code between the <?xml and the ?> is called an XML
declaration.
o Just as in HTML, they begin with <!-- and end with the first
occurrence of -->.
o Note that you may not use entity references inside a CDATA
marked section.
XML Namesapces
XML namespace is a collection of XML elements and
attributes identified by a URI.
<BOOKS>
<b:BOOK xmlns:b="www.book.com"
xmlns:m="urn:Finance:Money">
<b:TITLE>Creepy Crawlies</b:TITLE>
<b:PRICE m:currency="US Dollar">22.95</b:PRICE>
</b:BOOK>
</BOOKS>
Example
<doc xmlns:m="http://www.w3.org/1998/MathML"
xmlns:xlink="http://www.w3.org/1999/xlink">
<title>My Document</title>
<body xmlns="http://www.w3.org/1999/xhtml">
<p xlink:href="alternate.xml">I am a paragraph containing some mathematics:
<m:math><m:mi>x</m:mi></m:math>
</p>
</body>
</doc>
o The 'href' attribute is in the namespace associated with the 'xlink' prefix declared on the
'doc' element.
o The 'math' element is in the namespace associated with the 'm' prefix declared on the
Well Formed XML Documents &
Valid XML Documents
o An XML document with correct syntax is called "Well
Formed".
o The XML Parser is designed to read the XML and create a way
for programs to use XML.