Semantic Web
Semantic Web
U N D E RSTA NDING I N B R I E F
INTRODUCTION
W EB OF D OCUMENTS V S . W EB OF D ATA
Dealing with things sensibly and realistically in a way that is based on practical rather than
theoretical considerations.
POLYSEMY: If you get some result for your search and get some other
result as well with different meaning having same or similar name.
A B C D
A B C D
Problems:
From where do we get the required (personal) information?
Personalization vs Data Security
Dbpedia stores & publishes the result in RDF and a few other formats.
It also hosts a community effort to define extractors for the data, that can be used
well beyond Wikipedia.
It provides a number of services around the extracted data, like DBpedia mobile, a
SPARQL endpoint, a faceted browser, a number of mappings to external ontologies,
an ontology itself, etc.
Web of Data:
>31 billion Facts
>500 million Links
(Oct 2011)
USER USER
Personal
Assistant
Intelligent
Infrastructure
Presentation Retrieval Service Services
Service (e.g. (e.g. Google)
Firefox)
www documents
Ankur Biswas www documents 4/4/2016 31
3 Generations of Web Documents
1st Generation 2nd Generation 3rd Generation
Netbots
Java Script/ Applets Information Extraction
Presentation Planning
HTML / CSS
Database Access User Model
Template Based Machine Learning
Generation Online Layout
ID Name Homepage
id_xyz Ghosh, Amitav http://www.amitavghosh.com
London a:author
Harper Collins
a:name
a:homepage
Data export does not necessarily mean physical conversion of the data
relations can be generated on-the-fly at query time
via SQL bridges
scraping HTML pages
extracting data from Excel sheets
etc.
1
ID Titre Traducteur Original
2
ISBN 2020286682 Le Palais des Miroirs $A12$ ISBN 0-00-6511409-X
3
6
ID Auteur
7
ISBN 0-00-6511409-X $A11$
8
10
Nom
11
Ghosh, Amitav
12
Besse, Christianne
Ankur Biswas 4/4/2016 44
2nd: export your second set of data
http://isbn/000651409X Le palais des miroirs
f:auteur
http://isbn/2020386682
f:traducteur
f:nom
f:nom
Ghosh, Amitav
Besse, Christianne
2000
Same URI!
London
a:author
a:name
a:homepage
Le palais des miroirs
f:auteur
Ghosh, Amitav
http://www.amitavghosh.com
http://isbn/2020386682
f:traducteur
f:nom
f:nom
Ghosh, Amitav
Besse, Christianne
London
a:author
Harper Collins f:original
a:name f:auteur
a:homepage
http://isbn/2020386682
f:traducteur
f:nom
f:nom
Ghosh, Amitav
Besse, Christianne
London
a:author http://isbn/2020386682
Harper Collins f:auteur
f:auteur
r:type f:traducteu
f:traducteur
r:type
r:type r
a:name
a:name
a:homepage http://foaf/Person
f:nom a:homepage
f:nom
f:nom
Besse, Christianne
Ghosh, Amitav
http://www.amitavghosh.com
London
a:author
a:author http://isbn/2020386682
Harper Collins f:auteur
r:type f:traducteu
f:traducteur
r
a:name r:type
a:homepage http://foaf/Person
f:nom
f:nom
r:type
w:isbn
Besse, Christianne
Ghosh, Amitav http://www.amitavghosh.com
w:reference http://dbpedia.org/../The_Glass_Palace
foaf:name
w:author_of
http://dbpedia.org/../Amitav_Ghosh
w:born_in http://dbpedia.org/../Kolkata
http://dbpedia.org/../The_Hungry_Tide
w:author_of
w:author_of w:long
w:lat
http://dbpedia.org/../The_Calcutta_Chromosome
Ankur Research
Author Theme
Doc.html
rdf: type
bb: renee-j-miller foaf: Person
Renee J. Miller
foaf: name
dbpedia: Toront0
foaf: Friend of a Friend foaf: based_near
Ankur Biswas 4/4/2016 58
A Simple RDF Example (in RDF/XML)
rdf: type
bb: renee-j-miller foaf: Person
Renee J. Miller
foaf: name
dbpedia: Toront0
foaf: based_near
<?xml version="1.0"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:foaf="http://xmlns.com/foaf/spec/#"
xmlns:bb="http://data.bibbase.org/ontology/">
<rdf:Description rdf:about="http://.../author/renee-j-miller/">
<rdf:type rdf:resource="http://xmlns.com/foaf/spec/#term_Person"/>
<foaf:name xml:lang=en">Rene J. Miller</foaf:name>
<foaf:based_near
rdf:resource="http://dbpedia.org/resource/Toronto"/>
</rdf:Description>
</rdf:RDF>
Renee J. Miller
foaf: name
dbpedia: Toront0
foaf: based_near
<http://data.bibbase.org/author/renee-j-miller/>
rdf:type foaf:person .
foaf:name Rene J. Miller@en ;
foaf:based_near <http://dbpedia.org/resource/Toronto>
Renee J. Miller
foaf: name
dbpedia: Toront0
foaf: based_near
<http://data.bibbase.org/author/renee-j-miller/>
rdf:type foaf:person .
foaf:name Rene J. Miller@en ;
foaf:based_near [ rdf: type foaf:Place;
foaf:name Toronto] .
Result
?name
Rene J. Miller
SPARQL Endpoint
SPARQL Endpoint
Applications
SPARQL Processor
Triple Store Database
NLP Technique
SQL RDF
RDF Graph Relational Database
make your stuff available on the Web (whatever format) under an open
license
make it available as structured data (e.g., Excel instead of image scan of a
table)
make it available in a non-proprietary open format (e.g., CSV as well as of
Excel)
use URIs to denote things, so that people can point at your stuff
As a publisher
Its simple to publish.
You do not have explain repeatedly to others that they can use your data.
As a publisher
You have fine-granular control over the data items and can optimize their access (load balancing,
caching, etc.)
Other data publishers can now link into your data, promoting it to 5 star!
You typically invest some time slicing and dicing your data.
Youll need to assign URIs to data items and think about how to represent the data.
You need to either find existing patterns to reuse or create your own.
As a publisher
You make your data discoverable.
You increase the value of your data.
Your own organization will gain the same benefits from the links as the consumers.
Youll need to invest resources to link your data to other data on the Web.
You may need to repair broken or incorrect links.
Ankur Biswas 4/4/2016 83
Applications
Data integration (e.g., see project Optique http://www.optique-
project.eu/)
E-government (e.g., open data)
E-commerce
Tourism
Medicine
Biology
Earth Observation (see the work of my group in projects TELEIOS
http://www.earthobservatory.eu/ and LEO
http://www.linkedeodata.eu/ ).
Scientific Papers:
Maedche, Alexander. Ontology learning for the semantic web. Vol. 665. Springer Science & Business Media,
2012.
Schmachtenberg, Max, Christian Bizer, and Heiko Paulheim. "Adoption of the linked data best practices in
different topical domains." The semantic webISWC 2014. Springer International Publishing, 2014. 245-260.
Websites
http://dbpedia.org/snorql/
http://5stardata.info/en/
Ankur Biswas 4/4/2016 85
Thank You
Ankur Biswas 4/4/2016 86