Module 1
Module 1
the Internet:
– Introduction
– The Syntactic Web
– The Semantic Web
– How the Semantic Web Will Work.
• Ontology in Computer Science
– Defining the Term Ontology
– Differences among Taxonomies
– Thesauri and Ontologies
• Classifying Ontologies
• Web Ontologies
– Web Ontology Description Languages
– Ontology - Categories and Intelligence.
Bibliography
The Semantic Web, Scientific American, May 2001,
Tim Berners-Lee, James Hendler and Ora Lassila.
Breitman, K.K., Casanova, M.A., & Truszkowski, W.
(2007) Semantic web: Concepts, Technologies and
Applications. Springer Verlag, London
http://www.w3.org/
Antoniou, G., Van Harmelen, F. `(2004) “A Semantic
web Primer”(see library or Pdf copy)
Where we are Today: the Syntactic Web
6
Impossible(?) via the Syntactic Web…
• Complex queries involving background knowledge
– Find information about “animals that use sonar but are not either
bats or dolphins” , e.g.,
• Locating information in data repositories Barn
Owl
– Travel enquiries
– Prices of goods and services
– Results of human genome experiments
• Finding and using “web services”
– Visualise surface interactions between two proteins
• Delegating complex tasks to web “agents”
– Book me a holiday next weekend somewhere warm, not too far
away, and where they speak French or English
7
What is the Problem?
• Consider a typical web page:
• Markup consists of:
– rendering information
(e.g., font size and
colour)
– Hyper-links to related
content
• Semantic content is
accessible to humans but
not (easily) to
computers…
8
What information can we see…
WWW2002
The eleventh international world wide web conference
Sheraton waikiki hotel
Honolulu, hawaii, USA
7-11 may 2002
1 location 5 days learn interact
Registered participants coming from
australia, canada, chile denmark, france, germany, ghana, hong kong,
india, ireland, italy, japan, malta, new zealand, the netherlands, norway,
singapore, switzerland, the united kingdom, the united states, vietnam,
zaire
Register now
On the 7th May Honolulu will provide the backdrop of the eleventh
international world wide web conference. This prestigious event …
Speakers confirmed
Tim berners-lee
9
What information can a machine see…
WWW2002
The eleventh international world wide web conference
Sheraton waikiki hotel
Honolulu, hawaii, USA
7-11 may 2002
1 location 5 days learn interact
Registered participants coming from
australia, canada, chile denmark, france, germany, ghana, hong kong, india,
ireland, italy, japan, malta, new zealand, the netherlands, norway, singapore,
switzerland, the united kingdom, the united states, vietnam, zaire
Register now
On the 7th May Honolulu will provide
10
Background
• Human are capable of using the Web to carry out
tasks such as;
– Finding the German word for “Morning”
– Searching and reserving books from library
– Searching for a low price for a DVD
– etc.
• However, a computer cannot accomplish the
tasks without human direction because;
– Web pages are designed to be read by human, not
machines
11
11
Background
• A complex user query can be a big problem in finding
information on the Web.
– A query for “professor in faculty of engineering in universities in
the United States” may return results from the Web pages from
the other countries other than the United States.
• Most documents on the Web not only contain text
– also immense amount of images, sounds, video and other
multimedia files,
– these files are meaningless to computers.
• Semantic Web is a vision of information that is understandable by
computers, so that they can perform more of the tedious works
involved in finding, sharing, and combining information on the
Web.
12
12
Background
• Semantic Web will overcome such the
problems by making the Web not only
human-understandable but also machine-
understandable.
• Semantic Web is a vision of information that
is understandable by computers, so that they
can perform more of the tedious works
involved in finding, sharing, and combining
information on the Web.
13
13
Background
• Tim Berners-Lee expressed the vision of the
Semantic Web as follows:
I have a dream for the Web [in which computers] become capable
of analyzing all the data on the Web – the content, links, and
transactions between people and computers. A ‘Semantic Web’,
which should make this possible, has yet to emerge, but when it
does, the day-to-day mechanisms of trade, bureaucracy, and our
daily lives will be handled by machines talking to machines. The
‘intelligent agents’ people have touted for ages will finally
materialize.
14
14
Semantic Web
• It derives from W3C1 director Sir Tim Berners-Lee's vision
of the Web as a universal medium for data, information,
and knowledge exchange.
1. W3C (World Wide Web Consortium) is a forum for information, commerce, communication, and
collective understanding. W3C was founded in Oct. 1994. 15
15
Semantic Web (Cont’d)
• Statements are built with syntax rules
– The syntax of a language defines the rules for
building the language statements. But how can
syntax become semantic?
16
16
Semantic Web (Cont’d)
• Semantic Web (SW), proposed by W3C, is one of the most
promising and accepted approaches to make the Web
content becomes more machine-readable so that
intelligent agents can retrieve and process information
readily [Dong and Dan 05].
17
17
A Semantic Web — First Steps
Make web resources more accessible to automated processes
• Extend existing rendering markup with semantic markup
– Metadata annotations that describe content/function of web
accessible resources
• Use Ontologies to provide vocabulary for annotations
– “Formal specification” is accessible to machines
18
Why is there a need for the Semantic Web?
• Knowledge Management
– With the large number of documents made available online
by organizations, several document management systems
have entered the market. However, these systems have
severe weaknesses [Fensel et al. 03]:
• Searching information: Existing keyword-based search retrieves
irrelevant information that uses keyword in a context other than the
one in which the searcher is interested.
• Extracting information: Human browsing and reading is currently
required to extract relevant information from information sources, as
automatic agents lack the common sense knowledge required to
extract such information from textual representations and fail to
integrate information spread over different sources.
19
19
Why is there a need for the Semantic Web?
(Cont’d)
• Maintenance: Maintaining weakly-structured text sources is a
difficult and time-consuming activity when such sources become
large.
• Automatic document generation: Adaptive Web sites that enable a
dynamic reconfiguration of information according to user profiles or
other relevant aspects would be very useful.
22
22
How Will the Semantic Web Work?
• In order to organize Web content, AI
researchers proposed a series of conceptual
models.
• The central idea is to categorize information in
a standard way.
– Similar to the solution used to classify living beings
– Biologists use a well-defined taxonomy. Likewise,
computer scientists are looking for similar model
to help structure Web content
23
Themes related to the Semantic Web [Breitman et al. 07]
• Metadata
Concepts • Ontologies
• Ontology Languages
• Web Services
Semantic
Web
Applications Technologies
• Methodologies for
• Software agents
Ontology development
• Semantic desktop • Tools for Ontology
• Geospatial semantic web
development
• Ontology sources
24
How Will the Semantic Web Work?
• Metadata
– Metadata is “data about data”
– They serve to index Web pages and Web sites
in the Semantic Web
– Allowing other computers to acknowledge
what a Web page is about
25
How Will the Semantic Web Work?
• Ontologies
– In computer science, ontologies were adopted
in AI to facilitate knowledge sharing and reuse
[Fensel 01]
– Becoming widespread in areas:
• Intelligent information integration
• Cooperative information systems
• Agent-based software engineering
• E-commerce
26
How Will the Semantic Web Work?
• Ontologies
27
27
How Will the Semantic Web Work?
• Ontology Languages
– Designed to define ontologies
– They are sometimes called:
• Lightweight ontology languages
• Web-based ontology languages
• Markup ontology languages
– RDF (Resource Description Language)
– OWL (Web Ontology Language)
28
How Will the Semantic Web Work?
• Web Services
– Web services will be greatly improved if
semantics is added to the present Web
resources
– Computer will be able to:
• Make doctor appointments
• Synchronize with our agenda
• Find new suppliers for products we consume
• Make travel arrangements
29
How Will the Semantic Web Work?
• Applications of Semantic Web
– Personal Agent in Semantic Web
• Responsible for capturing user preferences, searching
for information on available resources, etc. to provide
answer that meet a user’s query
– Semantic desktop application
– Ontology applications in Art
• Cataloguing online cultural heritage or online museums
– The Hermitage Museum Web site
www.hermitagemuseum.org
30
The semantic web:
some definitions
An ontology is a set of concepts - such as things, events, and relations that are
specified in some way in order to create an agreed-upon vocabulary for exchanging
information. (Tom Gruber, an AI specialist at Stanford University.)
– “explicit” means that “the type of concepts used and the constraints on their use are
explicitly defined”;
– “formal” refers to the fact that “it should be machine readable”;
– “shared” refers to the fact that the knowledge represented in an ontology are agreed
upon and accepted by a group”;
– “conceptualization” refers to an abstract model that consists the relevant concepts and
the relationships that exists in a certain situation
Cook?
You mean
–chef
–information about how to cook something,
–or simply a place, person, business or some other entity with "cook" in its name.
The problem is that the word “rice“ or “cook” has no meaning, or semantic content, to the
computer.
World without ontology = Ambiguity
Example (2)
Ontologies have been proposed to solve the problems that arise from
using different terminology to refer to the same concept or using the
same term to refer to different concepts.
Howard Beck and Helena Sofia Pinto
Motivation (2)
Ontologies provide the required knowledge and representation to search the web in a database fashion
through implicit Boolean search.
What do ontologies look like?
Example: Car-Ad Ontology
Year Price
1..* 1..*
Textual
Benefits of Ontology
• To facilitate communications among people and organisations
aid to human communication and shared understanding by specifying meaning
• To facilitate communications among systems with out semantic ambiguity. i,e to achieve
inter-operability
• Top-level ontologies
• describes very general notions which are independent of a particular problem or domain
• are applicable across domains and includes vocabulary related to things, events, time, space, etc
• Domain ontologies
• knowledge represented in this kind of ontologies is specific to a particular domain such as forestry,
fishery, etc.
• They provide vocabularies about concepts in a domain and their relationships or about the theories
governing the domain.
Depending on the wide range of tasks to which the ontologies are put ontologies can vary in their
complexity
Ontologies range from simple taxonomies to highly tangled networks including constraints associated
with concepts and relations.
•Light-weight Ontology
• concepts
• ‘is-a’ hierarchy among concepts
• relations between concepts
•Heavy-weight Ontology
• cardinality constraints
• taxonomy of relations
• Axioms (restrictions)
Thesauri and Ontology
Similarities
• Both serve the same purpose, namely to provide a shared conceptualisation about a specific
part of the world to different users in order to facilitate an efficient communication of complex
knowledge.
• Both disciplines are based on concept systems representing highly complex knowledge
independent of any language.
• Both are concerned about covering a broad range of terminology used in a particular domain,
and in understanding the relationships among these terms.
• Both utilize a hierarchical organization to group terms into categories and subcategories.
• Little possibility of re-use due to inherent semantic ambiguity and lack of the explicitness of
their semantics .
• Developed for human use. They lack of expressive mechanisms to represent, maintain, and
reason about complex knowledge in an explicit form- interpretation is left for humans.
(Source:http://www.xmluk.org/slides/magic-circle_2002/wilson/XML_UK_SW_Thes/all.htm)
Problems with Thesaurus Modelling
BT/NT relations-AGROVOC
Thesauri have not been constructed with purely defined semantics. It is common for BT/NT
relations within a thesauri to include at least: • MAIZE
• NT dent maize
NT flint maize
•subtype of (e.g. soil/ subsoil) NT popcorn
NT soft maize
•instance (e.g. Development Agency/IDRC)) NT sweet corn
NT waxy maize
•part of (e.g. soil/top soil) • SOIL
•
•role (e.g. Development Agency/Voluntary agency) NT top soil
NT subsoil
•property of (e.g. maize/sweet corn)
• Development Agencies
• NT development banks
NT voluntary agencies
NT IDRC
Problems with Thesaurus Modelling
Equivalence relations – UF, USE
• •DEVELOPMENT AGENCIES
UF/USE - between the descriptor and the non-descriptor (s).
• UF aid institutions 1
• Associative relationship can represent:
1-–Similar but not necessarily identical concept
genuine synonymy, or identical meanings;
– near- synonymy, or similar meanings;
– In some thesaurus, antonym, or opposite meanings; ( eg. Eurovoc)
Problems with Thesaurus Modelling
Associative relations- RT
The RT associative relation is more even open to interpretation than the hierarchical relation
For some thesaurus, it can contain:
– cause and effect
– agency or instrument
– hierarchy - where polyhierarchy has not been allowed the missing hierarchical relationships are
replaced by associative relationships
– sequence in time or space
– constituent elements
– characteristic feature
– object of an action, process or discipline
– location
– similarity (in cases where two near-synonyms have been included as descriptors)
– antonym
RT in AGROVOC
•1- cause
Degradation
and effect
2- characteristic feature
•3- location
RT chemical reactions1
RT discoloration
RT hydrolysis
RT shrinkage
• MAIZE
• RT corn flour
RT corn starch 2
RT zea mays
• IDRC
• BT development agencies
RT canada 3
Thesauri and Ontology
how to migrate
• Ontologies are natural successors of thesauri particular for information retrieval and
knowledge management.
A abstract model
WC3-consortium
“Ontology is a term borrowed from philosophy that refers to
the science of describing the kinds of entities in the world and
how they are related “
Ontologies and Ontology Representations
• “Ontology” – a word borrowed from philosophy
• – But we are necessarily building logical systems
• • “Physical symbol systems”
• – Simon, H. A. (1969, 1981). The Sciences of the Artificial, MIT Press
• • “Concepts” and “Ontologies”/ “conceptualisations” in their
• original sense are psychosocial phenomena
• – We don’t really understand them
• • “Concept representations” and “Ontology representations” are
• engineering artefacts
• – At best approximations of our real concepts and conceptualisations
• (ontologies)
• • And we don’t even quite understand what we are approximating
What Is An Ontology?
• Word borrowed by computing for the
• explicit description of the conceptualisation of a domain:
• – concepts
• – properties and attributes of concepts
• – constraints on properties and attributes
• – Individuals (often, but not always)
• • An ontology defines
• – a common vocabulary
• – a shared understanding
Ontology
– Ontology categorizes concepts (which are defined by a set of
common properties) into classes based on common
characteristics
Lightweight Heavyweight
• Concepts, atomic types • Metaclasses
• Is-a hierarchy • Type constraints on
• Relationships between relations
Concepts • Cardinality constraints
• Taxonomy of relations
• Reified statements
• Axioms
• Semantic entailments
• Expressiveness
• Inference systems
A simple ontology: Animals
Why Develop an Ontology?
Animals Plants
Vertebrates Invertebrates
Ontology of People and their Roles
Typically, we want a richer ontology with more relationships
between concepts:
Employee Contractor
advises
funds
Structure of an Ontology
Ontologies typically have two distinct components:
• Names for important concepts and relationships in the domain
– Elephant is a concept whose members are a kind of animal
– Herbivore is a concept whose members are exactly those
animals who eat only plants or parts of plants
• Background knowledge/constraints on the domain
– Adult_Elephants weigh at least 2,000 kg
– No individual can be both a Herbivore and a Carnivore
Why develop an ontology?
• To define web resources more precisely and make them
more amenable to machine processing
• To make domain assumptions explicit
– Easier to change domain assumptions
– Easier to understand and update legacy data
• To separate domain knowledge from operational knowledge
– Re-use domain and operational knowledge separately
• A community reference for applications
• To share a consistent understanding of what information
means
Types of Ontologies [Guarino, 98]
Describe the
vocabulary related
to a generic Describe the
domain by vocabulary
specializing the related to a
concepts generic task or
introduced in the activity by
top-level ontology. specializing the
top-level
ontologies.