Semantically-enhanced information retrieval model based on multiple merged and dynamically enriched ontologies

Mohammed A. M. Maree
Ontologies are used in various Information Retrieval (IR) and Artificial Intelligence (AI) domains and applications such as the Semantic Web (SW), Question Answering (QA), knowledge representation and management, Query Expansion (QE), Natural Language Processing (NLP), and so on. They aim at providing a commonly agreed upon understanding of several domains across different communities. In addition, they define concepts and constraints on their use within a specific domain in a formal and
more » ... t manner. Hence, they are considered as the key element in enabling interoperability between heterogeneous systems and across various applications. However, the decentralized process of ontology development and the differences in viewpoints between ontology engineers have resulted in the so called the "semantic heterogeneity" problem between ontologies. In this context, conflicts in the semantic relations as well as other mismatches can be found between the concepts of ontologies that are developed to encode knowledge about the same domain. For example, we may find two or more domain-specific ontologies that use different terms to refer to the same concept or use the same term to refer to different concepts. To overcome "semantic heterogeneity" and achieve interoperability between heterogeneous systems, we need to resolve the semantic conflicts and other semantic mismatches between similar and overlapping ontologies. Another key challenge that needs to be addressed is how to maintain and dynamically enrich the merged ontologies and keep them up-to-date. To do this, we present a dynamic ontology enrichment model, which integrates semantic and statistical based relatedness measures to enrich ontologies with semantically related concepts and instances. Additionally, this thesis explains how ontological background knowledge (represented by multiple merged and further enriched ontologies from various domains) can be reused to support semantic search and retrieval capabilities, namely in a meta-search environment on the Web. It [...]
