Skip to main content

Showing 1–50 of 52 results for author: Horrocks, I

  1. arXiv:2406.10964  [pdf, other

    cs.AI

    Ontology Embedding: A Survey of Methods, Applications and Resources

    Authors: Jiaoyan Chen, Olga Mashkova, Fernando Zhapa-Camacho, Robert Hoehndorf, Yuan He, Ian Horrocks

    Abstract: Ontologies are widely used for representing domain knowledge and meta data, playing an increasingly important role in Information Systems, the Semantic Web, Bioinformatics and many other domains. However, logical reasoning that ontologies can directly support are quite limited in learning, approximation and prediction. One straightforward solution is to integrate statistical analysis and machine l… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: A comprehensive survey on ontology embedding

  2. arXiv:2402.17897  [pdf, other

    cs.CL cs.IR

    A Language Model based Framework for New Concept Placement in Ontologies

    Authors: Hang Dong, Jiaoyan Chen, Yuan He, Yongsheng Gao, Ian Horrocks

    Abstract: We investigate the task of inserting new concepts extracted from texts into an ontology using language models. We explore an approach with three steps: edge search which is to find a set of candidate locations to insert (i.e., subsumptions between concepts), edge formation and enrichment which leverages the ontological structure to produce and enhance the edge candidates, and edge selection which… ▽ More

    Submitted 4 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 20 pages, 3 figures, accepted for ESWC 2024

    ACM Class: I.2.7; I.2.4

  3. arXiv:2401.11374  [pdf, other

    cs.CL cs.AI cs.LG

    Language Models as Hierarchy Encoders

    Authors: Yuan He, Zhangdie Yuan, Jiaoyan Chen, Ian Horrocks

    Abstract: Interpreting hierarchical structures latent in language is a key limitation of current language models (LMs). While previous research has implicitly leveraged these hierarchies to enhance LMs, approaches for their explicit encoding are yet to be explored. To address this, we introduce a novel approach to re-train transformer encoder-based LMs as Hierarchy Transformer encoders (HiTs), harnessing th… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  4. arXiv:2312.11297  [pdf, other

    cs.DB cs.AI

    Optimised Storage for Datalog Reasoning

    Authors: Xinyue Zhang, Pan Hu, Yavor Nenov, Ian Horrocks

    Abstract: Materialisation facilitates Datalog reasoning by precomputing all consequences of the facts and the rules so that queries can be directly answered over the materialised facts. However, storing all materialised facts may be infeasible in practice, especially when the rules are complex and the given set of facts is large. We observe that for certain combinations of rules, there exist data structures… ▽ More

    Submitted 19 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 19 pages

  5. arXiv:2309.07172  [pdf, other

    cs.AI cs.CL cs.LG

    Exploring Large Language Models for Ontology Alignment

    Authors: Yuan He, Jiaoyan Chen, Hang Dong, Ian Horrocks

    Abstract: This work investigates the applicability of recent generative Large Language Models (LLMs), such as the GPT series and Flan-T5, to ontology alignment for identifying concept equivalence mappings across ontologies. To test the zero-shot performance of Flan-T5-XXL and GPT-3.5-turbo, we leverage challenging subsets from two equivalence matching datasets of the OAEI Bio-ML track, taking into account c… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted at ISWC 2023 (Posters and Demos)

  6. arXiv:2307.03067  [pdf, other

    cs.AI cs.CL cs.LG cs.LO

    DeepOnto: A Python Package for Ontology Engineering with Deep Learning

    Authors: Yuan He, Jiaoyan Chen, Hang Dong, Ian Horrocks, Carlo Allocca, Taehun Kim, Brahmananda Sapkota

    Abstract: Integrating deep learning techniques, particularly language models (LMs), with knowledge representation techniques like ontologies has raised widespread attention, urging the need of a platform that supports both paradigms. Although packages such as OWL API and Jena offer robust support for basic ontology processing features, they lack the capability to transform various types of information withi… ▽ More

    Submitted 8 March, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted by the Semantic Web Journal

  7. Ontology Enrichment from Texts: A Biomedical Dataset for Concept Discovery and Placement

    Authors: Hang Dong, Jiaoyan Chen, Yuan He, Ian Horrocks

    Abstract: Mentions of new concepts appear regularly in texts and require automated approaches to harvest and place them into Knowledge Bases (KB), e.g., ontologies and taxonomies. Existing datasets suffer from three issues, (i) mostly assuming that a new concept is pre-discovered and cannot support out-of-KB mention discovery; (ii) only using the concept label as the input along with the KB and thus lacking… ▽ More

    Submitted 1 September, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 5 pages, 1 figure, accepted for CIKM 2023. The dataset, data construction scripts, and baseline implementation are available at https://zenodo.org/record/8228005 (Zenodo) and https://github.com/KRR-Oxford/OET (GitHub)

    ACM Class: I.2.7

  8. arXiv:2306.04814  [pdf, ps, other

    cs.AI

    Revisiting Inferential Benchmarks for Knowledge Graph Completion

    Authors: Shuwen Liu, Bernardo Cuenca Grau, Ian Horrocks, Egor V. Kostylev

    Abstract: Knowledge Graph (KG) completion is the problem of extending an incomplete KG with missing facts. A key feature of Machine Learning approaches for KG completion is their ability to learn inference patterns, so that the predicted facts are the results of applying these patterns to the KG. Standard completion benchmarks, however, are not well-suited for evaluating models' abilities to learn patterns,… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted by the 20th International Conference on Principles of Knowledge Representation and Reasoning (KR 2023)

  9. arXiv:2305.06854  [pdf, other

    cs.DB cs.AI

    Enhancing Datalog Reasoning with Hypertree Decompositions

    Authors: Xinyue Zhang, Pan Hu, Yavor Nenov, Ian Horrocks

    Abstract: Datalog reasoning based on the seminaïve evaluation strategy evaluates rules using traditional join plans, which often leads to redundancy and inefficiency in practice, especially when the rules are complex. Hypertree decompositions help identify efficient query plans and reduce similar redundancy in query answering. However, it is unclear how this can be applied to materialisation and incremental… ▽ More

    Submitted 15 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  10. Reveal the Unknown: Out-of-Knowledge-Base Mention Discovery with Entity Linking

    Authors: Hang Dong, Jiaoyan Chen, Yuan He, Yinan Liu, Ian Horrocks

    Abstract: Discovering entity mentions that are out of a Knowledge Base (KB) from texts plays a critical role in KB maintenance, but has not yet been fully explored. The current methods are mostly limited to the simple threshold-based approach and feature-based classification, and the datasets for evaluation are relatively rare. We propose BLINKout, a new BERT-based Entity Linking (EL) method which can ident… ▽ More

    Submitted 1 September, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 11 pages, 3 figures, accepted for CIKM 2023

    ACM Class: I.2.7

  11. arXiv:2302.06761  [pdf, other

    cs.CL cs.AI cs.LO

    Language Model Analysis for Ontology Subsumption Inference

    Authors: Yuan He, Jiaoyan Chen, Ernesto Jiménez-Ruiz, Hang Dong, Ian Horrocks

    Abstract: Investigating whether pre-trained language models (LMs) can function as knowledge bases (KBs) has raised wide research interests recently. However, existing works focus on simple, triple-based, relational KBs, but omit more sophisticated, logic-based, conceptualised KBs such as OWL ontologies. To investigate an LM's knowledge of ontologies, we propose OntoLAMA, a set of inference-based probing tas… ▽ More

    Submitted 8 May, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted at Findings of ACL 2023; OntoLAMA Datasets are available at: https://huggingface.co/datasets/krr-oxford/OntoLAMA (Huggingface) or https://doi.org/10.5281/zenodo.6480540 (Zenodo)

  12. arXiv:2301.11118  [pdf, other

    cs.AI cs.LG cs.LO

    Dual Box Embeddings for the Description Logic EL++

    Authors: Mathias Jackermeier, Jiaoyan Chen, Ian Horrocks

    Abstract: OWL ontologies, whose formal semantics are rooted in Description Logic (DL), have been widely used for knowledge representation. Similar to Knowledge Graphs (KGs), ontologies are often incomplete, and maintaining and constructing them has proved challenging. While classical deductive reasoning algorithms use the precise formal semantics of an ontology to predict missing facts, recent years have wi… ▽ More

    Submitted 25 March, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Updated license information

  13. arXiv:2205.09901  [pdf, other

    cs.LG

    Cardinality-Minimal Explanations for Monotonic Neural Networks

    Authors: Ouns El Harzli, Bernardo Cuenca Grau, Ian Horrocks

    Abstract: In recent years, there has been increasing interest in explanation methods for neural model predictions that offer precise formal guarantees. These include abductive (respectively, contrastive) methods, which aim to compute minimal subsets of input features that are sufficient for a given prediction to hold (respectively, to change a given prediction). The corresponding decision problems are, howe… ▽ More

    Submitted 2 May, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

  14. arXiv:2205.03447  [pdf, ps, other

    cs.AI cs.LG q-bio.GN

    Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching

    Authors: Yuan He, Jiaoyan Chen, Hang Dong, Ernesto Jiménez-Ruiz, Ali Hadian, Ian Horrocks

    Abstract: Ontology Matching (OM) plays an important role in many domains such as bioinformatics and the Semantic Web, and its research is becoming increasingly popular, especially with the application of machine learning (ML) techniques. Although the Ontology Alignment Evaluation Initiative (OAEI) represents an impressive effort for the systematic evaluation of OM systems, it still suffers from several limi… ▽ More

    Submitted 22 July, 2023; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: Accepted paper (Best Resource Paper Candidate) in the 21st International Semantic Web Conference (ISWC-2022); Bio-ML Dataset: https://doi.org/10.5281/zenodo.6510086

  15. arXiv:2202.09791  [pdf, other

    cs.AI cs.CL

    Contextual Semantic Embeddings for Ontology Subsumption Prediction

    Authors: Jiaoyan Chen, Yuan He, Yuxia Geng, Ernesto Jimenez-Ruiz, Hang Dong, Ian Horrocks

    Abstract: Automating ontology construction and curation is an important but challenging task in knowledge engineering and artificial intelligence. Prediction by machine learning techniques such as contextual semantic embedding is a promising direction, but the relevant research is still preliminary especially for expressive ontologies in Web Ontology Language (OWL). In this paper, we present a new subsumpti… ▽ More

    Submitted 18 March, 2023; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: Accepted by World Wide Web Journal

  16. arXiv:2112.10006  [pdf, other

    cs.LG cs.AI

    Zero-shot and Few-shot Learning with Knowledge Graphs: A Comprehensive Survey

    Authors: Jiaoyan Chen, Yuxia Geng, Zhuo Chen, Jeff Z. Pan, Yuan He, Wen Zhang, Ian Horrocks, Huajun Chen

    Abstract: Machine learning especially deep neural networks have achieved great success but many of them often rely on a number of labeled samples for supervision. As sufficient labeled training data are not always ready due to e.g., continuously emerging prediction targets and costly sample annotation in real world applications, machine learning with sample shortage is now being widely investigated. Among a… ▽ More

    Submitted 3 December, 2022; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: A survey on Zero-shot and Few-shot Learning with Knowledge Graph. It has collected 96 ZSL and FSL papers on this topic, with 11 figures and 4 tables

  17. arXiv:2112.02682  [pdf, other

    cs.AI cs.CL cs.LG

    BERTMap: A BERT-based Ontology Alignment System

    Authors: Yuan He, Jiaoyan Chen, Denvar Antonyrajah, Ian Horrocks

    Abstract: Ontology alignment (a.k.a ontology matching (OM)) plays a critical role in knowledge integration. Owing to the success of machine learning in many domains, it has been applied in OM. However, the existing methods, which often adopt ad-hoc feature engineering or non-contextual word embeddings, have not yet outperformed rule-based systems especially in an unsupervised setting. In this paper, we prop… ▽ More

    Submitted 3 May, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: Full version (with appendix) of the accepted paper in 36th AAAI Conference on Artificial Intelligence 2022

  18. On Event-Driven Knowledge Graph Completion in Digital Factories

    Authors: Martin Ringsquandl, Evgeny Kharlamov, Daria Stepanova, Steffen Lamparter, Raffaello Lepratti, Ian Horrocks, Peer Kröger

    Abstract: Smart factories are equipped with machines that can sense their manufacturing environments, interact with each other, and control production processes. Smooth operation of such factories requires that the machines and engineering personnel that conduct their monitoring and diagnostics share a detailed common industrial knowledge about the factory, e.g., in the form of knowledge graphs. Creation an… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Journal ref: 2017 IEEE International Conference on Big Data (BIGDATA)

  19. arXiv:2107.00369  [pdf, ps, other

    cs.LO cs.AI

    Computing CQ lower-bounds over OWL 2 through approximation to RSA

    Authors: Federico Igne, Stefano Germano, Ian Horrocks

    Abstract: Conjunctive query (CQ) answering over knowledge bases is an important reasoning task. However, with expressive ontology languages such as OWL, query answering is computationally very expensive. The PAGOdA system addresses this issue by using a tractable reasoner to compute lower and upper-bound approximations, falling back to a fully-fledged OWL reasoner only when these bounds don't coincide. The… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 26 pages, 1 figure

  20. arXiv:2103.00070  [pdf, ps, other

    cs.AI cs.LG

    Knowledge-aware Zero-Shot Learning: Survey and Perspective

    Authors: Jiaoyan Chen, Yuxia Geng, Zhuo Chen, Ian Horrocks, Jeff Z. Pan, Huajun Chen

    Abstract: Zero-shot learning (ZSL) which aims at predicting classes that have never appeared during the training using external knowledge (a.k.a. side information) has been widely investigated. In this paper we present a literature review towards ZSL in the perspective of external knowledge, where we categorize the external knowledge, review their methods and compare different external knowledge. With the l… ▽ More

    Submitted 10 May, 2021; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: Accepted by IJCAI'21 Survey Track

  21. arXiv:2009.14654  [pdf, other

    cs.AI

    OWL2Vec*: Embedding of OWL Ontologies

    Authors: Jiaoyan Chen, Pan Hu, Ernesto Jimenez-Ruiz, Ole Magnus Holter, Denvar Antonyrajah, Ian Horrocks

    Abstract: Semantic embedding of knowledge graphs has been widely studied and used for prediction and statistical analysis tasks across various domains such as Natural Language Processing and the Semantic Web. However, less attention has been paid to developing robust methods for embedding OWL (Web Ontology Language) ontologies which can express a much wider range of semantics than knowledge graphs and have… ▽ More

    Submitted 25 January, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

  22. Correcting Knowledge Base Assertions

    Authors: Jiaoyan Chen, Xi Chen, Ian Horrocks, Ernesto Jimenez-Ruiz, Erik B. Myklebus

    Abstract: The usefulness and usability of knowledge bases (KBs) is often limited by quality issues. One common issue is the presence of erroneous assertions, often caused by lexical or semantic confusion. We study the problem of correcting such assertions, and present a general correction framework which combines lexical matching, semantic embedding, soft constraint mining and semantic consistency checking.… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: Accepted by The Web Conference (WWW) 2020

    ACM Class: I.2

  23. Datalog Reasoning over Compressed RDF Knowledge Bases

    Authors: Pan Hu, Jacopo Urbani, Boris Motik, Ian Horrocks

    Abstract: Materialisation is often used in RDF systems as a preprocessing step to derive all facts implied by given RDF triples and rules. Although widely used, materialisation considers all possible rule applications and can use a lot of memory for storing the derived facts, which can hinder performance. We present a novel materialisation technique that compresses the RDF triples so that the rules can some… ▽ More

    Submitted 29 August, 2019; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: CIKM 2019

  24. arXiv:1906.11180  [pdf, other

    cs.AI cs.CL

    Canonicalizing Knowledge Base Literals

    Authors: Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks

    Abstract: Ontology-based knowledge bases (KBs) like DBpedia are very valuable resources, but their usefulness and usability is limited by various quality issues. One such issue is the use of string literals instead of semantically typed entities. In this paper we study the automated canonicalization of such literals, i.e., replacing the literal with an existing entity from the KB or with a new entity that i… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Journal ref: International Semantic Web Conference (ISWC) 2019

  25. arXiv:1906.10261  [pdf, ps, other

    cs.DB cs.DC cs.LO

    Datalog Materialisation in Distributed RDF Stores with Dynamic Data Exchange

    Authors: Temitope Ajileye, Boris Motik, Ian Horrocks

    Abstract: Several centralised RDF systems support datalog reasoning by precomputing and storing all logically implied triples using the wellknown seminaive algorithm. Large RDF datasets often exceed the capacity of centralised RDF systems, and a common solution is to distribute the datasets in a cluster of shared-nothing servers. While numerous distributed query answering techniques are known, distributed s… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: 16 pages, ISWC conference

  26. arXiv:1906.00781  [pdf, other

    cs.DB cs.IR cs.LG

    Learning Semantic Annotations for Tabular Data

    Authors: Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks, Charles Sutton

    Abstract: The usefulness of tabular data such as web tables critically depends on understanding their semantics. This study focuses on column type prediction for tables without any meta data. Unlike traditional lexical matching-based methods, we propose a deep prediction model that can fully exploit a table's contextual semantics, including table locality features learned by a Hybrid Neural Network (HNN), a… ▽ More

    Submitted 30 May, 2019; originally announced June 2019.

    Comments: 7 pages

    Journal ref: IJCAI 2019

  27. arXiv:1811.02304  [pdf, ps, other

    cs.AI cs.DB

    Modular Materialisation of Datalog Programs

    Authors: Pan Hu, Boris Motik, Ian Horrocks

    Abstract: The seminaïve algorithm can materialise all consequences of arbitrary datalog rules, and it also forms the basis for incremental algorithms that update a materialisation as the input facts change. Certain (combinations of) rules, however, can be handled much more efficiently using custom algorithms. To integrate such algorithms into a general reasoning approach that can handle arbitrary rules, we… ▽ More

    Submitted 13 November, 2018; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: Accepted at AAAI 2019

  28. arXiv:1811.01304  [pdf, other

    cs.CL cs.AI

    ColNet: Embedding the Semantics of Web Tables for Column Type Prediction

    Authors: Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks, Charles Sutton

    Abstract: Automatically annotating column types with knowledge base (KB) concepts is a critical task to gain a basic understanding of web tables. Current methods rely on either table metadata like column name or entity correspondences of cells in the KB, and may fail to deal with growing web tables with incomplete meta information. In this paper we propose a neural network based column type annotation frame… ▽ More

    Submitted 14 November, 2018; v1 submitted 3 November, 2018; originally announced November 2018.

    Comments: AAAI 2019

  29. arXiv:1808.02291  [pdf, other

    cs.AI cs.DB cs.LO

    The Window Validity Problem in Rule-Based Stream Reasoning

    Authors: Alessandro Ronca, Mark Kaminski, Bernardo Cuenca Grau, Ian Horrocks

    Abstract: Rule-based temporal query languages provide the expressive power and flexibility required to capture in a natural way complex analysis tasks over streaming data. Stream processing applications, however, typically require near real-time response using limited resources. In particular, it becomes essential that the underpinning query language has favourable computational properties and that stream p… ▽ More

    Submitted 15 November, 2018; v1 submitted 7 August, 2018; originally announced August 2018.

  30. arXiv:1807.08372  [pdf, other

    cs.AI cs.LG

    Knowledge-based Transfer Learning Explanation

    Authors: Jiaoyan Chen, Freddy Lecue, Jeff Z. Pan, Ian Horrocks, Huajun Chen

    Abstract: Machine learning explanation can significantly boost machine learning's application in decision making, but the usability of current methods is limited in human-centric explanation, especially for transfer learning, an important machine learning branch that aims at utilizing knowledge from one learning domain (i.e., a pair of dataset and prediction task) to enhance prediction model training in ano… ▽ More

    Submitted 22 July, 2018; originally announced July 2018.

    Comments: Accepted by International Conference on Principles of Knowledge Representation and Reasoning, 2018

  31. arXiv:1805.01396  [pdf, ps, other

    cs.AI cs.LO

    Consequence-based Reasoning for Description Logics with Disjunction, Inverse Roles, Number Restrictions, and Nominals

    Authors: David Tena Cucala, Bernardo Cuenca Grau, Ian Horrocks

    Abstract: We present a consequence-based calculus for concept subsumption and classification in the description logic ALCHOIQ, which extends ALC with role hierarchies, inverse roles, number restrictions, and nominals. By using standard transformations, our calculus extends to SROIQ, which covers all of OWL 2 DL except for datatypes. A key feature of our calculus is its pay-as-you-go behaviour: unlike existi… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

  32. arXiv:1804.09473  [pdf, other

    cs.AI cs.LO

    Stratified Negation in Limit Datalog Programs

    Authors: Mark Kaminski, Bernardo Cuenca Grau, Egor V. Kostylev, Boris Motik, Ian Horrocks

    Abstract: There has recently been an increasing interest in declarative data analysis, where analytic tasks are specified using a logical language, and their implementation and optimisation are delegated to a general-purpose query engine. Existing declarative languages for data analysis can be formalised as variants of logic programming equipped with arithmetic function symbols and/or aggregation, and are t… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: 14 pages; full version of a paper accepted at IJCAI-18

  33. arXiv:1711.04013  [pdf, other

    cs.AI cs.DB

    Stream Reasoning in Temporal Datalog

    Authors: Alessandro Ronca, Mark Kaminski, Bernardo Cuenca Grau, Boris Motik, Ian Horrocks

    Abstract: In recent years, there has been an increasing interest in extending traditional stream processing engines with logical, rule-based, reasoning capabilities. This poses significant theoretical and practical challenges since rules can derive new information and propagate it both towards past and future time points; as a result, streamed query answers can depend on data that has not yet been received,… ▽ More

    Submitted 15 November, 2018; v1 submitted 10 November, 2017; originally announced November 2017.

  34. arXiv:1711.03987  [pdf, ps, other

    cs.DB cs.AI

    Optimised Maintenance of Datalog Materialisations

    Authors: Pan Hu, Boris Motik, Ian Horrocks

    Abstract: To efficiently answer queries, datalog systems often materialise all consequences of a datalog program, so the materialisation must be updated whenever the input facts change. Several solutions to the materialisation update problem have been proposed. The Delete/Rederive (DRed) and the Backward/Forward (B/F) algorithms solve this problem for general datalog, but both contain steps that evaluate ru… ▽ More

    Submitted 20 November, 2017; v1 submitted 10 November, 2017; originally announced November 2017.

    Comments: AAAI 2018

  35. arXiv:1705.07105  [pdf, other

    cs.AI

    The Bag Semantics of Ontology-Based Data Access

    Authors: Charalampos Nikolaou, Egor V. Kostylev, George Konstantinidis, Mark Kaminski, Bernardo Cuenca Grau, Ian Horrocks

    Abstract: Ontology-based data access (OBDA) is a popular approach for integrating and querying multiple data sources by means of a shared ontology. The ontology is linked to the sources using mappings, which assign views over the data to ontology predicates. Motivated by the need for OBDA systems supporting database-style aggregate queries, we propose a bag semantics for OBDA, where duplicate tuples in the… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

  36. arXiv:1705.06927  [pdf, other

    cs.AI cs.LO

    Foundations of Declarative Data Analysis Using Limit Datalog Programs

    Authors: Mark Kaminski, Bernardo Cuenca Grau, Egor V. Kostylev, Boris Motik, Ian Horrocks

    Abstract: Motivated by applications in declarative data analysis, we study $\mathit{Datalog}_{\mathbb{Z}}$---an extension of positive Datalog with arithmetic functions over integers. This language is known to be undecidable, so we propose two fragments. In $\mathit{limit}~\mathit{Datalog}_{\mathbb{Z}}$ predicates are axiomatised to keep minimal/maximal numeric values, allowing us to show that fact entailmen… ▽ More

    Submitted 12 November, 2017; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: 23 pages; full version of a paper accepted at IJCAI-17; v2 fixes some typos and improves the acknowledgments

  37. arXiv:1607.05351  [pdf, other

    cs.AI cs.DB

    Towards Analytics Aware Ontology Based Access to Static and Streaming Data (Extended Version)

    Authors: Evgeny Kharlamov, Yannis Kotidis, Theofilos Mailis, Christian Neuenstadt, Charalampos Nikolaou, Özgür Özcep, Christoforos Svingos, Dmitriy Zheleznyakov, Sebastian Brandt, Ian Horrocks, Yannis Ioannidis, Steffen Lamparter, Ralf Möller

    Abstract: Real-time analytics that requires integration and aggregation of heterogeneous and distributed streaming and static data is a typical task in many industrial scenarios such as diagnostics of turbines in Siemens. OBDA approach has a great potential to facilitate such tasks; however, it has a number of limitations in dealing with analytics that restrict its use in important industrial applications.… ▽ More

    Submitted 15 August, 2016; v1 submitted 18 July, 2016; originally announced July 2016.

  38. arXiv:1602.04498  [pdf, other

    cs.AI

    Extending Consequence-Based Reasoning to SRIQ

    Authors: Andrew Bate, Boris Motik, Bernardo Cuenca Grau, František Simančík, Ian Horrocks

    Abstract: Consequence-based calculi are a family of reasoning algorithms for description logics (DLs), and they combine hypertableau and resolution in a way that often achieves excellent performance in practice. Up to now, however, they were proposed for either Horn DLs (which do not support disjunction), or for DLs without counting quantifiers. In this paper we present a novel consequence-based calculus fo… ▽ More

    Submitted 23 February, 2016; v1 submitted 14 February, 2016; originally announced February 2016.

  39. arXiv:1505.00212  [pdf, other

    cs.DB cs.DS

    Combining Rewriting and Incremental Materialisation Maintenance for Datalog Programs with Equality

    Authors: Boris Motik, Yavor Nenov, Robert Piro, Ian Horrocks

    Abstract: Materialisation precomputes all consequences of a set of facts and a datalog program so that queries can be evaluated directly (i.e., independently from the program). Rewriting optimises materialisation for datalog programs with equality by replacing all equal constants with a single representative; and incremental maintenance algorithms can efficiently update a materialisation for small changes i… ▽ More

    Submitted 1 May, 2015; originally announced May 2015.

    Comments: All proofs contained in the appendix. 7 pages + 4 pages appendix. 7 algorithms and one table with evaluation results

  40. arXiv:1411.5313  [pdf, other

    cs.AI cs.LO

    Ontology Module Extraction via Datalog Reasoning

    Authors: Ana Armas Romero, Mark Kaminski, Bernardo Cuenca Grau, Ian Horrocks

    Abstract: Module extraction - the task of computing a (preferably small) fragment M of an ontology T that preserves entailments over a signature S - has found many applications in recent years. Extracting modules of minimal size is, however, computationally hard, and often algorithmically infeasible. Thus, practical techniques are based on approximations, where M provably captures the relevant entailments,… ▽ More

    Submitted 20 November, 2014; v1 submitted 19 November, 2014; originally announced November 2014.

    Comments: 13 pages. To appear in AAAI-15

  41. arXiv:1411.3622  [pdf, ps, other

    cs.DB cs.AI cs.DC

    Handling owl:sameAs via Rewriting

    Authors: Boris Motik, Yavor Nenov, Robert Piro, Ian Horrocks

    Abstract: Rewriting is widely used to optimise owl:sameAs reasoning in materialisation based OWL 2 RL systems. We investigate issues related to both the correctness and efficiency of rewriting, and present an algorithm that guarantees correctness, improves efficiency, and can be effectively parallelised. Our evaluation shows that our approach can reduce reasoning times on practical data sets by orders of ma… ▽ More

    Submitted 13 November, 2014; originally announced November 2014.

    Comments: This is the technical report supporting the AAAI 2015 Conference submission with the same title

  42. arXiv:1406.4110  [pdf

    cs.DB cs.AI

    Acyclicity Notions for Existential Rules and Their Application to Query Answering in Ontologies

    Authors: Bernardo Cuenca Grau, Ian Horrocks, Markus Krötzsch, Clemens Kupke, Despoina Magka, Boris Motik, Zhe Wang

    Abstract: Answering conjunctive queries (CQs) over a set of facts extended with existential rules is a prominent problem in knowledge representation and databases. This problem can be solved using the chase algorithm, which extends the given set of facts with fresh facts in order to satisfy the rules. If the chase terminates, then CQs can be evaluated directly in the resulting set of facts. The chase, howev… ▽ More

    Submitted 3 February, 2014; originally announced June 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 47, pages 741-808, 2013

  43. arXiv:1401.4604  [pdf

    cs.AI cs.LO

    Completeness Guarantees for Incomplete Ontology Reasoners: Theory and Practice

    Authors: Bernardo Cuenca Grau, Boris Motik, Giorgos Stoilos, Ian Horrocks

    Abstract: To achieve scalability of query answering, the developers of Semantic Web applications are often forced to use incomplete OWL 2 reasoners, which fail to derive all answers for at least one query, ontology, and data set. The lack of completeness guarantees, however, may be unacceptable for applications in areas such as health care and defence, where missing answers can adversely affect the applicat… ▽ More

    Submitted 18 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 43, pages 419-476, 2012

  44. arXiv:1401.3485  [pdf

    cs.LO cs.AI

    Hypertableau Reasoning for Description Logics

    Authors: Boris Motik, Rob Shearer, Ian Horrocks

    Abstract: We present a novel reasoning calculus for the description logic SHOIQ^+---a knowledge representation formalism with applications in areas such as the Semantic Web. Unnecessary nondeterminism and the construction of large models are two primary sources of inefficiency in the tableau-based reasoning calculi used in state-of-the-art reasoners. In order to reduce nondeterminism, we base our calculus o… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 36, pages 165-228, 2009

  45. arXiv:1304.1402  [pdf, ps, other

    cs.AI

    Computing Datalog Rewritings beyond Horn Ontologies

    Authors: Bernardo Cuenca Grau, Boris Motik, Giorgos Stoilos, Ian Horrocks

    Abstract: Rewriting-based approaches for answering queries over an OWL 2 DL ontology have so far been developed mainly for Horn fragments of OWL 2 DL. In this paper, we study the possibilities of answering queries over non-Horn ontologies using datalog rewritings. We prove that this is impossible in general even for very simple ontology languages, and even if PTIME = NP. Furthermore, we present a resolution… ▽ More

    Submitted 8 April, 2013; v1 submitted 4 April, 2013; originally announced April 2013.

    Comments: 14 pages. To appear at IJCAI 2013

  46. arXiv:1303.7430  [pdf, other

    cs.AI cs.DB cs.LO

    Introducing Nominals to the Combined Query Answering Approaches for EL

    Authors: Giorgio Stefanoni, Boris Motik, Ian Horrocks

    Abstract: So-called combined approaches answer a conjunctive query over a description logic ontology in three steps: first, they materialise certain consequences of the ontology and the data; second, they evaluate the query over the data; and third, they filter the result of the second phase to eliminate unsound answers. Such approaches were developed for various members of the DL-Lite and the EL families o… ▽ More

    Submitted 1 April, 2013; v1 submitted 29 March, 2013; originally announced March 2013.

    Comments: Extended version of a paper to appear on AAAI-13

  47. arXiv:1201.4089  [pdf, ps, other

    cs.AI cs.LO

    A Description Logic Primer

    Authors: Markus Krötzsch, Frantisek Simancik, Ian Horrocks

    Abstract: This paper provides a self-contained first introduction to description logics (DLs). The main concepts and features are explained with examples before syntax and semantics of the DL SROIQ are defined in detail. Additional sections review light-weight DL languages, discuss the relationship to the Web Ontology Language OWL and give pointers to further reading.

    Submitted 3 June, 2013; v1 submitted 19 January, 2012; originally announced January 2012.

    ACM Class: I.2.4; F.4.1

  48. Conjunctive Query Answering for the Description Logic SHIQ

    Authors: Birte Glimm, Ian Horrocks, Carsten Lutz, Ulrike Sattler

    Abstract: Conjunctive queries play an important role as an expressive query language for Description Logics (DLs). Although modern DLs usually provide for transitive roles, conjunctive query answering over DL knowledge bases is only poorly understood if transitive roles are admitted in the query. In this paper, we consider unions of conjunctive queries over knowledge bases formulated in the prominent DL SHI… ▽ More

    Submitted 31 October, 2011; originally announced November 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 31, pages 157-204, 2008

  49. Reasoning with Very Expressive Fuzzy Description Logics

    Authors: I. Horrocks, J. Z. Pan, G. Stamou, G. Stoilos, V. Tzouvaras

    Abstract: It is widely recognized today that the management of imprecision and vagueness will yield more intelligent and realistic knowledge-based applications. Description Logics (DLs) are a family of knowledge representation languages that have gained considerable attention the last decade, mainly due to their decidability and the existence of empirically high performance of reasoning algorithms. In this… ▽ More

    Submitted 31 October, 2011; originally announced November 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 30, pages 273-320, 2007

  50. arXiv:cs/0005014  [pdf, ps, other

    cs.LO cs.AI

    Practical Reasoning for Expressive Description Logics

    Authors: Ian Horrocks, Ulrike Sattler, Stephan Tobies

    Abstract: Description Logics (DLs) are a family of knowledge representation formalisms mainly characterised by constructors to build complex concepts and roles from atomic ones. Expressive role constructors are important in many applications, but can be computationally problematical. We present an algorithm that decides satisfiability of the DL ALC extended with transitive and inverse roles, role hierarch… ▽ More

    Submitted 10 May, 2000; originally announced May 2000.

    Comments: This paper appeared in the Proceedings of LPAR'99

    ACM Class: F.4.1, I.2.3, I.2.4