Skip to main content

Showing 1–25 of 25 results for author: Jimenez-Ruiz, E

  1. arXiv:2310.07417  [pdf, ps, other

    cs.AI cs.LG cs.SC

    What can knowledge graph alignment gain with Neuro-Symbolic learning approaches?

    Authors: Pedro Giesteira Cotovio, Ernesto Jimenez-Ruiz, Catia Pesquita

    Abstract: Knowledge Graphs (KG) are the backbone of many data-intensive applications since they can represent data coupled with its meaning and context. Aligning KGs across different domains and providers is necessary to afford a fuller and integrated representation. A severe limitation of current KG alignment (KGA) algorithms is that they fail to articulate logical thinking and reasoning with lexical, stru… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  2. Knowledge Graphs for the Life Sciences: Recent Developments, Challenges and Opportunities

    Authors: Jiaoyan Chen, Hang Dong, Janna Hastings, Ernesto Jiménez-Ruiz, Vanessa López, Pierre Monnin, Catia Pesquita, Petr Škoda, Valentina Tamma

    Abstract: The term life sciences refers to the disciplines that study living organisms and life processes, and include chemistry, biology, medicine, and a range of other related disciplines. Research efforts in life sciences are heavily data-driven, as they produce and consume vast amounts of scientific data, much of which is intrinsically relational and graph-structured. The volume of data and the comple… ▽ More

    Submitted 20 December, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: 33 pages, 1 figure, camera-ready version, accepted for Transactions on Graph Data and Knowledge (TGDK)

    ACM Class: I.2.4; J.3

  3. arXiv:2305.13258  [pdf, other

    cs.AI

    NeSy4VRD: A Multifaceted Resource for Neurosymbolic AI Research using Knowledge Graphs in Visual Relationship Detection

    Authors: David Herron, Ernesto Jiménez-Ruiz, Giacomo Tarroni, Tillman Weyde

    Abstract: NeSy4VRD is a multifaceted resource designed to support the development of neurosymbolic AI (NeSy) research. NeSy4VRD re-establishes public access to the images of the VRD dataset and couples them with an extensively revised, quality-improved version of the VRD visual relationship annotations. Crucially, NeSy4VRD provides a well-aligned, companion OWL ontology that describes the dataset domain.It… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  4. arXiv:2302.06761  [pdf, other

    cs.CL cs.AI cs.LO

    Language Model Analysis for Ontology Subsumption Inference

    Authors: Yuan He, Jiaoyan Chen, Ernesto Jiménez-Ruiz, Hang Dong, Ian Horrocks

    Abstract: Investigating whether pre-trained language models (LMs) can function as knowledge bases (KBs) has raised wide research interests recently. However, existing works focus on simple, triple-based, relational KBs, but omit more sophisticated, logic-based, conceptualised KBs such as OWL ontologies. To investigate an LM's knowledge of ontologies, we propose OntoLAMA, a set of inference-based probing tas… ▽ More

    Submitted 8 May, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted at Findings of ACL 2023; OntoLAMA Datasets are available at: https://huggingface.co/datasets/krr-oxford/OntoLAMA (Huggingface) or https://doi.org/10.5281/zenodo.6480540 (Zenodo)

  5. arXiv:2211.00192  [pdf, other

    cs.DB

    AI Assistants: A Framework for Semi-Automated Data Wrangling

    Authors: Tomas Petricek, Gerrit J. J. van den Burg, Alfredo Nazábal, Taha Ceritli, Ernesto Jiménez-Ruiz, Christopher K. I. Williams

    Abstract: Data wrangling tasks such as obtaining and linking data from various sources, transforming data formats, and correcting erroneous records, can constitute up to 80% of typical data engineering work. Despite the rise of machine learning and artificial intelligence, data wrangling remains a tedious and manual task. We introduce AI assistants, a class of semi-automatic interactive tools to streamline… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted for publication in IEEE Transactions on Knowledge and Data Engineering

  6. arXiv:2210.15985  [pdf, other

    cs.AI cs.LG q-bio.QM

    Understanding Adverse Biological Effect Predictions Using Knowledge Graphs

    Authors: Erik Bryhn Myklebust, Ernesto Jimenez-Ruiz, Jiaoyan Chen, Raoul Wolf, Knut Erik Tollefsen

    Abstract: Extrapolation of adverse biological (toxic) effects of chemicals is an important contribution to expand available hazard data in (eco)toxicology without the use of animals in laboratory experiments. In this work, we extrapolate effects based on a knowledge graph (KG) consisting of the most relevant effect data as domain-specific background knowledge. An effect prediction model, with and without ba… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Under review. 29 pages

  7. Query-based Industrial Analytics over Knowledge Graphs with Ontology Reshaping

    Authors: Zhuoxun Zheng, Baifan Zhou, Dongzhuoran Zhou, Gong Cheng, Ernesto Jiménez-Ruiz, Ahmet Soylu, Evgeny Kharlamo

    Abstract: Industrial analytics that includes among others equipment diagnosis and anomaly detection heavily relies on integration of heterogeneous production data. Knowledge Graphs (KGs) as the data format and ontologies as the unified data schemata are a prominent solution that offers high quality data integration and a convenient and standardised way to exchange data and to layer analytical applications o… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  8. arXiv:2205.03447  [pdf, ps, other

    cs.AI cs.LG q-bio.GN

    Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching

    Authors: Yuan He, Jiaoyan Chen, Hang Dong, Ernesto Jiménez-Ruiz, Ali Hadian, Ian Horrocks

    Abstract: Ontology Matching (OM) plays an important role in many domains such as bioinformatics and the Semantic Web, and its research is becoming increasingly popular, especially with the application of machine learning (ML) techniques. Although the Ontology Alignment Evaluation Initiative (OAEI) represents an impressive effort for the systematic evaluation of OM systems, it still suffers from several limi… ▽ More

    Submitted 22 July, 2023; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: Accepted paper (Best Resource Paper Candidate) in the 21st International Semantic Web Conference (ISWC-2022); Bio-ML Dataset: https://doi.org/10.5281/zenodo.6510086

  9. arXiv:2202.09791  [pdf, other

    cs.AI cs.CL

    Contextual Semantic Embeddings for Ontology Subsumption Prediction

    Authors: Jiaoyan Chen, Yuan He, Yuxia Geng, Ernesto Jimenez-Ruiz, Hang Dong, Ian Horrocks

    Abstract: Automating ontology construction and curation is an important but challenging task in knowledge engineering and artificial intelligence. Prediction by machine learning techniques such as contextual semantic embedding is a promising direction, but the relevant research is still preliminary especially for expressive ontologies in Web Ontology Language (OWL). In this paper, we present a new subsumpti… ▽ More

    Submitted 18 March, 2023; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: Accepted by World Wide Web Journal

  10. A Simple Standard for Sharing Ontological Mappings (SSSOM)

    Authors: Nicolas Matentzoglu, James P. Balhoff, Susan M. Bello, Chris Bizon, Matthew Brush, Tiffany J. Callahan, Christopher G Chute, William D. Duncan, Chris T. Evelo, Davera Gabriel, John Graybeal, Alasdair Gray, Benjamin M. Gyori, Melissa Haendel, Henriette Harmse, Nomi L. Harris, Ian Harrow, Harshad Hegde, Amelia L. Hoyt, Charles T. Hoyt, Dazhi Jiao, Ernesto Jiménez-Ruiz, Simon Jupp, Hyeongsik Kim, Sebastian Koehler , et al. (19 additional authors not shown)

    Abstract: Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, ar… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: Corresponding author: Christopher J. Mungall <cjmungall@lbl.gov>

  11. Prediction of Adverse Biological Effects of Chemicals Using Knowledge Graph Embeddings

    Authors: Erik B. Myklebust, Ernesto Jiménez-Ruiz, Jiaoyan Chen, Raoul Wolf, Knut Erik Tollefsen

    Abstract: We have created a knowledge graph based on major data sources used in ecotoxicological risk assessment. We have applied this knowledge graph to an important task in risk assessment, namely chemical effect prediction. We have evaluated nine knowledge graph embedding models from a selection of geometric, decomposition, and convolutional models on this prediction task. We show that using knowledge gr… ▽ More

    Submitted 30 March, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-40, 2022

    ACM Class: I.2

  12. arXiv:2009.14654  [pdf, other

    cs.AI

    OWL2Vec*: Embedding of OWL Ontologies

    Authors: Jiaoyan Chen, Pan Hu, Ernesto Jimenez-Ruiz, Ole Magnus Holter, Denvar Antonyrajah, Ian Horrocks

    Abstract: Semantic embedding of knowledge graphs has been widely studied and used for prediction and statistical analysis tasks across various domains such as Natural Language Processing and the Semantic Web. However, less attention has been paid to developing robust methods for embedding OWL (Web Ontology Language) ontologies which can express a much wider range of semantics than knowledge graphs and have… ▽ More

    Submitted 25 January, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

  13. arXiv:2003.05370  [pdf, other

    cs.AI

    Dividing the Ontology Alignment Task with Semantic Embeddings and Logic-based Modules

    Authors: Ernesto Jiménez-Ruiz, Asan Agibetov, Jiaoyan Chen, Matthias Samwald, Valerie Cross

    Abstract: Large ontologies still pose serious challenges to state-of-the-art ontology alignment systems. In this paper we present an approach that combines a neural embedding model and logic-based modules to accurately divide an input ontology matching task into smaller and more tractable matching (sub)tasks. We have conducted a comprehensive evaluation using the datasets of the Ontology Alignment Evaluatio… ▽ More

    Submitted 25 February, 2020; originally announced March 2020.

    Comments: Accepted to the 24th European Conference on Artificial Intelligence (ECAI 2020). arXiv admin note: text overlap with arXiv:1805.12402

    ACM Class: I.2

  14. Correcting Knowledge Base Assertions

    Authors: Jiaoyan Chen, Xi Chen, Ian Horrocks, Ernesto Jimenez-Ruiz, Erik B. Myklebus

    Abstract: The usefulness and usability of knowledge bases (KBs) is often limited by quality issues. One common issue is the presence of erroneous assertions, often caused by lexical or semantic confusion. We study the problem of correcting such assertions, and present a general correction framework which combines lexical matching, semantic embedding, soft constraint mining and semantic consistency checking.… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: Accepted by The Web Conference (WWW) 2020

    ACM Class: I.2

  15. arXiv:1908.10128  [pdf, other

    cs.AI cs.IR

    TERA: the Toxicological Effect and Risk Assessment Knowledge Graph

    Authors: Erik Bryhn Myklebust, Ernesto Jimenez-Ruiz, Jiaoyan Chen, Raoul Wolf, Knut Erik Tollefsen

    Abstract: Ecological risk assessment requires large amounts of chemical effect data from laboratory experiments. Due to experimental effort and animal welfare concerns it is desired to extrapolate data from existing sources. To cover the required chemical effect data several data sources need to be integrated to enable their interoperability. In this paper we introduce the Toxicological Effect and Risk Asse… ▽ More

    Submitted 12 December, 2019; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: Submitted to a conference

  16. Knowledge Graph Embedding for Ecotoxicological Effect Prediction

    Authors: Erik Bryhn Myklebust, Ernesto Jimenez-Ruiz, Jiaoyan Chen, Raoul Wolf, Knut Erik Tollefsen

    Abstract: Exploring the effects a chemical compound has on a species takes a considerable experimental effort. Appropriate methods for estimating and suggesting new effects can dramatically reduce the work needed to be done by a laboratory. In this paper we explore the suitability of using a knowledge graph embedding approach for ecotoxicological effect prediction. A knowledge graph has been constructed fro… ▽ More

    Submitted 11 November, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Journal ref: In: Ghidini C. et al. (eds) The Semantic Web - ISWC 2019. ISWC 2019. Lecture Notes in Computer Science, vol 11779. Springer, Cham

  17. arXiv:1906.11180  [pdf, other

    cs.AI cs.CL

    Canonicalizing Knowledge Base Literals

    Authors: Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks

    Abstract: Ontology-based knowledge bases (KBs) like DBpedia are very valuable resources, but their usefulness and usability is limited by various quality issues. One such issue is the use of string literals instead of semantically typed entities. In this paper we study the automated canonicalization of such literals, i.e., replacing the literal with an existing entity from the KB or with a new entity that i… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Journal ref: International Semantic Web Conference (ISWC) 2019

  18. arXiv:1906.00781  [pdf, other

    cs.DB cs.IR cs.LG

    Learning Semantic Annotations for Tabular Data

    Authors: Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks, Charles Sutton

    Abstract: The usefulness of tabular data such as web tables critically depends on understanding their semantics. This study focuses on column type prediction for tables without any meta data. Unlike traditional lexical matching-based methods, we propose a deep prediction model that can fully exploit a table's contextual semantics, including table locality features learned by a Hybrid Neural Network (HNN), a… ▽ More

    Submitted 30 May, 2019; originally announced June 2019.

    Comments: 7 pages

    Journal ref: IJCAI 2019

  19. arXiv:1901.08547  [pdf

    cs.LG cs.AI stat.ML

    Human-centric Transfer Learning Explanation via Knowledge Graph [Extended Abstract]

    Authors: Yuxia Geng, Jiaoyan Chen, Ernesto Jimenez-Ruiz, Huajun Chen

    Abstract: Transfer learning which aims at utilizing knowledge learned from one problem (source domain) to solve another different but related problem (target domain) has attracted wide research attentions. However, the current transfer learning methods are mostly uninterpretable, especially to people without ML expertise. In this extended abstract, we brief introduce two knowledge graph (KG) based framework… ▽ More

    Submitted 20 January, 2019; originally announced January 2019.

    Comments: In AAAI-19 Workshop on Network Interpretability for Deep Learning

  20. arXiv:1811.01304  [pdf, other

    cs.CL cs.AI

    ColNet: Embedding the Semantics of Web Tables for Column Type Prediction

    Authors: Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks, Charles Sutton

    Abstract: Automatically annotating column types with knowledge base (KB) concepts is a critical task to gain a basic understanding of web tables. Current methods rely on either table metadata like column name or entity correspondences of cells in the KB, and may fail to deal with growing web tables with incomplete meta information. In this paper we propose a neural network based column type annotation frame… ▽ More

    Submitted 14 November, 2018; v1 submitted 3 November, 2018; originally announced November 2018.

    Comments: AAAI 2019

  21. arXiv:1805.12402  [pdf, ps, other

    cs.AI

    Breaking-down the Ontology Alignment Task with a Lexical Index and Neural Embeddings

    Authors: Ernesto Jimenez-Ruiz, Asan Agibetov, Matthias Samwald, Valerie Cross

    Abstract: Large ontologies still pose serious challenges to state-of-the-art ontology alignment systems. In the paper we present an approach that combines a lexical index, a neural embedding model and locality modules to effectively divide an input ontology matching task into smaller and more tractable matching (sub)tasks. We have conducted a comprehensive evaluation using the datasets of the Ontology Align… ▽ More

    Submitted 31 May, 2018; originally announced May 2018.

  22. arXiv:1208.3148  [pdf, other

    cs.AI

    Evaluating Ontology Matching Systems on Large, Multilingual and Real-world Test Cases

    Authors: Christian Meilicke, Ondrej Sváb-Zamazal, Cássia Trojahn, Ernesto Jiménez-Ruiz, José-Luis Aguirre, Heiner Stuckenschmidt, Bernardo Cuenca Grau

    Abstract: In the field of ontology matching, the most systematic evaluation of matching systems is established by the Ontology Alignment Evaluation Initiative (OAEI), which is an annual campaign for evaluating ontology matching systems organized by different groups of researchers. In this paper, we report on the results of an intermediary OAEI campaign called OAEI 2011.5. The evaluations of this campaign ar… ▽ More

    Submitted 15 August, 2012; originally announced August 2012.

    Comments: Technical Report of the OAEI 2011.5 Evaluation Campaign

  23. arXiv:1012.1659  [pdf, other

    cs.AI cs.LO

    First steps in the logic-based assessment of post-composed phenotypic descriptions

    Authors: Ernesto Jimenez-Ruiz, Bernardo Cuenca Grau, Rafael Berlanga, Dietrich Rebholz-Schuhmann

    Abstract: In this paper we present a preliminary logic-based evaluation of the integration of post-composed phenotypic descriptions with domain ontologies. The evaluation has been performed using a description logic reasoner together with scalable techniques: ontology modularization and approximations of the logical difference between ontologies.

    Submitted 7 December, 2010; originally announced December 2010.

    Comments: in Adrian Paschke, Albert Burger, Andrea Splendiani, M. Scott Marshall, Paolo Romano: Proceedings of the 3rd International Workshop on Semantic Web Applications and Tools for the Life Sciences, Berlin,Germany, December 8-10, 2010

    Report number: SWAT4LS 2010 ACM Class: J.3

  24. arXiv:1012.1609  [pdf, other

    cs.IR

    Building conceptual spaces for exploring and linking biomedical resources

    Authors: R. Berlanga, E. Jimenez-Ruiz, V. Nebot

    Abstract: The establishment of links between data (e.g., patient records) and Web resources (e.g., literature) and the proper visualization of such discovered knowledge is still a challenge in most Life Science domains (e.g., biomedicine). In this paper we present our contribution to the community in the form of an infrastructure to annotate information resources, to discover relationships among them, and t… ▽ More

    Submitted 7 December, 2010; originally announced December 2010.

    Comments: in Adrian Paschke, Albert Burger, Andrea Splendiani, M. Scott Marshall, Paolo Romano: Proceedings of the 3rd International Workshop on Semantic Web Applications and Tools for the Life Sciences, Berlin,Germany, December 8-10, 2010

    Report number: SWAT4LS 2010 ACM Class: J.3

  25. arXiv:cs/0609144  [pdf

    cs.DB

    The Management and Integration of Biomedical Knowledge: Application in the Health-e-Child Project (Position Paper)

    Authors: E. Jimenez-Ruiz, R. Berlanga, I. Sanz, R. McClatchey, R. Danger, D. Manset, J. Paraire, A. Rios

    Abstract: The Health-e-Child project aims to develop an integrated healthcare platform for European paediatrics. In order to achieve a comprehensive view of childrens health, a complex integration of biomedical data, information, and knowledge is necessary. Ontologies will be used to formally define this domain knowledge and will form the basis for the medical knowledge management system. This paper intro… ▽ More

    Submitted 26 September, 2006; originally announced September 2006.

    Comments: 6 pages; 2 figures. Proceedings of the 1st International Workshop on Ontology content and evaluation in Enterprise

    ACM Class: H.2.4; J.3