Skip to main content

Showing 1–50 of 94 results for author: Auer, S

  1. arXiv:2407.02977  [pdf, other

    cs.CL cs.AI cs.IT

    Large Language Models as Evaluators for Scientific Synthesis

    Authors: Julia Evans, Jennifer D'Souza, Sören Auer

    Abstract: Our study explores how well the state-of-the-art Large Language Models (LLMs), like GPT-4 and Mistral, can assess the quality of scientific summaries or, more fittingly, scientific syntheses, comparing their evaluations to those of human annotators. We used a dataset of 100 research questions and their syntheses made by GPT-4 from abstracts of five related papers, checked against human quality rat… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 4 pages, forthcoming as part of the KONVENS 2024 proceedings https://konvens-2024.univie.ac.at/

  2. arXiv:2407.02409  [pdf, other

    cs.CL

    Effective Context Selection in LLM-based Leaderboard Generation: An Empirical Study

    Authors: Salomon Kabongo, Jennifer D'Souza, Sören Auer

    Abstract: This paper explores the impact of context selection on the efficiency of Large Language Models (LLMs) in generating Artificial Intelligence (AI) research leaderboards, a task defined as the extraction of (Task, Dataset, Metric, Score) quadruples from scholarly articles. By framing this challenge as a text generation objective and employing instruction finetuning with the FLAN-T5 collection, we int… ▽ More

    Submitted 6 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2406.04383

  3. arXiv:2406.07257  [pdf, other

    cs.CL cs.AI

    Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway

    Authors: Hamed Babaei Giglou, Tilahun Abedissa Taffa, Rana Abdullah, Aida Usmanova, Ricardo Usbeck, Jennifer D'Souza, Sören Auer

    Abstract: This paper introduces a scholarly Question Answering (QA) system on top of the NFDI4DataScience Gateway, employing a Retrieval Augmented Generation-based (RAG) approach. The NFDI4DS Gateway, as a foundational framework, offers a unified and intuitive interface for querying various scientific databases using federated search. The RAG-based scholarly QA, powered by a Large Language Model (LLM), faci… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 13 pages main content, 16 pages overall, 3 Figures, accepted for publication at NSLP 2024 workshop at ESWC 2024

  4. arXiv:2406.04383  [pdf, other

    cs.CL cs.AI

    Exploring the Latest LLMs for Leaderboard Extraction

    Authors: Salomon Kabongo, Jennifer D'Souza, Sören Auer

    Abstract: The rapid advancements in Large Language Models (LLMs) have opened new avenues for automating complex tasks in AI research. This paper investigates the efficacy of different LLMs-Mistral 7B, Llama-2, GPT-4-Turbo and GPT-4.o in extracting leaderboard information from empirical AI research articles. We explore three types of contextual inputs to the models: DocTAET (Document Title, Abstract, Experim… ▽ More

    Submitted 8 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2404.10317  [pdf, other

    cs.AI

    LLMs4OM: Matching Ontologies with Large Language Models

    Authors: Hamed Babaei Giglou, Jennifer D'Souza, Felix Engel, Sören Auer

    Abstract: Ontology Matching (OM), is a critical task in knowledge integration, where aligning heterogeneous ontologies facilitates data interoperability and knowledge sharing. Traditional OM systems often rely on expert knowledge or predictive models, with limited exploration of the potential of Large Language Models (LLMs). We present the LLMs4OM framework, a novel approach to evaluate the effectiveness of… ▽ More

    Submitted 23 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 8 pages, 1 figure, accepted to ESWC 2024 Special Track on LLMs for Knowledge Engineering (https://2024.eswc-conferences.org/call-for-papers-llms/)

  6. arXiv:2404.08443  [pdf, other

    cs.DL cs.IR

    Toward FAIR Semantic Publishing of Research Dataset Metadata in the Open Research Knowledge Graph

    Authors: Raia Abu Ahmad, Jennifer D'Souza, Matthäus Zloch, Wolfgang Otto, Georg Rehm, Allard Oelen, Stefan Dietze, Sören Auer

    Abstract: Search engines these days can serve datasets as search results. Datasets get picked up by search technologies based on structured descriptions on their official web pages, informed by metadata ontologies such as the Dataset content type of schema.org. Despite this promotion of the content type dataset as a first-class citizen of search results, a vast proportion of datasets, particularly research… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 8 pages, 1 figure, published in the Joint Proceedings of the Onto4FAIR 2023 Workshops

    Journal ref: In Joint Proceedings of the Onto4FAIR 2023 Workshops: Collocated with FOIS 2023 and SEMANTICS 2023. pp.23-31. https://hal.science/hal-04312604

  7. arXiv:2401.13365  [pdf, other

    cs.DL

    Organizing Scientific Knowledge From Energy System Research Using the Open Research Knowledge Graph

    Authors: Oliver Karras, Jan Göpfert, Patrick Kuckertz, Tristan Pelser, Sören Auer

    Abstract: Engineering sciences, such as energy system research, play an important role in developing solutions to technical, environmental, economic, and social challenges of our modern society. In this context, the transformation of energy systems into climate-neutral systems is one of the key strategies for mitigating climate change. For the transformation of energy systems, engineers model, simulate and… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 1. NFDI4Energy Conference

  8. arXiv:2401.10040  [pdf, other

    cs.CL cs.AI cs.DL cs.IT

    Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

    Authors: Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer

    Abstract: In this paper, we champion the use of structured and semantic content representation of discourse-based scholarly communication, inspired by tools like Wikipedia infoboxes or structured Amazon product descriptions. These representations provide users with a concise overview, aiding scientists in navigating the dense academic landscape. Our novel automated approach leverages the robust text generat… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 8 pages, 6 figures, Accepted as Findings of the ACL: EACL 2024

  9. arXiv:2312.01065  [pdf, other

    cs.DL

    Scholarly Knowledge Graph Construction from Published Software Packages

    Authors: Muhammad Haris, Sören Auer, Markus Stocker

    Abstract: The value of structured scholarly knowledge for research and society at large is well understood, but producing scholarly knowledge (i.e., knowledge traditionally published in articles) in structured form remains a challenge. We propose an approach for automatically extracting scholarly knowledge from published software packages by static analysis of their metadata and contents (scripts and data)… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 10 pages, 5 figures. arXiv admin note: text overlap with arXiv:2212.07921

  10. arXiv:2309.08042  [pdf, other

    cs.CV cs.AI

    Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved

    Authors: Yao Sun, Anna Kruspe, Liqiu Meng, Yifan Tian, Eike J Hoffmann, Stefan Auer, Xiao Xiang Zhu

    Abstract: Crowdsourced platforms provide huge amounts of street-view images that contain valuable building information. This work addresses the challenges in applying Scene Text Recognition (STR) in crowdsourced street-view images for building attribute mapping. We use Flickr images, particularly examining texts on building facades. A Berlin Flickr dataset is created, and pre-trained STR models are used for… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  11. An Approach to Evaluate User Interfaces in a Scholarly Knowledge Communication Domain

    Authors: Denis Obrezkov, Allard Oelen, Sören Auer

    Abstract: The amount of research articles produced every day is overwhelming: scholarly knowledge is getting harder to communicate and easier to get lost. A possible solution is to represent the information in knowledge graphs: structures representing knowledge in networks of entities, their semantic types, and relationships between them. But this solution has its own drawback: given its very specific task,… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 5 pages, 2 figures

    Journal ref: 19th IFIP TC13 International Conference, York, UK, August 28 - September 1, 2023, Proceedings, Part IV

  12. arXiv:2308.05074  [pdf, other

    cs.CY cs.AI cs.CV

    Drones4Good: Supporting Disaster Relief Through Remote Sensing and AI

    Authors: Nina Merkle, Reza Bahmanyar, Corentin Henry, Seyed Majid Azimi, Xiangtian Yuan, Simon Schopferer, Veronika Gstaiger, Stefan Auer, Anne Schneibel, Marc Wieland, Thomas Kraft

    Abstract: In order to respond effectively in the aftermath of a disaster, emergency services and relief organizations rely on timely and accurate information about the affected areas. Remote sensing has the potential to significantly reduce the time and effort required to collect such information by enabling a rapid survey of large areas. To achieve this, the main challenge is the automatic extraction of re… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  13. arXiv:2307.16648  [pdf, other

    cs.AI cs.CL cs.IT cs.LG

    LLMs4OL: Large Language Models for Ontology Learning

    Authors: Hamed Babaei Giglou, Jennifer D'Souza, Sören Auer

    Abstract: We propose the LLMs4OL approach, which utilizes Large Language Models (LLMs) for Ontology Learning (OL). LLMs have shown significant advancements in natural language processing, demonstrating their ability to capture complex language patterns in different knowledge domains. Our LLMs4OL paradigm investigates the following hypothesis: \textit{Can LLMs effectively apply their language pattern capturi… ▽ More

    Submitted 2 August, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: 15 pages main content, 27 pages overall, 2 Figures, accepted for publication at ISWC 2023 research track

  14. arXiv:2306.16791  [pdf, other

    cs.SE cs.DL

    Divide and Conquer the EmpiRE: A Community-Maintainable Knowledge Graph of Empirical Research in Requirements Engineering

    Authors: Oliver Karras, Felix Wernlein, Jil Klünder, Sören Auer

    Abstract: [Background.] Empirical research in requirements engineering (RE) is a constantly evolving topic, with a growing number of publications. Several papers address this topic using literature reviews to provide a snapshot of its "current" state and evolution. However, these papers have never built on or updated earlier ones, resulting in overlap and redundancy. The underlying problem is the unavailabi… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: Accepted for publication at the 17th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM 2023)

  15. arXiv:2306.10620  [pdf, other

    cs.SE

    A Metadata-Based Ecosystem to Improve the FAIRness of Research Software

    Authors: Patrick Kuckertz, Jan Göpfert, Oliver Karras, David Neuroth, Julian Schönau, Rodrigo Pueblas, Stephan Ferenz, Felix Engel, Noah Pflugradt, Jann M. Weinand, Astrid Nieße, Sören Auer, Detlef Stolten

    Abstract: The reuse of research software is central to research efficiency and academic exchange. The application of software enables researchers with varied backgrounds to reproduce, validate, and expand upon study findings. Furthermore, the analysis of open source code aids in the comprehension, comparison, and integration of approaches. Often, however, no further use occurs because relevant software cann… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  16. arXiv:2305.12900  [pdf, other

    cs.CL cs.AI cs.DL cs.IT

    Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph

    Authors: Jennifer D'Souza, Moussab Hrou, Sören Auer

    Abstract: There have been many recent investigations into prompt-based training of transformer language models for new text genres in low-resource settings. The prompt-based training approach has been found to be effective in generalizing pre-trained or fine-tuned models for transfer to resource-scarce settings. This work, for the first time, reports results on adopting prompt-based training of transformers… ▽ More

    Submitted 11 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 14 pages, 1 figure, accepted for publication as a short paper at DEXA 2023 (https://www.dexa.org/dexa2023)

  17. arXiv:2305.11068  [pdf, other

    cs.CL cs.AI

    ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a Knowledge Graph

    Authors: Salomon Kabongo, Jennifer D'Souza, Sören Auer

    Abstract: The purpose of this work is to describe the Orkg-Leaderboard software designed to extract leaderboards defined as Task-Dataset-Metric tuples automatically from large collections of empirical research papers in Artificial Intelligence (AI). The software can support both the main workflows of scholarly publishing, viz. as LaTeX files or as PDF files. Furthermore, the system is integrated with the Op… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: NA. arXiv admin note: text overlap with arXiv:2109.13089

  18. Evaluating BERT-based Scientific Relation Classifiers for Scholarly Knowledge Graph Construction on Digital Library Collections

    Authors: Ming Jiang, Jennifer D'Souza, Sören Auer, J. Stephen Downie

    Abstract: The rapid growth of research publications has placed great demands on digital libraries (DL) for advanced information management technologies. To cater to these demands, techniques relying on knowledge-graph structures are being advocated. In such graph-based pipelines, inferring semantic relations between related scientific concepts is a crucial step. Recently, BERT-based pre-trained models have… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Journal ref: International Journal on Digital Libraries (2022)

  19. arXiv:2303.16835  [pdf, other

    cs.CL cs.AI cs.LG

    Zero-shot Entailment of Leaderboards for Empirical AI Research

    Authors: Salomon Kabongo, Jennifer D'Souza, Sören Auer

    Abstract: We present a large-scale empirical investigation of the zero-shot learning phenomena in a specific recognizing textual entailment (RTE) task category, i.e. the automated mining of leaderboards for Empirical AI Research. The prior reported state-of-the-art models for leaderboards extraction formulated as an RTE task, in a non-zero-shot setting, are promising with above 90% reported performances. Ho… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: 5 pages, 1 figure. Accepted for publication at JCDL 2023 - Late Breaking Results and Datasets track (https://2023.jcdl.org/calls/papers/#paper_types), official citation forthcoming

  20. arXiv:2303.15113  [pdf, other

    cs.AI

    Describing and Organizing Semantic Web and Machine Learning Systems in the SWeMLS-KG

    Authors: Fajar J. Ekaputra, Majlinda Llugiqi, Marta Sabou, Andreas Ekelhart, Heiko Paulheim, Anna Breit, Artem Revenko, Laura Waltersdorfer, Kheir Eddine Farfar, Sören Auer

    Abstract: In line with the general trend in artificial intelligence research to create intelligent systems that combine learning and symbolic components, a new sub-area has emerged that focuses on combining machine learning (ML) components with techniques developed by the Semantic Web (SW) community - Semantic Web Machine Learning (SWeML for short). Due to its rapid growth and impact on several communities… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Preprint of a paper in the resource track of the 20th Extended Semantic Web Conference (ESWC'23)

  21. arXiv:2303.03882  [pdf, other

    cs.CY cs.HC

    A Next-Generation Digital Procurement Workspace Focusing on Information Integration, Automation, Analytics, and Sustainability

    Authors: Jan-David Stütz, Oliver Karras, Allard Oelen, Sören Auer

    Abstract: Recent events such as wars, sanctions, pandemics, and climate change have shown the importance of proper supply network management. A key step in managing supply networks is procurement. We present an approach for realizing a next-generation procurement workspace that aims to facilitate resilience and sustainability. To achieve this, the approach encompasses a novel way of information integration,… ▽ More

    Submitted 22 March, 2023; v1 submitted 17 February, 2023; originally announced March 2023.

    Comments: Accepted for publication for 25. International Conference on Enterprise Information Systems (ICEIS'23)

  22. arXiv:2212.07921  [pdf, other

    cs.DL

    Scholarly Knowledge Extraction from Published Software Packages

    Authors: Muhammad Haris, Markus Stocker, Sören Auer

    Abstract: A plethora of scientific software packages are published in repositories, e.g., Zenodo and figshare. These software packages are crucial for the reproducibility of published research. As an additional route to scholarly knowledge graph construction, we propose an approach for automated extraction of machine actionable (structured) scholarly knowledge from published software packages by static anal… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  23. MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles

    Authors: Mohamad Yaser Jaradeh, Markus Stocker, Sören Auer

    Abstract: Information extraction from scholarly articles is a challenging task due to the sizable document length and implicit information hidden in text, figures, and citations. Scholarly information extraction has various applications in exploration, archival, and curation services for digital libraries and knowledge management systems. We present MORTY, an information extraction technique that creates st… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Comments: Published as a short paper in ICADL 2022

  24. arXiv:2211.12223  [pdf, other

    cs.DL cs.HC

    KGMM -- A Maturity Model for Scholarly Knowledge Graphs based on Intertwined Human-Machine Collaboration

    Authors: Hassan Hussein, Allard Oelen, Oliver Karras, Sören Auer

    Abstract: Knowledge Graphs (KG) have gained increasing importance in science, business and society in the last years. However, most knowledge graphs were either extracted or compiled from existing sources. There are only relatively few examples where knowledge graphs were genuinely created by an intertwined human-machine collaboration. Also, since the quality of data and knowledge graphs is of paramount imp… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted as a full paper at the ICADL 2022: International Conference on Asian Digital Libraries 2022

  25. arXiv:2210.02034  [pdf, other

    cs.DL cs.AI

    Clustering Semantic Predicates in the Open Research Knowledge Graph

    Authors: Omar Arab Oghli, Jennifer D'Souza, Sören Auer

    Abstract: When semantically describing knowledge graphs (KGs), users have to make a critical choice of a vocabulary (i.e. predicates and resources). The success of KG building is determined by the convergence of shared vocabularies so that meaning can be established. The typical lifecycle for a new KG construction can be defined as follows: nascent phases of graph construction experience terminology diverge… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  26. Persistent Identification and Interlinking of FAIR Scholarly Knowledge

    Authors: Muhammad Haris, Markus Stocker, Sören Auer

    Abstract: We leverage the Open Research Knowledge Graph - a scholarly infrastructure that supports the creation, curation, and reuse of structured, semantic scholarly knowledge - and present an approach for persistent identification of FAIR scholarly knowledge. We propose a DOI-based persistent identification of ORKG Papers, which are machine-actionable descriptions of the essential information published in… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  27. Plumber: A Modular Framework to Create Information Extraction Pipelines

    Authors: Mohamad Yaser Jaradeh, Kuldeep Singh, Markus Stocker, Sören Auer

    Abstract: Information Extraction (IE) tasks are commonly studied topics in various domains of research. Hence, the community continuously produces multiple techniques, solutions, and tools to perform such tasks. However, running those tools and integrating them within existing infrastructure requires time, expertise, and resources. One pertinent task here is triples extraction and linking, where structured… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: pre-print for WWW'21 demo of ICWE PLUMBER publication

  28. Open Research Knowledge Graph:A System Walkthrough

    Authors: Mohamad Yaser Jaradeh, Allard Oelen, Manuel Prinz, Markus Stocker, Sören Auer

    Abstract: Despite improved digital access to scholarly literature in the last decades, the fundamental principles of scholarly communication remain unchanged and continue to be largely document-based. Scholarly knowledge remains locked in representations that are inadequate for machine processing. The Open Research Knowledge Graph (ORKG) is an infrastructure for representing, curating and exploring scholarl… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: Pre-print for TPDL 2019 demo

  29. arXiv:2205.07627  [pdf, other

    cs.DB cs.AI

    KnowGraph-PM: a Knowledge Graph based Pricing Model for Semiconductors Supply Chains

    Authors: Nour Ramzy, Soren Auer, Javad Chamanara, Hans Ehm

    Abstract: Semiconductor supply chains are described by significant demand fluctuation that increases as one moves up the supply chain, the so-called bullwhip effect. To counteract, semiconductor manufacturers aim to optimize capacity utilization, to deliver with shorter lead times and exploit this to generate revenue. Additionally, in a competitive market, firms seek to maintain customer relationships while… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  30. arXiv:2205.06499  [pdf, other

    cs.DB

    MARE: Semantic Supply Chain Disruption Management and Resilience Evaluation Framework

    Authors: Nour Ramzy, Soren Auer, Hans Ehm, Javad Chamanara

    Abstract: Supply Chains (SCs) are subject to disruptive events that potentially hinder the operational performance. Disruption Management Process (DMP) relies on the analysis of integrated heterogeneous data sources such as production scheduling, order management and logistics to evaluate the impact of disruptions on the SC. Existing approaches are limited as they address DMP process steps and corresponding… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  31. arXiv:2205.06484  [pdf, other

    cs.DB

    SENS: Semantic Synthetic Benchmarking Model for integrated supply chain simulation and analysis

    Authors: Nour Ramzy, Soren Auer, Hans Ehm, Javad Chamanara

    Abstract: Supply Chain (SC) modeling is essential to understand and influence SC behavior, especially for increasingly globalized and complex SCs. Existing models address various SC notions, e.g., processes, tiers and production, in an isolated manner limiting enriched analysis granted by integrated information systems. Moreover, the scarcity of real-world data prevents the benchmarking of the overall SC pe… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  32. TinyGenius: Intertwining Natural Language Processing with Microtask Crowdsourcing for Scholarly Knowledge Graph Creation

    Authors: Allard Oelen, Markus Stocker, Sören Auer

    Abstract: As the number of published scholarly articles grows steadily each year, new methods are needed to organize scholarly knowledge so that it can be more efficiently discovered and used. Natural Language Processing (NLP) techniques are able to autonomously process scholarly articles at scale and to create machine readable representations of the article content. However, autonomous NLP methods are by f… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  33. arXiv:2203.14617  [pdf, other

    cs.DL

    Enriching Scholarly Knowledge with Context

    Authors: Muhammad Haris, Markus Stocker, Sören Auer

    Abstract: Leveraging a GraphQL-based federated query service that integrates multiple scholarly communication infrastructures (specifically, DataCite, ORCID, ROR, OpenAIRE, Semantic Scholar, Wikidata and Altmetric), we develop a novel web widget based approach for the presentation of scholarly knowledge with rich contextual information. We implement the proposed approach in the Open Research Knowledge Graph… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  34. arXiv:2203.14579  [pdf, ps, other

    cs.CL cs.DL cs.IR cs.LG

    Computer Science Named Entity Recognition in the Open Research Knowledge Graph

    Authors: Jennifer D'Souza, Sören Auer

    Abstract: Domain-specific named entity recognition (NER) on Computer Science (CS) scholarly articles is an information extraction task that is arguably more challenging for the various annotation aims that can beset the task and has been less studied than NER in the general domain. Given that significant progress has been made on NER, we believe that scholarly domain-specific NER will receive increasing att… ▽ More

    Submitted 14 November, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: 15 pages, Accepted for publication as a short paper in 24th International Conference on Asia-Pacific Digital Libraries (ICADL 2022, https://icadl.net/icadl2022/)

  35. arXiv:2203.14574  [pdf, other

    cs.DL cs.AI cs.IR

    The Digitalization of Bioassays in the Open Research Knowledge Graph

    Authors: Jennifer D'Souza, Anita Monteverdi, Muhammad Haris, Marco Anteghini, Kheir Eddine Farfar, Markus Stocker, Vitor A. P. Martins dos Santos, Sören Auer

    Abstract: Background: Recent years are seeing a growing impetus in the semantification of scholarly knowledge at the fine-grained level of scientific entities in knowledge graphs. The Open Research Knowledge Graph (ORKG) https://www.orkg.org/ represents an important step in this direction, with thousands of scholarly contributions as structured, fine-grained, machine-readable data. There is a need, however,… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 12 pages, 5 figures, In Review at DeXa 2022 https://www.dexa.org/dexa2022

  36. SmartReviews: Towards Human- and Machine-actionable Representation of Review Articles

    Authors: Allard Oelen, Markus Stocker, Sören Auer

    Abstract: Review articles are a means to structure state-of-the-art literature and to organize the growing number of scholarly publications. However, review articles are suffering from numerous limitations, weakening the impact the articles could potentially have. A key limitation is the inability of machines to access and process knowledge presented within review articles. In this work, we present SmartRev… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

  37. arXiv:2111.15182  [pdf, other

    cs.AI cs.CL cs.DL cs.LG

    Easy Semantification of Bioassays

    Authors: Marco Anteghini, Jennifer D'Souza, Vitor A. P. Martins dos Santos, Sören Auer

    Abstract: Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. We propose a solution for automatically semantifying biological assays. Our solution contrasts the problem of automated semantification as labeling versus clustering where the two methods are on opposite ends of the method complex… ▽ More

    Submitted 2 December, 2021; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: 12 pages, 5 figures, Accepted for Publication in AIxIA 2021 (https://aixia2021.disco.unimib.it/home-page)

  38. Triple Classification for Scholarly Knowledge Graph Completion

    Authors: Mohamad Yaser Jaradeh, Kuldeep Singh, Markus Stocker, Sören Auer

    Abstract: Scholarly Knowledge Graphs (KGs) provide a rich source of structured information representing knowledge encoded in scientific publications. With the sheer volume of published scientific literature comprising a plethora of inhomogeneous entities and relations to describe scientific concepts, these KGs are inherently incomplete. We present exBERT, a method for leveraging pre-trained transformer lang… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  39. arXiv:2111.06827  [pdf

    cs.CV cs.AI cs.LG

    NRC-GAMMA: Introducing a Novel Large Gas Meter Image Dataset

    Authors: Ashkan Ebadi, Patrick Paul, Sofia Auer, Stéphane Tremblay

    Abstract: Automatic meter reading technology is not yet widespread. Gas, electricity, or water accumulation meters reading is mostly done manually on-site either by an operator or by the homeowner. In some countries, the operator takes a picture as reading proof to confirm the reading by checking offline with another operator and/or using it as evidence in case of conflicts or complaints. The whole process… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: 12 pages, 7 figures, 1 table

  40. arXiv:2110.09036  [pdf

    cs.CL cs.AI cs.IR cs.SC

    Ranking Facts for Explaining Answers to Elementary Science Questions

    Authors: Jennifer D'Souza, Isaiah Onando Mulang', Soeren Auer

    Abstract: In multiple-choice exams, students select one answer from among typically four choices and can explain why they made that particular choice. Students are good at understanding natural language questions and based on their domain knowledge can easily infer the question's answer by 'connecting the dots' across various pertinent facts. Considering automated reasoning for elementary science question… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 25 pages, 5 figures, accepted for publication in NLE

  41. arXiv:2109.13089  [pdf, other

    cs.CL cs.AI cs.DL

    Automated Mining of Leaderboards for Empirical AI Research

    Authors: Salomon Kabongo, Jennifer D'Souza, Sören Auer

    Abstract: With the rapid growth of research publications, empowering scientists to keep oversight over the scientific progress is of paramount importance. In this regard, the Leaderboards facet of information organization provides an overview on the state-of-the-art by aggregating empirical results from various studies addressing the same research challenge. Crowdsourcing efforts like PapersWithCode among o… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

  42. arXiv:2109.05857  [pdf, other

    cs.DL

    Federating Scholarly Infrastructures with GraphQL

    Authors: Muhammad Haris, Kheir Eddine Farfar, Markus Stocker, Sören Auer

    Abstract: A plethora of scholarly knowledge is being published on distributed scholarly infrastructures. Querying a single infrastructure is no longer sufficient for researchers to satisfy information needs. We present a GraphQL-based federated query service for executing distributed queries on numerous, heterogeneous scholarly infrastructures (currently, ORKG, DataCite and GeoNames), thus enabling the inte… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

  43. arXiv:2109.00199  [pdf, ps, other

    cs.IR cs.CL cs.DL

    Pattern-based Acquisition of Scientific Entities from Scholarly Article Titles

    Authors: Jennifer D'Souza, Soeren Auer

    Abstract: We describe a rule-based approach for the automatic acquisition of salient scientific entities from Computational Linguistics (CL) scholarly article titles. Two observations motivated the approach: (i) noting salient aspects of an article's contribution in its title; and (ii) pattern regularities capturing the salient terms that could be expressed in a set of rules. Only those lexico-syntactic pat… ▽ More

    Submitted 17 September, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: 8 pages, Accepted for publication in ICADL 2021 as a short paper

  44. arXiv:2108.05085  [pdf, other

    cs.DL cs.SE

    Researcher or Crowd Member? Why not both! The Open Research Knowledge Graph for Applying and Communicating CrowdRE Research

    Authors: Oliver Karras, Eduard C. Groen, Javed Ali Khan, Sören Auer

    Abstract: In recent decades, there has been a major shift towards improved digital access to scholarly works. However, even now that these works are available in digital form, they remain document-based, making it difficult to communicate the knowledge they contain. The next logical step is to extend these works with more flexible, fine-grained, semantic, and context-sensitive representations of scholarly k… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: Accepted for publication at 2021 IEEE 29th International Requirements Engineering Conference Workshops (REW)

  45. Demonstration of Faceted Search on Scholarly Knowledge Graphs

    Authors: Golsa Heidari, Ahmad Ramadan, Markus Stocker, Sören Auer

    Abstract: Scientists always look for the most accurate and relevant answer to their queries on the scholarly literature. Traditional scholarly search systems list documents instead of providing direct answers to the search queries. As data in knowledge graphs are not acquainted semantically, they are not machine-readable. Therefore, a search on scholarly knowledge graphs ends up in a full-text search, not a… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 2 pages, 1 figure, WWW 2021 Demo. arXiv admin note: substantial text overlap with arXiv:2107.05447

  46. EduCOR: An Educational and Career-Oriented Recommendation Ontology

    Authors: Eleni Ilkou, Hasan Abu-Rasheed, Mohammadreza Tavakoli, Sherzod Hakimov, Gábor Kismihók, Sören Auer, Wolfgang Nejdl

    Abstract: With the increased dependence on online learning platforms and educational resource repositories, a unified representation of digital learning resources becomes essential to support a dynamic and multi-source learning experience. We introduce the EduCOR ontology, an educational, career-oriented ontology that provides a foundation for representing online learning resources for personalised learning… ▽ More

    Submitted 13 July, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted in the The 20th International Semantic Web Conference (ISWC2021)

    ACM Class: E.2; I.2.4

  47. arXiv:2107.05447  [pdf, other

    cs.DL

    Leveraging a Federation of Knowledge Graphs to Improve Faceted Search in Digital Libraries

    Authors: Golsa Heidari, Ahmad Ramadan, Markus Stocker, Sören Auer

    Abstract: Scientists always look for the most accurate and relevant answers to their queries in the literature. Traditional scholarly digital libraries list documents in search results, and therefore are unable to provide precise answers to search queries. In other words, search in digital libraries is metadata search and, if available, full-text search. We present a methodology for improving a faceted sear… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 12 pages, 4 figures, TPDL 2021 conference

  48. arXiv:2107.03816  [pdf, other

    cs.DL

    SmartReviews: Towards Human- and Machine-actionable Reviews

    Authors: Allard Oelen, Markus Stocker, Sören Auer

    Abstract: Review articles summarize state-of-the-art work and provide a means to organize the growing number of scholarly publications. However, the current review method and publication mechanisms hinder the impact review articles can potentially have. Among other limitations, reviews only provide a snapshot of the current literature and are generally not readable by machines. In this work, we identify the… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: 1 figure

  49. arXiv:2106.07385  [pdf, other

    cs.CL cs.AI cs.DL cs.IR cs.LG

    SemEval-2021 Task 11: NLPContributionGraph -- Structuring Scholarly NLP Contributions for a Research Knowledge Graph

    Authors: Jennifer D'Souza, Sören Auer, Ted Pedersen

    Abstract: There is currently a gap between the natural language expression of scholarly publications and their structured semantic content modeling to enable intelligent content search. With the volume of research growing exponentially every year, a search feature operating over semantically structured content is compelling. The SemEval-2021 Shared Task NLPContributionGraph (a.k.a. 'the NCG task') tasks par… ▽ More

    Submitted 15 October, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 13 pages, 5 figures, 8 tables

    Journal ref: Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), (pp. 364-376), ACL

  50. arXiv:2102.10966  [pdf, other

    cs.CL cs.IR

    Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines

    Authors: Mohamad Yaser Jaradeh, Kuldeep Singh, Markus Stocker, Andreas Both, Sören Auer

    Abstract: In the last decade, a large number of Knowledge Graph (KG) information extraction approaches were proposed. Albeit effective, these efforts are disjoint, and their collective strengths and weaknesses in effective KG information extraction (IE) have not been studied in the literature. We propose Plumber, the first framework that brings together the research community's disjoint IE efforts. The Plum… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Accepted in ICWE 2021