Skip to main content

Showing 1–5 of 5 results for author: Otto, W

  1. arXiv:2404.08443  [pdf, other

    cs.DL cs.IR

    Toward FAIR Semantic Publishing of Research Dataset Metadata in the Open Research Knowledge Graph

    Authors: Raia Abu Ahmad, Jennifer D'Souza, Matthäus Zloch, Wolfgang Otto, Georg Rehm, Allard Oelen, Stefan Dietze, Sören Auer

    Abstract: Search engines these days can serve datasets as search results. Datasets get picked up by search technologies based on structured descriptions on their official web pages, informed by metadata ontologies such as the Dataset content type of schema.org. Despite this promotion of the content type dataset as a first-class citizen of search results, a vast proportion of datasets, particularly research… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 8 pages, 1 figure, published in the Joint Proceedings of the Onto4FAIR 2023 Workshops

    Journal ref: In Joint Proceedings of the Onto4FAIR 2023 Workshops: Collocated with FOIS 2023 and SEMANTICS 2023. pp.23-31. https://hal.science/hal-04312604

  2. arXiv:2404.05587  [pdf, ps, other

    cs.CL

    Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models

    Authors: Wolfgang Otto, Sharmila Upadhyaya, Stefan Dietze

    Abstract: This paper describes our participation in the Shared Task on Software Mentions Disambiguation (SOMD), with a focus on improving relation extraction in scholarly texts through generative Large Language Models (LLMs) using single-choice question-answering. The methodology prioritises the use of in-context learning capabilities of GLMs to extract software-related entities and their descriptive attrib… ▽ More

    Submitted 19 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted at: 1st Workshop on Natural Scientific Language Processing and Research Knowledge Graphs (NSLP 2024) Co-located with Extended Semantic Web Conference (ESWC 2024)

    ACM Class: I.2.7

  3. arXiv:2311.09860  [pdf, other

    cs.CL

    GSAP-NER: A Novel Task, Corpus, and Baseline for Scholarly Entity Extraction Focused on Machine Learning Models and Datasets

    Authors: Wolfgang Otto, Matthäus Zloch, Lu Gan, Saurav Karmakar, Stefan Dietze

    Abstract: Named Entity Recognition (NER) models play a crucial role in various NLP tasks, including information extraction (IE) and text understanding. In academic writing, references to machine learning models and datasets are fundamental components of various computer science publications and necessitate accurate models for identification. Despite the advancements in NER, existing ground truth datasets do… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 10 pages, 1 figure, Accepted at EMNLP2023-Findings

  4. arXiv:1906.04484  [pdf, other

    cs.DL cs.IR

    EXmatcher: Combining Features Based on Reference Strings and Segments to Enhance Citation Matching

    Authors: Behnam Ghavimi, Wolfgang Otto, Philipp Mayr

    Abstract: Citation matching is a challenging task due to different problems such as the variety of citation styles, mistakes in reference strings and the quality of identified reference segments. The classic citation matching configuration used in this paper is the combination of blocking technique and a binary classifier. Three different possible inputs (reference strings, reference segments and a combinat… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

  5. arXiv:1903.11693  [pdf

    cs.DL

    Highly cited references in PLOS ONE and their in-text usage over time

    Authors: Wolfgang Otto, Behnam Ghavimi, Philipp Mayr, Rajesh Piryani, Vivek Kumar Singh

    Abstract: In this article, we describe highly cited publications in a PLOS ONE full-text corpus. For these publications, we analyse the citation contexts concerning their position in the text and their age at the time of citing. By selecting the perspective of highly cited papers, we can distinguish them based on the context during citation even if we do not have any other information source or metrics. We… ▽ More

    Submitted 9 October, 2019; v1 submitted 27 March, 2019; originally announced March 2019.

    Comments: 6 pages, 3 figures, revised research-in-progress paper accepted at the 17th International Conference on Scientometrics & Informetrics (ISSI 2019), Rome, Italy