Skip to main content

Showing 1–50 of 86 results for author: Lehmann, J

  1. arXiv:2406.04670  [pdf, other

    cs.CL cs.AI

    MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources

    Authors: Dongkyu Lee, Chandana Satya Prakash, Jack FitzGerald, Jens Lehmann

    Abstract: Leveraging external knowledge is crucial for achieving high performance in knowledge-intensive tasks, such as question answering. The retrieve-and-read approach is widely adopted for integrating external knowledge into a language model. However, this approach suffers from increased computational cost and latency due to the long context length, which grows proportionally with the number of retrieve… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ACL2024-Findings

  2. arXiv:2405.02750  [pdf, other

    cs.CL cs.AI

    Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

    Authors: Zheng Zhao, Emilio Monti, Jens Lehmann, Haytham Assem

    Abstract: Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or contextually unfaithful content. LLMs utilize two primary knowledge sources: 1) prior (parametric) knowledge from pretraining, and 2) contextual (non-parametric)… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted to NAACL 2024

  3. arXiv:2404.12788  [pdf, other

    cs.CL

    REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking

    Authors: Nacime Bouziani, Shubhi Tyagi, Joseph Fisher, Jens Lehmann, Andrea Pierleoni

    Abstract: Extracting structured information from unstructured text is critical for many downstream NLP applications and is traditionally achieved by closed information extraction (cIE). However, existing approaches for cIE suffer from two limitations: (i) they are often pipelines which makes them prone to error propagation, and/or (ii) they are restricted to sentence level which prevents them from capturing… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL Industry Track 2024

  4. arXiv:2402.13703  [pdf, other

    cs.CL

    Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions?

    Authors: Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali

    Abstract: The adaption of multilingual pre-trained Large Language Models (LLMs) into eloquent and helpful assistants is essential to facilitate their use across different language regions. In that spirit, we are the first to conduct an extensive study of the performance of multilingual models on parallel, multi-turn instruction-tuning benchmarks across a selection of the most-spoken Indo-European languages.… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 22 pages, 7 figures

  5. arXiv:2402.01539  [pdf, ps, other

    cs.FL

    Backward Responsibility in Transition Systems Using General Power Indices

    Authors: Christel Baier, Roxane van den Bossche, Sascha Klüppelholz, Johannes Lehmann, Jakob Piribauer

    Abstract: To improve reliability and the understanding of AI systems, there is increasing interest in the use of formal methods, e.g. model checking. Model checking tools produce a counterexample when a model does not satisfy a property. Understanding these counterexamples is critical for efficient debugging, as it allows the developer to focus on the parts of the program that caused the issue. To this en… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  6. arXiv:2305.12416  [pdf, other

    cs.IR

    Direct Fact Retrieval from Knowledge Graphs without Entity Linking

    Authors: Jinheon Baek, Alham Fikri Aji, Jens Lehmann, Sung Ju Hwang

    Abstract: There has been a surge of interest in utilizing Knowledge Graphs (KGs) for various natural language processing/understanding tasks. The conventional mechanism to retrieve facts in KGs usually involves three steps: entity span detection, entity disambiguation, and relation classification. However, this approach requires additional labels for training each of the three subcomponents in addition to p… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  7. Retention Is All You Need

    Authors: Karishma Mohiuddin, Mirza Ariful Alam, Mirza Mohtashim Alam, Pascal Welke, Michael Martin, Jens Lehmann, Sahar Vahdati

    Abstract: Skilled employees are the most important pillars of an organization. Despite this, most organizations face high attrition and turnover rates. While several machine learning models have been developed to analyze attrition and its causal factors, the interpretations of those models remain opaque. In this paper, we propose the HR-DSS approach, which stands for Human Resource (HR) Decision Support Sys… ▽ More

    Submitted 26 August, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted at CIKM 2023 Applied Research Track

  8. arXiv:2210.04373  [pdf, other

    cs.CL

    Contrastive Representation Learning for Conversational Question Answering over Knowledge Graphs

    Authors: Endri Kacupaj, Kuldeep Singh, Maria Maleshkova, Jens Lehmann

    Abstract: This paper addresses the task of conversational question answering (ConvQA) over knowledge graphs (KGs). The majority of existing ConvQA methods rely on full supervision signals with a strict assumption of the availability of gold logical forms of queries to extract answers from the KG. However, creating such a gold logical form is not viable for each potential question in a real-world scenario. H… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: 31st ACM International Conference on Information and Knowledge Management (CIKM 2022)

  9. arXiv:2208.06734  [pdf, other

    cs.CL

    An Answer Verbalization Dataset for Conversational Question Answerings over Knowledge Graphs

    Authors: Endri Kacupaj, Kuldeep Singh, Maria Maleshkova, Jens Lehmann

    Abstract: We introduce a new dataset for conversational question answering over Knowledge Graphs (KGs) with verbalized answers. Question answering over KGs is currently focused on answer generation for single-turn questions (KGQA) or multiple-tun conversational question answering (ConvQA). However, in a real-world scenario (e.g., voice assistants such as Siri, Alexa, and Google Assistant), users prefer verb… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

  10. arXiv:2208.02743  [pdf, other

    cs.CL cs.AI

    Integrating Knowledge Graph embedding and pretrained Language Models in Hypercomplex Spaces

    Authors: Mojtaba Nayyeri, Zihao Wang, Mst. Mahfuja Akter, Mirza Mohtashim Alam, Md Rashad Al Hasan Rony, Jens Lehmann, Steffen Staab

    Abstract: Knowledge Graphs, such as Wikidata, comprise structural and textual knowledge in order to represent knowledge. For each of the two modalities dedicated approaches for graph embedding and language models learn patterns that allow for predicting novel structural knowledge. Few approaches have integrated learning and inference with both modalities and these existing ones could only partially exploit… ▽ More

    Submitted 16 August, 2023; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: ISWC2023 version

  11. arXiv:2206.13354  [pdf, other

    cs.CL cs.AI

    Transformer with Tree-order Encoding for Neural Program Generation

    Authors: Klaudia-Doris Thellmann, Bernhard Stadler, Ricardo Usbeck, Jens Lehmann

    Abstract: While a considerable amount of semantic parsing approaches have employed RNN architectures for code generation tasks, there have been only few attempts to investigate the applicability of Transformers for this task. Including hierarchical information of the underlying programming language syntax has proven to be effective for code generation. Since the positional encoding of the Transformer can on… ▽ More

    Submitted 30 May, 2022; originally announced June 2022.

    Comments: This paper was authored in late 2020 and early 2021 for the most part

    MSC Class: 68T07; 68T50 ACM Class: I.2.7

  12. arXiv:2204.09149  [pdf, other

    cs.CL cs.AI

    DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation

    Authors: Md Rashad Al Hasan Rony, Ricardo Usbeck, Jens Lehmann

    Abstract: Task-oriented dialogue generation is challenging since the underlying knowledge is often dynamic and effectively incorporating knowledge into the learning process is hard. It is particularly challenging to generate both human-like and informative responses in this setting. Recent research primarily focused on various knowledge distillation methods where the underlying relationship between the fact… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted by the North American Chapter of the Association for Computational Linguistics (NAACL) 2022

  13. arXiv:2203.09183  [pdf, other

    cs.CL cs.AI

    RoMe: A Robust Metric for Evaluating Natural Language Generation

    Authors: Md Rashad Al Hasan Rony, Liubov Kovriguina, Debanjan Chaudhuri, Ricardo Usbeck, Jens Lehmann

    Abstract: Evaluating Natural Language Generation (NLG) systems is a challenging task. Firstly, the metric should ensure that the generated hypothesis reflects the reference's semantics. Secondly, it should consider the grammatical quality of the generated sentence. Thirdly, it should be robust enough to handle various surface forms of the generated sentence. Thus, an effective evaluation metric has to be mu… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted by the Association for Computational Linguistics (ACL) 2022

  14. arXiv:2203.04703  [pdf, other

    cs.AI cs.LG

    LEMON: LanguagE ModeL for Negative Sampling of Knowledge Graph Embeddings

    Authors: Md Rashad Al Hasan Rony, Mirza Mohtashim Alam, Semab Ali, Jens Lehmann, Sahar Vahdati

    Abstract: Knowledge Graph Embedding models have become an important area of machine learning.Those models provide a latent representation of entities and relations in a knowledge graph which can then be used in downstream machine learning tasks such as link prediction. The learning process of such models can be performed by contrasting positive and negative triples. While all triples of a KG are considered… ▽ More

    Submitted 15 October, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

  15. Time-aware Graph Neural Networks for Entity Alignment between Temporal Knowledge Graphs

    Authors: Chengjin Xu, Fenglong Su, Jens Lehmann

    Abstract: Entity alignment aims to identify equivalent entity pairs between different knowledge graphs (KGs). Recently, the availability of temporal KGs (TKGs) that contain time information created the need for reasoning over time in such TKGs. Existing embedding-based entity alignment approaches disregard time information that commonly exists in many large-scale KGs, leaving much room for improvement. In t… ▽ More

    Submitted 13 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: Accepted at EMNLP2021

  16. Geometric Algebra based Embeddings for Static and Temporal Knowledge Graph Completion

    Authors: Chengjin Xu, Mojtaba Nayyeri, Yung-Yu Chen, Jens Lehmann

    Abstract: Recent years, Knowledge Graph Embeddings (KGEs) have shown promising performance on link prediction tasks by mapping the entities and relations from a Knowledge Graph (KG) into a geometric space and thus have gained increasing attentions. In addition, many recent Knowledge Graphs involve evolving data, e.g., the fact (\textit{Obama}, \textit{PresidentOf}, \textit{USA}) is valid only from 2009 to 2… ▽ More

    Submitted 25 February, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: There are some theorem mistakes in the Appendix section need to be fixed. And we are still trying to solve them. We submitted the Arxiv version for providing the supplementary analysis, but now we hope to withdraw the current version to avoid misleading the readers from Arxiv

  17. arXiv:2112.07606  [pdf, ps, other

    cs.CL cs.AI

    Semantic Answer Type and Relation Prediction Task (SMART 2021)

    Authors: Nandana Mihindukulasooriya, Mohnish Dubey, Alfio Gliozzo, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Ricardo Usbeck, Gaetano Rossiello, Uttam Kumar

    Abstract: Each year the International Semantic Web Conference organizes a set of Semantic Web Challenges to establish competitions that will advance state-of-the-art solutions in some problem domains. The Semantic Answer Type and Relation Prediction Task (SMART) task is one of the ISWC 2021 Semantic Web challenges. This is the second year of the challenge after a successful SMART 2020 at ISWC 2020. This yea… ▽ More

    Submitted 10 January, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    ACM Class: F.4.1; I.2.4; I.2.7

  18. arXiv:2112.01989  [pdf, other

    cs.CL cs.AI cs.LG

    Survey on English Entity Linking on Wikidata

    Authors: Cedric Möller, Jens Lehmann, Ricardo Usbeck

    Abstract: Wikidata is a frequently updated, community-driven, and multilingual knowledge graph. Hence, Wikidata is an attractive basis for Entity Linking, which is evident by the recent increase in published papers. This survey focuses on four subjects: (1) Which Wikidata Entity Linking datasets exist, how widely used are they and how are they constructed? (2) Do the characteristics of Wikidata matter for t… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: Disclaimer: Cedric Möller, Jens Lehmann, Ricardo Usbeck, 2021. The definitive, peer reviewed and edited version of this article is published in the Semantic Web Journal, Special issue: Latest Advancements in Linguistic 3 Linked Data, 2021

  19. Geolog: Scalable Logic Programming on Spatial Data

    Authors: Tobias Grubenmann, Jens Lehmann

    Abstract: Spatial data is ubiquitous in our data-driven society. The Logic Programming community has been investigating the use of spatial data in different settings. Despite the success of this research, the Geographic Information System (GIS) community has rarely made use of these new approaches. This has mainly two reasons. First, there is a lack of tools that tightly integrate logical reasoning into sta… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: In Proceedings ICLP 2021, arXiv:2109.07914

    Journal ref: EPTCS 345, 2021, pp. 191-204

  20. arXiv:2107.04894  [pdf, other

    cs.LG

    Improving Inductive Link Prediction Using Hyper-Relational Facts

    Authors: Mehdi Ali, Max Berrendorf, Mikhail Galkin, Veronika Thost, Tengfei Ma, Volker Tresp, Jens Lehmann

    Abstract: For many years, link prediction on knowledge graphs (KGs) has been a purely transductive task, not allowing for reasoning on unseen entities. Recently, increasing efforts are put into exploring semi- and fully inductive scenarios, enabling inference over unseen and emerging entities. Still, all these approaches only consider triple-based \glspl{kg}, whereas their richer counterparts, hyper-relatio… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

  21. arXiv:2107.03297  [pdf, other

    cs.AI cs.CL cs.DL cs.LG

    Trans4E: Link Prediction on Scholarly Knowledge Graphs

    Authors: Mojtaba Nayyeri, Gokce Muge Cil, Sahar Vahdati, Francesco Osborne, Mahfuzur Rahman, Simone Angioni, Angelo Salatino, Diego Reforgiato Recupero, Nadezhda Vassilyeva, Enrico Motta, Jens Lehmann

    Abstract: The incompleteness of Knowledge Graphs (KGs) is a crucial issue affecting the quality of AI-based services. In the scholarly domain, KGs describing research publications typically lack important information, hindering our ability to analyse and predict research dynamics. In recent years, link prediction approaches based on Knowledge Graph Embedding models became the first aid for this issue. In th… ▽ More

    Submitted 3 July, 2021; originally announced July 2021.

  22. arXiv:2106.13316  [pdf, other

    cs.CL

    VOGUE: Answer Verbalization through Multi-Task Learning

    Authors: Endri Kacupaj, Shyamnath Premnadh, Kuldeep Singh, Jens Lehmann, Maria Maleshkova

    Abstract: In recent years, there have been significant developments in Question Answering over Knowledge Graphs (KGQA). Despite all the notable advancements, current KGQA systems only focus on answer generation techniques and not on answer verbalization. However, in real-world scenarios (e.g., voice assistants such as Alexa, Siri, etc.), users prefer verbalized answers instead of a generated response. This… ▽ More

    Submitted 28 June, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2021

  23. arXiv:2106.07472  [pdf, ps, other

    cs.LG math.OC stat.ML

    Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation

    Authors: Anas Barakat, Pascal Bianchi, Julien Lehmann

    Abstract: Actor-critic methods integrating target networks have exhibited a stupendous empirical success in deep reinforcement learning. However, a theoretical understanding of the use of target networks in actor-critic methods is largely missing in the literature. In this paper, we reduce this gap between theory and practice by proposing the first theoretical analysis of an online target-based actor-critic… ▽ More

    Submitted 22 February, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 50 pages

    Journal ref: AISTATS 2022

  24. arXiv:2105.11407  [pdf, other

    cs.CL

    VANiLLa : Verbalized Answers in Natural Language at Large Scale

    Authors: Debanjali Biswas, Mohnish Dubey, Md Rashad Al Hasan Rony, Jens Lehmann

    Abstract: In the last years, there have been significant developments in the area of Question Answering over Knowledge Graphs (KGQA). Despite all the notable advancements, current KGQA datasets only provide the answers as the direct output result of the formal query, rather than full sentences incorporating question context. For achieving coherent answers sentence with the question's vocabulary, template-ba… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

  25. arXiv:2104.14994  [pdf, other

    cs.IR cs.MM

    GeoWINE: Geolocation based Wiki, Image,News and Event Retrieval

    Authors: Golsa Tahmasebzadeh, Endri Kacupaj, Eric Müller-Budack, Sherzod Hakimov, Jens Lehmann, Ralph Ewerth

    Abstract: In the context of social media, geolocation inference on news or events has become a very important task. In this paper, we present the GeoWINE (Geolocation-based Wiki-Image-News-Event retrieval) demonstrator, an effective modular system for multimodal retrieval which expects only a single image as input. The GeoWINE system consists of five modules in order to retrieve related information from var… ▽ More

    Submitted 4 May, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in: International ACM SIGIR Conference on Research and Development in Information Retrieval 2021

  26. arXiv:2104.05003  [pdf, other

    cs.AI

    Multiple Run Ensemble Learning with Low-Dimensional Knowledge Graph Embeddings

    Authors: Chengjin Xu, Mojtaba Nayyeri, Sahar Vahdati, Jens Lehmann

    Abstract: Among the top approaches of recent years, link prediction using knowledge graph embedding (KGE) models has gained significant attention for knowledge graph completion. Various embedding models have been proposed so far, among which, some recent KGE models obtain state-of-the-art performance on link prediction tasks by using embeddings with a high dimension (e.g. 1000) which accelerate the costs of… ▽ More

    Submitted 30 May, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted by the 2021 International Joint Conference on Neural Networks (IJCNN 2021)

  27. arXiv:2104.01569  [pdf, other

    cs.CL

    Conversational Question Answering over Knowledge Graphs with Transformer and Graph Attention Networks

    Authors: Endri Kacupaj, Joan Plepi, Kuldeep Singh, Harsh Thakkar, Jens Lehmann, Maria Maleshkova

    Abstract: This paper addresses the task of (complex) conversational question answering over a knowledge graph. For this task, we propose LASAGNE (muLti-task semAntic parSing with trAnsformer and Graph atteNtion nEtworks). It is the first approach, which employs a transformer architecture extended with Graph Attention Networks for multi-task neural semantic parsing. LASAGNE uses a transformer model for gener… ▽ More

    Submitted 24 June, 2021; v1 submitted 4 April, 2021; originally announced April 2021.

    Comments: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)

  28. arXiv:2103.16289  [pdf, other

    cs.CL

    Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers

    Authors: Debanjan Chaudhuri, Md Rashad Al Hasan Rony, Jens Lehmann

    Abstract: Generating knowledge grounded responses in both goal and non-goal oriented dialogue systems is an important research challenge. Knowledge Graphs (KG) can be viewed as an abstraction of the real world, which can potentially facilitate a dialogue system to produce knowledge grounded responses. However, integrating KGs into the dialogue generation process in an end-to-end manner is a non-trivial task… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: 16 pages, 3 figures, accepted at ESWC 2021

  29. arXiv:2103.07771  [pdf, other

    cs.CL

    ParaQA: A Question Answering Dataset with Paraphrase Responses for Single-Turn Conversation

    Authors: Endri Kacupaj, Barshana Banerjee, Kuldeep Singh, Jens Lehmann

    Abstract: This paper presents ParaQA, a question answering (QA) dataset with multiple paraphrased responses for single-turn conversation over knowledge graphs (KG). The dataset was created using a semi-automated framework for generating diverse paraphrasing of the answers using techniques such as back-translation. The existing datasets for conversational question answering over KGs (single-turn/multi-turn)… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.

    Comments: 18th Extended Semantic Web Conference 2021 (ESWC'2021) - Resources Track

  30. arXiv:2103.07766  [pdf, other

    cs.CL

    Context Transformer with Stacked Pointer Networks for Conversational Question Answering over Knowledge Graphs

    Authors: Joan Plepi, Endri Kacupaj, Kuldeep Singh, Harsh Thakkar, Jens Lehmann

    Abstract: Neural semantic parsing approaches have been widely used for Question Answering (QA) systems over knowledge graphs. Such methods provide the flexibility to handle QA datasets with complex queries and a large number of entities. In this work, we propose a novel framework named CARTON, which performs multi-task semantic parsing for handling the problem of conversational question answering over a lar… ▽ More

    Submitted 24 June, 2021; v1 submitted 13 March, 2021; originally announced March 2021.

    Comments: 18th Extended Semantic Web Conference 2021 (ESWC'2021) - Research Track

  31. arXiv:2101.09969  [pdf, other

    cs.CL

    CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata

    Authors: Manoj Prabhakar Kannan Ravi, Kuldeep Singh, Isaiah Onando Mulang', Saeedeh Shekarpour, Johannes Hoffart, Jens Lehmann

    Abstract: In this paper, we propose CHOLAN, a modular approach to target end-to-end entity linking (EL) over knowledge bases. CHOLAN consists of a pipeline of two transformer-based models integrated sequentially to accomplish the EL task. The first transformer model identifies surface forms (entity mentions) in a given text. For each mention, a second transformer model is employed to classify the target ent… ▽ More

    Submitted 8 February, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: accepted in EACL 2021 (full paper)

  32. arXiv:2012.00555  [pdf, ps, other

    cs.AI cs.CL cs.IR

    SeMantic AnsweR Type prediction task (SMART) at ISWC 2020 Semantic Web Challenge

    Authors: Nandana Mihindukulasooriya, Mohnish Dubey, Alfio Gliozzo, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Ricardo Usbeck

    Abstract: Each year the International Semantic Web Conference accepts a set of Semantic Web Challenges to establish competitions that will advance the state of the art solutions in any given problem domain. The SeMantic AnsweR Type prediction task (SMART) was part of ISWC 2020 challenges. Question type and answer type prediction can play a key role in knowledge base question answering systems providing insi… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  33. arXiv:2011.00983  [pdf, other

    cs.LO

    Out of Control: Reducing Probabilistic Models by Control-State Elimination

    Authors: Tobias Winkler, Johannes Lehmann, Joost-Pieter Katoen

    Abstract: State-of-the-art probabilistic model checkers perform verification on explicit-state Markov models defined in a high-level programming formalism like the PRISM modeling language. Typically, the low-level models resulting from such program-like specifications exhibit lots of structure such as repeating subpatterns. Established techniques like probabilistic bisimulation minimization are able to expl… ▽ More

    Submitted 12 November, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: full version including proofs, 33 pages

  34. arXiv:2010.06684  [pdf, other

    cs.LG cs.AI

    Motif Learning in Knowledge Graphs Using Trajectories Of Differential Equations

    Authors: Mojtaba Nayyeri, Chengjin Xu, Jens Lehmann, Sahar Vahdati

    Abstract: Knowledge Graph Embeddings (KGEs) have shown promising performance on link prediction tasks by mapping the entities and relations from a knowledge graph into a geometric space (usually a vector space). Ultimately, the plausibility of the predicted links is measured by using a scoring function over the learned embeddings (vectors). Therefore, the capability in preserving graph characteristics inclu… ▽ More

    Submitted 18 October, 2020; v1 submitted 13 October, 2020; originally announced October 2020.

  35. arXiv:2010.01029  [pdf, other

    cs.CL cs.AI cs.LG

    TeRo: A Time-aware Knowledge Graph Embedding via Temporal Rotation

    Authors: Chengjin Xu, Mojtaba Nayyeri, Fouad Alkhoury, Hamed Shariat Yazdi, Jens Lehmann

    Abstract: In the last few years, there has been a surge of interest in learning representations of entitiesand relations in knowledge graph (KG). However, the recent availability of temporal knowledgegraphs (TKGs) that contain time information for each fact created the need for reasoning overtime in such TKGs. In this regard, we present a new approach of TKG embedding, TeRo, which defines the temporal evolu… ▽ More

    Submitted 24 October, 2020; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: This paper is accepted by COLING2020

  36. arXiv:2010.00989  [pdf, other

    cs.LG cs.AI stat.ML

    Knowledge Graph Embeddings in Geometric Algebras

    Authors: Chengjin Xu, Mojtaba Nayyeri, Yung-Yu Chen, Jens Lehmann

    Abstract: Knowledge graph (KG) embedding aims at embedding entities and relations in a KG into a lowdimensional latent representation space. Existing KG embedding approaches model entities andrelations in a KG by utilizing real-valued , complex-valued, or hypercomplex-valued (Quaternionor Octonion) representations, all of which are subsumed into a geometric algebra. In this work,we introduce a novel geometr… ▽ More

    Submitted 22 March, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: This paper is accepted by COLING2020

  37. arXiv:2009.10847  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Message Passing for Hyper-Relational Knowledge Graphs

    Authors: Mikhail Galkin, Priyansh Trivedi, Gaurav Maheshwari, Ricardo Usbeck, Jens Lehmann

    Abstract: Hyper-relational knowledge graphs (KGs) (e.g., Wikidata) enable associating additional key-value pairs along with the main triple to disambiguate, or restrict the validity of a fact. In this work, we propose a message passing based graph encoder - StarE capable of modeling such hyper-relational KGs. Unlike existing approaches, StarE can encode an arbitrary number of additional information (qualifi… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: Accepted to EMNLP 2020

  38. arXiv:2009.00106  [pdf, other

    cs.CL cs.AI

    PNEL: Pointer Network based End-To-End Entity Linking over Knowledge Graphs

    Authors: Debayan Banerjee, Debanjan Chaudhuri, Mohnish Dubey, Jens Lehmann

    Abstract: Question Answering systems are generally modelled as a pipeline consisting of a sequence of steps. In such a pipeline, Entity Linking (EL) is often the first step. Several EL models first perform span detection and then entity disambiguation. In such models errors from the span detection phase cascade to later steps and result in a drop of overall accuracy. Moreover, lack of gold entity spans in t… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

  39. arXiv:2008.11450  [pdf, other

    cs.LG stat.ML

    Training Multimodal Systems for Classification with Multiple Objectives

    Authors: Jason Armitage, Shramana Thakur, Rishi Tripathi, Jens Lehmann, Maria Maleshkova

    Abstract: We learn about the world from a diverse range of sensory information. Automated systems lack this ability as investigation has centred on processing information presented in a single form. Adapting architectures to learn from multiple modalities creates the potential to learn rich representations of the world - but current multimodal systems only deliver marginal improvements on unimodal approache… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Journal ref: Proceedings of the 1st International Workshop on Cross-lingual Event-centric Open Analytics co-located with the 17th Extended Semantic Web Conference (ESWC 2020)

  40. MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities

    Authors: Jason Armitage, Endri Kacupaj, Golsa Tahmasebzadeh, Swati, Maria Maleshkova, Ralph Ewerth, Jens Lehmann

    Abstract: In this paper, we introduce the MLM (Multiple Languages and Modalities) dataset - a new resource to train and evaluate multitask systems on samples in multiple modalities and three languages. The generation process and inclusion of semantic data provide a resource that further tests the ability for multitask systems to learn relationships between entities. The dataset is designed for researchers a… ▽ More

    Submitted 4 September, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

    Journal ref: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 2967-2974. 2020

  41. Evaluating the Impact of Knowledge Graph Context on Entity Disambiguation Models

    Authors: Isaiah Onando Mulang', Kuldeep Singh, Chaitali Prabhu, Abhishek Nadgeri, Johannes Hoffart, Jens Lehmann

    Abstract: Pretrained Transformer models have emerged as state-of-the-art approaches that learn contextual information from text to improve the performance of several NLP tasks. These models, albeit powerful, still require specialized knowledge in specific scenarios. In this paper, we argue that context derived from a knowledge graph (in our case: Wikidata) provides enough signals to inform pretrained transf… ▽ More

    Submitted 30 August, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: to appear in proceedings of CIKM 2020

    ACM Class: I.1.2

    Journal ref: CIKM 2020

  42. arXiv:2007.14175  [pdf, ps, other

    cs.LG cs.AI stat.ML

    PyKEEN 1.0: A Python Library for Training and Evaluating Knowledge Graph Embeddings

    Authors: Mehdi Ali, Max Berrendorf, Charles Tapley Hoyt, Laurent Vermue, Sahand Sharifzadeh, Volker Tresp, Jens Lehmann

    Abstract: Recently, knowledge graph embeddings (KGEs) received significant attention, and several software libraries have been developed for training and evaluating KGEs. While each of them addresses specific needs, we re-designed and re-implemented PyKEEN, one of the first KGE libraries, in a community effort. PyKEEN 1.0 enables users to compose knowledge graph embedding models (KGEMs) based on a wide rang… ▽ More

    Submitted 30 July, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

  43. arXiv:2007.09668  [pdf, other

    cs.LG stat.ML

    Improving the Long-Range Performance of Gated Graph Neural Networks

    Authors: Denis Lukovnikov, Jens Lehmann, Asja Fischer

    Abstract: Many popular variants of graph neural networks (GNNs) that are capable of handling multi-relational graphs may suffer from vanishing gradients. In this work, we propose a novel GNN architecture based on the Gated Graph Neural Network with an improved ability to handle long-range dependencies in multi-relational graphs. An experimental analysis on different synthetic tasks demonstrates that the pro… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

  44. arXiv:2006.13365  [pdf, other

    cs.LG cs.AI stat.ML

    Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework

    Authors: Mehdi Ali, Max Berrendorf, Charles Tapley Hoyt, Laurent Vermue, Mikhail Galkin, Sahand Sharifzadeh, Asja Fischer, Volker Tresp, Jens Lehmann

    Abstract: The heterogeneity in recently published knowledge graph embedding models' implementations, training, and evaluation has made fair and thorough comparisons difficult. In order to assess the reproducibility of previously published results, we re-implemented and evaluated 21 interaction models in the PyKEEN software package. Here, we outline which results could be reproduced with their reported hyper… ▽ More

    Submitted 1 November, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

  45. IQA: Interactive Query Construction in Semantic Question Answering Systems

    Authors: Hamid Zafar, Mohnish Dubey, Jens Lehmann, Elena Demidova

    Abstract: Semantic Question Answering (SQA) systems automatically interpret user questions expressed in a natural language in terms of semantic queries. This process involves uncertainty, such that the resulting queries do not always accurately match the user intent, especially for more complex and less common questions. In this article, we aim to empower users in guiding SQA systems towards the intended se… ▽ More

    Submitted 25 June, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Journal ref: Journal of Web Semantics Volume 64, October 2020, 100586

  46. arXiv:2006.04986  [pdf, other

    cs.AI cs.LG

    5* Knowledge Graph Embeddings with Projective Transformations

    Authors: Mojtaba Nayyeri, Sahar Vahdati, Can Aykul, Jens Lehmann

    Abstract: Performing link prediction using knowledge graph embedding models has become a popular approach for knowledge graph completion. Such models employ a transformation function that maps nodes via edges into a vector space in order to measure the likelihood of the links. While mapping the individual nodes, the structure of subgraphs is also transformed. Most of the embedding models designed in Euclide… ▽ More

    Submitted 14 March, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted in AAAI 2021

  47. arXiv:2006.04556  [pdf, other

    cs.AI cs.DB cs.LG

    Unveiling Relations in the Industry 4.0 Standards Landscape based on Knowledge Graph Embeddings

    Authors: Ariam Rivas, Irlán Grangel-González, Diego Collarana, Jens Lehmann, Maria-Esther Vidal

    Abstract: Industry~4.0 (I4.0) standards and standardization frameworks have been proposed with the goal of \emph{empowering interoperability} in smart factories. These standards enable the description and interaction of the main components, systems, and processes inside of a smart factory. Due to the growing number of frameworks and standards, there is an increasing need for approaches that automatically an… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Comments: 15 pages, 7 figures, DEXA2020 Conference

  48. arXiv:2002.11143  [pdf, other

    cs.CL

    End-to-End Entity Linking and Disambiguation leveraging Word and Knowledge Graph Embeddings

    Authors: Rostislav Nedelchev, Debanjan Chaudhuri, Jens Lehmann, Asja Fischer

    Abstract: Entity linking - connecting entity mentions in a natural language utterance to knowledge graph (KG) entities is a crucial step for question answering over KGs. It is often based on measuring the string similarity between the entity label and its mention in the question. The relation referred to in the question can help to disambiguate between entities with the same label. This can be misleading if… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  49. arXiv:2001.11985  [pdf, other

    cs.CL cs.AI cs.LG

    Pretrained Transformers for Simple Question Answering over Knowledge Graphs

    Authors: D. Lukovnikov, A. Fischer, J. Lehmann

    Abstract: Answering simple questions over knowledge graphs is a well-studied problem in question answering. Previous approaches for this task built on recurrent and convolutional neural network based architectures that use pretrained word embeddings. It was recently shown that finetuning pretrained transformer networks (e.g. BERT) can outperform previous approaches on various natural language processing tas… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

  50. arXiv:2001.11499  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Estimating and abstracting the 3D structure of bones using neural networks on X-ray (2D) images

    Authors: Jana Čavojská, Julian Petrasch, Nicolas J. Lehmann, Agnès Voisard, Peter Böttcher

    Abstract: In this paper, we present a deep-learning based method for estimating the 3D structure of a bone from a pair of 2D X-ray images. Our triplet loss-trained neural network selects the most closely matching 3D bone shape from a predefined set of shapes. Our predictions have an average root mean square (RMS) distance of 1.08 mm between the predicted and true shapes, making it more accurate than the ave… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: 13 pages, 5 figures, 1 table, submitted to Communications Biology

    Journal ref: Communications biology, 2020, 3(1), pp.1-13