Skip to main content

Showing 1–13 of 13 results for author: Gashteovski, K

  1. arXiv:2406.12494  [pdf, other

    cs.CL

    LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization

    Authors: Masafumi Enomoto, Kunihiro Takeoka, Kosuke Akimoto, Kiril Gashteovski, Masafumi Oyamada

    Abstract: Open-Domain Multi-Document Summarization (ODMDS) is crucial for addressing diverse information needs, which aims to generate a summary as answer to user's query, synthesizing relevant content from multiple documents in a large collection. Existing approaches that first find relevant passages and then generate a summary using a language model are inadequate for ODMDS. This is because open-ended que… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 13 pages, 3 figures

  2. arXiv:2404.06411  [pdf, other

    cs.AI cs.CL

    AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents

    Authors: Luca Gioacchini, Giuseppe Siracusano, Davide Sanvito, Kiril Gashteovski, David Friede, Roberto Bifulco, Carolin Lawrence

    Abstract: The advances made by Large Language Models (LLMs) have led to the pursuit of LLM agents that can solve intricate, multi-step reasoning tasks. As with any research pursuit, benchmarking and evaluation are key corner stones to efficient and reliable progress. However, existing benchmarks are often narrow and simply compute overall task success. To face these issues, we propose AgentQuest -- a framew… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted at the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2024)

  3. arXiv:2311.06647  [pdf, other

    cs.CL

    Robust Text Classification: Analyzing Prototype-Based Networks

    Authors: Zhivar Sourati, Darshan Deshpande, Filip Ilievski, Kiril Gashteovski, Sascha Saralajew

    Abstract: Downstream applications often require text classification models to be accurate and robust. While the accuracy of the state-of-the-art Language Models (LMs) approximates human performance, they often exhibit a drop in performance on noisy data found in the real world. This lack of robustness can be concerning, as even small perturbations in the text, irrelevant to the target task, can cause classi… ▽ More

    Submitted 17 June, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

  4. arXiv:2310.14909  [pdf, other

    cs.CL cs.AI cs.LG

    Linking Surface Facts to Large-Scale Knowledge Graphs

    Authors: Gorjan Radevski, Kiril Gashteovski, Chia-Chien Hung, Carolin Lawrence, Goran Glavaš

    Abstract: Open Information Extraction (OIE) methods extract facts from natural language text in the form of ("subject"; "relation"; "object") triples. These facts are, however, merely surface forms, the ambiguity of which impedes their downstream usage; e.g., the surface phrase "Michael Jordan" may refer to either the former basketball player or the university professor. Knowledge Graphs (KGs), on the other… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  5. arXiv:2307.00524  [pdf, other

    cs.CL

    Large Language Models Enable Few-Shot Clustering

    Authors: Vijay Viswanathan, Kiril Gashteovski, Carolin Lawrence, Tongshuang Wu, Graham Neubig

    Abstract: Unlike traditional unsupervised clustering, semi-supervised clustering allows users to provide meaningful structure to the data, which helps the clustering algorithm to match the user's intent. Existing approaches to semi-supervised clustering require a significant amount of feedback from an expert to improve the clusters. In this paper, we ask whether a large language model can amplify an expert'… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

  6. arXiv:2305.14163  [pdf, other

    cs.CL cs.LG

    Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection

    Authors: David Dukić, Kiril Gashteovski, Goran Glavaš, Jan Šnajder

    Abstract: Event detection is a crucial information extraction task in many domains, such as Wikipedia or news. The task typically relies on trigger detection (TD) -- identifying token spans in the text that evoke specific events. While the notion of triggers should ideally be universal across domains, domain transfer for TD from high- to low-resource domains results in significant performance drops. We addr… ▽ More

    Submitted 1 February, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at EACL 2024 Findings

  7. arXiv:2208.11024  [pdf, other

    cs.AI

    KGxBoard: Explainable and Interactive Leaderboard for Evaluation of Knowledge Graph Completion Models

    Authors: Haris Widjaja, Kiril Gashteovski, Wiem Ben Rim, Pengfei Liu, Christopher Malon, Daniel Ruffinelli, Carolin Lawrence, Graham Neubig

    Abstract: Knowledge Graphs (KGs) store information in the form of (head, predicate, tail)-triples. To augment KGs with new knowledge, researchers proposed models for KG Completion (KGC) tasks such as link prediction; i.e., answering (h; p; ?) or (?; p; t) queries. Such models are usually evaluated with averaged metrics on a held-out test set. While useful for tracking progress, averaged single-score metrics… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  8. arXiv:2207.04447  [pdf, other

    cs.CL

    Human-Centric Research for NLP: Towards a Definition and Guiding Questions

    Authors: Bhushan Kotnis, Kiril Gashteovski, Julia Gastinger, Giuseppe Serra, Francesco Alesiani, Timo Sztyler, Ammar Shaker, Na Gong, Carolin Lawrence, Zhao Xu

    Abstract: With Human-Centric Research (HCR) we can steer research activities so that the research outcome is beneficial for human stakeholders, such as end users. But what exactly makes research human-centric? We address this question by providing a working definition and define how a research pipeline can be split into different stages in which human-centric components can be added. Additionally, we discus… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

  9. arXiv:2205.12749  [pdf, other

    cs.AI cs.HC

    A Human-Centric Assessment Framework for AI

    Authors: Sascha Saralajew, Ammar Shaker, Zhao Xu, Kiril Gashteovski, Bhushan Kotnis, Wiem Ben Rim, Jürgen Quittek, Carolin Lawrence

    Abstract: With the rise of AI systems in real-world applications comes the need for reliable and trustworthy AI. An essential aspect of this are explainable AI systems. However, there is no agreed standard on how explainable AI systems should be assessed. Inspired by the Turing test, we introduce a human-centric assessment framework where a leading domain expert accepts or rejects the solutions of an AI sys… ▽ More

    Submitted 1 July, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted as submission to ICML 2022 Workshop on Human-Machine Collaboration and Teaming

  10. arXiv:2110.08144  [pdf, other

    cs.CL cs.AI

    milIE: Modular & Iterative Multilingual Open Information Extraction

    Authors: Bhushan Kotnis, Kiril Gashteovski, Daniel Oñoro Rubio, Vanesa Rodriguez-Tembras, Ammar Shaker, Makoto Takamoto, Mathias Niepert, Carolin Lawrence

    Abstract: Open Information Extraction (OpenIE) is the task of extracting (subject, predicate, object) triples from natural language sentences. Current OpenIE systems extract all triple slots independently. In contrast, we explore the hypothesis that it may be beneficial to extract triple slots iteratively: first extract easy slots, followed by the difficult ones by conditioning on the easy slots, and theref… ▽ More

    Submitted 25 April, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  11. arXiv:2109.07464  [pdf, other

    cs.CL

    AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark

    Authors: Niklas Friedrich, Kiril Gashteovski, Mingying Yu, Bhushan Kotnis, Carolin Lawrence, Mathias Niepert, Goran Glavaš

    Abstract: Open Information Extraction (OIE) is the task of extracting facts from sentences in the form of relations and their corresponding arguments in schema-free manner. Intrinsic performance of OIE systems is difficult to measure due to the incompleteness of existing OIE benchmarks: the ground truth extractions do not group all acceptable surface realizations of the same fact that can be extracted from… ▽ More

    Submitted 13 April, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

  12. arXiv:2109.06850  [pdf, other

    cs.CL cs.AI

    BenchIE: A Framework for Multi-Faceted Fact-Based Open Information Extraction Evaluation

    Authors: Kiril Gashteovski, Mingying Yu, Bhushan Kotnis, Carolin Lawrence, Mathias Niepert, Goran Glavaš

    Abstract: Intrinsic evaluations of OIE systems are carried out either manually -- with human evaluators judging the correctness of extractions -- or automatically, on standardized benchmarks. The latter, while much more cost-effective, is less reliable, primarily because of the incompleteness of the existing OIE benchmarks: the ground truth extractions do not include all acceptable variants of the same fact… ▽ More

    Submitted 13 April, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

  13. arXiv:1904.12324  [pdf, other

    cs.CL

    OPIEC: An Open Information Extraction Corpus

    Authors: Kiril Gashteovski, Sebastian Wanner, Sven Hertling, Samuel Broscheit, Rainer Gemulla

    Abstract: Open information extraction (OIE) systems extract relations and their arguments from natural language text in an unsupervised manner. The resulting extractions are a valuable resource for downstream tasks such as knowledge base construction, open question answering, or event schema induction. In this paper, we release, describe, and analyze an OIE corpus called OPIEC, which was extracted from the… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.

    Comments: In Proceedings of the Conference of Automatic Knowledge Base Construction (AKBC) 2019

    Journal ref: In Proceedings of the Conference of Automatic Knowledge Base Construction (AKBC) 2019