Skip to main content

Showing 1–34 of 34 results for author: Bos, J

  1. arXiv:2407.01899  [pdf, other

    cs.CL

    Scope-enhanced Compositional Semantic Parsing for DRT

    Authors: Xiulin Yang, Jonas Groschwitz, Alexander Koller, Johan Bos

    Abstract: Discourse Representation Theory (DRT) distinguishes itself from other semantic representation frameworks by its ability to model complex semantic and discourse phenomena through structural nesting and variable binding. While seq2seq models hold the state of the art on DRT parsing, their accuracy degrades with the complexity of the sentence, and they sometimes struggle to produce well-formed DRT re… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2404.12698  [pdf, other

    cs.CL

    Neural Semantic Parsing with Extremely Rich Symbolic Meaning Representations

    Authors: Xiao Zhang, Gosse Bouma, Johan Bos

    Abstract: Current open-domain neural semantics parsers show impressive performance. However, closer inspection of the symbolic meaning representations they produce reveals significant weaknesses: sometimes they tend to merely copy character sequences from the source text to form symbolic concepts, defaulting to the most frequent word sense based in the training distribution. By leveraging the hierarchical s… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: This manuscript has been submitted to Computational Linguistics journal on 2024-03-15

  3. arXiv:2404.08354  [pdf, other

    cs.CL

    Gaining More Insight into Neural Semantic Parsing with Challenging Benchmarks

    Authors: Xiao Zhang, Chunliu Wang, Rik van Noord, Johan Bos

    Abstract: The Parallel Meaning Bank (PMB) serves as a corpus for semantic processing with a focus on semantic parsing and text generation. Currently, we witness an excellent performance of neural parsers and generators on the PMB. This might suggest that such semantic processing tasks have by and large been solved. We argue that this is not the case and that performance scores from the past on the PMB are i… ▽ More

    Submitted 7 May, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  4. arXiv:2310.02053  [pdf, other

    cs.CL

    Controlling Topic-Focus Articulation in Meaning-to-Text Generation using Graph Neural Networks

    Authors: Chunliu Wang, Rik van Noord, Johan Bos

    Abstract: A bare meaning representation can be expressed in various ways using natural language, depending on how the information is structured on the surface level. We are interested in finding ways to control topic-focus articulation when generating text from meaning. We focus on distinguishing active and passive voice for sentences with transitive verbs. The idea is to add pragmatic information such as t… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  5. arXiv:2306.09725  [pdf, other

    cs.CL

    Discourse Representation Structure Parsing for Chinese

    Authors: Chunliu Wang, Xiao Zhang, Johan Bos

    Abstract: Previous work has predominantly focused on monolingual English semantic parsing. We, instead, explore the feasibility of Chinese semantic parsing in the absence of labeled data for Chinese meaning representations. We describe the pipeline of automatically collecting the linearized Chinese meaning representation data for sequential-to sequential neural networks. We further propose a test suite desi… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: NATURAL LOGIC MEETS MACHINE LEARNING IV Workshop

  6. arXiv:2306.00124  [pdf, other

    cs.CL

    Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation

    Authors: Chunliu Wang, Huiyuan Lai, Malvina Nissim, Johan Bos

    Abstract: Pre-trained language models (PLMs) have achieved great success in NLP and have recently been used for tasks in computational semantics. However, these tasks do not fully benefit from PLMs since meaning representations are not explicitly included in the pre-training stage. We introduce multilingual pre-trained language-meaning models based on Discourse Representation Structures (DRSs), including me… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted by ACL2023 findings

  7. arXiv:2305.08414  [pdf, other

    cs.CL cs.AI

    What's the Meaning of Superhuman Performance in Today's NLU?

    Authors: Simone Tedeschi, Johan Bos, Thierry Declerck, Jan Hajic, Daniel Hershcovich, Eduard H. Hovy, Alexander Koller, Simon Krek, Steven Schockaert, Rico Sennrich, Ekaterina Shutova, Roberto Navigli

    Abstract: In the last five years, there has been a significant focus in Natural Language Processing (NLP) on developing larger Pretrained Language Models (PLMs) and introducing benchmarks such as SuperGLUE and SQuAD to measure their abilities in language understanding, reasoning, and reading comprehension. These PLMs have achieved impressive results on these benchmarks, even surpassing human performance in… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 9 pages, long paper at ACL 2023 proceedings

  8. arXiv:2301.08277  [pdf, other

    cs.DL

    LaTeX, metadata, and publishing workflows

    Authors: Joppe W. Bos, Kevin S. McCurley

    Abstract: The field of scientific publishing that is served by LaTeX is increasingly dependent on the availability of metadata about publications. We discuss how to use LaTeX classes and BibTeX styles to curate metadata throughout the life cycle of a published article. Our focus is on streamlining and automating much of publishing workflow. We survey the various options and drawbacks of the existing approac… ▽ More

    Submitted 9 April, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

  9. arXiv:2109.07078  [pdf, other

    cs.CV cs.RO

    DSOR: A Scalable Statistical Filter for Removing Falling Snow from LiDAR Point Clouds in Severe Winter Weather

    Authors: Akhil Kurup, Jeremy Bos

    Abstract: For autonomous vehicles to viably replace human drivers they must contend with inclement weather. Falling rain and snow introduce noise in LiDAR returns resulting in both false positive and false negative object detections. In this article we introduce the Winter Adverse Driving dataSet (WADS) collected in the snow belt region of Michigan's Upper Peninsula. WADS is the first multi-modal dataset fe… ▽ More

    Submitted 30 October, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  10. arXiv:2103.16020  [pdf, other

    eess.IV cs.CV cs.GR cs.LG

    Machine learning method for light field refocusing

    Authors: Eisa Hedayati, Timothy C. Havens, Jeremy P. Bos

    Abstract: Light field imaging introduced the capability to refocus an image after capturing. Currently there are two popular methods for refocusing, shift-and-sum and Fourier slice methods. Neither of these two methods can refocus the light field in real-time without any pre-processing. In this paper we introduce a machine learning based refocusing technique that is capable of extracting 16 refocused images… ▽ More

    Submitted 18 April, 2022; v1 submitted 29 March, 2021; originally announced March 2021.

  11. arXiv:2012.14854  [pdf, other

    cs.CL

    The Parallel Meaning Bank: A Framework for Semantically Annotating Multiple Languages

    Authors: Lasha Abzianidze, Rik van Noord, Chunliu Wang, Johan Bos

    Abstract: This paper gives a general description of the ideas behind the Parallel Meaning Bank, a framework with the aim to provide an easy way to annotate compositional semantics for texts written in languages other than English. The annotation procedure is semi-automatic, and comprises seven layers of linguistic information: segmentation, symbolisation, semantic tagging, word sense disambiguation, syntact… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

    Comments: 13 pages, 5 figures, 1 table

    MSC Class: 68T50 ACM Class: I.2.7

  12. arXiv:2012.14837  [pdf, other

    cs.CL

    DRS at MRP 2020: Dressing up Discourse Representation Structures as Graphs

    Authors: Lasha Abzianidze, Johan Bos, Stephan Oepen

    Abstract: Discourse Representation Theory (DRT) is a formal account for representing the meaning of natural language discourse. Meaning in DRT is modeled via a Discourse Representation Structure (DRS), a meaning representation with a model-theoretic interpretation, which is usually depicted as nested boxes. In contrast, a directed labeled graph is a common data structure used to encode semantics of natural… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

    Comments: 10 pages, 4 figures, 4 tables, CoNLL 2020 Shared Task

    MSC Class: 68T50 ACM Class: I.2.7

  13. arXiv:2011.04308  [pdf, other

    cs.CL

    Character-level Representations Improve DRS-based Semantic Parsing Even in the Age of BERT

    Authors: Rik van Noord, Antonio Toral, Johan Bos

    Abstract: We combine character-level and contextual language model representations to improve performance on Discourse Representation Structure parsing. Character representations can easily be added in a sequence-to-sequence model in either one encoder or as a fully separate encoder, with improvements that are robust to different language models, languages and data sets. For English, these improvements are… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: EMNLP 2020 (long)

  14. Light Field Compression by Residual CNN Assisted JPEG

    Authors: Eisa Hedayati, Timothy C. Havens, Jeremy P. Bos

    Abstract: Light field (LF) imaging has gained significant attention due to its recent success in 3-dimensional (3D) displaying and rendering as well as augmented and virtual reality usage. Nonetheless, because of the two extra dimensions, LFs are much larger than conventional images. We develop a JPEG-assisted learning-based technique to reconstruct an LF from a JPEG bitstream with a bit per pixel ratio of… ▽ More

    Submitted 18 March, 2021; v1 submitted 30 September, 2020; originally announced October 2020.

    Journal ref: 2021 International Joint Conference on Neural Networks (IJCNN), 2021, pp. 1-9

  15. Thirty Musts for Meaning Banking

    Authors: Johan Bos, Lasha Abzianidze

    Abstract: Meaning banking--creating a semantically annotated corpus for the purpose of semantic parsing or generation--is a challenging task. It is quite simple to come up with a complex meaning representation, but it is hard to design a simple meaning representation that captures many nuances of meaning. This paper lists some lessons learned in nearly ten years of meaning annotation during the development… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: https://www.aclweb.org/anthology/W19-3302/

    Journal ref: Proceedings of the First International Workshop on Designing Meaning Representations, 2019, Association for Computational Linguistics

  16. arXiv:2005.13399  [pdf, other

    cs.CL cs.AI cs.LG

    The First Shared Task on Discourse Representation Structure Parsing

    Authors: Lasha Abzianidze, Rik van Noord, Hessel Haagsma, Johan Bos

    Abstract: The paper presents the IWCS 2019 shared task on semantic parsing where the goal is to produce Discourse Representation Structures (DRSs) for English sentences. DRSs originate from Discourse Representation Theory and represent scoped meaning representations that capture the semantics of negation, modals, quantification, and presupposition triggers. Additionally, concepts and event-participants in D… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: International Conference on Computational Semantics (IWCS)

    ACM Class: I.2.7

    Journal ref: Proceedings of the IWCS Shared Task on Semantic Parsing, IWCS, SIGSEM, 2019, Association for Computational Linguistics

  17. arXiv:2001.08133  [pdf, ps, other

    cs.PL cs.LO

    Drawing Prolog Search Trees: A Manual for Teachers and Students of Logic Programming

    Authors: Johan Bos

    Abstract: Programming in Prolog is hard for programmers that are used to procedural coding. In this manual the method of drawing search trees is introduced with the aim to get a better understanding of how Prolog works. After giving a first example of a Prolog database, query and search tree, the art of drawing search trees is systematically introduced giving guidelines for queries with variables, conjuncti… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Comments: 20 pages, 8 listings, 7 figures

  18. arXiv:1911.08829  [pdf, other

    cs.CL

    Casting a Wide Net: Robust Extraction of Potentially Idiomatic Expressions

    Authors: Hessel Haagsma, Malvina Nissim, Johan Bos

    Abstract: Idiomatic expressions like `out of the woods' and `up the ante' present a range of difficulties for natural language processing applications. We present work on the annotation and extraction of what we term potentially idiomatic expressions (PIEs), a subclass of multiword expressions covering both literal and non-literal uses of idiomatic expressions. Existing corpora of PIEs are small and have li… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

  19. arXiv:1908.01355  [pdf, other

    cs.CL

    Separating Argument Structure from Logical Structure in AMR

    Authors: Johan Bos

    Abstract: The AMR (Abstract Meaning Representation) formalism for representing meaning of natural language sentences was not designed to deal with scope and quantifiers. By extending AMR with indices for contexts and formulating constraints on these contexts, a formalism is derived that makes correct prediction for inferences involving negation and bound variables. The attractive core predicate-argument str… ▽ More

    Submitted 27 October, 2020; v1 submitted 4 August, 2019; originally announced August 2019.

  20. arXiv:1906.06448  [pdf, other

    cs.CL

    Can neural networks understand monotonicity reasoning?

    Authors: Hitomi Yanaka, Koji Mineshima, Daisuke Bekki, Kentaro Inui, Satoshi Sekine, Lasha Abzianidze, Johan Bos

    Abstract: Monotonicity reasoning is one of the important reasoning skills for any intelligent natural language inference (NLI) model in that it requires the ability to capture the interaction between lexical and syntactic structures. Since no test set has been developed for monotonicity reasoning with wide coverage, it is still unclear whether neural models can perform monotonicity reasoning in a proper way… ▽ More

    Submitted 27 June, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: accepted by ACL2019 BlackboxNLP (long paper)

  21. arXiv:1904.12166  [pdf, ps, other

    cs.CL

    HELP: A Dataset for Identifying Shortcomings of Neural Models in Monotonicity Reasoning

    Authors: Hitomi Yanaka, Koji Mineshima, Daisuke Bekki, Kentaro Inui, Satoshi Sekine, Lasha Abzianidze, Johan Bos

    Abstract: Large crowdsourced datasets are widely used for training and evaluating neural models on natural language inference (NLI). Despite these efforts, neural models have a hard time capturing logical inferences, including those licensed by phrase replacements, so-called monotonicity reasoning. Since no large dataset has been developed for monotonicity reasoning, it is still unclear whether the main obs… ▽ More

    Submitted 27 April, 2019; originally announced April 2019.

    Comments: 6 pages, 1 figure, accepted as *SEM 2019

  22. arXiv:1904.00904  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph physics.data-an

    An Atomistic Machine Learning Package for Surface Science and Catalysis

    Authors: Martin Hangaard Hansen, José A. Garrido Torres, Paul C. Jennings, Ziyun Wang, Jacob R. Boes, Osman G. Mamun, Thomas Bligaard

    Abstract: We present work flows and a software module for machine learning model building in surface science and heterogeneous catalysis. This includes fingerprinting atomic structures from 3D structure and/or connectivity information, it includes descriptor selection methods and benchmarks, and it includes active learning frameworks for atomic structure optimization, acceleration of screening studies and f… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

  23. arXiv:1810.12579  [pdf, other

    cs.CL

    Exploring Neural Methods for Parsing Discourse Representation Structures

    Authors: Rik van Noord, Lasha Abzianidze, Antonio Toral, Johan Bos

    Abstract: Neural methods have had several recent successes in semantic parsing, though they have yet to face the challenge of producing meaning representations based on formal semantics. We present a sequence-to-sequence neural semantic parser that is able to produce Discourse Representation Structures (DRSs) for English sentences with high accuracy, outperforming traditional DRS parsers. To facilitate the… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

    Comments: to appear in TACL 2018

  24. arXiv:1808.09716  [pdf, other

    cs.CL

    What can we learn from Semantic Tagging?

    Authors: Mostafa Abdou, Artur Kulmizev, Vinit Ravishankar, Lasha Abzianidze, Johan Bos

    Abstract: We investigate the effects of multi-task learning using the recently introduced task of semantic tagging. We employ semantic tagging as an auxiliary task for three different NLP tasks: part-of-speech tagging, Universal Dependency parsing, and Natural Language Inference. We compare full neural network sharing, partial neural network sharing, and what we term the learning what to share setting where… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: 9 pages with references and appendixes. EMNLP 2018 camera ready

  25. arXiv:1802.08599  [pdf, other

    cs.CL

    Evaluating Scoped Meaning Representations

    Authors: Rik van Noord, Lasha Abzianidze, Hessel Haagsma, Johan Bos

    Abstract: Semantic parsing offers many opportunities to improve natural language understanding. We present a semantically annotated parallel corpus for English, German, Italian, and Dutch where sentences are aligned with scoped meaning representations in order to capture the semantics of negation, modals, quantification, and presupposition triggers. The semantic formalism is based on Discourse Representatio… ▽ More

    Submitted 10 April, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: Camera-ready for LREC 2018

  26. arXiv:1709.10381  [pdf, ps, other

    cs.CL

    Towards Universal Semantic Tagging

    Authors: Lasha Abzianidze, Johan Bos

    Abstract: The paper proposes the task of universal semantic tagging---tagging word tokens with language-neutral, semantically informative tags. We argue that the task, with its independent nature, contributes to better semantic analysis for wide-coverage multilingual text. We present the initial version of the semantic tagset and show that (a) the tags provide semantically fine-grained information, and (b)… ▽ More

    Submitted 29 September, 2017; originally announced September 2017.

    Comments: 9 pages, International Conference on Computational Semantics (IWCS)

  27. arXiv:1705.09980  [pdf, ps, other

    cs.CL

    Neural Semantic Parsing by Character-based Translation: Experiments with Abstract Meaning Representations

    Authors: Rik van Noord, Johan Bos

    Abstract: We evaluate the character-level translation method for neural semantic parsing on a large corpus of sentences annotated with Abstract Meaning Representations (AMRs). Using a sequence-to-sequence model, and some trivial preprocessing and postprocessing of AMRs, we obtain a baseline accuracy of 53.1 (F-score on AMR-triples). We examine five different approaches to improve this baseline result: (i) r… ▽ More

    Submitted 9 October, 2017; v1 submitted 28 May, 2017; originally announced May 2017.

    Comments: Camera ready for CLIN 2017 journal

  28. arXiv:1704.02156  [pdf, other

    cs.CL

    The Meaning Factory at SemEval-2017 Task 9: Producing AMRs with Neural Semantic Parsing

    Authors: Rik van Noord, Johan Bos

    Abstract: We evaluate a semantic parser based on a character-based sequence-to-sequence model in the context of the SemEval-2017 shared task on semantic parsing for AMRs. With data augmentation, super characters, and POS-tagging we gain major improvements in performance compared to a baseline character-level model. Although we improve on previous character-based neural semantic parsing models, the overall a… ▽ More

    Submitted 19 April, 2017; v1 submitted 7 April, 2017; originally announced April 2017.

    Comments: To appear in Proceedings of SemEval, 2017 (camera-ready)

  29. arXiv:1702.03964  [pdf, other

    cs.CL

    The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations Annotated with Compositional Meaning Representations

    Authors: Lasha Abzianidze, Johannes Bjerva, Kilian Evang, Hessel Haagsma, Rik van Noord, Pierre Ludmann, Duc-Duy Nguyen, Johan Bos

    Abstract: The Parallel Meaning Bank is a corpus of translations annotated with shared, formal meaning representations comprising over 11 million words divided over four languages (English, German, Italian, and Dutch). Our approach is based on cross-lingual projection: automatically produced (and manually corrected) semantic annotations for English sentences are mapped onto their word-aligned translations, a… ▽ More

    Submitted 13 February, 2017; originally announced February 2017.

    Comments: To appear at EACL 2017

  30. arXiv:1609.07053  [pdf, other

    cs.CL

    Semantic Tagging with Deep Residual Networks

    Authors: Johannes Bjerva, Barbara Plank, Johan Bos

    Abstract: We propose a novel semantic tagging task, sem-tagging, tailored for the purpose of multilingual semantic parsing, and present the first tagger using deep residual networks (ResNets). Our tagger uses both word and character representations and includes a novel residual bypass architecture. We evaluate the tagset both intrinsically on the new task of semantic tagging, as well as on Part-of-Speech (P… ▽ More

    Submitted 31 October, 2016; v1 submitted 22 September, 2016; originally announced September 2016.

    Comments: COLING 2016, camera ready version

  31. arXiv:1202.4285  [pdf, other

    math.NT cs.CR

    Finding ECM-friendly curves through a study of Galois properties

    Authors: Razvan Barbulescu, Joppe W. Bos, Cyril Bouvier, Thorsten Kleinjung, Peter L. Montgomery

    Abstract: In this paper we prove some divisibility properties of the cardinality of elliptic curves modulo primes. These proofs explain the good behavior of certain parameters when using Montgomery or Edwards curves in the setting of the elliptic curve method (ECM) for integer factorization. The ideas of the proofs help us to find new families of elliptic curves with good division properties which increase… ▽ More

    Submitted 4 September, 2012; v1 submitted 20 February, 2012; originally announced February 2012.

    Journal ref: Algorithmic Number Theory Symposium (2012)

  32. Rascal: From Algebraic Specification to Meta-Programming

    Authors: Jeroen van den Bos, Mark Hills, Paul Klint, Tijs van der Storm, Jurgen J. Vinju

    Abstract: Algebraic specification has a long tradition in bridging the gap between specification and programming by making specifications executable. Building on extensive experience in designing, implementing and using specification formalisms that are based on algebraic specification and term rewriting (namely Asf and Asf+Sdf), we are now focusing on using the best concepts from algebraic specification an… ▽ More

    Submitted 30 June, 2011; originally announced July 2011.

    Comments: In Proceedings AMMSE 2011, arXiv:1106.5962

    ACM Class: D.3.2

    Journal ref: EPTCS 56, 2011, pp. 15-32

  33. Compositional Semantics in Verbmobil

    Authors: Johan Bos, Björn Gambäck, Christian Lieske, Yoshiki Mori, Manfred Pinkal, Karsten Worm

    Abstract: The paper discusses how compositional semantics is implemented in the Verbmobil speech-to-speech translation system using LUD, a description language for underspecified discourse representation structures. The description language and its formal interpretation in DRT are described as well as its implementation together with the architecture of the system's entire syntactic-semantic processing mo… ▽ More

    Submitted 30 July, 1996; originally announced July 1996.

    Comments: 6 pages, LaTeX, uses colap.sty

    Journal ref: Proceedings of COLING '96

  34. Bridging as Coercive Accommodation

    Authors: Johan Bos, Paul Buitelaar, Anne-Marie Mineur

    Abstract: In this paper we discuss the notion of "bridging" in Discourse Representation Theory as a tool to account for discourse referents that have only been established implicitly, through the lexical semantics of other referents. In doing so, we use ideas from Generative Lexicon theory, to introduce antecedents for anaphoric expressions that cannot be "linked" to a proper antecedent, but that do not n… ▽ More

    Submitted 2 August, 1995; originally announced August 1995.

    Comments: LaTeX file, 16 pages, uses named.sty. Paper presented at CLNLP workshop, Edinburgh, April 3-5, 1995

    Report number: CLAUS 52 Technical Report