Skip to main content

Showing 1–19 of 19 results for author: Garijo, D

  1. arXiv:2312.07852  [pdf, ps, other

    cs.DL

    Recording provenance of workflow runs with RO-Crate

    Authors: Simone Leo, Michael R. Crusoe, Laura Rodríguez-Navas, Raül Sirvent, Alexander Kanitz, Paul De Geest, Rudolf Wittner, Luca Pireddu, Daniel Garijo, José M. Fernández, Iacopo Colonnelli, Matej Gallo, Tazro Ohta, Hirotaka Suetake, Salvador Capella-Gutierrez, Renske de Wit, Bruno P. Kinoshita, Stian Soiland-Reyes

    Abstract: Recording the provenance of scientific computation results is key to the support of traceability, reproducibility and quality assessment of data products. Several data models have been explored to address this need, providing representations of workflow plans and their executions as well as means of packaging the resulting information for archiving and sharing. However, existing approaches tend to… ▽ More

    Submitted 16 July, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: 38 pages, 5 figures, 3 tables. Resubmitted to PLOS ONE following peer review

  2. arXiv:2305.03107  [pdf, other

    math.CO cs.DM

    Homomorphisms between graphs embedded on surfaces

    Authors: Delia Garijo, Andrew Goodall, Lluís Vena

    Abstract: We extend the notion of graph homomorphism to cellularly embedded graphs (maps) by designing operations on vertices and edges that respect the surface topology; we thus obtain the first definition of map homomorphism that preserves both the combinatorial structure (as a graph homomorphism) and the topological structure of the surface (in particular, orientability and genus). Notions such as the co… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 46 pages, 11 figures

  3. Workflows Community Summit 2022: A Roadmap Revolution

    Authors: Rafael Ferreira da Silva, Rosa M. Badia, Venkat Bala, Debbie Bard, Peer-Timo Bremer, Ian Buckley, Silvina Caino-Lores, Kyle Chard, Carole Goble, Shantenu Jha, Daniel S. Katz, Daniel Laney, Manish Parashar, Frederic Suter, Nick Tyler, Thomas Uram, Ilkay Altintas, Stefan Andersson, William Arndt, Juan Aznar, Jonathan Bader, Bartosz Balis, Chris Blanton, Kelly Rosa Braghetto, Aharon Brodutch , et al. (80 additional authors not shown)

    Abstract: Scientific workflows have become integral tools in broad scientific computing use cases. Science discovery is increasingly dependent on workflows to orchestrate large and complex scientific experiments that range from execution of a cloud-based data preprocessing pipeline to multi-facility instrument-to-edge-to-HPC computational workflows. Given the changing landscape of scientific computing and t… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Report number: ORNL/TM-2023/2885

  4. arXiv:2201.12650  [pdf, other

    cs.DM math.CO

    New results on the robust coloring problem

    Authors: Delia Garijo, Alberto Márquez, Rafael Robles

    Abstract: Many variations of the classical graph coloring model have been intensively studied due to their multiple applications; scheduling problems and aircraft assignments, for instance, motivate the robust coloring problem. This model gets to capture natural constraints of those optimization problems by combining the information provided by two colorings: a vertex coloring of a graph and the induced edg… ▽ More

    Submitted 16 May, 2023; v1 submitted 29 January, 2022; originally announced January 2022.

    MSC Class: 05C15; 68R10

  5. A Community Roadmap for Scientific Workflows Research and Development

    Authors: Rafael Ferreira da Silva, Henri Casanova, Kyle Chard, Ilkay Altintas, Rosa M Badia, Bartosz Balis, Tainã Coleman, Frederik Coppens, Frank Di Natale, Bjoern Enders, Thomas Fahringer, Rosa Filgueira, Grigori Fursin, Daniel Garijo, Carole Goble, Dorran Howell, Shantenu Jha, Daniel S. Katz, Daniel Laney, Ulf Leser, Maciej Malawski, Kshitij Mehta, Loïc Pottier, Jonathan Ozik, J. Luc Peterson , et al. (4 additional authors not shown)

    Abstract: The landscape of workflow systems for scientific applications is notoriously convoluted with hundreds of seemingly equivalent workflow systems, many isolated research claims, and a steep learning curve. To address some of these challenges and lay the groundwork for transforming workflows research and development, the WorkflowsRI and ExaWorks projects partnered to bring the international workflows… ▽ More

    Submitted 8 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2103.09181

  6. arXiv:2108.07119  [pdf, ps, other

    cs.AI

    Creating and Querying Personalized Versions of Wikidata on a Laptop

    Authors: Hans Chalupsky, Pedro Szekely, Filip Ilievski, Daniel Garijo, Kartik Shenoy

    Abstract: Application developers today have three choices for exploiting the knowledge present in Wikidata: they can download the Wikidata dumps in JSON or RDF format, they can use the Wikidata API to get data about individual entities, or they can use the Wikidata SPARQL endpoint. None of these methods can support complex, yet common, query use cases, such as retrieval of large amounts of data or aggregati… ▽ More

    Submitted 18 August, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

    ACM Class: H.3.3; I.2

  7. Packaging research artefacts with RO-Crate

    Authors: Stian Soiland-Reyes, Peter Sefton, Mercè Crosas, Leyla Jael Castro, Frederik Coppens, José M. Fernández, Daniel Garijo, Björn Grüning, Marco La Rosa, Simone Leo, Eoghan Ó Carragáin, Marc Portier, Ana Trisovic, RO-Crate Community, Paul Groth, Carole Goble

    Abstract: An increasing number of researchers support reproducibility by including pointers to and descriptions of datasets, software and methods in their publications. However, scientific articles may be ambiguous, incomplete and difficult to process by automated systems. In this paper we introduce RO-Crate, an open, community-driven, and lightweight approach to packaging research artefacts along with thei… ▽ More

    Submitted 6 December, 2021; v1 submitted 14 August, 2021; originally announced August 2021.

    Comments: 44 pages. Accepted for Data Science

    ACM Class: H.1.1; H.3.2

    Journal ref: Data Science 2022

  8. arXiv:2107.00156  [pdf, other

    cs.AI

    A Study of the Quality of Wikidata

    Authors: Kartik Shenoy, Filip Ilievski, Daniel Garijo, Daniel Schwabe, Pedro Szekely

    Abstract: Wikidata has been increasingly adopted by many communities for a wide variety of applications, which demand high-quality knowledge to deliver successful results. In this paper, we develop a framework to detect and analyze low-quality statements in Wikidata by shedding light on the current practices exercised by the community. We explore three indicators of data quality in Wikidata, based on: 1) co… ▽ More

    Submitted 18 November, 2021; v1 submitted 30 June, 2021; originally announced July 2021.

    Comments: 12 pages

    Journal ref: Journal of Web Semantics, Special issue on Community-Based Knowledge Bases, 2021

  9. Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development

    Authors: Rafael Ferreira da Silva, Henri Casanova, Kyle Chard, Tainã Coleman, Dan Laney, Dong Ahn, Shantenu Jha, Dorran Howell, Stian Soiland-Reys, Ilkay Altintas, Douglas Thain, Rosa Filgueira, Yadu Babuji, Rosa M. Badia, Bartosz Balis, Silvina Caino-Lores, Scott Callaghan, Frederik Coppens, Michael R. Crusoe, Kaushik De, Frank Di Natale, Tu M. A. Do, Bjoern Enders, Thomas Fahringer, Anne Fouilloux , et al. (33 additional authors not shown)

    Abstract: Scientific workflows are a cornerstone of modern scientific computing, and they have underpinned some of the most significant discoveries of the last decade. Many of these workflows have high computational, storage, and/or communication demands, and thus must execute on a wide range of large-scale platforms, from large clouds to upcoming exascale HPC platforms. Workflows will play a crucial role i… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  10. arXiv:2103.11676  [pdf, other

    cs.CG cs.DM

    Continuous mean distance of a weighted graph

    Authors: Delia Garijo, Alberto Márquez, Rodrigo I. Silveira

    Abstract: We study the concept of the continuous mean distance of a weighted graph. For connected unweighted graphs, the mean distance can be defined as the arithmetic mean of the distances between all pairs of vertices. This parameter provides a natural measure of the compactness of the graph, and has been intensively studied, together with several variants, including its version for weighted graphs. The c… ▽ More

    Submitted 13 January, 2023; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Revised version

  11. arXiv:2012.13117  [pdf, other

    cs.DL cs.CY

    Nine Best Practices for Research Software Registries and Repositories: A Concise Guide

    Authors: Task Force on Best Practices for Software Registries, :, Alain Monteil, Alejandra Gonzalez-Beltran, Alexandros Ioannidis, Alice Allen, Allen Lee, Anita Bandrowski, Bruce E. Wilson, Bryce Mecum, Cai Fan Du, Carly Robinson, Daniel Garijo, Daniel S. Katz, David Long, Genevieve Milliken, Hervé Ménager, Jessica Hausman, Jurriaan H. Spaaks, Katrina Fenlon, Kristin Vanderbilt, Lorraine Hwang, Lynn Davis, Martin Fenner, Michael R. Crusoe , et al. (8 additional authors not shown)

    Abstract: Scientific software registries and repositories serve various roles in their respective disciplines. These resources improve software discoverability and research transparency, provide information for software citations, and foster preservation of computational methods that might otherwise be lost over time, thereby supporting research reproducibility and replicability. However, developing these r… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: 18 pages

  12. arXiv:2009.10263  [pdf, other

    cs.LG cs.CV cs.CY eess.IV

    Semantic Workflows and Machine Learning for the Assessment of Carbon Storage by Urban Trees

    Authors: Juan Carrillo, Daniel Garijo, Mark Crowley, Rober Carrillo, Yolanda Gil, Katherine Borda

    Abstract: Climate science is critical for understanding both the causes and consequences of changes in global temperatures and has become imperative for decisive policy-making. However, climate science studies commonly require addressing complex interoperability issues between data, software, and experimental approaches from multiple fields. Scientific workflow systems provide unparalleled advantages to add… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: Previously published as part of the SciKnow 2019 Workshop, November 19th, 2019. Los Angeles, California, USA. Collocated with the tenth International Conference on Knowledge Capture (K-CAP)

    Journal ref: Proceedings of the Third International Workshop on Capturing Scientific Knowledge co-located with the 10th International Conference on Knowledge Capture (K-CAP 2019)

  13. arXiv:2007.09206  [pdf, other

    cs.AI

    OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs

    Authors: Daniel Garijo, Maximiliano Osorio

    Abstract: In recent years, Semantic Web technologies have been increasingly adopted by researchers, industry and public institutions to describe and link data on the Web, create web annotations and consume large knowledge graphs like Wikidata and DBPedia. However, there is still a knowledge gap between ontology engineers, who design, populate and create knowledge graphs; and web developers, who need to unde… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    ACM Class: D.2.12

  14. arXiv:2006.00088  [pdf, other

    cs.AI cs.DB

    KGTK: A Toolkit for Large Knowledge Graph Manipulation and Analysis

    Authors: Filip Ilievski, Daniel Garijo, Hans Chalupsky, Naren Teja Divvala, Yixiang Yao, Craig Rogers, Rongpeng Li, Jun Liu, Amandeep Singh, Daniel Schwabe, Pedro Szekely

    Abstract: Knowledge graphs (KGs) have become the preferred technology for representing, sharing and adding knowledge to modern AI applications. While KGs have become a mainstream technology, the RDF/SPARQL-centric toolset for operating with them at scale is heterogeneous, difficult to integrate and only covers a subset of the operations that are commonly needed in data science applications. In this paper we… ▽ More

    Submitted 26 May, 2021; v1 submitted 29 May, 2020; originally announced June 2020.

    Comments: 16 pages

  15. arXiv:2003.13084  [pdf, ps, other

    cs.DL cs.AI cs.DB

    Best Practices for Implementing FAIR Vocabularies and Ontologies on the Web

    Authors: Daniel Garijo, María Poveda-Villalón

    Abstract: With the adoption of Semantic Web technologies, an increasing number of vocabularies and ontologies have been developed in different domains, ranging from Biology to Agronomy or Geosciences. However, many of these ontologies are still difficult to find, access and understand by researchers due to a lack of documentation, URI resolving issues, versioning problems, etc. In this chapter we describe g… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: 16 pages, 4 figures

    ACM Class: I.2; I.2.4

  16. arXiv:1807.10093  [pdf, other

    cs.CG

    Computing optimal shortcuts for networks

    Authors: Delia Garijo, Alberto Márquez, Natalia Rodríguez, Rodrigo I. Silveira

    Abstract: We study augmenting a plane Euclidean network with a segment, called a shortcut, to minimize the largest distance between any two points along the edges of the resulting network. Problems of this type have received considerable attention recently, mostly for discrete variants of the problem. We consider a fully continuous setting, where the problem of computing distances and placing a shortcut i… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: 21 pages

  17. arXiv:1703.04329  [pdf, other

    cs.CG

    Stabbing segments with rectilinear objects

    Authors: Mercè Claverol, Delia Garijo, Matias Korman, Carlos Seara, Rodrigo I. Silveira

    Abstract: Given a set $S$ of $n$ line segments in the plane, we say that a region $\mathcal{R}\subseteq \mathbb{R}^2$ is a {\em stabber} for $S$ if $\mathcal{R}$ contains exactly one endpoint of each segment of $S$. In this paper we provide optimal or near-optimal algorithms for reporting all combinatorially different stabbers for several shapes of stabbers. Specifically, we consider the case in which the s… ▽ More

    Submitted 13 March, 2017; originally announced March 2017.

  18. arXiv:1603.06764  [pdf, other

    cs.CG cs.DM

    On Hamiltonian alternating cycles and paths

    Authors: Mercè Claverol, Alfredo García, Delia Garijo, Carlos Seara, Javier Tejel

    Abstract: We undertake a study on computing Hamiltonian alternating cycles and paths on bicolored point sets. This has been an intensively studied problem, not always with a solution, when the paths and cycles are also required to be plane. In this paper, we relax the constraint on the cycles and paths from being plane to being 1-plane, and deal with the same type of questions as those for the plane case, o… ▽ More

    Submitted 31 March, 2017; v1 submitted 22 March, 2016; originally announced March 2016.

  19. arXiv:1401.4307  [pdf, other

    cs.DL

    The Research Object Suite of Ontologies: Sharing and Exchanging Research Data and Methods on the Open Web

    Authors: Khalid Belhajjame, Jun Zhao, Daniel Garijo, Kristina Hettne, Raul Palma, Óscar Corcho, José-Manuel Gómez-Pérez, Sean Bechhofer, Graham Klyne, Carole Goble

    Abstract: Research in life sciences is increasingly being conducted in a digital and online environment. In particular, life scientists have been pioneers in embracing new computational tools to conduct their investigations. To support the sharing of digital objects produced during such research investigations, we have witnessed in the last few years the emergence of specialized repositories, e.g., DataVers… ▽ More

    Submitted 3 February, 2014; v1 submitted 17 January, 2014; originally announced January 2014.

    Comments: 20 pages