-
Link Climate: An Interoperable Knowledge Graph Platform for Climate Data
Authors:
Jiantao Wu,
Fabrizio Orlandi,
Declan O'Sullivan,
Soumyabrata Dev
Abstract:
Climate science has become more ambitious in recent years as global awareness about the environment has grown. To better understand climate, historical climate (e.g. archived meteorological variables such as temperature, wind, water, etc.) and climate-related data (e.g. geographical features and human activities) are widely used by today's climate research to derive models for an explainable clima…
▽ More
Climate science has become more ambitious in recent years as global awareness about the environment has grown. To better understand climate, historical climate (e.g. archived meteorological variables such as temperature, wind, water, etc.) and climate-related data (e.g. geographical features and human activities) are widely used by today's climate research to derive models for an explainable climate change and its effects. However, such data sources are often dispersed across a multitude of disconnected data silos on the Web. Moreover, there is a lack of advanced climate data platforms to enable multi-source heterogeneous climate data analysis, therefore, researchers must face a stern challenge in collecting and analyzing multi-source data. In this paper, we address this problem by proposing a climate knowledge graph for the integration of multiple climate data and other data sources into one service, leveraging Web technologies (e.g. HTTP) for multi-source climate data analysis. The proposed knowledge graph is primarily composed of data from the National Oceanic and Atmospheric Administration's daily climate summaries, OpenStreetMap, and Wikidata, and it supports joint data queries on these widely used databases. This paper shows, with a use case in Ireland and the United Kingdom, how climate researchers could benefit from this platform as it allows them to easily integrate datasets from different domains and geographical locations.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
A semantic web approach to uplift decentralized household energy data
Authors:
Jiantao Wu,
Fabrizio Orlandi,
Tarek AlSkaif,
Declan O'Sullivan,
Soumyabrata Dev
Abstract:
In a decentralized household energy system comprised of various devices such as home appliances, electric vehicles, and solar panels, end-users are able to dig deeper into the system's details and further achieve energy sustainability if they are presented with data on the electric energy consumption and production at the granularity of the device. However, many databases in this field are siloed…
▽ More
In a decentralized household energy system comprised of various devices such as home appliances, electric vehicles, and solar panels, end-users are able to dig deeper into the system's details and further achieve energy sustainability if they are presented with data on the electric energy consumption and production at the granularity of the device. However, many databases in this field are siloed from other domains, including solely information pertaining to energy. This may result in the loss of information (e.g. weather) on each device's energy use. Meanwhile, a large number of these datasets have been extensively used in computational modeling techniques such as machine learning models. While such computational approaches achieve great accuracy and performance by concentrating only on a local view of datasets, model reliability cannot be guaranteed since such models are very vulnerable to data input fluctuations when information omission is taken into account. This article tackles the data isolation issue in the field of smart energy systems by examining Semantic Web methods on top of a household energy system. We offer an ontology-based approach for managing decentralized data at the device-level resolution in a system. As a consequence, the scope of the data associated with each device may easily be expanded in an interoperable manner throughout the Web, and additional information, such as weather, can be obtained from the Web, provided that the data is organized according to W3C standards.
△ Less
Submitted 26 August, 2022; v1 submitted 18 August, 2022;
originally announced August 2022.
-
Automated Climate Analyses Using Knowledge Graph
Authors:
Jiantao Wu,
Huan Chen,
Fabrizio Orlandi,
Yee Hui Lee,
Declan O'Sullivan,
Soumyabrata Dev
Abstract:
The FAIR (Findable, Accessible, Interoperable, Reusable) data principles are fundamental for climate researchers and all stakeholders in the current digital ecosystem. In this paper, we demonstrate how relational climate data can be "FAIR" and modeled using RDF, in line with Semantic Web technologies and our Climate Analysis ontology. Thus, heterogeneous climate data can be stored in graph databas…
▽ More
The FAIR (Findable, Accessible, Interoperable, Reusable) data principles are fundamental for climate researchers and all stakeholders in the current digital ecosystem. In this paper, we demonstrate how relational climate data can be "FAIR" and modeled using RDF, in line with Semantic Web technologies and our Climate Analysis ontology. Thus, heterogeneous climate data can be stored in graph databases and offered as Linked Data on the Web. As a result, climate researchers will be able to use the standard SPARQL query language to query these sources directly on the Web. In this paper, we demonstrate the usefulness of our SPARQL endpoint for automated climate analytics. We illustrate two sample use cases that establish the advantage of representing climate data as knowledge graphs.
△ Less
Submitted 21 October, 2021;
originally announced October 2021.
-
An Interoperable Open Data Portal for Climate Analysis
Authors:
Jiantao Wu,
Huan Chen,
Fabrizio Orlandi,
Yee Hui Lee,
Declan O'Sullivan,
Soumyabrata Dev
Abstract:
This work proposes an open interoperable data portal that offers access to a Web-wide climate domain knowledge graph created for Ireland and England's NOAA climate daily data. There are three main components contributing to this data portal: the first is the upper layer schema of the knowledge graph -- the climate analysis (CA) ontology -- the second is an ad hoc SPARQL server by which to store th…
▽ More
This work proposes an open interoperable data portal that offers access to a Web-wide climate domain knowledge graph created for Ireland and England's NOAA climate daily data. There are three main components contributing to this data portal: the first is the upper layer schema of the knowledge graph -- the climate analysis (CA) ontology -- the second is an ad hoc SPARQL server by which to store the graph data and provide public Web access, the last is a dereferencing engine deployed to resolve URIs for entity information. Our knowledge graph form of NOAA climate data facilitates the supply of semantic climate information to researchers and offers a variety of semantic applications that can be built on top of it.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Ontology Modeling for Decentralized Household Energy Systems
Authors:
Jiantao Wu,
Fabrizio Orlandi,
Tarek AlSkaif,
Declan O'Sullivan,
Soumyabrata Dev
Abstract:
In a decentralized household energy system consisting of various devices such as washing machines, heat pumps, and solar panels, understanding the electric energy consumption and production data at the granularity of the device helps end-users be closer to the system and further achieve the sustainability of energy use. However, many datasets in this area are isolated from other domains with recor…
▽ More
In a decentralized household energy system consisting of various devices such as washing machines, heat pumps, and solar panels, understanding the electric energy consumption and production data at the granularity of the device helps end-users be closer to the system and further achieve the sustainability of energy use. However, many datasets in this area are isolated from other domains with records of only energy-related data. This may raise a loss of information (e.g. weather) that is relevant to the energy use of each device. A noticeable disadvantage is that many of those datasets have to be used in computational modeling approaches such as machine learning models, which are vulnerable to the data feed, to advance the understanding of energy consumption and production. Although such computational methods have achieved a high benchmark merely through a local view of datasets, the reusability cannot be firmly guaranteed when the information omission is taken into account. This paper addresses the data isolation problem in the smart energy systems area by exploring Semantic Web techniques on top of a household energy system. We propose an ontology modeling solution for the management of decentralized data at the resolution of a device in the system. As a result, the scope of the data concerning each device can be easily extended to be wider across the web and more information that may be of interest such as weather can be retrieved from the Web if the data are structured by the ontology.
△ Less
Submitted 3 August, 2021;
originally announced August 2021.
-
An Ontology Model for Climatic Data Analysis
Authors:
Jiantao Wu,
Fabrizio Orlandi,
Declan O'Sullivan,
Soumyabrata Dev
Abstract:
Recently ontologies have been exploited in a wide range of research areas for data modeling and data management. They greatly assists in defining the semantic model of the underlying data combined with domain knowledge. In this paper, we propose the Climate Analysis (CA) Ontology to model climate datasets used by remote sensing analysts. We use the data published by National Oceanic and Atmospheri…
▽ More
Recently ontologies have been exploited in a wide range of research areas for data modeling and data management. They greatly assists in defining the semantic model of the underlying data combined with domain knowledge. In this paper, we propose the Climate Analysis (CA) Ontology to model climate datasets used by remote sensing analysts. We use the data published by National Oceanic and Atmospheric Administration (NOAA) to further explore how ontology modeling can be used to facilitate the field of climatic data processing. The idea of this work is to convert relational climate data to the Resource Description Framework (RDF) data model, so that it can be stored in a graph database and easily accessed through the Web as Linked Data. Typically, this provides climate researchers, who are interested in datasets such as NOAA, with the potential of enriching and interlinking with other databases. As a result, our approach facilitates data integration and analysis of diverse climatic data sources and allows researchers to interrogate these sources directly on the Web using the standard SPARQL query language.
△ Less
Submitted 6 June, 2021;
originally announced June 2021.
-
Using Mapping Languages for Building Legal Knowledge Graphs from XML Files
Authors:
Ademar Crotti Junior,
Fabrizio Orlandi,
Declan O'Sullivan,
Christian Dirschl,
Quentin Reul
Abstract:
This paper presents our experience on building RDF knowledge graphs for an industrial use case in the legal domain. The information contained in legal information systems are often accessed through simple keyword interfaces and presented as a simple list of hits. In order to improve search accuracy one may avail of knowledge graphs, where the semantics of the data can be made explicit. Significant…
▽ More
This paper presents our experience on building RDF knowledge graphs for an industrial use case in the legal domain. The information contained in legal information systems are often accessed through simple keyword interfaces and presented as a simple list of hits. In order to improve search accuracy one may avail of knowledge graphs, where the semantics of the data can be made explicit. Significant research effort has been invested in the area of building knowledge graphs from semi-structured text documents, such as XML, with the prevailing approach being the use of mapping languages. In this paper, we present a semantic model for representing legal documents together with an industrial use case. We also present a set of use case requirements based on the proposed semantic model, which are used to compare and discuss the use of state-of-the-art mapping languages for building knowledge graphs for legal data.
△ Less
Submitted 18 November, 2019;
originally announced November 2019.
-
Interlinking Heterogeneous Data for Smart Energy Systems
Authors:
Fabrizio Orlandi,
Alan Meehan,
Murhaf Hossari,
Soumyabrata Dev,
Declan O'Sullivan,
Tarek AlSkaif
Abstract:
Smart energy systems in general, and solar energy analysis in particular, have recently gained increasing interest. This is mainly due to stronger focus on smart energy saving solutions and recent developments in photovoltaic (PV) cells. Various data-driven and machine-learning frameworks are being proposed by the research community. However, these frameworks perform their analysis - and are desig…
▽ More
Smart energy systems in general, and solar energy analysis in particular, have recently gained increasing interest. This is mainly due to stronger focus on smart energy saving solutions and recent developments in photovoltaic (PV) cells. Various data-driven and machine-learning frameworks are being proposed by the research community. However, these frameworks perform their analysis - and are designed on - specific, heterogeneous and isolated datasets, distributed across different sites and sources, making it hard to compare results and reproduce the analysis on similar data. We propose an approach based on Web (W3C) standards and Linked Data technologies for representing and converting PV and weather records into an Resource Description Framework (RDF) graph-based data format. This format, and the presented approach, is ideal in a data integration scenario where data needs to be converted into homogeneous form and different datasets could be interlinked for distributed analysis.
△ Less
Submitted 5 July, 2019;
originally announced July 2019.
-
Disintermediation of Inter-Blockchain Transactions
Authors:
S. Matthew English,
Fabrizio Orlandi,
Soeren Auer
Abstract:
Different versions of peer-to-peer electronic cash exist as data represented by separate blockchains. Payments between such systems cannot be sent directly from one party to another without going through a financial institution. Bitcoin provided part of the solution but its utility is limited to intra-blockchain transactions. The benefits are lost if a trusted third party is required to execute in…
▽ More
Different versions of peer-to-peer electronic cash exist as data represented by separate blockchains. Payments between such systems cannot be sent directly from one party to another without going through a financial institution. Bitcoin provided part of the solution but its utility is limited to intra-blockchain transactions. The benefits are lost if a trusted third party is required to execute inter-blockchain transactions. We propose a solution to the inter-blockchain transaction problem using the same fundamental principles of Bitcoin. The protocol is described by the Uberledger framework, a hierarchical meta-blockchain layer that encapsulates information regarding the fidelity of peer-to-peer transaction facilitators.
△ Less
Submitted 8 September, 2016;
originally announced September 2016.
-
Towards Cleaning-up Open Data Portals: A Metadata Reconciliation Approach
Authors:
Alan Tygel,
Sören Auer,
Jeremy Debattista,
Fabrizio Orlandi,
Maria Luiza Machado Campos
Abstract:
This paper presents an approach for metadata reconciliation, curation and linking for Open Governamental Data Portals (ODPs). ODPs have been lately the standard solution for governments willing to put their public data available for the society. Portal managers use several types of metadata to organize the datasets, one of the most important ones being the tags. However, the tagging process is sub…
▽ More
This paper presents an approach for metadata reconciliation, curation and linking for Open Governamental Data Portals (ODPs). ODPs have been lately the standard solution for governments willing to put their public data available for the society. Portal managers use several types of metadata to organize the datasets, one of the most important ones being the tags. However, the tagging process is subject to many problems, such as synonyms, ambiguity or incoherence, among others. As our empiric analysis of ODPs shows, these issues are currently prevalent in most ODPs and effectively hinders the reuse of Open Data. In order to address these problems, we develop and implement an approach for tag reconciliation in Open Data Portals, encompassing local actions related to individual portals, and global actions for adding a semantic metadata layer above individual portals. The local part aims to enhance the quality of tags in a single portal, and the global part is meant to interlink ODPs by establishing relations between tags.
△ Less
Submitted 15 October, 2015;
originally announced October 2015.
-
Interest-based RDF Update Propagation
Authors:
Kemele M. Endris,
Sidra Faisal,
Fabrizio Orlandi,
Sören Auer,
Simon Scerri
Abstract:
Many LOD datasets, such as DBpedia and LinkedGeoData, are voluminous and process large amounts of requests from diverse applications. Many data products and services rely on full or partial local LOD replications to ensure faster querying and processing. While such replicas enhance the flexibility of information sharing and integration infrastructures, they also introduce data duplication with all…
▽ More
Many LOD datasets, such as DBpedia and LinkedGeoData, are voluminous and process large amounts of requests from diverse applications. Many data products and services rely on full or partial local LOD replications to ensure faster querying and processing. While such replicas enhance the flexibility of information sharing and integration infrastructures, they also introduce data duplication with all the associated undesirable consequences. Given the evolving nature of the original and authoritative datasets, to ensure consistent and up-to-date replicas frequent replacements are required at a great cost. In this paper, we introduce an approach for interest-based RDF update propagation, which propagates only interesting parts of updates from the source to the target dataset. Effectively, this enables remote applications to `subscribe' to relevant datasets and consistently reflect the necessary changes locally without the need to frequently replace the entire dataset (or a relevant subset). Our approach is based on a formal definition for graph-pattern-based interest expressions that is used to filter interesting parts of updates from the source. We implement the approach in the iRap framework and perform a comprehensive evaluation based on DBpedia Live updates, to confirm the validity and value of our approach.
△ Less
Submitted 26 May, 2015;
originally announced May 2015.
-
"How much?" Is Not Enough - An Analysis of Open Budget Initiatives
Authors:
Alan Freihof Tygel,
Judie Attard,
Fabrizio Orlandi,
Maria Luiza Machado Campos,
Sören Auer
Abstract:
A worldwide movement towards the publication of Open Government Data is taking place, and budget data is one of the key elements pushing this trend. Its importance is mostly related to transparency, but publishing budget data, combined with other actions, can also improve democratic participation, allow comparative analysis of governments and boost data-driven business. However, the lack of standa…
▽ More
A worldwide movement towards the publication of Open Government Data is taking place, and budget data is one of the key elements pushing this trend. Its importance is mostly related to transparency, but publishing budget data, combined with other actions, can also improve democratic participation, allow comparative analysis of governments and boost data-driven business. However, the lack of standards and common evaluation criteria still hinders the development of appropriate tools and the materialization of the appointed benefits. In this paper, we present a model to analyse government initiatives to publish budget data. We identify the main features of these initiatives with a double objective: (i) to drive a structured analysis, relating some dimensions to their possible impacts, and (ii) to derive characterization attributes to compare initiatives based on each dimension. We define use perspectives and analyse some initiatives using this model. We conclude that, in order to favour use perspectives, special attention must be given to user feedback, semantics standards and linking possibilities.
△ Less
Submitted 7 April, 2015;
originally announced April 2015.