Skip to main content

Showing 1–32 of 32 results for author: Bonifati, A

  1. arXiv:2407.04823  [pdf, other

    cs.DB

    Path-based Algebraic Foundations of Graph Query Languages

    Authors: Renzo Angles, Angela Bonifati, Roberto García, Domagoj Vrgoč

    Abstract: Graph databases are gaining momentum thanks to the flexibility and expressiveness of their data model and query languages. A standardization activity driven by the ISO/IEC standardization body is also ongoing and has already conducted to the specification of the first versions of two standard graph query languages, namely SQL/PGQ and GQL, respectively in 2023 and 2024. Apart from the standards, th… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Under review

  2. arXiv:2406.13062  [pdf, other

    cs.DB

    Transforming Property Graphs

    Authors: Angela Bonifati, Filip Murlak, Yann Ramusat

    Abstract: In this paper, we study a declarative framework for specifying transformations of property graphs. In order to express such transformations, we leverage queries formulated in the Graph Pattern Calculus (GPC), which is an abstraction of the common core of recent standard graph query languages, GQL and SQL/PGQ. In contrast to previous frameworks targeting graph topology only, we focus on the impact… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: To appear in VLDB 2024

  3. arXiv:2406.06754  [pdf, other

    cs.DB

    Incremental Sliding Window Connectivity over Streaming Graphs

    Authors: Chao Zhang, Angela Bonifati, M. Tamer Özsu

    Abstract: We study index-based processing for connectivity queries within sliding windows on streaming graphs. These queries, which determine whether two vertices belong to the same connected component, are fundamental operations in real-time graph data processing and demand high throughput and low latency. While indexing methods that leverage data structures for fully dynamic connectivity can facilitate ef… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: To appear in VLDB 2024

  4. arXiv:2405.13015  [pdf, other

    cs.CL cs.AI

    Assisted Debate Builder with Large Language Models

    Authors: Elliot Faugier, Frédéric Armetta, Angela Bonifati, Bruno Yun

    Abstract: We introduce ADBL2, an assisted debate builder tool. It is based on the capability of large language models to generalise and perform relation-based argument mining in a wide-variety of domains. It is the first open-source tool that leverages relation-based mining for (1) the verification of pre-established relations in a debate and (2) the assisted creation of new arguments by means of large lang… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 7 pages, 2 figures

  5. arXiv:2311.03542  [pdf, other

    cs.DB

    Indexing Techniques for Graph Reachability Queries

    Authors: Chao Zhang, Angela Bonifati, M. Tamer Özsu

    Abstract: We survey graph reachability indexing techniques for efficient processing of graph reachability queries in two types of popular graph models: plain graphs and edge-labeled graphs. Reachability queries are fundamental in graph processing, and reachability indexes are specialized data structures tailored for speeding up such queries. Work on this topic goes back four decades -- we include 33 of the… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  6. arXiv:2308.06374  [pdf, other

    cs.AI cs.CL

    Large Language Models and Knowledge Graphs: Opportunities and Challenges

    Authors: Jeff Z. Pan, Simon Razniewski, Jan-Christoph Kalo, Sneha Singhania, Jiaoyan Chen, Stefan Dietze, Hajira Jabeen, Janna Omeliyanenko, Wen Zhang, Matteo Lissandrini, Russa Biswas, Gerard de Melo, Angela Bonifati, Edlira Vakaj, Mauro Dragoni, Damien Graux

    Abstract: Large Language Models (LLMs) have taken Knowledge Representation -- and the world -- by storm. This inflection point marks a shift from explicit knowledge representation to a renewed focus on the hybrid representation of both explicit knowledge and parametric knowledge. In this position paper, we will discuss some of the common debate points within the community on LLMs (parametric knowledge) and… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 30 pages

  7. arXiv:2306.01388  [pdf, other

    cs.DB

    From Large Language Models to Databases and Back: A discussion on research and education

    Authors: Sihem Amer-Yahia, Angela Bonifati, Lei Chen, Guoliang Li, Kyuseok Shim, Jianliang Xu, Xiaochun Yang

    Abstract: This discussion was conducted at a recent panel at the 28th International Conference on Database Systems for Advanced Applications (DASFAA 2023), held April 17-20, 2023 in Tianjin, China. The title of the panel was "What does LLM (ChatGPT) Bring to Data Science Research and Education? Pros and Cons". It was moderated by Lei Chen and Xiaochun Yang. The discussion raised several questions on how lar… ▽ More

    Submitted 7 July, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 7 pages, 2 figures, the Panel at the 28th International Conference on Database Systems for Advanced Applications (DASFAA 2023)

  8. PG-Schema: Schemas for Property Graphs

    Authors: Renzo Angles, Angela Bonifati, Stefania Dumbrava, George Fletcher, Alastair Green, Jan Hidders, Bei Li, Leonid Libkin, Victor Marsault, Wim Martens, Filip Murlak, Stefan Plantikow, Ognjen Savković, Michael Schmidt, Juan Sequeda, Sławek Staworko, Dominik Tomaszuk, Hannes Voigt, Domagoj Vrgoč, Mingxi Wu, Dušan Živković

    Abstract: Property graphs have reached a high level of maturity, witnessed by multiple robust graph database systems as well as the ongoing ISO standardization effort aiming at creating a new standard Graph Query Language (GQL). Yet, despite documented demand, schema support is limited both in existing systems and in the first version of the GQL Standard. It is anticipated that the second version of the GQL… ▽ More

    Submitted 8 July, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: 26 pages

    Journal ref: Proc. ACM Manag. Data (2023)

  9. arXiv:2203.08606  [pdf, other

    cs.DB

    A Reachability Index for Recursive Label-Concatenated Graph Queries

    Authors: Chao Zhang, Angela Bonifati, Hugo Kapp, Vlad Ioan Haprian, Jean-Pierre Lozi

    Abstract: Reachability queries checking the existence of a path from a source node to a target node are fundamental operators for querying and processing graph data. Current approaches for index-based evaluation of reachability queries either focus on plain reachability or constraint-based reachability with alternation only. In this paper, for the first time we study the problem of index-based processing fo… ▽ More

    Submitted 20 July, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

  10. arXiv:2112.09011  [pdf, other

    cs.DB

    Provenance-aware Discovery of Functional Dependencies on Integrated Views

    Authors: Ugo Comignani, Laure Berti-Équille, Noël Novelli, Angela Bonifati

    Abstract: The automatic discovery of functional dependencies(FDs) has been widely studied as one of the hardest problems in data profiling. Existing approaches have focused on making the FD computation efficient while inspecting single relations at a time. In this paper, for the first time we address the problem of inferring FDs for multiple relations as they occur in integrated views by solely using the fu… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 12 pages + biblio and appendices. arXiv admin note: text overlap with arXiv:2012.06237

  11. arXiv:2106.15703  [pdf, other

    cs.DB

    Threshold Queries in Theory and in the Wild

    Authors: Angela Bonifati, Stefania Dumbrava, George Fletcher, Jan Hidders, Matthias Hofer, Wim Martens, Filip Murlak, Joshua Shinavier, Sławek Staworko, Dominik Tomaszuk

    Abstract: Threshold queries are an important class of queries that only require computing or counting answers up to a specified threshold value. To the best of our knowledge, threshold queries have been largely disregarded in the research literature, which is surprising considering how common they are in practice. In this paper, we present a deep theoretical analysis of threshold query evaluation and show t… ▽ More

    Submitted 17 November, 2021; v1 submitted 29 June, 2021; originally announced June 2021.

  12. arXiv:2101.12305  [pdf, other

    cs.DB

    Evaluating Complex Queries on Streaming Graphs

    Authors: Anil Pacaci, Angela Bonifati, M. Tamer Özsu

    Abstract: We study the problem of evaluating persistent queries over streaming graphs in a principled fashion. These queries need to be evaluated over unbounded and very high speed graph streams. We define a streaming graph data model and query model incorporating navigational queries, subgraph queries and paths as first-class citizens. To support this full-fledged query model we develop a streaming graph a… ▽ More

    Submitted 1 August, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 18 pages; typos fixed; examples, experimental setup and analysis updated

  13. arXiv:2012.06171  [pdf, other

    cs.DC cs.DB

    The Future is Big Graphs! A Community View on Graph Processing Systems

    Authors: Sherif Sakr, Angela Bonifati, Hannes Voigt, Alexandru Iosup, Khaled Ammar, Renzo Angles, Walid Aref, Marcelo Arenas, Maciej Besta, Peter A. Boncz, Khuzaima Daudjee, Emanuele Della Valle, Stefania Dumbrava, Olaf Hartig, Bernhard Haslhofer, Tim Hegeman, Jan Hidders, Katja Hose, Adriana Iamnitchi, Vasiliki Kalavri, Hugo Kapp, Wim Martens, M. Tamer Özsu, Eric Peukert, Stefan Plantikow , et al. (16 additional authors not shown)

    Abstract: Graphs are by nature unifying abstractions that can leverage interconnectedness to represent, explore, predict, and explain real- and digital-world phenomena. Although real users and consumers of graph instances and graph workloads understand these abstractions, future problems will require new abstractions and systems. What needs to happen in the next decade for big graph processing to continue t… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures, collaboration between the large-scale systems and data management communities, work started at the Dagstuhl Seminar 19491 on Big Graph Processing Systems, to be published in the Communications of the ACM

    ACM Class: C.3; E.0; H.2; J.0

  14. arXiv:2010.07386  [pdf, other

    cs.DB

    Valentine: Evaluating Matching Techniques for Dataset Discovery

    Authors: Christos Koutras, George Siachamis, Andra Ionescu, Kyriakos Psarakis, Jerry Brons, Marios Fragkoulis, Christoph Lofi, Angela Bonifati, Asterios Katsifodimos

    Abstract: Data scientists today search large data lakes to discover and integrate datasets. In order to bring together disparate data sources, dataset discovery methods rely on some form of schema matching: the process of establishing correspondences between datasets. Traditionally, schema matching has been used to find matching pairs of columns between a source and a target schema. However, the use of sche… ▽ More

    Submitted 13 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

  15. arXiv:2004.14794  [pdf, other

    cs.DB

    Graph Summarization

    Authors: Angela Bonifati, Stefania Dumbrava, Haridimos Kondylakis

    Abstract: The continuous and rapid growth of highly interconnected datasets, which are both voluminous and complex, calls for the development of adequate processing and analytical techniques. One method for condensing and simplifying such datasets is graph summarization. It denotes a series of application-specific algorithms designed to transform graphs into more compact representations while preserving str… ▽ More

    Submitted 12 May, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: To appear in the Encyclopedia of Big Data Technologies

  16. arXiv:2004.07668  [pdf, ps, other

    cs.DB cs.HC

    Holding a Conference Online and Live due to COVID-19

    Authors: Angela Bonifati, Giovanna Guerrini, Carsten Lutz, Wim Martens, Lara Mazilu, Norman Paton, Marcos Antonio Vaz Salles, Marc H. Scholl, Yongluan Zhou

    Abstract: The joint EDBT/ICDT conference (International Conference on Extending Database Technology/International Conference on Database Theory) is a well established conference series on data management, with annual meetings in the second half of March that attract 250 to 300 delegates. Three weeks before EDBT/ICDT 2020 was planned to take place in Copenhagen, the rapidly developing Covid-19 pandemic led t… ▽ More

    Submitted 20 April, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

  17. arXiv:2004.02012  [pdf, other

    cs.DB

    Regular Path Query Evaluation on Streaming Graphs

    Authors: Anil Pacaci, Angela Bonifati, M. Tamer Özsu

    Abstract: We study persistent query evaluation over streaming graphs, which is becoming increasingly important. We focus on navigational queries that determine if there exists a path between two entities that satisfies a user-specified constraint. We adopt the Regular Path Query (RPQ) model that specifies navigational patterns with labeled constraints. We propose deterministic algorithms to efficiently eval… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    Comments: A shorter version of this paper has been accepted for publication in 2020 International Conference on Management of Data (SIGMOD 2020)

  18. arXiv:2001.07906  [pdf, ps, other

    cs.DB cs.IR cs.SI

    Graph Generators: State of the Art and Open Challenges

    Authors: Angela Bonifati, Irena Holubová, Arnau Prat-Pérez, Sherif Sakr

    Abstract: The abundance of interconnected data has fueled the design and implementation of graph generators reproducing real-world linking properties, or gauging the effectiveness of graph algorithms, techniques and applications manipulating these data. We consider graph generation across multiple subfields, such as Semantic Web, graph databases, social networks, and community detection, along with general… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Comments: ACM Computing Surveys, 32 pages

  19. arXiv:1903.09242  [pdf, other

    cs.DB

    Repairing mappings under policy views

    Authors: Angela Bonifati, Ugo Comignani, Efthymia Tsamoura

    Abstract: The problem of data exchange involves a source schema, a target schema and a set of mappings from transforming the data between the two schemas. We study the problem of data exchange in the presence of privacy restrictions on the source. The privacy restrictions are expressed as a set of policy views representing the information that is safe to expose over all instances of the source. We propose a… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: 12 pages

  20. arXiv:1902.06427  [pdf, ps, other

    cs.DB

    Schema Validation and Evolution for Graph Databases

    Authors: Angela Bonifati, Peter Furniss, Alastair Green, Russ Harmer, Eugenia Oshurko, Hannes Voigt

    Abstract: Despite the maturity of commercial graph databases, little consensus has been reached so far on the standardization of data definition languages (DDLs) for property graphs (PG). The discussion on the characteristics of PG schemas is ongoing in many standardization and community groups. Although some basic aspects of a schema are already present in Neo4j 3.5, like in most commercial graph databases… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

    Comments: 36 pages, 9 figures

  21. arXiv:1811.11561  [pdf, other

    cs.DB

    Approximate Evaluation of Label-Constrained Reachability Queries

    Authors: Stefania Dumbrava, Angela Bonifati, Amaia Nazabal Ruiz Diaz, Romain Vuillemot

    Abstract: The current surge of interest in graph-based data models mirrors the usage of increasingly complex reachability queries, as witnessed by recent analytical studies on real-world graph query logs. Despite the maturity of graph DBMS capabilities, complex label-constrained reachability queries, along with their corresponding aggregate versions, remain difficult to evaluate. In this paper, we focus on… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

  22. arXiv:1804.10565  [pdf, other

    cs.DB cs.LO cs.PL

    Certified Graph View Maintenance with Regular Datalog

    Authors: Angela Bonifati, Stefania Dumbrava, Emilio Jesus Gallego Arias

    Abstract: We employ the Coq proof assistant to develop a mechanically-certified framework for evaluating graph queries and incrementally maintaining materialized graph instances, also called views. The language we use for defining queries and views is Regular Datalog (RD) -- a notable fragment of non-recursive Datalog that can express complex navigational queries, with transitive closure as native operator.… ▽ More

    Submitted 27 April, 2018; originally announced April 2018.

    Comments: Paper presented at the 34nd International Conference on Logic Programming (ICLP 2018), Oxford, UK, July 14 to July 17, 2018. 18 pages, LaTeX, (arXiv:YYMM.NNNNN)

  23. arXiv:1708.00363  [pdf, ps, other

    cs.DB

    An Analytical Study of Large SPARQL Query Logs

    Authors: Angela Bonifati, Wim Martens, Thomas Timm

    Abstract: With the adoption of RDF as the data model for Linked Data and the Semantic Web, query specification from end- users has become more and more common in SPARQL end- points. In this paper, we conduct an in-depth analytical study of the queries formulated by end-users and harvested from large and up-to-date query logs from a wide variety of RDF data sources. As opposed to previous studies, ours is th… ▽ More

    Submitted 1 August, 2017; originally announced August 2017.

  24. arXiv:1602.00563  [pdf, other

    cs.DB

    Functional Dependencies Unleashed for Scalable Data Exchange

    Authors: Angela Bonifati, Ioana Ileana, Michele Linardi

    Abstract: We address the problem of efficiently evaluating target functional dependencies (fds) in the Data Exchange (DE) process. Target fds naturally occur in many DE scenarios, including the ones in Life Sciences in which multiple source relations need to be structured under a constrained target schema. However, despite their wide use, target fds' evaluation is still a bottleneck in the state-of-the-art… ▽ More

    Submitted 16 April, 2016; v1 submitted 1 February, 2016; originally announced February 2016.

  25. gMark: Schema-Driven Generation of Graphs and Queries

    Authors: Guillaume Bagan, Angela Bonifati, Radu Ciucanu, George H. L. Fletcher, Aurélien Lemay, Nicky Advokaat

    Abstract: Massive graph data sets are pervasive in contemporary application domains. Hence, graph database systems are becoming increasingly important. In the experimental study of these systems, it is vital that the research community has shared solutions for the generation of database instances and query workloads having predictable and controllable properties. In this paper, we present the design and eng… ▽ More

    Submitted 6 December, 2016; v1 submitted 26 November, 2015; originally announced November 2015.

    Comments: Accepted in November 2016. URL: http://ieeexplore.ieee.org/document/7762945/. in IEEE Transactions on Knowledge and Data Engineering 2017

  26. arXiv:1503.01707  [pdf, ps, other

    cs.DB cs.AI cs.LO

    Mapping-equivalence and oid-equivalence of single-function object-creating conjunctive queries

    Authors: Angela Bonifati, Werner Nutt, Riccardo Torlone, Jan Van den Bussche

    Abstract: Conjunctive database queries have been extended with a mechanism for object creation to capture important applications such as data exchange, data integration, and ontology-based data access. Object creation generates new object identifiers in the result, that do not belong to the set of constants in the source database. The new object identifiers can be also seen as Skolem terms. Hence, object-cr… ▽ More

    Submitted 12 January, 2016; v1 submitted 5 March, 2015; originally announced March 2015.

    Comments: This revised version has been accepted on 11 January 2016 for publication in The VLDB Journal

  27. arXiv:1212.6857  [pdf, other

    cs.DB cs.DM

    A Trichotomy for Regular Simple Path Queries on Graphs

    Authors: Guillaume Bagan, Angela Bonifati, Benoit Groz

    Abstract: Regular path queries (RPQs) select nodes connected by some path in a graph. The edge labels of such a path have to form a word that matches a given regular expression. We investigate the evaluation of RPQs with an additional constraint that prevents multiple traversals of the same nodes. Those regular simple path queries (RSPQs) find several applications in practice, yet they quickly become intrac… ▽ More

    Submitted 31 December, 2012; originally announced December 2012.

    Comments: 15 pages, conference submission

    MSC Class: 05CXX ACM Class: E.2; F.2.2; G.2.2; H.2.3

  28. arXiv:1111.6084  [pdf, other

    cs.DB cs.SI

    Semantic Query Reformulation in Social PDMS

    Authors: Angela Bonifati, Gianvito Summa, Esther Pacitti, Fady Draidi

    Abstract: We consider social peer-to-peer data management systems (PDMS), where each peer maintains both semantic mappings between its schema and some acquaintances, and social links with peer friends. In this context, reformulating a query from a peer's schema into other peer's schemas is a hard problem, as it may generate as many rewritings as the set of mappings from that peer to the outside and transiti… ▽ More

    Submitted 25 November, 2011; originally announced November 2011.

    Comments: 29 pages, 8 figures, query rewriting in PDMS

  29. arXiv:1010.2148  [pdf, other

    cs.DB

    Ontological Matchmaking in Recommender Systems

    Authors: Angela Bonifati, Giansalvatore Mecca, Domenica Sileo, Gianvito Summa

    Abstract: The electronic marketplace offers great potential for the recommendation of supplies. In the so called recommender systems, it is crucial to apply matchmaking strategies that faithfully satisfy the predicates specified in the demand, and take into account as much as possible the user preferences. We focus on real-life ontology-driven matchmaking scenarios and identify a number of challenges, being… ▽ More

    Submitted 11 October, 2010; originally announced October 2010.

    Comments: 28 pages, 8 figures

  30. arXiv:cs/0602039  [pdf, ps, other

    cs.DB

    Path Summaries and Path Partitioning in Modern XML Databases

    Authors: Andrei Arion, Angela Bonifati, Ioana Manolescu, Andrea Pugliese

    Abstract: We study the applicability of XML path summaries in the context of current-day XML databases. We find that summaries provide an excellent basis for optimizing data access methods, which furthermore mixes very well with path-partitioned stores. We provide practical algorithms for building and exploiting summaries, and prove its benefits through extensive experiments.

    Submitted 10 February, 2006; originally announced February 2006.

  31. arXiv:cs/0506002  [pdf, ps, other

    cs.DB

    HepToX: Heterogeneous Peer to Peer XML Databases

    Authors: Angela Bonifati, Elaine Qing Chang, Terence Ho, Laks V. S. Lakshmanan

    Abstract: We study a collection of heterogeneous XML databases maintaining similar and related information, exchanging data via a peer to peer overlay network. In this setting, a mediated global schema is unrealistic. Yet, users/applications wish to query the databases via one peer using its schema. We have recently developed HepToX, a P2P Heterogeneous XML database system. A key idea is that whenever a p… ▽ More

    Submitted 31 May, 2005; originally announced June 2005.

    Comments: 11 pages plus cover page

    Report number: UBC TR-2005-15 ACM Class: H.2.4; H.2.5

  32. arXiv:cs/9912015  [pdf, ps, other

    cs.DB

    Comparative Analysis of Five XML Query Languages

    Authors: Angela Bonifati, Stefano Ceri

    Abstract: XML is becoming the most relevant new standard for data representation and exchange on the WWW. Novel languages for extracting and restructuring the XML content have been proposed, some in the tradition of database query languages (i.e. SQL, OQL), others more closely inspired by XML. No standard for XML query language has yet been decided, but the discussion is ongoing within the World Wide Web… ▽ More

    Submitted 22 December, 1999; originally announced December 1999.

    Comments: TeX v3.1415, 17 pages, 6 figures, to be published in ACM Sigmod Record, March 2000

    Report number: Dipartimento di Elettronica e Informazione, Politecnico di Milano (Italy) Technical Report nr.99-76 ACM Class: H.2; H.2.3; I.7; I.7.1; I.7.2