Skip to main content

Showing 1–50 of 56 results for author: Vidal, M

  1. iASiS: Towards Heterogeneous Big Data Analysis for Personalized Medicine

    Authors: Anastasia Krithara, Fotis Aisopos, Vassiliki Rentoumi, Anastasios Nentidis, Konstantinos Bougatiotis, Maria-Esther Vidal, Ernestina Menasalvas, Alejandro Rodriguez-Gonzalez, Eleftherios G. Samaras, Peter Garrard, Maria Torrente, Mariano Provencio Pulla, Nikos Dimakopoulos, Rui Mauricio, Jordi Rambla De Argila, Gian Gaetano Tartaglia, George Paliouras

    Abstract: The vision of IASIS project is to turn the wave of big biomedical data heading our way into actionable knowledge for decision makers. This is achieved by integrating data from disparate sources, including genomics, electronic health records and bibliography, and applying advanced analytics methods to discover useful patterns. The goal is to turn large amounts of available data into actionable info… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 6 pages, 2 figures, accepted at 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)

    Journal ref: 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS), Cordoba, Spain, 2019, pp. 106-111

  2. arXiv:2407.00509  [pdf, other

    cs.AI

    Leveraging Ontologies to Document Bias in Data

    Authors: Mayra Russo, Maria-Esther Vidal

    Abstract: Machine Learning (ML) systems are capable of reproducing and often amplifying undesired biases. This puts emphasis on the importance of operating under practices that enable the study and understanding of the intrinsic characteristics of ML pipelines, prompting the emergence of documentation frameworks with the idea that ``any remedy for bias starts with awareness of its existence''. However, a re… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  3. Adaptive Artificial Immune Networks for Mitigating DoS flooding Attacks

    Authors: Jorge Maestre Vidal, Ana Lucila Sandoval Orozco, Luis Javier García Villalba

    Abstract: Denial of service attacks pose a threat in constant growth. This is mainly due to their tendency to gain in sophistication, ease of implementation, obfuscation and the recent improvements in occultation of fingerprints. On the other hand, progress towards self-organizing networks, and the different techniques involved in their development, such as software-defined networking, network-function virt… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Journal ref: J. Maestre Vidal, A. L. Sandoval Orozco, L. J. García Villalba: Adaptive Artificial Immune Networks for Mitigating DoS Flooding Attacks. Swarm and Evolutionary Computation. Vol. 38, pp. 3894-108, February 2018

  4. arXiv:2402.05571  [pdf

    cs.CL cs.LG

    Traditional Machine Learning Models and Bidirectional Encoder Representations From Transformer (BERT)-Based Automatic Classification of Tweets About Eating Disorders: Algorithm Development and Validation Study

    Authors: José Alberto Benítez-Andrades, José-Manuel Alija-Pérez, Maria-Esther Vidal, Rafael Pastor-Vargas, María Teresa García-Ordás

    Abstract: Background: Eating disorders are increasingly prevalent, and social networks offer valuable information. Objective: Our goal was to identify efficient machine learning models for categorizing tweets related to eating disorders. Methods: Over three months, we collected tweets about eating disorders. A 2,000-tweet subset was labeled for: (1) being written by individuals with eating disorders, (2… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Journal ref: JMIR Medical Informatics, Volume 10, Issue 2, 2022, ID e34492

  5. arXiv:2402.05536  [pdf

    cs.LG cs.CL

    Empowering machine learning models with contextual knowledge for enhancing the detection of eating disorders in social media posts

    Authors: José Alberto Benítez-Andrades, María Teresa García-Ordás, Mayra Russo, Ahmad Sakor, Luis Daniel Fernandes Rotger, Maria-Esther Vidal

    Abstract: Social networks are vital for information sharing, especially in the health sector for discussing diseases and treatments. These platforms, however, often feature posts as brief texts, posing challenges for Artificial Intelligence (AI) in understanding context. We introduce a novel hybrid approach combining community-maintained knowledge graphs (like Wikidata) with deep learning to enhance the cat… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Journal ref: Semantic Web, Volume 4, Issue 5, pp. 873-892, 2023

  6. A novel pattern recognition system for detecting Android malware by analyzing suspicious boot sequences

    Authors: Jorge Maestre Vidal, Marco Antonio Sotelo Monge, Luis Javier García Villalba

    Abstract: This paper introduces a malware detection system for smartphones based on studying the dynamic behavior of suspicious applications. The main goal is to prevent the installation of the malicious software on the victim systems. The approach focuses on identifying malware addressed against the Android platform. For that purpose, only the system calls performed during the boot process of the recently… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Journal ref: Knowledge-Based Systems. Vol. 150, pp. 198-217, June 2018

  7. arXiv:2402.03369  [pdf

    eess.AS cs.CL cs.LG cs.SD

    Evaluation of Google's Voice Recognition and Sentence Classification for Health Care Applications

    Authors: Majbah Uddin, Nathan Huynh, Jose M Vidal, Kevin M Taaffe, Lawrence D Fredendall, Joel S Greenstein

    Abstract: This study examined the use of voice recognition technology in perioperative services (Periop) to enable Periop staff to record workflow milestones using mobile technology. The use of mobile technology to improve patient flow and quality of care could be facilitated if such voice recognition technology could be made robust. The goal of this experiment was to allow the Periop staff to provide care… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Journal ref: Engineering Management Journal, 27:3, 152-162, 2015

  8. arXiv:2310.19503  [pdf, other

    cs.AI cs.CY cs.MA

    Trust, Accountability, and Autonomy in Knowledge Graph-based AI for Self-determination

    Authors: Luis-Daniel Ibáñez, John Domingue, Sabrina Kirrane, Oshani Seneviratne, Aisling Third, Maria-Esther Vidal

    Abstract: Knowledge Graphs (KGs) have emerged as fundamental platforms for powering intelligent decision-making and a wide range of Artificial Intelligence (AI) services across major corporations such as Google, Walmart, and AirBnb. KGs complement Machine Learning (ML) algorithms by providing data context and semantics, thereby enabling further inference and question-answering capabilities. The integration… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  9. arXiv:2211.08190  [pdf

    cs.RO

    Reconocimiento de Objetos a partir de Nube de Puntos en un Veículo Aéreo no Tripulado

    Authors: Agustina Marion de Freitas Vidal, Anthony Rodriguez, Richard Suarez, André Kelbouscas, Ricardo Grando

    Abstract: Currently, research in robotics, artificial intelligence and drones are advancing exponentially, they are directly or indirectly related to various areas of the economy, from agriculture to industry. With this context, this project covers these topics guiding them, seeking to provide a framework that is capable of helping to develop new future researchers. For this, we use an aerial vehicle that w… ▽ More

    Submitted 23 October, 2022; originally announced November 2022.

    Comments: in Spanish language. Articulo aceptado en la FEBITEC 2022

  10. arXiv:2210.15645  [pdf, other

    cs.DB cs.OH cs.PL

    Dragoman: Efficiently Evaluating Declarative Mapping Languages over Frameworks for Knowledge Graph Creation

    Authors: Samaneh Jozashoori, Enrique Iglesias, Maria-Esther Vidal

    Abstract: In recent years, there have been valuable efforts and contributions to make the process of RDF knowledge graph creation traceable and transparent; extending and applying declarative mapping languages is an example. One challenging step is the traceability of procedures that aim to overcome interoperability issues, a.k.a. data-level integration. In most pipelines, data integration is performed by a… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  11. arXiv:2206.07375  [pdf, other

    cs.DB

    Knowledge4COVID-19: A Semantic-based Approach for Constructing a COVID-19 related Knowledge Graph from Various Sources and Analysing Treatments' Toxicities

    Authors: Ahmad Sakor, Samaneh Jozashoori, Emetis Niazmand, Ariam Rivas, Kostantinos Bougiatiotis, Fotis Aisopos, Enrique Iglesias, Philipp D. Rohde, Trupti Padiya, Anastasia Krithara, Georgios Paliouras, Maria-Esther Vidal

    Abstract: In this paper, we present Knowledge4COVID-19, a framework that aims to showcase the power of integrating disparate sources of knowledge to discover adverse drug effects caused by drug-drug interactions among COVID-19 treatments and pre-existing condition drugs. Initially, we focus on constructing the Knowledge4COVID-19 knowledge graph (KG) from the declarative definition of mapping rules using the… ▽ More

    Submitted 7 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

  12. arXiv:2205.13883  [pdf, other

    cs.DB

    Efficient Semantic Summary Graphs for Querying Large Knowledge Graphs

    Authors: Emetis Niazmand, Gezim Sejdiu, Damien Graux, Maria-Esther Vidal

    Abstract: Knowledge Graphs (KGs) integrate heterogeneous data, but one challenge is the development of efficient tools for allowing end users to extract useful insights from these sources of knowledge. In such a context, reducing the size of a Resource Description Framework (RDF) graph while preserving all information can speed up query engines by limiting data shuffle, especially in a distributed setting.… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  13. arXiv:2203.07436  [pdf, other

    cs.CV cs.AI q-bio.QM

    SuperAnimal pretrained pose estimation models for behavioral analysis

    Authors: Shaokai Ye, Anastasiia Filippova, Jessy Lauer, Steffen Schneider, Maxime Vidal, Tian Qiu, Alexander Mathis, Mackenzie Weygandt Mathis

    Abstract: Quantification of behavior is critical in applications ranging from neuroscience, veterinary medicine and animal conservation efforts. A common key step for behavioral analysis is first extracting relevant keypoints on animals, known as pose estimation. However, reliable inference of poses currently requires domain knowledge and manual labeling effort to build supervised models. We present a serie… ▽ More

    Submitted 30 December, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Models and demos available at http://modelzoo.deeplabcut.org

  14. arXiv:2201.09694  [pdf, other

    cs.AI cs.DB

    Scaling Up Knowledge Graph Creation to Large and Heterogeneous Data Sources

    Authors: Enrique Iglesias, Samaneh Jozashoori, Maria-Esther Vidal

    Abstract: RDF knowledge graphs (KG) are powerful data structures to represent factual statements created from heterogeneous data sources. KG creation is laborious and demands data management techniques to be executed efficiently. This paper tackles the problem of the automatic generation of KG creation processes declaratively specified; it proposes techniques for planning and transforming heterogeneous data… ▽ More

    Submitted 26 October, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

  15. EABlock: A Declarative Entity Alignment Block for Knowledge Graph Creation Pipelines

    Authors: Samaneh Jozashoori, Ahmad Sakor, Enrique Iglesias, Maria-Esther Vidal

    Abstract: Despite encoding enormous amount of rich and valuable data, existing data sources are mostly created independently, being a significant challenge to their integration. Mapping languages, e.g., RML and R2RML, facilitate declarative specification of the process of applying meta-data and integrating data into a knowledge graph. Mapping rules can also include knowledge extraction functions in addition… ▽ More

    Submitted 15 December, 2021; v1 submitted 14 December, 2021; originally announced December 2021.

  16. arXiv:2111.07005  [pdf, other

    cs.CR

    Understanding and Assessment of Mission-Centric Key Cyber Terrains for joint Military Operations

    Authors: Álvaro Luis Martínez, Jorge Maestre Vidal, Victor A. Villagrá González

    Abstract: Since the cyberspace consolidated as fifth warfare dimension, the different actors of the defense sector began an arms race toward achieving cyber superiority, on which research, academic and industrial stakeholders contribute from a dual vision, mostly linked to a large and heterogeneous heritage of developments and adoption of civilian cybersecurity capabilities. In this context, augmenting the… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: Preprint of an extended version of the conference "A novel automatic discovery system of critical assets in cyberspace-oriented military missions", in Proc. of the First Workshop on Recent Advances in Cyber Situational Awareness on Military Operations (CSA 2020) held by the 15th ARES International Conference in August 2020. https://doi.org/10.1145/3407023.3409225

  17. arXiv:2107.06999  [pdf

    cs.DC

    Reuse of Semantic Models for Emerging Smart Grids Applications

    Authors: Valentina Janev, Dušan Popadić, Dea Pujić, Maria Esther Vidal, Kemele Endris

    Abstract: Data in the energy domain grows at unprecedented rates. Despite the great potential that IoT platforms and other big data-driven technologies have brought in the energy sector, data exchange and data integration are still not wholly achieved. As a result, fragmented applications are developed against energy data silos, and data exchange is limited to few applications. Therefore, this paper identif… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: Paper presented at the ICIST Conference 2021

  18. Managing Knowledge in Energy Data Spaces

    Authors: Valentina Janev, Maria-Esther Vidal, Kemele Endris, Dea Pujic

    Abstract: Data in the energy domain grows at unprecedented rates and is usually generated by heterogeneous energy systems. Despite the great potential that big data-driven technologies can bring to the energy sector, general adoption is still lagging. Several challenges related to controlled data exchange and data integration are still not wholly achieved. As a result, fragmented applications are developed… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: Based on the article Valentina Janev, Maria-Esther Vidal, Kemele M. Endris, Dea Pujic: Managing Knowledge in Energy Data Spaces. WWW (Companion Volume) 2021: 7-15

    ACM Class: H.2.5; H.2.8

  19. Analyzing a Knowledge Graph of Industry4.0 Standards

    Authors: Irlan Grangel-Gonzalez, Maria-Esther Vidal

    Abstract: In this article, we tackle the problem of standard interoperability across different standardization frameworks, and devise a knowledge-driven approach that allows for the description of standards and standardization frameworks into an Industry 4.0 knowledge graph (I40KG). The STO ontology represents properties of standards and standardization frameworks, as well as relationships among them. The I… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: Based on the paper Irlan Grangel-Gonzalez, Maria-Esther Vidal: Analyzing a Knowledge Graph of Industry 4.0 Standards. WWW (Companion Volume) 2021: 16-25

    ACM Class: H.2.5; H.2.8

  20. arXiv:2105.09312  [pdf, other

    cs.DB

    Knowledge-driven Data Ecosystems Towards Data Transparency

    Authors: Sandra Geisler, Maria-Esther Vidal, Cinzia Cappiello, Bernadette Farias Lóscio, Avigdor Gal, Matthias Jarke, Maurizio Lenzerini, Paolo Missier, Boris Otto, Elda Paja, Barbara Pernici, Jakob Rehof

    Abstract: A Data Ecosystem offers a keystone-player or alliance-driven infrastructure that enables the interaction of different stakeholders and the resolution of interoperability issues among shared data. However, despite years of research in data governance and management, trustability is still affected by the absence of transparent and traceable data-driven pipelines. In this work, we focus on requiremen… ▽ More

    Submitted 21 May, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

  21. arXiv:2103.12115  [pdf, other

    cs.CV

    End-to-End Trainable Multi-Instance Pose Estimation with Transformers

    Authors: Lucas Stoffl, Maxime Vidal, Alexander Mathis

    Abstract: We propose an end-to-end trainable approach for multi-instance pose estimation, called POET (POse Estimation Transformer). Combining a convolutional neural network with a transformer encoder-decoder architecture, we formulate multiinstance pose estimation from images as a direct set prediction problem. Our model is able to directly regress the pose of all individuals, utilizing a bipartite matchin… ▽ More

    Submitted 21 December, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

  22. arXiv:2103.00560  [pdf, other

    cs.CV q-bio.QM

    Perspectives on individual animal identification from biology and computer vision

    Authors: Maxime Vidal, Nathan Wolf, Beth Rosenberg, Bradley P. Harris, Alexander Mathis

    Abstract: Identifying individual animals is crucial for many biological investigations. In response to some of the limitations of current identification methods, new automated computer vision approaches have emerged with strong performance. Here, we review current advances of computer vision identification techniques to provide both computer scientists and biologists with an overview of the available tools… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: 12 pages, 1 figure, 2 boxes and 1 table

    Journal ref: Integr Comp Biol . 2021 Oct 4;61(3):900-916

  23. arXiv:2101.08676  [pdf, other

    cs.NI

    Conceptualization and cases of study on cyber operations against the sustainability of the tactical edge

    Authors: Marco Antonio Sotelo Monge, Jorge Maestre Vidal

    Abstract: The last decade consolidated the cyberspace as fifth domain of operations, which extends its preliminarily intelligence and information exchange purposes towards enabling complex offensive and defensive operations supported/supportively of parallel kinetic domain actuations. Although there is a plethora of well documented cases on strategic and operational interventions of cyber commands, the cybe… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  24. arXiv:2101.07136  [pdf, other

    cs.DB

    Trav-SHACL: Efficiently Validating Networks of SHACL Constraints

    Authors: Mónica Figuera, Philipp D. Rohde, Maria-Esther Vidal

    Abstract: Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of kn… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  25. arXiv:2011.09748  [pdf, other

    cs.DB

    Compact Representations for Efficient Storage of Semantic Sensor Data

    Authors: Farah Karim, Maria-Esther Vidal, Sören Auer

    Abstract: Nowadays, there is a rapid increase in the number of sensor data generated by a wide variety of sensors and devices. Data semantics facilitate information exchange, adaptability, and interoperability among several sensors and devices. Sensor data and their meaning can be described using ontologies, e.g., the Semantic Sensor Network (SSN) Ontology. Notwithstanding, semantically enriched, the size o… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

  26. FunMap: Efficient Execution of Functional Mappings for Knowledge Graph Creation

    Authors: Samaneh Jozashoori, David Chaves-Fraga, Enrique Iglesias, Maria-Esther Vidal, Oscar Corcho

    Abstract: Data has exponentially grown in the last years, and knowledge graphs constitute powerful formalisms to integrate a myriad of existing data sources. Transformation functions -- specified with function-based mapping languages like FunUL and RML+FnO -- can be applied to overcome interoperability issues across heterogeneous data sources. However, the absence of engines to efficiently execute these map… ▽ More

    Submitted 5 October, 2020; v1 submitted 31 August, 2020; originally announced August 2020.

  27. SDM-RDFizer: An RML Interpreter for the Efficient Creation of RDF Knowledge Graphs

    Authors: Enrique Iglesias, Samaneh Jozashoori, David Chaves-Fraga, Diego Collarana, Maria-Esther Vidal

    Abstract: In recent years, the amount of data has increased exponentially, and knowledge graphs have gained attention as data structures to integrate data and knowledge harvested from myriad data sources. However, data complexity issues like large volume, high-duplicate rate, and heterogeneity usually characterize these data sources, being required data management tools able to address the impact negatively… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  28. arXiv:2006.04556  [pdf, other

    cs.AI cs.DB cs.LG

    Unveiling Relations in the Industry 4.0 Standards Landscape based on Knowledge Graph Embeddings

    Authors: Ariam Rivas, Irlán Grangel-González, Diego Collarana, Jens Lehmann, Maria-Esther Vidal

    Abstract: Industry~4.0 (I4.0) standards and standardization frameworks have been proposed with the goal of \emph{empowering interoperability} in smart factories. These standards enable the description and interaction of the main components, systems, and processes inside of a smart factory. Due to the growing number of frameworks and standards, there is an increasing need for approaches that automatically an… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Comments: 15 pages, 7 figures, DEXA2020 Conference

  29. Compacting Frequent Star Patterns in RDF Graphs

    Authors: Farah Karim, Maria-Esther Vidal, Sören Auer

    Abstract: Knowledge graphs have become a popular formalism for representing entities and their properties using a graph data model, e.g., the Resource Description Framework (RDF). An RDF graph comprises entities of the same type connected to objects or other entities using labeled edges annotated with properties. RDF graphs usually contain entities that share the same objects in a certain group of propertie… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

  30. arXiv:2002.08102  [pdf, other

    cs.DB

    Optimizing Federated Queries Based on the Physical Design of a Data Lake

    Authors: Philipp D. Rohde, Maria-Esther Vidal

    Abstract: The optimization of query execution plans is known to be crucial for reducing the query execution time. In particular, query optimization has been studied thoroughly for relational databases over the past decades. Recently, the Resource Description Framework (RDF) became popular for publishing data on the Web. As a consequence, federations composed of different data models like RDF and relational… ▽ More

    Submitted 23 March, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: work-in-progress paper

  31. arXiv:2002.06071  [pdf, other

    cs.CL cs.AI cs.LG

    FQuAD: French Question Answering Dataset

    Authors: Martin d'Hoffschmidt, Wacim Belblidia, Tom Brendlé, Quentin Heinrich, Maxime Vidal

    Abstract: Recent advances in the field of language modeling have improved state-of-the-art results on many Natural Language Processing tasks. Among them, Reading Comprehension has made significant progress over the past few years. However, most results are reported in English since labeled resources available in other languages, such as French, remain scarce. In the present work, we introduce the French Que… ▽ More

    Submitted 25 May, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: 15 pages, 5 figures

  32. arXiv:2001.09762  [pdf, other

    cs.CY

    Bias in Data-driven AI Systems -- An Introductory Survey

    Authors: Eirini Ntoutsi, Pavlos Fafalios, Ujwal Gadiraju, Vasileios Iosifidis, Wolfgang Nejdl, Maria-Esther Vidal, Salvatore Ruggieri, Franco Turini, Symeon Papadopoulos, Emmanouil Krasanakis, Ioannis Kompatsiaris, Katharina Kinder-Kurlanda, Claudia Wagner, Fariba Karimi, Miriam Fernandez, Harith Alani, Bettina Berendt, Tina Kruegel, Christian Heinze, Klaus Broelemann, Gjergji Kasneci, Thanassis Tiropanis, Steffen Staab

    Abstract: AI-based systems are widely employed nowadays to make decisions that have far-reaching impacts on individuals and society. Their decisions might affect everyone, everywhere and anytime, entailing concerns about potential human rights issues. Therefore, it is necessary to move beyond traditional AI algorithms optimized for predictive performance and embed ethical and legal principles in their desig… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 19 pages, 1 figure

  33. Enhancing Virtual Ontology Based Access over Tabular Data with Morph-CSV

    Authors: David Chaves-Fraga, Edna Ruckhaus, Freddy Priyatna, Maria-Esther Vidal, Oscar Corcho

    Abstract: Ontology-Based Data Access (OBDA) has traditionally focused on providing a unified view of heterogeneous datasets, either by materializing integrated data into RDF or by performing on-the fly querying via SPARQL query translation. In the specific case of tabular datasets represented as several CSV or Excel files, query translation approaches have been applied by considering each source as a single… ▽ More

    Submitted 21 February, 2021; v1 submitted 24 January, 2020; originally announced January 2020.

  34. Falcon 2.0: An Entity and Relation Linking Tool over Wikidata

    Authors: Ahmad Sakor, Kuldeep Singh, Anery Patel, Maria-Esther Vidal

    Abstract: The Natural Language Processing (NLP) community has significantly contributed to the solutions for entity and relation recognition from the text, and possibly linking them to proper matches in Knowledge Graphs (KGs). Considering Wikidata as the background KG, still, there are limited tools to link knowledge within the text to Wikidata. In this paper, we present Falcon 2.0, first joint entity, and… ▽ More

    Submitted 31 August, 2020; v1 submitted 24 December, 2019; originally announced December 2019.

    Comments: CIKM 2020 Paper 8 pages

  35. arXiv:1912.06214  [pdf, other

    cs.CL

    Encoding Knowledge Graph Entity Aliases in Attentive Neural Network for Wikidata Entity Linking

    Authors: Isaiah Onando Mulang, Kuldeep Singh, Akhilesh Vyas, Saeedeh Shekarpour, Maria Esther Vidal, Jens Lehmann, Soren Auer

    Abstract: The collaborative knowledge graphs such as Wikidata excessively rely on the crowd to author the information. Since the crowd is not bound to a standard protocol for assigning entity titles, the knowledge graph is populated by non-standard, noisy, long or even sometimes awkward titles. The issue of long, implicit, and nonstandard entity representations is a challenge in Entity Linking (EL) approach… ▽ More

    Submitted 26 September, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: 15 pages

    Journal ref: WISE 2020 (21st International Conference on Web Information Systems Engineering)

  36. arXiv:1911.02679  [pdf, other

    cs.SE

    A Domain-Specific Language for Verifying Software Requirement Constraints

    Authors: Marzina Vidal, Tiago Massoni, Franklin Ramalho

    Abstract: Software requirement analysis can certainly benefit from prevention and early detection of failures, in particular by some kind of automatic analysis. Formal methods offer means to represent and analyze requirements with rigorous tools, avoiding ambiguities and allowing automatic verification of requirement consistency. However, formalisms often clash in the culture or lack of skills of software a… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: Preprint for the 2019 Brazilian Symposium on Formal Methods

  37. MapSDI: A Scaled-up Semantic Data Integration Framework for Knowledge Graph Creation

    Authors: Samaneh Jozashoori, Maria-Esther Vidal

    Abstract: Semantic web technologies have significantly contributed with effective solutions for the problems of data integration and knowledge graph creation. However, with the rapid growth of big data in diverse domains, different interoperability issues still demand to be addressed, being scalability one of the main challenges. In this paper, we address the problem of knowledge graph creation at scale and… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

  38. arXiv:1908.06265  [pdf, other

    cs.DB

    Towards an Integrated Graph Algebra for Graph Pattern Matching with Gremlin (Extended Version)

    Authors: Harsh Thakkar, Dharmen Punjani, Soeren Auer, Maria-Esther Vidal

    Abstract: Graph data management (also called NoSQL) has revealed beneficial characteristics in terms of flexibility and scalability by differently balancing between query expressivity and schema flexibility. This peculiar advantage has resulted into an unforeseen race of developing new task-specific graph systems, query languages and data models, such as property graphs, key-value, wide column, resource des… ▽ More

    Submitted 7 September, 2019; v1 submitted 17 August, 2019; originally announced August 2019.

    Comments: This is an extended version of an article formally published at DEXA 2017

  39. arXiv:1908.05098  [pdf, other

    cs.CL cs.IR

    Towards Optimisation of Collaborative Question Answering over Knowledge Graphs

    Authors: Kuldeep Singh, Mohamad Yaser Jaradeh, Saeedeh Shekarpour, Akash Kulkarni, Arun Sethupat Radhakrishna, Ioanna Lytra, Maria-Esther Vidal, Jens Lehmann

    Abstract: Collaborative Question Answering (CQA) frameworks for knowledge graphs aim at integrating existing question answering (QA) components for implementing sequences of QA tasks (i.e. QA pipelines). The research community has paid substantial attention to CQAs since they support reusability and scalability of the available components in addition to the flexibility of pipelines. CQA frameworks attempt t… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

  40. arXiv:1903.12554  [pdf, other

    cs.DB cs.CY

    Linked Open Data Validity -- A Technical Report from ISWS 2018

    Authors: Tayeb Abderrahmani Ghor, Esha Agrawal, Mehwish Alam, Omar Alqawasmeh, Claudia D'amato, Amina Annane, Amr Azzam, Andrew Berezovskyi, Russa Biswas, Mathias Bonduel, Quentin Brabant, Cristina-iulia Bucur, Elena Camossi, Valentina Anita Carriero, Shruthi Chari, David Chaves Fraga, Fiorela Ciroku, Michael Cochez, Hubert Curien, Vincenzo Cutrona, Rahma Dandan, Danilo Dess, Valerio Di Carlo, Ahmed El Amine Djebri, Marieke Van Erp , et al. (46 additional authors not shown)

    Abstract: Linked Open Data (LOD) is the publicly available RDF data in the Web. Each LOD entity is identfied by a URI and accessible via HTTP. LOD encodes globalscale knowledge potentially available to any human as well as artificial intelligence that may want to benefit from it as background knowledge for supporting their tasks. LOD has emerged as the backbone of applications in diverse fields such as Natu… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

  41. arXiv:1811.01660  [pdf, other

    cs.DB

    Data Integration for Supporting Biomedical Knowledge Graph Creation at Large-Scale

    Authors: Samaneh Jozashoori, Tatiana Novikova, Maria-Esther Vidal

    Abstract: In recent years, following FAIR and open data principles, the number of available big data including biomedical data has been increased exponentially. In order to extract knowledge, these data should be curated, integrated, and semantically described. Accordingly, several semantic integration techniques have been developed; albeit effective, they may suffer from scalability in terms of different p… ▽ More

    Submitted 5 November, 2018; originally announced November 2018.

  42. arXiv:1809.10044  [pdf, other

    cs.IR cs.CL

    No One is Perfect: Analysing the Performance of Question Answering Components over the DBpedia Knowledge Graph

    Authors: Kuldeep Singh, Ioanna Lytra, Arun Sethupat Radhakrishna, Saeedeh Shekarpour, Maria-Esther Vidal, Jens Lehmann

    Abstract: Question answering (QA) over knowledge graphs has gained significant momentum over the past five years due to the increasing availability of large knowledge graphs and the rising importance of question answering for user interaction. DBpedia has been the most prominently used knowledge graph in this setting and most approaches currently use a pipeline of processing steps connecting a sequence of c… ▽ More

    Submitted 27 July, 2020; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: Evaluation of State of the art Question Answering components performing entity linking, relation linking etc

    Journal ref: Journal of Web Semantics (JWS 2020)

  43. arXiv:1807.06816  [pdf, other

    cs.DL physics.soc-ph

    Unveiling Scholarly Communities over Knowledge Graphs

    Authors: Sahar Vahdati, Guillermo Palma, Rahul Jyoti Nath, Christoph Lange, Sören Auer, Maria-Esther Vidal

    Abstract: Knowledge graphs represent the meaning of properties of real-world entities and relationships among them in a natural way. Exploiting semantics encoded in knowledge graphs enables the implementation of knowledge-driven tasks such as semantic retrieval, query processing, and question answering, as well as solutions to knowledge discovery tasks including pattern discovery and link prediction. In thi… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

    Comments: 12 pages. Paper accepted in the 22nd International Conference on Theory and Practice of Digital Libraries, 2018

  44. arXiv:1705.08018  [pdf, other

    cs.CL

    Use of Knowledge Graph in Rescoring the N-Best List in Automatic Speech Recognition

    Authors: Ashwini Jaya Kumar, Camilo Morales, Maria-Esther Vidal, Christoph Schmidt, Sören Auer

    Abstract: With the evolution of neural network based methods, automatic speech recognition (ASR) field has been advanced to a level where building an application with speech interface is a reality. In spite of these advances, building a real-time speech recogniser faces several problems such as low recognition accuracy, domain constraint, and out-of-vocabulary words. The low recognition accuracy problem is… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

  45. Comparing MapReduce and Pipeline Implementations for Counting Triangles

    Authors: Edelmira Pasarella, Maria-Esther Vidal, Cristina Zoltan

    Abstract: A common method to define a parallel solution for a computational problem consists in finding a way to use the Divide and Conquer paradigm in order to have processors acting on its own data and scheduled in a parallel fashion. MapReduce is a programming model that follows this paradigm, and allows for the definition of efficient solutions by both decomposing a problem into steps on subsets of the… ▽ More

    Submitted 12 January, 2017; originally announced January 2017.

    Comments: In Proceedings PROLE 2016, arXiv:1701.03069

    ACM Class: D.1.3; F.1.2

    Journal ref: EPTCS 237, 2017, pp. 20-33

  46. arXiv:1608.02800  [pdf, other

    cs.PF cs.DB

    LITMUS: An Open Extensible Framework for Benchmarking RDF Data Management Solutions

    Authors: Harsh Thakkar, Mohnish Dubey, Gezim Sejdiu, Axel-Cyrille Ngonga Ngomo, Jeremy Debattista, Christoph Lange, Jens Lehmann, Sören Auer, Maria-Esther Vidal

    Abstract: Developments in the context of Open, Big, and Linked Data have led to an enormous growth of structured data on the Web. To keep up with the pace of efficient consumption and management of the data at this rate, many data Management solutions have been developed for specific tasks and applications. We present LITMUS, a framework for benchmarking data management solutions. LITMUS goes beyond classic… ▽ More

    Submitted 9 August, 2016; originally announced August 2016.

    Comments: 8 pages, 1 figure, position paper

  47. arXiv:1503.02940  [pdf, other

    cs.DB

    Efficient Query Processing for SPARQL Federations with Replicated Fragments

    Authors: Gabriela Montoya, Hala Skaf-Molli, Pascal Molli, Maria-Esther Vidal

    Abstract: Low reliability and availability of public SPARQL endpoints prevent real-world applications from exploiting all the potential of these querying infras-tructures. Fragmenting data on servers can improve data availability but degrades performance. Replicating fragments can offer new tradeoff between performance and availability. We propose FEDRA, a framework for querying Linked Data that takes advan… ▽ More

    Submitted 10 March, 2015; originally announced March 2015.

  48. arXiv:1503.02911  [pdf, other

    cs.DB

    RDF-Hunter: Automatically Crowdsourcing the Execution of Queries Against RDF Data Sets

    Authors: Maribel Acosta, Elena Simperl, Fabian Flöck, Maria-Esther Vidal, Rudi Studer

    Abstract: In the last years, a large number of RDF data sets has become available on the Web. However, due to the semi-structured nature of RDF data, missing values affect answer completeness of queries that are posed against this data. To overcome this limitation, we propose RDF-Hunter, a novel hybrid query processing approach that brings together machine and human computation to execute queries against RD… ▽ More

    Submitted 10 March, 2015; originally announced March 2015.

  49. arXiv:1407.2899  [pdf, other

    cs.DB

    Fedra: Query Processing for SPARQL Federations with Divergence

    Authors: Gabriela Montoya, Hala Skaf-Molli, Pascal Molli, Maria-Esther Vidal

    Abstract: Data replication and deployment of local SPARQL endpoints improve scalability and availability of public SPARQL endpoints, making the consumption of Linked Data a reality. This solution requires synchronization and specific query processing strategies to take advantage of replication. However, existing replication aware techniques in federations of SPARQL endpoints do not consider data dynamicity.… ▽ More

    Submitted 10 July, 2014; originally announced July 2014.

  50. arXiv:0711.2087  [pdf, other

    cs.DB cs.LO

    Query Evaluation and Optimization in the Semantic Web

    Authors: Edna Ruckhaus, Eduardo Ruiz, Maria-Esther Vidal

    Abstract: We address the problem of answering Web ontology queries efficiently. An ontology is formalized as a Deductive Ontology Base (DOB), a deductive database that comprises the ontology's inference axioms and facts. A cost-based query optimization technique for DOB is presented. A hybrid cost model is proposed to estimate the cost and cardinality of basic and inferred facts. Cardinality and cost of i… ▽ More

    Submitted 13 November, 2007; originally announced November 2007.

    Comments: 18 pages, 8 figures, 7 tables. Presented at the ALPSWS2006 First International Workshop on Applications of Logic Programming in the Semantic Web and Semantic Web Services where it got a "Best Paper Award". To appear in Theory and Practice of Logic Programming (TPLP)

    ACM Class: F.4.1; H.2.3; I.2.4