Skip to main content

Showing 1–44 of 44 results for author: Nejdl, W

  1. arXiv:2405.15442  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Precision Healthcare: Robust Fusion of Time Series and Image Data

    Authors: Ali Rasekh, Reza Heidari, Amir Hosein Haji Mohammad Rezaie, Parsa Sharifi Sedeh, Zahra Ahmadi, Prasenjit Mitra, Wolfgang Nejdl

    Abstract: With the increasing availability of diverse data types, particularly images and time series data from medical experiments, there is a growing demand for techniques designed to combine various modalities of data effectively. Our motivation comes from the important areas of predicting mortality and phenotyping where using different modalities of data could significantly improve our ability to predic… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2404.17194  [pdf

    cs.CL

    TIGQA:An Expert Annotated Question Answering Dataset in Tigrinya

    Authors: Hailay Teklehaymanot, Dren Fazlija, Niloy Ganguly, Gourab K. Patro, Wolfgang Nejdl

    Abstract: The absence of explicitly tailored, accessible annotated datasets for educational purposes presents a notable obstacle for NLP tasks in languages with limited resources.This study initially explores the feasibility of using machine translation (MT) to convert an existing dataset into a Tigrinya dataset in SQuAD format. As a result, we present TIGQA, an expert annotated educational dataset consisti… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 9 pages,3 figures, 7 tables,2 listings

    MSC Class: cs.CL

    Journal ref: LREC-COLING 2024

  3. Beyond Accuracy: Investigating Error Types in GPT-4 Responses to USMLE Questions

    Authors: Soumyadeep Roy, Aparup Khatua, Fatemeh Ghoochani, Uwe Hadler, Wolfgang Nejdl, Niloy Ganguly

    Abstract: GPT-4 demonstrates high accuracy in medical QA tasks, leading with an accuracy of 86.70%, followed by Med-PaLM 2 at 86.50%. However, around 14% of errors remain. Additionally, current works use GPT-4 to only predict the correct option without providing any explanation and thus do not provide any insight into the thinking process and reasoning used by GPT-4 or other LLMs. Therefore, we introduce a… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures. Accepted for publication at the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024)

  4. arXiv:2404.12839  [pdf, other

    cs.CV cs.AI cs.LG

    ECOR: Explainable CLIP for Object Recognition

    Authors: Ali Rasekh, Sepehr Kazemi Ranjbar, Milad Heidari, Wolfgang Nejdl

    Abstract: Large Vision Language Models (VLMs), such as CLIP, have significantly contributed to various computer vision tasks, including object recognition and object detection. Their open vocabulary feature enhances their value. However, their black-box nature and lack of explainability in predictions make them less trustworthy in critical domains. Recently, some work has been done to force VLMs to provide… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  5. arXiv:2309.04503  [pdf, other

    quant-ph cs.DB cs.DS

    Quantum Algorithm for Maximum Biclique Problem

    Authors: Xiaofan Li, Prasenjit Mitra, Rui Zhou, Wolfgang Nejdl

    Abstract: Identifying a biclique with the maximum number of edges bears considerable implications for numerous fields of application, such as detecting anomalies in E-commerce transactions, discerning protein-protein interactions in biology, and refining the efficacy of social network recommendation algorithms. However, the inherent NP-hardness of this problem significantly complicates the matter. The prohi… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  6. GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning

    Authors: Soumyadeep Roy, Jonas Wallat, Sowmya S Sundaram, Wolfgang Nejdl, Niloy Ganguly

    Abstract: Large-scale language models such as DNABert and LOGO aim to learn optimal gene representations and are trained on the entire Human Reference Genome. However, standard tokenization schemes involve a simple sliding window of tokens like k-mers that do not leverage any gene-based semantics and thus may lead to (trivial) masking of easily predictable sequences and subsequently inefficient Masked Langu… ▽ More

    Submitted 29 July, 2023; originally announced July 2023.

    Comments: 12 pages including appendix. Accepted for publication at 26th European Conference on Artificial Intelligence ECAI 2023

    Journal ref: Frontiers in Artificial Intelligence and Applications, Volume 372: ECAI 2023

  7. arXiv:2305.06741  [pdf, other

    cs.LG cs.AI

    IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers

    Authors: Jingge Xiao, Leonie Basso, Wolfgang Nejdl, Niloy Ganguly, Sandipan Sikdar

    Abstract: Continuous-time models such as Neural ODEs and Neural Flows have shown promising results in analyzing irregularly sampled time series frequently encountered in electronic health records. Based on these models, time series are typically processed with a hybrid of an initial value problem (IVP) solver and a recurrent neural network within the variational autoencoder architecture. Sequentially solvin… ▽ More

    Submitted 12 February, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: AAAI 2024 Camera-Ready Version

  8. arXiv:2302.06975  [pdf, other

    cs.AI

    A Review of the Role of Causality in Developing Trustworthy AI Systems

    Authors: Niloy Ganguly, Dren Fazlija, Maryam Badar, Marco Fisichella, Sandipan Sikdar, Johanna Schrader, Jonas Wallat, Koustav Rudra, Manolis Koubarakis, Gourab K. Patro, Wadhah Zai El Amri, Wolfgang Nejdl

    Abstract: State-of-the-art AI models largely lack an understanding of the cause-effect relationship that governs human understanding of the real world. Consequently, these models do not generalize to unseen data, often produce unfair results, and are difficult to interpret. This has led to efforts to improve the trustworthiness aspects of AI models. Recently, causal modeling and inference methods have emerg… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 55 pages, 8 figures. Under review

  9. arXiv:2211.04812  [pdf, other

    cs.LG cs.CY

    Discrimination and Class Imbalance Aware Online Naive Bayes

    Authors: Maryam Badar, Marco Fisichella, Vasileios Iosifidis, Wolfgang Nejdl

    Abstract: Fairness-aware mining of massive data streams is a growing and challenging concern in the contemporary domain of machine learning. Many stream learning algorithms are used to replace humans at critical decision-making points e.g., hiring staff, assessing credit risk, etc. This calls for handling massive incoming information with minimum response delay while ensuring fair and high quality decisions… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  10. arXiv:2211.03533  [pdf, other

    cs.CL

    A Multi-task Model for Sentiment Aided Stance Detection of Climate Change Tweets

    Authors: Apoorva Upadhyaya, Marco Fisichella, Wolfgang Nejdl

    Abstract: Climate change has become one of the biggest challenges of our time. Social media platforms such as Twitter play an important role in raising public awareness and spreading knowledge about the dangers of the current climate crisis. With the increasing number of campaigns and communication about climate change through social media, the information could create more awareness and reach the general p… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Accepted in AAAI CONFERENCE ON WEB AND SOCIAL MEDIA (ICWSM 2023)

  11. arXiv:2211.02678  [pdf, ps, other

    eess.SP cs.LG

    Efficient ECG-based Atrial Fibrillation Detection via Parameterised Hypercomplex Neural Networks

    Authors: Leonie Basso, Zhao Ren, Wolfgang Nejdl

    Abstract: Atrial fibrillation (AF) is the most common cardiac arrhythmia and associated with a high risk for serious conditions like stroke. The use of wearable devices embedded with automatic and timely AF assessment from electrocardiograms (ECGs) has shown to be promising in preventing life-threatening situations. Although deep neural networks have demonstrated superiority in model performance, their use… ▽ More

    Submitted 11 September, 2023; v1 submitted 27 October, 2022; originally announced November 2022.

    Comments: Published at EUSIPCO 2023

  12. arXiv:2210.08500  [pdf, other

    cs.CL

    This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Clinical Text

    Authors: Betty van Aken, Jens-Michalis Papaioannou, Marcel G. Naik, Georgios Eleftheriadis, Wolfgang Nejdl, Felix A. Gers, Alexander Löser

    Abstract: The use of deep neural models for diagnosis prediction from clinical text has shown promising results. However, in clinical practice such models must not only be accurate, but provide doctors with interpretable and helpful results. We introduce ProtoPatient, a novel method based on prototypical networks and label-wise attention with both of these abilities. ProtoPatient makes predictions based on… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: AACL-IJCNLP 2022 Main Conference (Long Paper)

  13. arXiv:2206.03248  [pdf

    cs.CY cs.CL cs.LG

    Rites de Passage: Elucidating Displacement to Emplacement of Refugees on Twitter

    Authors: Aparup Khatua, Wolfgang Nejdl

    Abstract: Social media deliberations allow to explore refugee-related is-sues. AI-based studies have investigated refugee issues mostly around a specific event and considered unimodal approaches. Contrarily, we have employed a multimodal architecture for probing the refugee journeys from their home to host nations. We draw insights from Arnold van Gennep's anthropological work 'Les Rites de Passage', which… ▽ More

    Submitted 24 June, 2022; v1 submitted 30 May, 2022; originally announced June 2022.

    Comments: This work has been accepted to appear at HT'22-33rd ACM Conference on Hypertext and Social Media

  14. arXiv:2203.16141  [pdf, other

    cs.SD cs.LG eess.AS

    Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis

    Authors: Yi Chang, Zhao Ren, Thanh Tam Nguyen, Wolfgang Nejdl, Björn W. Schuller

    Abstract: Respiratory sound classification is an important tool for remote screening of respiratory-related diseases such as pneumonia, asthma, and COVID-19. To facilitate the interpretability of classification results, especially ones based on deep learning, many explanation methods have been proposed using prototypes. However, existing explanation techniques often assume that the data is non-biased and th… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Submitted to INTERSPEECH 2022

  15. arXiv:2202.12521  [pdf, other

    cs.DB

    How to reduce the search space of Entity Resolution: with Blocking or Nearest Neighbor search?

    Authors: George Papadakis, Marco Fisichella, Franziska Schoger, George Mandilaras, Nikolaus Augsten, Wolfgang Nejdl

    Abstract: Entity Resolution suffers from quadratic time complexity. To increase its time efficiency, three kinds of filtering techniques are typically used for restricting its search space: (i) blocking workflows, which group together entity profiles with identical or similar signatures, (ii) string similarity join algorithms, which quickly detect entities more similar than a threshold, and (iii) nearest-ne… ▽ More

    Submitted 6 October, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

  16. arXiv:2112.06642  [pdf

    cs.LG cs.CL cs.CY cs.SI

    Unraveling Social Perceptions & Behaviors towards Migrants on Twitter

    Authors: Aparup Khatua, Wolfgang Nejdl

    Abstract: We draw insights from the social psychology literature to identify two facets of Twitter deliberations about migrants, i.e., perceptions about migrants and behaviors towards mi-grants. Our theoretical anchoring helped us in identifying two prevailing perceptions (i.e., sympathy and antipathy) and two dominant behaviors (i.e., solidarity and animosity) of social media users towards migrants. We hav… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: This work has been accepted to appear at International Conference on Web and Social Media ICWSM-2022

  17. arXiv:2110.03536  [pdf, other

    cs.SD

    Prototype Learning for Interpretable Respiratory Sound Analysis

    Authors: Zhao Ren, Thanh Tam Nguyen, Wolfgang Nejdl

    Abstract: Remote screening of respiratory diseases has been widely studied as a non-invasive and early instrument for diagnosis purposes, especially in the pandemic. The respiratory sound classification task has been realized with numerous deep neural network (DNN) models due to their superior performance. However, in the high-stake medical domain where decisions can have significant consequences, it is des… ▽ More

    Submitted 7 February, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: Technical report of the paper accepted by IEEE ICASSP 2022

  18. arXiv:2108.07403  [pdf, other

    cs.LG cs.AI

    FARF: A Fair and Adaptive Random Forests Classifier

    Authors: Wenbin Zhang, Albert Bifet, Xiangliang Zhang, Jeremy C. Weiss, Wolfgang Nejdl

    Abstract: As Artificial Intelligence (AI) is used in more applications, the need to consider and mitigate biases from the learned models has followed. Most works in developing fair learning algorithms focus on the offline setting. However, in many real-world applications data comes in an online fashion and needs to be processed on the fly. Moreover, in practical application, there is a trade-off between acc… ▽ More

    Submitted 21 August, 2021; v1 submitted 16 August, 2021; originally announced August 2021.

  19. EduCOR: An Educational and Career-Oriented Recommendation Ontology

    Authors: Eleni Ilkou, Hasan Abu-Rasheed, Mohammadreza Tavakoli, Sherzod Hakimov, Gábor Kismihók, Sören Auer, Wolfgang Nejdl

    Abstract: With the increased dependence on online learning platforms and educational resource repositories, a unified representation of digital learning resources becomes essential to support a dynamic and multi-source learning experience. We introduce the EduCOR ontology, an educational, career-oriented ontology that provides a foundation for representing online learning resources for personalised learning… ▽ More

    Submitted 13 July, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted in the The 20th International Semantic Web Conference (ISWC2021)

    ACM Class: E.2; I.2.4

  20. Hashing-Accelerated Graph Neural Networks for Link Prediction

    Authors: Wei Wu, Bin Li, Chuan Luo, Wolfgang Nejdl

    Abstract: Networks are ubiquitous in the real world. Link prediction, as one of the key problems for network-structured data, aims to predict whether there exists a link between two nodes. The traditional approaches are based on the explicit similarity computation between the compact node representation by embedding each node into a low-dimensional space. In order to efficiently handle the intensive similar… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Journal ref: The Web Conference 2021

  21. arXiv:2104.02055  [pdf, other

    eess.SP cs.LG

    Data augmentation for dealing with low sampling rates in NILM

    Authors: Tai Le Quy, Sergej Zerr, Eirini Ntoutsi, Wolfgang Nejdl

    Abstract: Data have an important role in evaluating the performance of NILM algorithms. The best performance of NILM algorithms is achieved with high-quality evaluation data. However, many existing real-world data sets come with a low sampling quality, and often with gaps, lacking data for some recording periods. As a result, in such data, NILM algorithms can hardly recognize devices and estimate their powe… ▽ More

    Submitted 30 March, 2021; originally announced April 2021.

    Comments: 10 pages, 3 figures, 6 tables

  22. arXiv:2101.06570  [pdf, other

    cs.LG cs.CR

    Membership Inference Attack on Graph Neural Networks

    Authors: Iyiola E. Olatunji, Wolfgang Nejdl, Megha Khosla

    Abstract: Graph Neural Networks (GNNs), which generalize traditional deep neural networks on graph data, have achieved state-of-the-art performance on several graph analytical tasks. We focus on how trained GNN models could leak information about the \emph{member} nodes that they were trained on. We introduce two realistic settings for performing a membership inference (MI) attack on GNNs. While choosing th… ▽ More

    Submitted 18 December, 2021; v1 submitted 16 January, 2021; originally announced January 2021.

    Comments: Best student paper award, IEEE TPS 21

  23. arXiv:2001.09762  [pdf, other

    cs.CY

    Bias in Data-driven AI Systems -- An Introductory Survey

    Authors: Eirini Ntoutsi, Pavlos Fafalios, Ujwal Gadiraju, Vasileios Iosifidis, Wolfgang Nejdl, Maria-Esther Vidal, Salvatore Ruggieri, Franco Turini, Symeon Papadopoulos, Emmanouil Krasanakis, Ioannis Kompatsiaris, Katharina Kinder-Kurlanda, Claudia Wagner, Fariba Karimi, Miriam Fernandez, Harith Alani, Bettina Berendt, Tina Kruegel, Christian Heinze, Klaus Broelemann, Gjergji Kasneci, Thanassis Tiropanis, Steffen Staab

    Abstract: AI-based systems are widely employed nowadays to make decisions that have far-reaching impacts on individuals and society. Their decisions might affect everyone, everywhere and anytime, entailing concerns about potential human rights issues. Therefore, it is necessary to move beyond traditional AI algorithms optimized for predictive performance and embed ethical and legal principles in their desig… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 19 pages, 1 figure

  24. Towards a Ranking Model for Semantic Layers over Digital Archives

    Authors: Pavlos Fafalios, Vaibhav Kasturia, Wolfgang Nejdl

    Abstract: Archived collections of documents (like newspaper archives) serve as important information sources for historians, journalists, sociologists and other interested parties. Semantic Layers over such digital archives allow describing and publishing metadata and semantic information about the archived documents in a standard format (RDF), which in turn can be queried through a structured query languag… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

  25. Ranking Archived Documents for Structured Queries on Semantic Layers

    Authors: Pavlos Fafalios, Vaibhav Kasturia, Wolfgang Nejdl

    Abstract: Archived collections of documents (like newspaper and web archives) serve as important information sources in a variety of disciplines, including Digital Humanities, Historical Science, and Journalism. However, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into usable sources of information. A semantic layer is an RDF graph that… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

  26. Expedition: A Time-Aware Exploratory Search System Designed for Scholars

    Authors: Jaspreet Singh, Wolfgang Nejdl, Avishek Anand

    Abstract: Archives are an important source of study for various scholars. Digitization and the web have made archives more accessible and led to the development of several time-aware exploratory search systems. However these systems have been designed for more general users rather than scholars. Scholars have more complex information needs in comparison to general users. They also require support for corpus… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

  27. Building and Querying Semantic Layers for Web Archives (Extended Version)

    Authors: Pavlos Fafalios, Helge Holzmann, Vaibhav Kasturia, Wolfgang Nejdl

    Abstract: Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem a… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

    Comments: This is a preprint of an article accepted for publication in the International Journal on Digital Libraries (2018)

    Journal ref: International Journal on Digital Libraries, ISSN: 1432-5012 (Print) 1432-1300 (Online), 2018

  28. History by Diversity: Helping Historians search News Archives

    Authors: Jaspreet Singh, Wolfgang Nejdl, Avishek Anand

    Abstract: Longitudinal corpora like newspaper archives are of immense value to historical research, and time as an important factor for historians strongly influences their search behaviour in these archives. While searching for articles published over time, a key preference is to retrieve documents which cover the important aspects from important points in time which is different from standard search behav… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

  29. arXiv:1810.09176  [pdf, other

    cs.SI cs.LG stat.ML

    Node Representation Learning for Directed Graphs

    Authors: Megha Khosla, Jurek Leonhardt, Wolfgang Nejdl, Avishek Anand

    Abstract: We propose a novel approach for learning node representations in directed graphs, which maintains separate views or embedding spaces for the two distinct node roles induced by the directionality of the edges. We argue that the previous approaches either fail to encode the edge directionality or their encodings cannot be generalized across tasks. With our simple \emph{alternating random walk} strat… ▽ More

    Submitted 28 June, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: Accepted in ECML-PKDD 2019

  30. arXiv:1808.08316  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    A Trio Neural Model for Dynamic Entity Relatedness Ranking

    Authors: Tu Nguyen, Tuan Tran, Wolfgang Nejdl

    Abstract: Measuring entity relatedness is a fundamental task for many natural language processing and information retrieval applications. Prior work often studies entity relatedness in static settings and an unsupervised manner. However, entities in real-world are often involved in many different relationships, consequently entity-relations are very dynamic over time. In this work, we propose a neural netwo… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 August, 2018; originally announced August 2018.

    Comments: In Proceedings of CoNLL 2018

  31. LogCanvas: Visualizing Search History Using Knowledge Graphs

    Authors: Luyan Xu, Zeon Trevor Fernando, Xuan Zhou, Wolfgang Nejdl

    Abstract: In this demo paper, we introduce LogCanvas, a platform for user search history visualisation. Different from the existing visualisation tools, LogCanvas focuses on helping users re-construct the semantic relationship among their search activities. LogCanvas segments a user's search history into different sessions and generates a knowledge graph to represent the information exploration process in e… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.

    Comments: Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, SIGIR '18 (demo), July 2018

  32. arXiv:1803.07890  [pdf, other

    cs.IR cs.LG

    Multiple Models for Recommending Temporal Aspects of Entities

    Authors: Tu Nguyen, Nattiya Kanhabua, Wolfgang Nejdl

    Abstract: Entity aspect recommendation is an emerging task in semantic search that helps users discover serendipitous and prominent information with respect to an entity, of which salience (e.g., popularity) is the most important factor in previous work. However, entity aspects are temporally dynamic and often driven by events happening over time. For such cases, aspect suggestion based solely on salience f… ▽ More

    Submitted 9 April, 2024; v1 submitted 21 March, 2018; originally announced March 2018.

    Comments: In proceedings of the 15th Extended Semantic Web Conference (ESWC 2018)

  33. arXiv:1703.10339  [pdf, other

    cs.IR cs.CL cs.SI

    Finding News Citations for Wikipedia

    Authors: Besnik Fetahu, Katja Markert, Wolfgang Nejdl, Avishek Anand

    Abstract: An important editing policy in Wikipedia is to provide citations for added statements in Wikipedia pages, where statements can be arbitrary pieces of text, ranging from a sentence to a paragraph. In many cases citations are either outdated or missing altogether. In this work we address the problem of finding and updating news citations for statements in entity pages. We propose a two-stage super… ▽ More

    Submitted 24 April, 2017; v1 submitted 30 March, 2017; originally announced March 2017.

  34. On the Applicability of Delicious for Temporal Search on Web Archives

    Authors: Helge Holzmann, Wolfgang Nejdl, Avishek Anand

    Abstract: Web archives are large longitudinal collections that store webpages from the past, which might be missing on the current live Web. Consequently, temporal search over such collections is essential for finding prominent missing webpages and tasks like historical analysis. However, this has been challenging due to the lack of popularity information and proper ground truth to evaluate temporal retriev… ▽ More

    Submitted 3 February, 2017; originally announced February 2017.

    Comments: SIGIR 2016, Pisa, Italy

  35. The Dawn of Today's Popular Domains: A Study of the Archived German Web over 18 Years

    Authors: Helge Holzmann, Wolfgang Nejdl, Avishek Anand

    Abstract: The Web has been around and maturing for 25 years. The popular websites of today have undergone vast changes during this period, with a few being there almost since the beginning and many new ones becoming popular over the years. This makes it worthwhile to take a look at how these sites have evolved and what they might tell us about the future of the Web. We therefore embarked on a longitudinal s… ▽ More

    Submitted 3 February, 2017; originally announced February 2017.

    Comments: JCDL 2016, Newark, NJ, USA

  36. ArchiveWeb: collaboratively extending and exploring web archive collections - How would you like to work with your collections?

    Authors: Zeon Trevor Fernando, Ivana Marenzi, Wolfgang Nejdl

    Abstract: Curated web archive collections contain focused digital content which is collected by archiving organizations, groups, and individuals to provide a representative sample covering specific topics and events to preserve them for future exploration and analysis. In this paper, we discuss how to best support collaborative construction and exploration of these collections through the ArchiveWeb system.… ▽ More

    Submitted 1 February, 2017; originally announced February 2017.

    Comments: Published via Springer in International Journal on Digital Libraries

  37. ArchiveWeb: Collaboratively Extending and Exploring Web Archive Collections

    Authors: Zeon Trevor Fernando, Ivana Marenzi, Wolfgang Nejdl, Rishita Kalyani

    Abstract: Curated web archive collections contain focused digital contents which are collected by archiving organizations to provide a representative sample covering specific topics and events to preserve them for future exploration and analysis. In this paper, we discuss how to best support collaborative construction and exploration of these collections through the ArchiveWeb system. ArchiveWeb has been de… ▽ More

    Submitted 1 February, 2017; originally announced February 2017.

    Comments: Published via Springer in International Conference on Theory and Practice of Digital Libraries (TPDL 2016)

  38. How to Search the Internet Archive Without Indexing It

    Authors: Nattiya Kanhabua, Philipp Kemkes, Wolfgang Nejdl, Tu Ngoc Nguyen, Felipe Reis, Nam Khanh Tran

    Abstract: Significant parts of cultural heritage are produced on the web during the last decades. While easy accessibility to the current web is a good baseline, optimal access to the past web faces several challenges. This includes dealing with large-scale web archive collections and lacking of usage logs that contain implicit human feedback most relevant for today's web search. In this paper, we propose a… ▽ More

    Submitted 28 January, 2017; originally announced January 2017.

    Journal ref: 20th International Conference on Theory and Practice of Digital Libraries, TPDL 2016, Proceedings, pp 147-160

  39. Can We Find Documents in Web Archives without Knowing their Contents?

    Authors: Khoi Duy Vo, Tuan Tran, Tu Ngoc Nguyen, Xiaofei Zhu, Wolfgang Nejdl

    Abstract: Recent advances of preservation technologies have led to an increasing number of Web archive systems and collections. These collections are valuable to explore the past of the Web, but their value can only be uncovered with effective access and exploration mechanisms. Ideal search and rank- ing methods must be robust to the high redundancy and the temporal noise of contents, as well as scalable to… ▽ More

    Submitted 14 January, 2017; originally announced January 2017.

    Comments: Published via ACM to Websci 2015

    ACM Class: H.3.1

  40. arXiv:1611.03426  [pdf, other

    cs.CY cs.IR cs.SI stat.ML

    Why is it Difficult to Detect Sudden and Unexpected Epidemic Outbreaks in Twitter?

    Authors: Avaré Stewart, Sara Romano, Nattiya Kanhabua, Sergio Di Martino, Wolf Siberski, Antonino Mazzeo, Wolfgang Nejdl, Ernesto Diaz-Aviles

    Abstract: Social media services such as Twitter are a valuable source of information for decision support systems. Many studies have shown that this also holds for the medical domain, where Twitter is considered a viable tool for public health officials to sift through relevant information for the early detection, management, and control of epidemic outbreaks. This is possible due to the inherent capability… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: ACM CCS Concepts: Applied computing - Health informatics; Information systems - Web mining; Document filtering; Novelty in information retrieval; Recommender systems; Human-centered computing - Social media

  41. arXiv:1407.4832  [pdf, other

    cs.IR cs.AI cs.LG

    Collaborative Filtering Ensemble for Personalized Name Recommendation

    Authors: Bernat Coma-Puig, Ernesto Diaz-Aviles, Wolfgang Nejdl

    Abstract: Out of thousands of names to choose from, picking the right one for your child is a daunting task. In this work, our objective is to help parents making an informed decision while choosing a name for their baby. We follow a recommender system approach and combine, in an ensemble, the individual rankings produced by simple collaborative filtering algorithms in order to produce a personalized list o… ▽ More

    Submitted 16 July, 2014; originally announced July 2014.

    Comments: Top-N recommendation; personalized ranking; given name recommendation

    ACM Class: H.3.3; I.2.6

    Journal ref: Proceedings of the ECML PKDD Discovery Challenge - Recommending Given Names. Co-located with ECML PKDD 2013. Prague, Czech Republic, September 27, 2013

  42. arXiv:1302.6832  [pdf

    cs.AI

    Model-Based Diagnosis with Qualitative Temporal Uncertainty

    Authors: Wolfgang Nejdl, Johann Gamper

    Abstract: In this paper we describe a framework for model-based diagnosis of dynamic systems, which extends previous work in this field by using and expressing temporal uncertainty in the form of qualitative interval relations a la Allen. Based on a logical framework extended by qualitative and quantitative temporal constraints we show how to describe behavioral models (both consistency- and abductive-based… ▽ More

    Submitted 27 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

    Report number: UAI-P-1994-PG-432-439

  43. arXiv:1203.1378  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Epidemic Intelligence for the Crowd, by the Crowd (Full Version)

    Authors: Ernesto Diaz-Aviles, Avaré Stewart, Edward Velasco, Kerstin Denecke, Wolfgang Nejdl

    Abstract: Tracking Twitter for public health has shown great potential. However, most recent work has been focused on correlating Twitter messages to influenza rates, a disease that exhibits a marked seasonal pattern. In the presence of sudden outbreaks, how can social media streams be used to strengthen surveillance capacity? In May 2011, Germany reported an outbreak of Enterohemorrhagic Escherichia coli (… ▽ More

    Submitted 5 March, 2012; originally announced March 2012.

    Comments: A short version of this work has been accepted for publication at the International AAAI Conference on Weblogs and Social Media (ICWSM 2012)

  44. arXiv:0812.4461  [pdf, ps, other

    cs.IR

    Mining User Profiles to Support Structure and Explanation in Open Social Networking

    Authors: Avare Stewart, Ernesto Diaz-Aviles, Wolfgang Nejdl

    Abstract: The proliferation of media sharing and social networking websites has brought with it vast collections of site-specific user generated content. The result is a Social Networking Divide in which the concepts and structure common across different sites are hidden. The knowledge and structures from one social site are not adequately exploited to provide new information and resources to the same or… ▽ More

    Submitted 23 December, 2008; originally announced December 2008.

    Comments: International Workshop on Interacting with Multimedia Content in the Social Semantic Web (IMC-SSW 2008). Collocated with the 3rd International Conference on Semantic and Digital Media Technologies (SAMT 2008), Koblenz, Germany, Dec. 03 2008

    ACM Class: H.3.3; H.3.5

    Journal ref: In Proceedings of the International Workshop on Interacting with Multimedia Content in the Social Semantic Web (IMC-SSW'08), pages 21-30. Koblenz, Germany, Dec. 3, 2008