Skip to main content

Showing 1–50 of 52 results for author: Kasneci, G

  1. arXiv:2406.11391  [pdf, other

    cs.LG

    P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models

    Authors: Shuo Yang, Chenchen Yuan, Yao Rong, Felix Steinbauer, Gjergji Kasneci

    Abstract: A multitude of industries depend on accurate and reasonable tabular data augmentation for their business processes. Contemporary methodologies in generating tabular data revolve around utilizing Generative Adversarial Networks (GAN) or fine-tuning Large Language Models (LLM). However, GAN-based approaches are documented to produce samples with common-sense errors attributed to the absence of exter… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: The paper was accepted by findings of ACL 2024

  2. arXiv:2405.13536  [pdf, other

    cs.LG cs.AI cs.CL

    Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers

    Authors: Tobias Leemann, Alina Fastowski, Felix Pfeiffer, Gjergji Kasneci

    Abstract: We address the critical challenge of applying feature attribution methods to the transformer architecture, which dominates current applications in natural language processing and beyond. Traditional attribution methods to explainable AI (XAI) explicitly or implicitly rely on linear or additive surrogate models to quantify the impact of input features on a model's output. In this work, we formally… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2404.15435  [pdf, other

    cs.HC

    Introduction to Eye Tracking: A Hands-On Tutorial for Students and Practitioners

    Authors: Enkelejda Kasneci, Hong Gao, Suleyman Ozdel, Virmarie Maquiling, Enkeleda Thaqi, Carrie Lau, Yao Rong, Gjergji Kasneci, Efe Bozkir

    Abstract: Eye-tracking technology is widely used in various application areas such as psychology, neuroscience, marketing, and human-computer interaction, as it is a valuable tool for understanding how people process information and interact with their environment. This tutorial provides a comprehensive introduction to eye tracking, from the basics of eye anatomy and physiology to the principles and applica… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. arXiv:2403.10330  [pdf, other

    cs.LG

    Towards Non-Adversarial Algorithmic Recourse

    Authors: Tobias Leemann, Martin Pawelczyk, Bardh Prenkaj, Gjergji Kasneci

    Abstract: The streams of research on adversarial examples and counterfactual explanations have largely been growing independently. This has led to several recent works trying to elucidate their similarities and differences. Most prominently, it has been argued that adversarial examples, as opposed to counterfactual explanations, have a unique characteristic in that they lead to a misclassification compared… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  5. arXiv:2402.18284  [pdf, other

    cs.CL cs.AI

    Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization

    Authors: Shuo Yang, Gjergji Kasneci

    Abstract: Wide usage of ChatGPT has highlighted the potential of reinforcement learning from human feedback. However, its training pipeline relies on manual ranking, a resource-intensive process. To reduce labor costs, we propose a self-supervised text ranking approach for applying Proximal-Policy-Optimization to fine-tune language models while eliminating the need for human annotators. Our method begins wi… ▽ More

    Submitted 2 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

  6. arXiv:2402.02136  [pdf, other

    cs.HC

    User Intent Recognition and Satisfaction with Large Language Models: A User Study with ChatGPT

    Authors: Anna Bodonhelyi, Efe Bozkir, Shuo Yang, Enkelejda Kasneci, Gjergji Kasneci

    Abstract: The rapid evolution of large language models such as GPT-4 Turbo represents an impactful paradigm shift in digital interaction and content engagement. While these models encode vast amounts of human-generated knowledge and excel in processing diverse data types, recent research shows that they often face the challenge of accurately responding to specific user intents, leading to increased user dis… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  7. arXiv:2401.00832  [pdf, other

    cs.AI cs.CY

    Taking the Next Step with Generative Artificial Intelligence: The Transformative Role of Multimodal Large Language Models in Science Education

    Authors: Arne Bewersdorff, Christian Hartmann, Marie Hornberger, Kathrin Seßler, Maria Bannert, Enkelejda Kasneci, Gjergji Kasneci, Xiaoming Zhai, Claudia Nerdel

    Abstract: The integration of Artificial Intelligence (AI), particularly Large Language Model (LLM)-based systems, in education has shown promise in enhancing teaching and learning experiences. However, the advent of Multimodal Large Language Models (MLLMs) like GPT-4 with vision (GPT-4V), capable of processing multimodal data including text, sound, and visual inputs, opens a new era of enriched, personalize… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  8. arXiv:2311.12684  [pdf, other

    cs.LG

    Adversarial Reweighting Guided by Wasserstein Distance for Bias Mitigation

    Authors: Xuan Zhao, Simone Fabbrizzi, Paula Reyero Lobo, Siamak Ghodsi, Klaus Broelemann, Steffen Staab, Gjergji Kasneci

    Abstract: The unequal representation of different groups in a sample population can lead to discrimination of minority groups when machine learning models make automated decisions. To address these issues, fairness-aware machine learning jointly optimizes two (or more) metrics aiming at predictive effectiveness and low unfairness. However, the inherent under-representation of minorities in the data makes th… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  9. arXiv:2311.10512  [pdf, other

    cs.LG

    Causal Fairness-Guided Dataset Reweighting using Neural Networks

    Authors: Xuan Zhao, Klaus Broelemann, Salvatore Ruggieri, Gjergji Kasneci

    Abstract: The importance of achieving fairness in machine learning models cannot be overstated. Recent research has pointed out that fairness should be examined from a causal perspective, and several fairness notions based on the on Pearl's causal framework have been proposed. In this paper, we construct a reweighting scheme of datasets to address causal fairness. Our approach aims at mitigating bias by con… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: To be published in the proceedings of 2023 IEEE International Conference on Big Data (IEEE BigData 2023)

  10. arXiv:2311.08228  [pdf, other

    cs.LG

    Counterfactual Explanation for Regression via Disentanglement in Latent Space

    Authors: Xuan Zhao, Klaus Broelemann, Gjergji Kasneci

    Abstract: Counterfactual Explanations (CEs) help address the question: How can the factors that influence the prediction of a predictive model be changed to achieve a more favorable outcome from a user's perspective? Thus, they bear the potential to guide the user's interaction with AI systems since they represent easy-to-understand explanations. To be applicable, CEs need to be realistic and actionable. In… ▽ More

    Submitted 23 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: CXAI workshop @ ICDM 2023. arXiv admin note: text overlap with arXiv:2307.13390

  11. arXiv:2308.02353  [pdf, other

    cs.LG cs.AI

    Adapting to Change: Robust Counterfactual Explanations in Dynamic Data Landscapes

    Authors: Bardh Prenkaj, Mario Villaizan-Vallelado, Tobias Leemann, Gjergji Kasneci

    Abstract: We introduce a novel semi-supervised Graph Counterfactual Explainer (GCE) methodology, Dynamic GRAph Counterfactual Explainer (DyGRACE). It leverages initial knowledge about the data distribution to search for valid counterfactuals while avoiding using information from potentially outdated decision functions in subsequent time steps. Employing two graph autoencoders (GAEs), DyGRACE learns the repr… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  12. arXiv:2307.13390  [pdf, other

    cs.LG

    Counterfactual Explanation via Search in Gaussian Mixture Distributed Latent Space

    Authors: Xuan Zhao, Klaus Broelemann, Gjergji Kasneci

    Abstract: Counterfactual Explanations (CEs) are an important tool in Algorithmic Recourse for addressing two questions: 1. What are the crucial factors that led to an automated prediction/decision? 2. How can these factors be changed to achieve a more favorable outcome from a user's perspective? Thus, guiding the user's interaction with AI systems by proposing easy-to-understand explanations and easy-to-att… ▽ More

    Submitted 21 November, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: XAI workshop of IJCAI 2023

  13. arXiv:2306.07273  [pdf, other

    cs.LG cs.CR stat.ML

    Gaussian Membership Inference Privacy

    Authors: Tobias Leemann, Martin Pawelczyk, Gjergji Kasneci

    Abstract: We propose a novel and practical privacy notion called $f$-Membership Inference Privacy ($f$-MIP), which explicitly considers the capabilities of realistic adversaries under the membership inference attack threat model. Consequently, $f$-MIP offers interpretable privacy guarantees and improved utility (e.g., better classification accuracy). In particular, we derive a parametric family of $f$-MIP g… ▽ More

    Submitted 26 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera-ready. The first two authors contributed equally

  14. arXiv:2303.08081  [pdf, other

    cs.LG stat.ML

    Explanation Shift: How Did the Distribution Shift Impact the Model?

    Authors: Carlos Mougan, Klaus Broelemann, David Masip, Gjergji Kasneci, Thanassis Thiropanis, Steffen Staab

    Abstract: As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions or model prediction distributions and try to understand issues regarding the interactions between learned models and shifting distributions. We suggest a novel… ▽ More

    Submitted 7 September, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.12369

  15. arXiv:2212.12374  [pdf, other

    cs.LG

    Relational Local Explanations

    Authors: Vadim Borisov, Gjergji Kasneci

    Abstract: The majority of existing post-hoc explanation approaches for machine learning models produce independent, per-variable feature attribution scores, ignoring a critical inherent characteristics of homogeneously structured data, such as visual or text data: there exist latent inter-variable relationships between features. In response, we develop a novel model-agnostic and permutation-based feature at… ▽ More

    Submitted 11 February, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

  16. arXiv:2211.09940  [pdf, other

    cs.LG cs.DC

    Entry Dependent Expert Selection in Distributed Gaussian Processes Using Multilabel Classification

    Authors: Hamed Jalali, Gjergji Kasneci

    Abstract: By distributing the training process, local approximation reduces the cost of the standard Gaussian Process. An ensemble technique combines local predictions from Gaussian experts trained on different partitions of the data. Ensemble methods aggregate models' predictions by assuming a perfect diversity of local predictors. Although it keeps the aggregation tractable, this assumption is often viola… ▽ More

    Submitted 8 January, 2024; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: A condensed version of this work has been accepted at the Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems workshop during NeurIPS 2022

  17. arXiv:2211.02151  [pdf, other

    cs.LG cs.AI cs.CY

    Decomposing Counterfactual Explanations for Consequential Decision Making

    Authors: Martin Pawelczyk, Lea Tiyavorabun, Gjergji Kasneci

    Abstract: The goal of algorithmic recourse is to reverse unfavorable decisions (e.g., from loan denial to approval) under automated decision making by suggesting actionable feature changes (e.g., reduce the number of credit cards). To generate low-cost recourse the majority of methods work under the assumption that the features are independently manipulable (IMF). To address the feature dependency issue the… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  18. arXiv:2210.13954  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    I Prefer not to Say: Protecting User Consent in Models with Optional Personal Data

    Authors: Tobias Leemann, Martin Pawelczyk, Christian Thomas Eberle, Gjergji Kasneci

    Abstract: We examine machine learning models in a setup where individuals have the choice to share optional personal information with a decision-making system, as seen in modern insurance pricing models. Some users consent to their data being used whereas others object and keep their data undisclosed. In this work, we show that the decision not to share data can be considered as information in itself that s… ▽ More

    Submitted 2 February, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: v5: AAAI-24 Camera-Ready Version Including Appendices. v1: NeurIPS 2022 Workshop on Algorithmic Fairness through the Lens of Causality and Privacy (AFCP)

  19. arXiv:2210.12369  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Explanation Shift: Detecting distribution shifts on tabular data via the explanation space

    Authors: Carlos Mougan, Klaus Broelemann, Gjergji Kasneci, Thanassis Tiropanis, Steffen Staab

    Abstract: As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In the past, predictive performance was considered the key indicator to monitor. However, explanation aspects have come to attention within the last years. In this work, we investigate how model predictive performance and model explanation characteristics are affected under distribution… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Neural Information Processing Systems (NeurIPS 2022). Workshop on Distribution Shifts: Connecting Methods and Applications

  20. arXiv:2210.11584  [pdf, other

    cs.AI cs.HC

    Towards Human-centered Explainable AI: A Survey of User Studies for Model Explanations

    Authors: Yao Rong, Tobias Leemann, Thai-trang Nguyen, Lisa Fiedler, Peizhu Qian, Vaibhav Unhelkar, Tina Seidel, Gjergji Kasneci, Enkelejda Kasneci

    Abstract: Explainable AI (XAI) is widely viewed as a sine qua non for ever-expanding AI research. A better understanding of the needs of XAI users, as well as human-centered evaluations of explainable models are both a necessity and a challenge. In this paper, we explore how HCI and AI researchers conduct user studies in XAI applications based on a systematic literature review. After identifying and thoroug… ▽ More

    Submitted 19 December, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

  21. arXiv:2210.06280  [pdf, other

    cs.LG

    Language Models are Realistic Tabular Data Generators

    Authors: Vadim Borisov, Kathrin Seßler, Tobias Leemann, Martin Pawelczyk, Gjergji Kasneci

    Abstract: Tabular data is among the oldest and most ubiquitous forms of data. However, the generation of synthetic samples with the original data's characteristics remains a significant challenge for tabular data. While many generative models from the computer vision domain, such as variational autoencoders or generative adversarial networks, have been adapted for tabular data generation, less research has… ▽ More

    Submitted 22 April, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  22. arXiv:2209.02764  [pdf, other

    cs.LG stat.ML

    Change Detection for Local Explainability in Evolving Data Streams

    Authors: Johannes Haug, Alexander Braun, Stefan Zürn, Gjergji Kasneci

    Abstract: As complex machine learning models are increasingly used in sensitive applications like banking, trading or credit scoring, there is a growing demand for reliable explanation mechanisms. Local feature attribution methods have become a popular technique for post-hoc and model-agnostic explanations. However, attribution methods typically assume a stationary environment in which the predictive model… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: To be published in the proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM 2022)

  23. arXiv:2208.14137  [pdf, other

    cs.LG cs.AI cs.CY

    On the Trade-Off between Actionable Explanations and the Right to be Forgotten

    Authors: Martin Pawelczyk, Tobias Leemann, Asia Biega, Gjergji Kasneci

    Abstract: As machine learning (ML) models are increasingly being deployed in high-stakes applications, policymakers have suggested tighter data protection regulations (e.g., GDPR, CCPA). One key principle is the "right to be forgotten" which gives users the right to have their data deleted. Another key principle is the right to an actionable explanation, also known as algorithmic recourse, allowing users to… ▽ More

    Submitted 11 October, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: ICLR 2023 camera ready version

    Journal ref: 11th International Conference on Learning Representations (ICLR) 2023

  24. arXiv:2208.03142  [pdf, other

    cs.CV cs.LG

    BoxShrink: From Bounding Boxes to Segmentation Masks

    Authors: Michael Gröger, Vadim Borisov, Gjergji Kasneci

    Abstract: One of the core challenges facing the medical image computing community is fast and efficient data sample labeling. Obtaining fine-grained labels for segmentation is particularly demanding since it is expensive, time-consuming, and requires sophisticated tools. On the contrary, applying bounding boxes is fast and takes significantly less time than fine-grained labeling, but does not produce detail… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  25. Fairness in Agreement With European Values: An Interdisciplinary Perspective on AI Regulation

    Authors: Alejandra Bringas Colmenarejo, Luca Nannini, Alisa Rieger, Kristen M. Scott, Xuan Zhao, Gourab K. Patro, Gjergji Kasneci, Katharina Kinder-Kurlanda

    Abstract: With increasing digitalization, Artificial Intelligence (AI) is becoming ubiquitous. AI-based systems to identify, optimize, automate, and scale solutions to complex economic and societal problems are being proposed and implemented. This has motivated regulation efforts, including the Proposal of an EU AI Act. This interdisciplinary position paper considers various concerns surrounding fairness an… ▽ More

    Submitted 8 June, 2022; originally announced July 2022.

    Comments: In proceedings of AAAI/ACM Conference AIES 2022 (https://doi.org/10.1145/3514094.3534158)

  26. arXiv:2206.13872  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    When are Post-hoc Conceptual Explanations Identifiable?

    Authors: Tobias Leemann, Michael Kirchhof, Yao Rong, Enkelejda Kasneci, Gjergji Kasneci

    Abstract: Interest in understanding and factorizing learned embedding spaces through conceptual explanations is steadily growing. When no human concept labels are available, concept discovery methods search trained embedding spaces for interpretable concepts like object shape or color that can provide post-hoc explanations for decisions. Unlike previous work, we argue that concept discovery should be identi… ▽ More

    Submitted 6 June, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: v5: UAI2023 camera-ready including supplementary material. The first two authors contributed equally

  27. arXiv:2204.13625  [pdf, other

    cs.LG stat.ML

    Standardized Evaluation of Machine Learning Methods for Evolving Data Streams

    Authors: Johannes Haug, Effi Tramountani, Gjergji Kasneci

    Abstract: Due to the unspecified and dynamic nature of data streams, online machine learning requires powerful and flexible solutions. However, evaluating online machine learning methods under realistic conditions is difficult. Existing work therefore often draws on different heuristics and simulations that do not necessarily produce meaningful and reliable results. Indeed, in the absence of common evaluati… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  28. Dynamic Model Tree for Interpretable Data Stream Learning

    Authors: Johannes Haug, Klaus Broelemann, Gjergji Kasneci

    Abstract: Data streams are ubiquitous in modern business and society. In practice, data streams may evolve over time and cannot be stored indefinitely. Effective and transparent machine learning on data streams is thus often challenging. Hoeffding Trees have emerged as a state-of-the art for online predictive modelling. They are easy to train and provide meaningful convergence guarantees under a stationary… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: To be published in the proceedings of the 38th IEEE International Conference on Data Engineering (ICDE 2022)

  29. arXiv:2203.06768  [pdf, other

    cs.LG cs.CY

    Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse

    Authors: Martin Pawelczyk, Teresa Datta, Johannes van-den-Heuvel, Gjergji Kasneci, Himabindu Lakkaraju

    Abstract: As machine learning models are increasingly being employed to make consequential decisions in real-world settings, it becomes critical to ensure that individuals who are adversely impacted (e.g., loan denied) by the predictions of these models are provided with a means for recourse. While several approaches have been proposed to construct recourses for affected individuals, the recourses output by… ▽ More

    Submitted 11 October, 2023; v1 submitted 13 March, 2022; originally announced March 2022.

    Comments: ICLR 2023, camera ready version

    Journal ref: 11th International Conference on Learning Representations (ICLR) 2023

  30. arXiv:2202.03287  [pdf, other

    cs.LG cs.DC stat.ML

    Gaussian Graphical Models as an Ensemble Method for Distributed Gaussian Processes

    Authors: Hamed Jalali, Gjergji Kasneci

    Abstract: Distributed Gaussian process (DGP) is a popular approach to scale GP to big data which divides the training data into some subsets, performs local inference for each partition, and aggregates the results to acquire global prediction. To combine the local predictions, the conditional independence assumption is used which basically means there is a perfect diversity between the subsets. Although it… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: OPT2021: 13th Annual Workshop on Optimization for Machine Learning

  31. arXiv:2202.00449  [pdf, other

    cs.CV cs.LG

    A Consistent and Efficient Evaluation Strategy for Attribution Methods

    Authors: Yao Rong, Tobias Leemann, Vadim Borisov, Gjergji Kasneci, Enkelejda Kasneci

    Abstract: With a variety of local feature attribution methods being proposed in recent years, follow-up work suggested several evaluation strategies. To assess the attribution quality across different attribution techniques, the most popular among these evaluation strategies in the image domain use pixel perturbations. However, recent advances discovered that different evaluation strategies produce conflict… ▽ More

    Submitted 14 June, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: 26 pages. Accepted at ICML 2022

  32. arXiv:2111.07379  [pdf, other

    cs.LG

    A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines

    Authors: Vadim Borisov, Johannes Meier, Johan van den Heuvel, Hamed Jalali, Gjergji Kasneci

    Abstract: Understanding the results of deep neural networks is an essential step towards wider acceptance of deep learning algorithms. Many approaches address the issue of interpreting artificial neural networks, but often provide divergent explanations. Moreover, different hyperparameters of an explanatory method can lead to conflicting interpretations. In this paper, we propose a technique for aggregating… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

  33. Deep Neural Networks and Tabular Data: A Survey

    Authors: Vadim Borisov, Tobias Leemann, Kathrin Seßler, Johannes Haug, Martin Pawelczyk, Gjergji Kasneci

    Abstract: Heterogeneous tabular data are the most commonly used form of data and are essential for numerous critical and computationally demanding applications. On homogeneous data sets, deep neural networks have repeatedly shown excellent performance and have therefore been widely adopted. However, their adaptation to tabular data for inference or data generation tasks remains challenging. To facilitate fu… ▽ More

    Submitted 29 June, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

  34. arXiv:2108.00783  [pdf, other

    cs.LG cs.AI

    CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

    Authors: Martin Pawelczyk, Sascha Bielawski, Johannes van den Heuvel, Tobias Richter, Gjergji Kasneci

    Abstract: Counterfactual explanations provide means for prescriptive model explanations by suggesting actionable feature changes (e.g., increase income) that allow individuals to achieve favorable outcomes in the future (e.g., insurance approval). Choosing an appropriate method is a crucial aspect for meaningful counterfactual explanations. As documented in recent reviews, there exists a quickly growing lit… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: Accepted to NeurIPS Benchmark & Data Set Track

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  35. TEyeD: Over 20 million real-world eye images with Pupil, Eyelid, and Iris 2D and 3D Segmentations, 2D and 3D Landmarks, 3D Eyeball, Gaze Vector, and Eye Movement Types

    Authors: Wolfgang Fuhl, Gjergji Kasneci, Enkelejda Kasneci

    Abstract: We present TEyeD, the world's largest unified public data set of eye images taken with head-mounted devices. TEyeD was acquired with seven different head-mounted eye trackers. Among them, two eye trackers were integrated into virtual reality (VR) or augmented reality (AR) devices. The images in TEyeD were obtained from various tasks, including car rides, simulator rides, outdoor sports activities,… ▽ More

    Submitted 6 June, 2023; v1 submitted 3 February, 2021; originally announced February 2021.

    Comments: Download: Just connect via FTP as user TEyeDUser and without password to nephrit.cs.uni-tuebingen.de (ftp://nephrit.cs.uni-tuebingen.de)

  36. Gaussian Experts Selection using Graphical Models

    Authors: Hamed Jalali, Martin Pawelczyk, Gjergji Kasneci

    Abstract: Local approximations are popular methods to scale Gaussian processes (GPs) to big data. Local approximations reduce time complexity by dividing the original dataset into subsets and training a local expert on each subset. Aggregating the experts' prediction is done assuming either conditional dependence or independence between the experts. Imposing the \emph{conditional independence assumption} (C… ▽ More

    Submitted 31 August, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

  37. arXiv:2101.00905  [pdf, other

    cs.LG

    On Baselines for Local Feature Attributions

    Authors: Johannes Haug, Stefan Zürn, Peter El-Jiz, Gjergji Kasneci

    Abstract: High-performing predictive models, such as neural nets, usually operate as black boxes, which raises serious concerns about their interpretability. Local feature attribution methods help to explain black box models and are therefore a powerful tool for assessing the reliability and fairness of predictions. To this end, most attribution models compare the importance of input features with a referen… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: Accepted at the AAAI-21 Workshop on Explainable Agency in AI

  38. arXiv:2010.15996  [pdf, other

    astro-ph.IM cs.LG

    Lessons Learned from the 1st ARIEL Machine Learning Challenge: Correcting Transiting Exoplanet Light Curves for Stellar Spots

    Authors: Nikolaos Nikolaou, Ingo P. Waldmann, Angelos Tsiaras, Mario Morvan, Billy Edwards, Kai Hou Yip, Giovanna Tinetti, Subhajit Sarkar, James M. Dawson, Vadim Borisov, Gjergji Kasneci, Matej Petkovic, Tomaz Stepisnik, Tarek Al-Ubaidi, Rachel Louise Bailey, Michael Granitzer, Sahib Julka, Roman Kern, Patrick Ofner, Stefan Wagner, Lukas Heppe, Mirko Bunse, Katharina Morik

    Abstract: The last decade has witnessed a rapid growth of the field of exoplanet discovery and characterisation. However, several big challenges remain, many of which could be addressed using machine learning methodology. For instance, the most prolific method for detecting exoplanets and inferring several of their characteristics, transit photometry, is very sensitive to the presence of stellar spots. The… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: 20 pages, 7 figures, 2 tables, Submitted to The Astrophysics Journal (ApJ)

  39. Learning Parameter Distributions to Detect Concept Drift in Data Streams

    Authors: Johannes Haug, Gjergji Kasneci

    Abstract: Data distributions in streaming environments are usually not stationary. In order to maintain a high predictive quality at all times, online learning models need to adapt to distributional changes, which are known as concept drift. The timely and robust identification of concept drift can be difficult, as we never have access to the true distribution of streaming data. In this work, we propose a n… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: To be published in the proceedings of the 25th International Conference on Pattern Recognition (ICPR 2020)

  40. Aggregating Dependent Gaussian Experts in Local Approximation

    Authors: Hamed Jalali, Gjergji Kasneci

    Abstract: Distributed Gaussian processes (DGPs) are prominent local approximation methods to scale Gaussian processes (GPs) to large datasets. Instead of a global estimation, they train local experts by dividing the training set into subsets, thus reducing the time complexity. This strategy is based on the conditional independence assumption, which basically means that there is a perfect diversity between t… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: 8 pages

    Journal ref: 25th International Conference on Pattern Recognition, ICPR 2020, Virtual Event / Milan, Italy, January 10-15, 2021

  41. arXiv:2006.13132  [pdf, other

    cs.LG cs.CY stat.ML

    On Counterfactual Explanations under Predictive Multiplicity

    Authors: Martin Pawelczyk, Klaus Broelemann, Gjergji Kasneci

    Abstract: Counterfactual explanations are usually obtained by identifying the smallest change made to an input to change a prediction made by a fixed model (hereafter called sparse methods). Recent work, however, has revitalized an old insight: there often does not exist one superior solution to a prediction problem with respect to commonly used measures of interest (e.g. error rate). In fact, often multipl… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Journal ref: Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), PMLR volume 124, 2020

  42. Leveraging Model Inherent Variable Importance for Stable Online Feature Selection

    Authors: Johannes Haug, Martin Pawelczyk, Klaus Broelemann, Gjergji Kasneci

    Abstract: Feature selection can be a crucial factor in obtaining robust and accurate predictions. Online feature selection models, however, operate under considerable restrictions; they need to efficiently extract salient input features based on a bounded set of observations, while enabling robust and accurate predictions. In this work, we introduce FIRES, a novel framework for online feature selection. The… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: To be published in the Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2020)

  43. arXiv:2001.09762  [pdf, other

    cs.CY

    Bias in Data-driven AI Systems -- An Introductory Survey

    Authors: Eirini Ntoutsi, Pavlos Fafalios, Ujwal Gadiraju, Vasileios Iosifidis, Wolfgang Nejdl, Maria-Esther Vidal, Salvatore Ruggieri, Franco Turini, Symeon Papadopoulos, Emmanouil Krasanakis, Ioannis Kompatsiaris, Katharina Kinder-Kurlanda, Claudia Wagner, Fariba Karimi, Miriam Fernandez, Harith Alani, Bettina Berendt, Tina Kruegel, Christian Heinze, Klaus Broelemann, Gjergji Kasneci, Thanassis Tiropanis, Steffen Staab

    Abstract: AI-based systems are widely employed nowadays to make decisions that have far-reaching impacts on individuals and society. Their decisions might affect everyone, everywhere and anytime, entailing concerns about potential human rights issues. Therefore, it is necessary to move beyond traditional AI algorithms optimized for predictive performance and embed ethical and legal principles in their desig… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 19 pages, 1 figure

  44. Learning Model-Agnostic Counterfactual Explanations for Tabular Data

    Authors: Martin Pawelczyk, Johannes Haug, Klaus Broelemann, Gjergji Kasneci

    Abstract: Counterfactual explanations can be obtained by identifying the smallest change made to a feature vector to qualitatively influence a prediction; for example, from 'loan rejected' to 'awarded' or from 'high risk of cardiovascular disease' to 'low risk'. Previous approaches often emphasized that counterfactuals should be easily interpretable to humans, motivating sparse solutions with few changes to… ▽ More

    Submitted 3 May, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Update version: from Neurips Workshop to WWW publication. In Proceedings of The Web Conference 2020 (WWW 20), April 20-24, 2020, Taipei, Taiwan

  45. arXiv:1905.10073  [pdf, other

    cs.LG stat.ML

    Training Decision Trees as Replacement for Convolution Layers

    Authors: Wolfgang Fuhl, Gjergji Kasneci, Wolfgang Rosenstiel, Enkelejda Kasneci

    Abstract: We present an alternative layer to convolution layers in convolutional neural networks (CNNs). Our approach reduces the complexity of convolutions by replacing it with binary decisions. Those binary decisions are used as indexes to conditional distributions where each weight represents a leaf in a decision tree. This means that only the indices to the weights need to be determined once, thus reduc… ▽ More

    Submitted 11 February, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: Will be published in the proceedings oft he AAAI 2020 conference

  46. A Gradient-Based Split Criterion for Highly Accurate and Transparent Model Trees

    Authors: Klaus Broelemann, Gjergji Kasneci

    Abstract: Machine learning algorithms aim at minimizing the number of false decisions and increasing the accuracy of predictions. However, the high predictive power of advanced algorithms comes at the costs of transparency. State-of-the-art methods, such as neural networks and ensemble methods, often result in highly complex models that offer little transparency. We propose shallow model trees as a way to… ▽ More

    Submitted 29 August, 2019; v1 submitted 25 September, 2018; originally announced September 2018.

    Journal ref: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, {IJCAI} 2019

  47. arXiv:1807.10680  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Combining Restricted Boltzmann Machines with Neural Networks for Latent Truth Discovery

    Authors: Klaus Broelemann, Gjergji Kasneci

    Abstract: Latent truth discovery, LTD for short, refers to the problem of aggregating ltiple claims from various sources in order to estimate the plausibility of atements about entities. In the absence of a ground truth, this problem is highly challenging, when some sources provide conflicting claims and others no claims at all. In this work we provide an unsupervised stochastic inference procedure on top o… ▽ More

    Submitted 27 July, 2018; originally announced July 2018.

  48. arXiv:1801.00283  [pdf, ps, other

    cs.LG stat.ML

    Restricted Boltzmann Machines for Robust and Fast Latent Truth Discovery

    Authors: Klaus Broelemann, Thomas Gottron, Gjergji Kasneci

    Abstract: We address the problem of latent truth discovery, LTD for short, where the goal is to discover the underlying true values of entity attributes in the presence of noisy, conflicting or incomplete information. Despite a multitude of algorithms to address the LTD problem that can be found in literature, only little is known about their overall performance with respect to effectiveness (in terms of tr… ▽ More

    Submitted 31 December, 2017; originally announced January 2018.

  49. arXiv:1711.00112  [pdf, other

    cs.CV

    PupilNet v2.0: Convolutional Neural Networks for CPU based real time Robust Pupil Detection

    Authors: Wolfgang Fuhl, Thiago Santini, Gjergji Kasneci, Wolfgang Rosenstiel, Enkelejda Kasneci

    Abstract: Real-time, accurate, and robust pupil detection is an essential prerequisite for pervasive video-based eye-tracking. However, automated pupil detection in realworld scenarios has proven to be an intricate challenge due to fast illumination changes, pupil occlusion, non-centered and off-axis eye recording, as well as physiological eye characteristics. In this paper, we approach this challenge throu… ▽ More

    Submitted 30 October, 2017; originally announced November 2017.

    Comments: Pupil detection, pupil center estimation, image processing, CNN. arXiv admin note: substantial text overlap with arXiv:1601.04902

  50. arXiv:1601.04902  [pdf, other

    cs.CV

    PupilNet: Convolutional Neural Networks for Robust Pupil Detection

    Authors: Wolfgang Fuhl, Thiago Santini, Gjergji Kasneci, Enkelejda Kasneci

    Abstract: Real-time, accurate, and robust pupil detection is an essential prerequisite for pervasive video-based eye-tracking. However, automated pupil detection in real-world scenarios has proven to be an intricate challenge due to fast illumination changes, pupil occlusion, non centered and off-axis eye recording, and physiological eye characteristics. In this paper, we propose and evaluate a method based… ▽ More

    Submitted 19 January, 2016; originally announced January 2016.

    Comments: 9 pages, 11 figures