-
Towards Personalised Patient Risk Prediction Using Temporal Hospital Data Trajectories
Authors:
Thea Barnes,
Enrico Werner,
Jeffrey N. Clark,
Raul Santos-Rodriguez
Abstract:
Quantifying a patient's health status provides clinicians with insight into patient risk, and the ability to better triage and manage resources. Early Warning Scores (EWS) are widely deployed to measure overall health status, and risk of adverse outcomes, in hospital patients. However, current EWS are limited both by their lack of personalisation and use of static observations. We propose a pipeli…
▽ More
Quantifying a patient's health status provides clinicians with insight into patient risk, and the ability to better triage and manage resources. Early Warning Scores (EWS) are widely deployed to measure overall health status, and risk of adverse outcomes, in hospital patients. However, current EWS are limited both by their lack of personalisation and use of static observations. We propose a pipeline that groups intensive care unit patients by the trajectories of observations data throughout their stay as a basis for the development of personalised risk predictions. Feature importance is considered to provide model explainability. Using the MIMIC-IV dataset, six clusters were identified, capturing differences in disease codes, observations, lengths of admissions and outcomes. Applying the pipeline to data from just the first four hours of each ICU stay assigns the majority of patients to the same cluster as when the entire stay duration is considered. In-hospital mortality prediction models trained on individual clusters had higher F1 score performance in five of the six clusters when compared against the unclustered patient cohort. The pipeline could form the basis of a clinical decision support tool, working to improve the clinical characterisation of risk groups and the early detection of patient deterioration.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
TraCE: Trajectory Counterfactual Explanation Scores
Authors:
Jeffrey N. Clark,
Edward A. Small,
Nawid Keshtmand,
Michelle W. L. Wan,
Elena Fillola Mayoral,
Enrico Werner,
Christopher P. Bourdeaux,
Raul Santos-Rodriguez
Abstract:
Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterf…
▽ More
Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterfactual Explanation) scores, which is able to distill and condense progress in highly complex scenarios into a single value. We demonstrate TraCE's utility across domains by showcasing its main properties in two case studies spanning healthcare and climate change.
△ Less
Submitted 26 January, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance Analysis
Authors:
Elias Werner,
Nishant Kumar,
Matthias Lieber,
Sunna Torge,
Stefan Gumhold,
Wolfgang E. Nagel
Abstract:
Concept drift detection is crucial for many AI systems to ensure the system's reliability. These systems often have to deal with large amounts of data or react in real-time. Thus, drift detectors must meet computational requirements or constraints with a comprehensive performance evaluation. However, so far, the focus of developing drift detectors is on inference quality, e.g. accuracy, but not on…
▽ More
Concept drift detection is crucial for many AI systems to ensure the system's reliability. These systems often have to deal with large amounts of data or react in real-time. Thus, drift detectors must meet computational requirements or constraints with a comprehensive performance evaluation. However, so far, the focus of developing drift detectors is on inference quality, e.g. accuracy, but not on computational performance, such as runtime. Many of the previous works consider computational performance only as a secondary objective and do not have a benchmark for such evaluation. Hence, we propose and explain performance engineering for unsupervised concept drift detection that reflects on computational complexities, benchmarking, and performance analysis. We provide the computational complexities of existing unsupervised drift detectors and discuss why further computational performance investigations are required. Hence, we state and substantiate the aspects of a benchmark for unsupervised drift detection reflecting on inference quality and computational performance. Furthermore, we demonstrate performance analysis practices that have proven their effectiveness in High-Performance Computing, by tracing two drift detectors and displaying their performance data.
△ Less
Submitted 10 June, 2024; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Identification, explanation and clinical evaluation of hospital patient subtypes
Authors:
Enrico Werner,
Jeffrey N. Clark,
Ranjeet S. Bhamber,
Michael Ambler,
Christopher P. Bourdeaux,
Alexander Hepburn,
Christopher J. McWilliams,
Raul Santos-Rodriguez
Abstract:
We present a pipeline in which unsupervised machine learning techniques are used to automatically identify subtypes of hospital patients admitted between 2017 and 2021 in a large UK teaching hospital. With the use of state-of-the-art explainability techniques, the identified subtypes are interpreted and assigned clinical meaning. In parallel, clinicians assessed intra-cluster similarities and inte…
▽ More
We present a pipeline in which unsupervised machine learning techniques are used to automatically identify subtypes of hospital patients admitted between 2017 and 2021 in a large UK teaching hospital. With the use of state-of-the-art explainability techniques, the identified subtypes are interpreted and assigned clinical meaning. In parallel, clinicians assessed intra-cluster similarities and inter-cluster differences of the identified patient subtypes within the context of their clinical knowledge. By confronting the outputs of both automatic and clinician-based explanations, we aim to highlight the mutual benefit of combining machine learning techniques with clinical expertise.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of Behavior
Authors:
Jennifer J. Sun,
Markus Marks,
Andrew Ulmer,
Dipam Chakraborty,
Brian Geuther,
Edward Hayes,
Heng Jia,
Vivek Kumar,
Sebastian Oleszko,
Zachary Partridge,
Milan Peelman,
Alice Robie,
Catherine E. Schretter,
Keith Sheppard,
Chao Sun,
Param Uttarwar,
Julian M. Wagner,
Eric Werner,
Joseph Parker,
Pietro Perona,
Yisong Yue,
Kristin Branson,
Ann Kennedy
Abstract:
We introduce MABe22, a large-scale, multi-agent video and trajectory benchmark to assess the quality of learned behavior representations. This dataset is collected from a variety of biology experiments, and includes triplets of interacting mice (4.7 million frames video+pose tracking data, 10 million frames pose only), symbiotic beetle-ant interactions (10 million frames video data), and groups of…
▽ More
We introduce MABe22, a large-scale, multi-agent video and trajectory benchmark to assess the quality of learned behavior representations. This dataset is collected from a variety of biology experiments, and includes triplets of interacting mice (4.7 million frames video+pose tracking data, 10 million frames pose only), symbiotic beetle-ant interactions (10 million frames video data), and groups of interacting flies (4.4 million frames of pose tracking data). Accompanying these data, we introduce a panel of real-life downstream analysis tasks to assess the quality of learned representations by evaluating how well they preserve information about the experimental conditions (e.g. strain, time of day, optogenetic stimulation) and animal behavior. We test multiple state-of-the-art self-supervised video and trajectory representation learning methods to demonstrate the use of our benchmark, revealing that methods developed using human action datasets do not fully translate to animal datasets. We hope that our benchmark and dataset encourage a broader exploration of behavior representation learning methods across species and settings.
△ Less
Submitted 30 June, 2023; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Improving traffic sign recognition by active search
Authors:
S. Jaghouar,
H. Gustafsson,
B. Mehlig,
E. Werner,
N. Gustafsson
Abstract:
We describe an iterative active-learning algorithm to recognise rare traffic signs. A standard ResNet is trained on a training set containing only a single sample of the rare class. We demonstrate that by sorting the samples of a large, unlabeled set by the estimated probability of belonging to the rare class, we can efficiently identify samples from the rare class. This works despite the fact tha…
▽ More
We describe an iterative active-learning algorithm to recognise rare traffic signs. A standard ResNet is trained on a training set containing only a single sample of the rare class. We demonstrate that by sorting the samples of a large, unlabeled set by the estimated probability of belonging to the rare class, we can efficiently identify samples from the rare class. This works despite the fact that this estimated probability is usually quite low. A reliable active-learning loop is obtained by labeling these candidate samples, including them in the training set, and iterating the procedure. Further, we show that we get similar results starting from a single synthetic sample. Our results are important as they indicate a straightforward way of improving traffic-sign recognition for automated driving systems. In addition, they show that we can make use of the information hidden in low confidence outputs, which is usually ignored.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
A Category Theory of Communication Theory
Authors:
Eric Werner
Abstract:
A theory of how agents can come to understand a language is presented. If understanding a sentence $α$ is to associate an operator with $α$ that transforms the representational state of the agent as intended by the sender, then coming to know a language involves coming to know the operators that correspond to the meaning of any sentence. This involves a higher order operator that operates on the p…
▽ More
A theory of how agents can come to understand a language is presented. If understanding a sentence $α$ is to associate an operator with $α$ that transforms the representational state of the agent as intended by the sender, then coming to know a language involves coming to know the operators that correspond to the meaning of any sentence. This involves a higher order operator that operates on the possible transformations that operate on the representational capacity of the agent. We formalize these constructs using concepts and diagrams analogous to category theory.
△ Less
Submitted 28 May, 2015;
originally announced May 2015.
-
On the mixed $f$-divergence for multiple pairs of measures
Authors:
Elisabeth M. Werner,
Deping Ye
Abstract:
In this paper, the concept of the classical $f$-divergence (for a pair of measures) is extended to the mixed $f$-divergence (for multiple pairs of measures). The mixed $f$-divergence provides a way to measure the difference between multiple pairs of (probability) measures. Properties for the mixed $f$-divergence are established, such as permutation invariance and symmetry in distributions. An Alex…
▽ More
In this paper, the concept of the classical $f$-divergence (for a pair of measures) is extended to the mixed $f$-divergence (for multiple pairs of measures). The mixed $f$-divergence provides a way to measure the difference between multiple pairs of (probability) measures. Properties for the mixed $f$-divergence are established, such as permutation invariance and symmetry in distributions. An Alexandrov-Fenchel type inequality and an isoperimetric type inequality for the mixed $f$-divergence will be proved and applications in the theory of convex bodies are given.
△ Less
Submitted 24 April, 2013;
originally announced April 2013.
-
On the complexity of learning a language: An improvement of Block's algorithm
Authors:
Eric Werner
Abstract:
Language learning is thought to be a highly complex process. One of the hurdles in learning a language is to learn the rules of syntax of the language. Rules of syntax are often ordered in that before one rule can applied one must apply another. It has been thought that to learn the order of n rules one must go through all n! permutations. Thus to learn the order of 27 rules would require 27! step…
▽ More
Language learning is thought to be a highly complex process. One of the hurdles in learning a language is to learn the rules of syntax of the language. Rules of syntax are often ordered in that before one rule can applied one must apply another. It has been thought that to learn the order of n rules one must go through all n! permutations. Thus to learn the order of 27 rules would require 27! steps or 1.08889x10^{28} steps. This number is much greater than the number of seconds since the beginning of the universe! In an insightful analysis the linguist Block ([Block 86], pp. 62-63, p.238) showed that with the assumption of transitivity this vast number of learning steps reduces to a mere 377 steps. We present a mathematical analysis of the complexity of Block's algorithm. The algorithm has a complexity of order n^2 given n rules. In addition, we improve Block's results exponentially, by introducing an algorithm that has complexity of order less than n log n.
△ Less
Submitted 11 December, 2012;
originally announced December 2012.
-
The Origin, Evolution and Development of Bilateral Symmetry in Multicellular Organisms
Authors:
Eric Werner
Abstract:
A computational theory and model of the ontogeny and development of bilateral symmetry in multicellular organisms is presented. Understanding the origin and evolution of bilateral organisms requires an understanding of how bilateral symmetry develops, starting from a single cell. Bilateral symmetric growth of a multicellular organism from a single starter cell is explained as resulting from the op…
▽ More
A computational theory and model of the ontogeny and development of bilateral symmetry in multicellular organisms is presented. Understanding the origin and evolution of bilateral organisms requires an understanding of how bilateral symmetry develops, starting from a single cell. Bilateral symmetric growth of a multicellular organism from a single starter cell is explained as resulting from the opposite handedness and orientation along one axis in two daughter founder cells that are in equivalent developmental control network states. Several methods of establishing the initial orientation of the daughter cells (including oriented cell division and cell signaling) are discussed. The orientation states of the daughter cells are epigenetically inherited by their progeny. This results in mirror development with the two founding daughter cells generating complementary mirror image multicellular morphologies. The end product is a bilateral symmetric organism. The theory gives a unified explanation of diverse phenomena including symmetry breaking, situs inversus, gynandromorphs, inside-out growth, bilaterally symmetric cancers, and the rapid, punctuated evolution of bilaterally symmetric organisms in the Cambrian Explosion. The theory is supported by experimental results on early embryonic development. The theory makes precise testable predications.
△ Less
Submitted 13 July, 2012;
originally announced July 2012.
-
f-Divergence for convex bodies
Authors:
Elisabeth M. Werner
Abstract:
We introduce f-divergence, a concept from information theory and statistics, for convex bodies in R^n. We prove that f-divergences are SL(n) invariant valuations and we establish an affine isoperimetric inequality for these quantities. We show that generalized affine surface area and in particular the L_p affine surface area from the L_p Brunn Minkowski theory are special cases of f-divergences.
We introduce f-divergence, a concept from information theory and statistics, for convex bodies in R^n. We prove that f-divergences are SL(n) invariant valuations and we establish an affine isoperimetric inequality for these quantities. We show that generalized affine surface area and in particular the L_p affine surface area from the L_p Brunn Minkowski theory are special cases of f-divergences.
△ Less
Submitted 15 May, 2012;
originally announced May 2012.
-
Cancer Networks: A general theoretical and computational framework for understanding cancer
Authors:
Eric Werner
Abstract:
We present a general computational theory of cancer and its developmental dynamics. The theory is based on a theory of the architecture and function of developmental control networks which guide the formation of multicellular organisms. Cancer networks are special cases of developmental control networks. Cancer results from transformations of normal developmental networks. Our theory generates a n…
▽ More
We present a general computational theory of cancer and its developmental dynamics. The theory is based on a theory of the architecture and function of developmental control networks which guide the formation of multicellular organisms. Cancer networks are special cases of developmental control networks. Cancer results from transformations of normal developmental networks. Our theory generates a natural classification of all possible cancers based on their network architecture. Each cancer network has a unique topology and semantics and developmental dynamics that result in distinct clinical tumor phenotypes. We apply this new theory with a series of proof of concept cases for all the basic cancer types. These cases have been computationally modeled, their behavior simulated and mathematically described using a multicellular systems biology approach. There are fascinating correspondences between the dynamic developmental phenotype of computationally modeled {\em in silico} cancers and natural {\em in vivo} cancers. The theory lays the foundation for a new research paradigm for understanding and investigating cancer. The theory of cancer networks implies that new diagnostic methods and new treatments to cure cancer will become possible.
△ Less
Submitted 26 October, 2011;
originally announced October 2011.
-
On Programs and Genomes
Authors:
Eric Werner
Abstract:
We outline the global control architecture of genomes. A theory of genomic control information is presented. The concept of a developmental control network called a cene (for control gene) is introduced. We distinguish parts-genes from control genes or cenes. Cenes are interpreted and executed by the cell and, thereby, direct cell actions including communication, growth, division, differentiation…
▽ More
We outline the global control architecture of genomes. A theory of genomic control information is presented. The concept of a developmental control network called a cene (for control gene) is introduced. We distinguish parts-genes from control genes or cenes. Cenes are interpreted and executed by the cell and, thereby, direct cell actions including communication, growth, division, differentiation and multi-cellular development. The cenome is the global developmental control network in the genome. The cenome is also a cene that consists of interlinked sub-cenes that guide the ontogeny of the organism. The complexity of organisms is linked to the complexity of the cenome. The relevance to ontogeny and evolution is mentioned. We introduce the concept of a universal cell and a universal genome.
△ Less
Submitted 24 October, 2011;
originally announced October 2011.