subscribe to arXiv mailings

Towards Personalised Patient Risk Prediction Using Temporal Hospital Data Trajectories

Authors: Thea Barnes, Enrico Werner, Jeffrey N. Clark, Raul Santos-Rodriguez

Abstract: Quantifying a patient's health status provides clinicians with insight into patient risk, and the ability to better triage and manage resources. Early Warning Scores (EWS) are widely deployed to measure overall health status, and risk of adverse outcomes, in hospital patients. However, current EWS are limited both by their lack of personalisation and use of static observations. We propose a pipeli… ▽ More Quantifying a patient's health status provides clinicians with insight into patient risk, and the ability to better triage and manage resources. Early Warning Scores (EWS) are widely deployed to measure overall health status, and risk of adverse outcomes, in hospital patients. However, current EWS are limited both by their lack of personalisation and use of static observations. We propose a pipeline that groups intensive care unit patients by the trajectories of observations data throughout their stay as a basis for the development of personalised risk predictions. Feature importance is considered to provide model explainability. Using the MIMIC-IV dataset, six clusters were identified, capturing differences in disease codes, observations, lengths of admissions and outcomes. Applying the pipeline to data from just the first four hours of each ICU stay assigns the majority of patients to the same cluster as when the entire stay duration is considered. In-hospital mortality prediction models trained on individual clusters had higher F1 score performance in five of the six clusters when compared against the unclustered patient cohort. The pipeline could form the basis of a clinical decision support tool, working to improve the clinical characterisation of risk groups and the early detection of patient deterioration. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2309.15965 [pdf, other]

TraCE: Trajectory Counterfactual Explanation Scores

Authors: Jeffrey N. Clark, Edward A. Small, Nawid Keshtmand, Michelle W. L. Wan, Elena Fillola Mayoral, Enrico Werner, Christopher P. Bourdeaux, Raul Santos-Rodriguez

Abstract: Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterf… ▽ More Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterfactual Explanation) scores, which is able to distill and condense progress in highly complex scenarios into a single value. We demonstrate TraCE's utility across domains by showcasing its main properties in two case studies spanning healthcare and climate change. △ Less

Submitted 26 January, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

Comments: 10 pages, 4 figures, appendix

arXiv:2304.08319 [pdf, other]

Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance Analysis

Authors: Elias Werner, Nishant Kumar, Matthias Lieber, Sunna Torge, Stefan Gumhold, Wolfgang E. Nagel

Abstract: Concept drift detection is crucial for many AI systems to ensure the system's reliability. These systems often have to deal with large amounts of data or react in real-time. Thus, drift detectors must meet computational requirements or constraints with a comprehensive performance evaluation. However, so far, the focus of developing drift detectors is on inference quality, e.g. accuracy, but not on… ▽ More Concept drift detection is crucial for many AI systems to ensure the system's reliability. These systems often have to deal with large amounts of data or react in real-time. Thus, drift detectors must meet computational requirements or constraints with a comprehensive performance evaluation. However, so far, the focus of developing drift detectors is on inference quality, e.g. accuracy, but not on computational performance, such as runtime. Many of the previous works consider computational performance only as a secondary objective and do not have a benchmark for such evaluation. Hence, we propose and explain performance engineering for unsupervised concept drift detection that reflects on computational complexities, benchmarking, and performance analysis. We provide the computational complexities of existing unsupervised drift detectors and discuss why further computational performance investigations are required. Hence, we state and substantiate the aspects of a benchmark for unsupervised drift detection reflecting on inference quality and computational performance. Furthermore, we demonstrate performance analysis practices that have proven their effectiveness in High-Performance Computing, by tracing two drift detectors and displaying their performance data. △ Less

Submitted 10 June, 2024; v1 submitted 17 April, 2023; originally announced April 2023.

Comments: Accepted at 13th International Conference on Data Science, Technology and Applications (DATA). Source code: https://github.com/elwer/Perf_DD

arXiv:2301.08019 [pdf, other]

Identification, explanation and clinical evaluation of hospital patient subtypes

Authors: Enrico Werner, Jeffrey N. Clark, Ranjeet S. Bhamber, Michael Ambler, Christopher P. Bourdeaux, Alexander Hepburn, Christopher J. McWilliams, Raul Santos-Rodriguez

Abstract: We present a pipeline in which unsupervised machine learning techniques are used to automatically identify subtypes of hospital patients admitted between 2017 and 2021 in a large UK teaching hospital. With the use of state-of-the-art explainability techniques, the identified subtypes are interpreted and assigned clinical meaning. In parallel, clinicians assessed intra-cluster similarities and inte… ▽ More We present a pipeline in which unsupervised machine learning techniques are used to automatically identify subtypes of hospital patients admitted between 2017 and 2021 in a large UK teaching hospital. With the use of state-of-the-art explainability techniques, the identified subtypes are interpreted and assigned clinical meaning. In parallel, clinicians assessed intra-cluster similarities and inter-cluster differences of the identified patient subtypes within the context of their clinical knowledge. By confronting the outputs of both automatic and clinician-based explanations, we aim to highlight the mutual benefit of combining machine learning techniques with clinical expertise. △ Less

Submitted 19 January, 2023; originally announced January 2023.

arXiv:2207.10553 [pdf, other]

MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of Behavior

Authors: Jennifer J. Sun, Markus Marks, Andrew Ulmer, Dipam Chakraborty, Brian Geuther, Edward Hayes, Heng Jia, Vivek Kumar, Sebastian Oleszko, Zachary Partridge, Milan Peelman, Alice Robie, Catherine E. Schretter, Keith Sheppard, Chao Sun, Param Uttarwar, Julian M. Wagner, Eric Werner, Joseph Parker, Pietro Perona, Yisong Yue, Kristin Branson, Ann Kennedy

Abstract: We introduce MABe22, a large-scale, multi-agent video and trajectory benchmark to assess the quality of learned behavior representations. This dataset is collected from a variety of biology experiments, and includes triplets of interacting mice (4.7 million frames video+pose tracking data, 10 million frames pose only), symbiotic beetle-ant interactions (10 million frames video data), and groups of… ▽ More We introduce MABe22, a large-scale, multi-agent video and trajectory benchmark to assess the quality of learned behavior representations. This dataset is collected from a variety of biology experiments, and includes triplets of interacting mice (4.7 million frames video+pose tracking data, 10 million frames pose only), symbiotic beetle-ant interactions (10 million frames video data), and groups of interacting flies (4.4 million frames of pose tracking data). Accompanying these data, we introduce a panel of real-life downstream analysis tasks to assess the quality of learned representations by evaluating how well they preserve information about the experimental conditions (e.g. strain, time of day, optogenetic stimulation) and animal behavior. We test multiple state-of-the-art self-supervised video and trajectory representation learning methods to demonstrate the use of our benchmark, revealing that methods developed using human action datasets do not fully translate to animal datasets. We hope that our benchmark and dataset encourage a broader exploration of behavior representation learning methods across species and settings. △ Less

Submitted 30 June, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

Comments: To appear in ICML 2023, Project website: https://sites.google.com/view/computational-behavior/our-datasets/mabe2022-dataset

arXiv:2111.14426 [pdf, other]

doi 10.1007/978-3-031-16788-1_36

Improving traffic sign recognition by active search

Authors: S. Jaghouar, H. Gustafsson, B. Mehlig, E. Werner, N. Gustafsson

Abstract: We describe an iterative active-learning algorithm to recognise rare traffic signs. A standard ResNet is trained on a training set containing only a single sample of the rare class. We demonstrate that by sorting the samples of a large, unlabeled set by the estimated probability of belonging to the rare class, we can efficiently identify samples from the rare class. This works despite the fact tha… ▽ More We describe an iterative active-learning algorithm to recognise rare traffic signs. A standard ResNet is trained on a training set containing only a single sample of the rare class. We demonstrate that by sorting the samples of a large, unlabeled set by the estimated probability of belonging to the rare class, we can efficiently identify samples from the rare class. This works despite the fact that this estimated probability is usually quite low. A reliable active-learning loop is obtained by labeling these candidate samples, including them in the training set, and iterating the procedure. Further, we show that we get similar results starting from a single synthetic sample. Our results are important as they indicate a straightforward way of improving traffic-sign recognition for automated driving systems. In addition, they show that we can make use of the information hidden in low confidence outputs, which is usually ignored. △ Less

Submitted 29 November, 2021; originally announced November 2021.

Comments: 6 pages, 7 Figures

Journal ref: DAGM GCPR 2022 Pattern Recognition pp. 594--606 (2022)

arXiv:1505.07712 [pdf, ps, other]

A Category Theory of Communication Theory

Authors: Eric Werner

Abstract: A theory of how agents can come to understand a language is presented. If understanding a sentence $α$ is to associate an operator with $α$ that transforms the representational state of the agent as intended by the sender, then coming to know a language involves coming to know the operators that correspond to the meaning of any sentence. This involves a higher order operator that operates on the p… ▽ More A theory of how agents can come to understand a language is presented. If understanding a sentence $α$ is to associate an operator with $α$ that transforms the representational state of the agent as intended by the sender, then coming to know a language involves coming to know the operators that correspond to the meaning of any sentence. This involves a higher order operator that operates on the possible transformations that operate on the representational capacity of the agent. We formalize these constructs using concepts and diagrams analogous to category theory. △ Less

Submitted 28 May, 2015; originally announced May 2015.

Comments: 5 pages

arXiv:1304.6792 [pdf, ps, other]

On the mixed $f$-divergence for multiple pairs of measures

Authors: Elisabeth M. Werner, Deping Ye

Abstract: In this paper, the concept of the classical $f$-divergence (for a pair of measures) is extended to the mixed $f$-divergence (for multiple pairs of measures). The mixed $f$-divergence provides a way to measure the difference between multiple pairs of (probability) measures. Properties for the mixed $f$-divergence are established, such as permutation invariance and symmetry in distributions. An Alex… ▽ More In this paper, the concept of the classical $f$-divergence (for a pair of measures) is extended to the mixed $f$-divergence (for multiple pairs of measures). The mixed $f$-divergence provides a way to measure the difference between multiple pairs of (probability) measures. Properties for the mixed $f$-divergence are established, such as permutation invariance and symmetry in distributions. An Alexandrov-Fenchel type inequality and an isoperimetric type inequality for the mixed $f$-divergence will be proved and applications in the theory of convex bodies are given. △ Less

Submitted 24 April, 2013; originally announced April 2013.

MSC Class: 94A15; 94A17

arXiv:1212.2390 [pdf, ps, other]

On the complexity of learning a language: An improvement of Block's algorithm

Authors: Eric Werner

Abstract: Language learning is thought to be a highly complex process. One of the hurdles in learning a language is to learn the rules of syntax of the language. Rules of syntax are often ordered in that before one rule can applied one must apply another. It has been thought that to learn the order of n rules one must go through all n! permutations. Thus to learn the order of 27 rules would require 27! step… ▽ More Language learning is thought to be a highly complex process. One of the hurdles in learning a language is to learn the rules of syntax of the language. Rules of syntax are often ordered in that before one rule can applied one must apply another. It has been thought that to learn the order of n rules one must go through all n! permutations. Thus to learn the order of 27 rules would require 27! steps or 1.08889x10^{28} steps. This number is much greater than the number of seconds since the beginning of the universe! In an insightful analysis the linguist Block ([Block 86], pp. 62-63, p.238) showed that with the assumption of transitivity this vast number of learning steps reduces to a mere 377 steps. We present a mathematical analysis of the complexity of Block's algorithm. The algorithm has a complexity of order n^2 given n rules. In addition, we improve Block's results exponentially, by introducing an algorithm that has complexity of order less than n log n. △ Less

Submitted 11 December, 2012; originally announced December 2012.

Comments: 7 pages. Key Words: Language learning, rules of language, complexity, learning algorithms, evolution of language

arXiv:1207.3289 [pdf, other]

The Origin, Evolution and Development of Bilateral Symmetry in Multicellular Organisms

Authors: Eric Werner

Abstract: A computational theory and model of the ontogeny and development of bilateral symmetry in multicellular organisms is presented. Understanding the origin and evolution of bilateral organisms requires an understanding of how bilateral symmetry develops, starting from a single cell. Bilateral symmetric growth of a multicellular organism from a single starter cell is explained as resulting from the op… ▽ More A computational theory and model of the ontogeny and development of bilateral symmetry in multicellular organisms is presented. Understanding the origin and evolution of bilateral organisms requires an understanding of how bilateral symmetry develops, starting from a single cell. Bilateral symmetric growth of a multicellular organism from a single starter cell is explained as resulting from the opposite handedness and orientation along one axis in two daughter founder cells that are in equivalent developmental control network states. Several methods of establishing the initial orientation of the daughter cells (including oriented cell division and cell signaling) are discussed. The orientation states of the daughter cells are epigenetically inherited by their progeny. This results in mirror development with the two founding daughter cells generating complementary mirror image multicellular morphologies. The end product is a bilateral symmetric organism. The theory gives a unified explanation of diverse phenomena including symmetry breaking, situs inversus, gynandromorphs, inside-out growth, bilaterally symmetric cancers, and the rapid, punctuated evolution of bilaterally symmetric organisms in the Cambrian Explosion. The theory is supported by experimental results on early embryonic development. The theory makes precise testable predications. △ Less

Submitted 13 July, 2012; originally announced July 2012.

Comments: 29 pages

arXiv:1205.3423 [pdf, ps, other]

f-Divergence for convex bodies

Authors: Elisabeth M. Werner

Abstract: We introduce f-divergence, a concept from information theory and statistics, for convex bodies in R^n. We prove that f-divergences are SL(n) invariant valuations and we establish an affine isoperimetric inequality for these quantities. We show that generalized affine surface area and in particular the L_p affine surface area from the L_p Brunn Minkowski theory are special cases of f-divergences. We introduce f-divergence, a concept from information theory and statistics, for convex bodies in R^n. We prove that f-divergences are SL(n) invariant valuations and we establish an affine isoperimetric inequality for these quantities. We show that generalized affine surface area and in particular the L_p affine surface area from the L_p Brunn Minkowski theory are special cases of f-divergences. △ Less

Submitted 15 May, 2012; originally announced May 2012.

MSC Class: 52A20; 53A15

arXiv:1110.5865 [pdf, other]

Cancer Networks: A general theoretical and computational framework for understanding cancer

Authors: Eric Werner

Abstract: We present a general computational theory of cancer and its developmental dynamics. The theory is based on a theory of the architecture and function of developmental control networks which guide the formation of multicellular organisms. Cancer networks are special cases of developmental control networks. Cancer results from transformations of normal developmental networks. Our theory generates a n… ▽ More We present a general computational theory of cancer and its developmental dynamics. The theory is based on a theory of the architecture and function of developmental control networks which guide the formation of multicellular organisms. Cancer networks are special cases of developmental control networks. Cancer results from transformations of normal developmental networks. Our theory generates a natural classification of all possible cancers based on their network architecture. Each cancer network has a unique topology and semantics and developmental dynamics that result in distinct clinical tumor phenotypes. We apply this new theory with a series of proof of concept cases for all the basic cancer types. These cases have been computationally modeled, their behavior simulated and mathematically described using a multicellular systems biology approach. There are fascinating correspondences between the dynamic developmental phenotype of computationally modeled {\em in silico} cancers and natural {\em in vivo} cancers. The theory lays the foundation for a new research paradigm for understanding and investigating cancer. The theory of cancer networks implies that new diagnostic methods and new treatments to cure cancer will become possible. △ Less

Submitted 26 October, 2011; originally announced October 2011.

Comments: Key words: Cancer networks, cene, cenome, developmental control networks, stem cells, stem cell networks, cancer stem cells, stochastic stem cell networks, metastases hierarchy, linear networks, exponential networks, geometric cancer networks, cell signaling, cancer cell communication networks, systems biology, computational biology, multiagent systems, muticellular modeling, cancer modeling

arXiv:1110.5265 [pdf]

On Programs and Genomes

Authors: Eric Werner

Abstract: We outline the global control architecture of genomes. A theory of genomic control information is presented. The concept of a developmental control network called a cene (for control gene) is introduced. We distinguish parts-genes from control genes or cenes. Cenes are interpreted and executed by the cell and, thereby, direct cell actions including communication, growth, division, differentiation… ▽ More We outline the global control architecture of genomes. A theory of genomic control information is presented. The concept of a developmental control network called a cene (for control gene) is introduced. We distinguish parts-genes from control genes or cenes. Cenes are interpreted and executed by the cell and, thereby, direct cell actions including communication, growth, division, differentiation and multi-cellular development. The cenome is the global developmental control network in the genome. The cenome is also a cene that consists of interlinked sub-cenes that guide the ontogeny of the organism. The complexity of organisms is linked to the complexity of the cenome. The relevance to ontogeny and evolution is mentioned. We introduce the concept of a universal cell and a universal genome. △ Less

Submitted 24 October, 2011; originally announced October 2011.

Comments: This a slightly extended version of Part I of a position paper distributed on November 18, 2007 to the participants of our Balliol Seminar on the Conceptual Foundations of Systems Biology. It presented my ideas on the global control architecture of genomes. Denis Noble and myself started the seminar in the Michaelmas term in the autumn of 2006 at Balliol College, University of Oxford

Showing 1–13 of 13 results for author: Werner, E