-
Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision
Authors:
Hao Dong,
Eleni Chatzi,
Olga Fink
Abstract:
The task of open-set domain generalization (OSDG) involves recognizing novel classes within unseen domains, which becomes more challenging with multiple modalities as input. Existing works have only addressed unimodal OSDG within the meta-learning framework, without considering multimodal scenarios. In this work, we introduce a novel approach to address Multimodal Open-Set Domain Generalization (M…
▽ More
The task of open-set domain generalization (OSDG) involves recognizing novel classes within unseen domains, which becomes more challenging with multiple modalities as input. Existing works have only addressed unimodal OSDG within the meta-learning framework, without considering multimodal scenarios. In this work, we introduce a novel approach to address Multimodal Open-Set Domain Generalization (MM-OSDG) for the first time, utilizing self-supervision. To this end, we introduce two innovative multimodal self-supervised pretext tasks: Masked Cross-modal Translation and Multimodal Jigsaw Puzzles. These tasks facilitate the learning of multimodal representative features, thereby enhancing generalization and open-class detection capabilities. Additionally, we propose a novel entropy weighting mechanism to balance the loss across different modalities. Furthermore, we extend our approach to tackle also the Multimodal Open-Set Domain Adaptation (MM-OSDA) problem, especially in scenarios where unlabeled data from the target domain is available. Extensive experiments conducted under MM-OSDG, MM-OSDA, and Multimodal Closed-Set DG settings on the EPIC-Kitchens and HAC datasets demonstrate the efficacy and versatility of the proposed approach. Our source code is available at https://github.com/donghao51/MOOSA.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities
Authors:
Hao Dong,
Yue Zhao,
Eleni Chatzi,
Olga Fink
Abstract:
Detecting out-of-distribution (OOD) samples is important for deploying machine learning models in safety-critical applications such as autonomous driving and robot-assisted surgery. Existing research has mainly focused on unimodal scenarios on image data. However, real-world applications are inherently multimodal, which makes it essential to leverage information from multiple modalities to enhance…
▽ More
Detecting out-of-distribution (OOD) samples is important for deploying machine learning models in safety-critical applications such as autonomous driving and robot-assisted surgery. Existing research has mainly focused on unimodal scenarios on image data. However, real-world applications are inherently multimodal, which makes it essential to leverage information from multiple modalities to enhance the efficacy of OOD detection. To establish a foundation for more realistic Multimodal OOD Detection, we introduce the first-of-its-kind benchmark, MultiOOD, characterized by diverse dataset sizes and varying modality combinations. We first evaluate existing unimodal OOD detection algorithms on MultiOOD, observing that the mere inclusion of additional modalities yields substantial improvements. This underscores the importance of utilizing multiple modalities for OOD detection. Based on the observation of Modality Prediction Discrepancy between in-distribution (ID) and OOD data, and its strong correlation with OOD performance, we propose the Agree-to-Disagree (A2D) algorithm to encourage such discrepancy during training. Moreover, we introduce a novel outlier synthesis method, NP-Mix, which explores broader feature spaces by leveraging the information from nearest neighbor classes and complements A2D to strengthen OOD detection performance. Extensive experiments on MultiOOD demonstrate that training with A2D and NP-Mix improves existing OOD detection algorithms by a large margin. Our source code and MultiOOD benchmark are available at https://github.com/donghao51/MultiOOD.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Monitoring-Supported Value Generation for Managing Structures and Infrastructure Systems
Authors:
Antonios Kamariotis,
Eleni Chatzi,
Daniel Straub,
Nikolaos Dervilis,
Kai Goebel,
Aidan J. Hughes,
Geert Lombaert,
Costas Papadimitriou,
Konstantinos G. Papakonstantinou,
Matteo Pozzi,
Michael Todd,
Keith Worden
Abstract:
To maximize its value, the design, development and implementation of Structural Health Monitoring (SHM) should focus on its role in facilitating decision support. In this position paper, we offer perspectives on the synergy between SHM and decision-making. We propose a classification of SHM use cases aligning with various dimensions that are closely linked to the respective decision contexts. The…
▽ More
To maximize its value, the design, development and implementation of Structural Health Monitoring (SHM) should focus on its role in facilitating decision support. In this position paper, we offer perspectives on the synergy between SHM and decision-making. We propose a classification of SHM use cases aligning with various dimensions that are closely linked to the respective decision contexts. The types of decisions that have to be supported by the SHM system within these settings are discussed along with the corresponding challenges. We provide an overview of different classes of models that are required for integrating SHM in the decision-making process to support management and operation and maintenance of structures and infrastructure systems. Fundamental decision-theoretic principles and state-of-the-art methods for optimizing maintenance and operational decision-making under uncertainty are briefly discussed. Finally, we offer a viewpoint on the appropriate course of action for quantifying, validating and maximizing the added value generated by SHM. This work aspires to synthesize the different perspectives of the SHM, Prognostic Health Management (PHM), and reliability communities, and deliver a roadmap towards monitoring-based decision support.
△ Less
Submitted 4 January, 2024;
originally announced February 2024.
-
NNG-Mix: Improving Semi-supervised Anomaly Detection with Pseudo-anomaly Generation
Authors:
Hao Dong,
Gaëtan Frusque,
Yue Zhao,
Eleni Chatzi,
Olga Fink
Abstract:
Anomaly detection (AD) is essential in identifying rare and often critical events in complex systems, finding applications in fields such as network intrusion detection, financial fraud detection, and fault detection in infrastructure and industrial systems. While AD is typically treated as an unsupervised learning task due to the high cost of label annotation, it is more practical to assume acces…
▽ More
Anomaly detection (AD) is essential in identifying rare and often critical events in complex systems, finding applications in fields such as network intrusion detection, financial fraud detection, and fault detection in infrastructure and industrial systems. While AD is typically treated as an unsupervised learning task due to the high cost of label annotation, it is more practical to assume access to a small set of labeled anomaly samples from domain experts, as is the case for semi-supervised anomaly detection. Semi-supervised and supervised approaches can leverage such labeled data, resulting in improved performance. In this paper, rather than proposing a new semi-supervised or supervised approach for AD, we introduce a novel algorithm for generating additional pseudo-anomalies on the basis of the limited labeled anomalies and a large volume of unlabeled data. This serves as an augmentation to facilitate the detection of new anomalies. Our proposed algorithm, named Nearest Neighbor Gaussian Mixup (NNG-Mix), efficiently integrates information from both labeled and unlabeled data to generate pseudo-anomalies. We compare the performance of this novel algorithm with commonly applied augmentation techniques, such as Mixup and Cutout. We evaluate NNG-Mix by training various existing semi-supervised and supervised anomaly detection algorithms on the original training data along with the generated pseudo-anomalies. Through extensive experiments on 57 benchmark datasets in ADBench, reflecting different data types, we demonstrate that NNG-Mix outperforms other data augmentation methods. It yields significant performance improvements compared to the baselines trained exclusively on the original training data. Notably, NNG-Mix yields up to 16.4%, 8.8%, and 8.0% improvements on Classical, CV, and NLP datasets in ADBench. Our source code is available at https://github.com/donghao51/NNG-Mix.
△ Less
Submitted 11 June, 2024; v1 submitted 20 November, 2023;
originally announced November 2023.
-
Discussing the Spectrum of Physics-Enhanced Machine Learning; a Survey on Structural Mechanics Applications
Authors:
Marcus Haywood-Alexander,
Wei Liu,
Kiran Bacsa,
Zhilu Lai,
Eleni Chatzi
Abstract:
The intersection of physics and machine learning has given rise to the physics-enhanced machine learning (PEML) paradigm, aiming to improve the capabilities and reduce the individual shortcomings of data- or physics-only methods. In this paper, the spectrum of physics-enhanced machine learning methods, expressed across the defining axes of physics and data, is discussed by engaging in a comprehens…
▽ More
The intersection of physics and machine learning has given rise to the physics-enhanced machine learning (PEML) paradigm, aiming to improve the capabilities and reduce the individual shortcomings of data- or physics-only methods. In this paper, the spectrum of physics-enhanced machine learning methods, expressed across the defining axes of physics and data, is discussed by engaging in a comprehensive exploration of its characteristics, usage, and motivations. In doing so, we present a survey of recent applications and developments of PEML techniques, revealing the potency of PEML in addressing complex challenges. We further demonstrate application of select such schemes on the simple working example of a single degree-of-freedom Duffing oscillator, which allows to highlight the individual characteristics and motivations of different `genres' of PEML approaches. To promote collaboration and transparency, and to provide practical examples for the reader, the code generating these working examples is provided alongside this paper. As a foundational contribution, this paper underscores the significance of PEML in pushing the boundaries of scientific and engineering research, underpinned by the synergy of physical insights and machine learning capabilities.
△ Less
Submitted 22 April, 2024; v1 submitted 31 October, 2023;
originally announced October 2023.
-
SimMMDG: A Simple and Effective Framework for Multi-modal Domain Generalization
Authors:
Hao Dong,
Ismail Nejjar,
Han Sun,
Eleni Chatzi,
Olga Fink
Abstract:
In real-world scenarios, achieving domain generalization (DG) presents significant challenges as models are required to generalize to unknown target distributions. Generalizing to unseen multi-modal distributions poses even greater difficulties due to the distinct properties exhibited by different modalities. To overcome the challenges of achieving domain generalization in multi-modal scenarios, w…
▽ More
In real-world scenarios, achieving domain generalization (DG) presents significant challenges as models are required to generalize to unknown target distributions. Generalizing to unseen multi-modal distributions poses even greater difficulties due to the distinct properties exhibited by different modalities. To overcome the challenges of achieving domain generalization in multi-modal scenarios, we propose SimMMDG, a simple yet effective multi-modal DG framework. We argue that mapping features from different modalities into the same embedding space impedes model generalization. To address this, we propose splitting the features within each modality into modality-specific and modality-shared components. We employ supervised contrastive learning on the modality-shared features to ensure they possess joint properties and impose distance constraints on modality-specific features to promote diversity. In addition, we introduce a cross-modal translation module to regularize the learned features, which can also be used for missing-modality generalization. We demonstrate that our framework is theoretically well-supported and achieves strong performance in multi-modal DG on the EPIC-Kitchens dataset and the novel Human-Animal-Cartoon (HAC) dataset introduced in this paper. Our source code and HAC dataset are available at https://github.com/donghao51/SimMMDG.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Knowledge Engineering for Wind Energy
Authors:
Yuriy Marykovskiy,
Thomas Clark,
Justin Day,
Marcus Wiens,
Charles Henderson,
Julian Quick,
Imad Abdallah,
Anna Maria Sempreviva,
Jean-Paul Calbimonte,
Eleni Chatzi,
Sarah Barber
Abstract:
With the rapid evolution of the wind energy sector, there is an ever-increasing need to create value from the vast amounts of data made available both from within the domain, as well as from other sectors. This article addresses the challenges faced by wind energy domain experts in converting data into domain knowledge, connecting and integrating it with other sources of knowledge, and making it a…
▽ More
With the rapid evolution of the wind energy sector, there is an ever-increasing need to create value from the vast amounts of data made available both from within the domain, as well as from other sectors. This article addresses the challenges faced by wind energy domain experts in converting data into domain knowledge, connecting and integrating it with other sources of knowledge, and making it available for use in next generation artificially intelligent systems. To this end, this article highlights the role that knowledge engineering can play in the process of digital transformation of the wind energy sector. It presents the main concepts underpinning Knowledge-Based Systems and summarises previous work in the areas of knowledge engineering and knowledge representation in a manner that is relevant and accessible to domain experts. A systematic analysis of the current state-of-the-art on knowledge engineering in the wind energy domain is performed, with available tools put into perspective by establishing the main domain actors and their needs and identifying key problematic areas. Finally, guidelines for further development and improvement are provided.
△ Less
Submitted 1 October, 2023;
originally announced October 2023.
-
POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance
Authors:
Giacomo Arcieri,
Cyprien Hoelzl,
Oliver Schwery,
Daniel Straub,
Konstantinos G. Papakonstantinou,
Eleni Chatzi
Abstract:
Partially Observable Markov Decision Processes (POMDPs) can model complex sequential decision-making problems under stochastic and uncertain environments. A main reason hindering their broad adoption in real-world applications is the lack of availability of a suitable POMDP model or a simulator thereof. Available solution algorithms, such as Reinforcement Learning (RL), require the knowledge of th…
▽ More
Partially Observable Markov Decision Processes (POMDPs) can model complex sequential decision-making problems under stochastic and uncertain environments. A main reason hindering their broad adoption in real-world applications is the lack of availability of a suitable POMDP model or a simulator thereof. Available solution algorithms, such as Reinforcement Learning (RL), require the knowledge of the transition dynamics and the observation generating process, which are often unknown and non-trivial to infer. In this work, we propose a combined framework for inference and robust solution of POMDPs via deep RL. First, all transition and observation model parameters are jointly inferred via Markov Chain Monte Carlo sampling of a hidden Markov model, which is conditioned on actions, in order to recover full posterior distributions from the available data. The POMDP with uncertain parameters is then solved via deep RL techniques with the parameter distributions incorporated into the solution via domain randomization, in order to develop solutions that are robust to model uncertainty. As a further contribution, we compare the use of transformers and long short-term memory networks, which constitute model-free RL solutions, with a model-based/model-free hybrid approach. We apply these methods to the real-world problem of optimal maintenance planning for railway assets.
△ Less
Submitted 16 July, 2023;
originally announced July 2023.
-
VpROM: A novel Variational AutoEncoder-boosted Reduced Order Model for the treatment of parametric dependencies in nonlinear systems
Authors:
Thomas Simpson,
Konstantinos Vlachas,
Anthony Garland,
Nikolaos Dervilis,
Eleni Chatzi
Abstract:
Reduced Order Models (ROMs) are of considerable importance in many areas of engineering in which computational time presents difficulties. Established approaches employ projection-based reduction such as Proper Orthogonal Decomposition, however, such methods can become inefficient or fail in the case of parameteric or strongly nonlinear models. Such limitations are usually tackled via a library of…
▽ More
Reduced Order Models (ROMs) are of considerable importance in many areas of engineering in which computational time presents difficulties. Established approaches employ projection-based reduction such as Proper Orthogonal Decomposition, however, such methods can become inefficient or fail in the case of parameteric or strongly nonlinear models. Such limitations are usually tackled via a library of local reduction bases each of which being valid for a given parameter vector. The success of such methods, however, is strongly reliant upon the method used to relate the parameter vectors to the local bases, this is typically achieved using clustering or interpolation methods. We propose the replacement of these methods with a Variational Autoencoder (VAE) to be used as a generative model which can infer the local basis corresponding to a given parameter vector in a probabilistic manner. The resulting VAE-boosted parametric ROM \emph{VpROM} still retains the physical insights of a projection-based method but also allows for better treatment of problems where model dependencies or excitation traits cause the dynamic behavior to span multiple response regimes. Moreover, the probabilistic treatment of the VAE representation allows for uncertainty quantification on the reduction bases which may then be propagated to the ROM response. The performance of the proposed approach is validated on an open-source simulation benchmark featuring hysteresis and multi-parametric dependencies, and on a large-scale wind turbine tower characterised by nonlinear material behavior and model uncertainty.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
Graph Neural Networks for Aerodynamic Flow Reconstruction from Sparse Sensing
Authors:
Gregory Duth��,
Imad Abdallah,
Sarah Barber,
Eleni Chatzi
Abstract:
Sensing the fluid flow around an arbitrary geometry entails extrapolating from the physical quantities perceived at its surface in order to reconstruct the features of the surrounding fluid. This is a challenging inverse problem, yet one that if solved could have a significant impact on many engineering applications. The exploitation of such an inverse logic has gained interest in recent years wit…
▽ More
Sensing the fluid flow around an arbitrary geometry entails extrapolating from the physical quantities perceived at its surface in order to reconstruct the features of the surrounding fluid. This is a challenging inverse problem, yet one that if solved could have a significant impact on many engineering applications. The exploitation of such an inverse logic has gained interest in recent years with the advent of widely available cheap but capable MEMS-based sensors. When combined with novel data-driven methods, these sensors may allow for flow reconstruction around immersed structures, benefiting applications such as unmanned airborne/underwater vehicle path planning or control and structural health monitoring of wind turbine blades. In this work, we train deep reversible Graph Neural Networks (GNNs) to perform flow sensing (flow reconstruction) around two-dimensional aerodynamic shapes: airfoils. Motivated by recent work, which has shown that GNNs can be powerful alternatives to mesh-based forward physics simulators, we implement a Message-Passing Neural Network to simultaneously reconstruct both the pressure and velocity fields surrounding simulated airfoils based on their surface pressure distributions, whilst additionally gathering useful farfield properties in the form of context vectors. We generate a unique dataset of Computational Fluid Dynamics simulations by simulating random, yet meaningful combinations of input boundary conditions and airfoil shapes. We show that despite the challenges associated with reconstructing the flow around arbitrary airfoil geometries in high Reynolds turbulent inflow conditions, our framework is able to generalize well to unseen cases.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems
Authors:
Giacomo Arcieri,
Cyprien Hoelzl,
Oliver Schwery,
Daniel Straub,
Konstantinos G. Papakonstantinou,
Eleni Chatzi
Abstract:
Structural Health Monitoring (SHM) describes a process for inferring quantifiable metrics of structural condition, which can serve as input to support decisions on the operation and maintenance of infrastructure assets. Given the long lifespan of critical structures, this problem can be cast as a sequential decision making problem over prescribed horizons. Partially Observable Markov Decision Proc…
▽ More
Structural Health Monitoring (SHM) describes a process for inferring quantifiable metrics of structural condition, which can serve as input to support decisions on the operation and maintenance of infrastructure assets. Given the long lifespan of critical structures, this problem can be cast as a sequential decision making problem over prescribed horizons. Partially Observable Markov Decision Processes (POMDPs) offer a formal framework to solve the underlying optimal planning task. However, two issues can undermine the POMDP solutions. Firstly, the need for a model that can adequately describe the evolution of the structural condition under deterioration or corrective actions and, secondly, the non-trivial task of recovery of the observation process parameters from available monitoring data. Despite these potential challenges, the adopted POMDP models do not typically account for uncertainty on model parameters, leading to solutions which can be unrealistically confident. In this work, we address both key issues. We present a framework to estimate POMDP transition and observation model parameters directly from available data, via Markov Chain Monte Carlo (MCMC) sampling of a Hidden Markov Model (HMM) conditioned on actions. The MCMC inference estimates distributions of the involved model parameters. We then form and solve the POMDP problem by exploiting the inferred distributions, to derive solutions that are robust to model uncertainty. We successfully apply our approach on maintenance planning for railway track assets on the basis of a "fractal value" indicator, which is computed from actual railway monitoring data.
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
Neural Extended Kalman Filters for Learning and Predicting Dynamics of Structural Systems
Authors:
Wei Liu,
Zhilu Lai,
Kiran Bacsa,
Eleni Chatzi
Abstract:
Accurate structural response prediction forms a main driver for structural health monitoring and control applications. This often requires the proposed model to adequately capture the underlying dynamics of complex structural systems. In this work, we utilize a learnable Extended Kalman Filter (EKF), named the Neural Extended Kalman Filter (Neural EKF) throughout this paper, for learning the laten…
▽ More
Accurate structural response prediction forms a main driver for structural health monitoring and control applications. This often requires the proposed model to adequately capture the underlying dynamics of complex structural systems. In this work, we utilize a learnable Extended Kalman Filter (EKF), named the Neural Extended Kalman Filter (Neural EKF) throughout this paper, for learning the latent evolution dynamics of complex physical systems. The Neural EKF is a generalized version of the conventional EKF, where the modeling of process dynamics and sensory observations can be parameterized by neural networks, therefore learned by end-to-end training. The method is implemented under the variational inference framework with the EKF conducting inference from sensing measurements. Typically, conventional variational inference models are parameterized by neural networks independent of the latent dynamics models. This characteristic makes the inference and reconstruction accuracy weakly based on the dynamics models and renders the associated training inadequate. In this work, we show that the structure imposed by the Neural EKF is beneficial to the learning process. We demonstrate the efficacy of the framework on both simulated and real-world structural monitoring datasets, with the results indicating significant predictive capabilities of the proposed scheme.
△ Less
Submitted 3 July, 2023; v1 submitted 9 October, 2022;
originally announced October 2022.
-
Neural modal ordinary differential equations: Integrating physics-based modeling with neural ordinary differential equations for modeling high-dimensional monitored structures
Authors:
Zhilu Lai,
Wei Liu,
Xudong Jian,
Kiran Bacsa,
Limin Sun,
Eleni Chatzi
Abstract:
The order/dimension of models derived on the basis of data is commonly restricted by the number of observations, or in the context of monitored systems, sensing nodes. This is particularly true for structural systems (e.g., civil or mechanical structures), which are typically high-dimensional in nature. In the scope of physics-informed machine learning, this paper proposes a framework -- termed Ne…
▽ More
The order/dimension of models derived on the basis of data is commonly restricted by the number of observations, or in the context of monitored systems, sensing nodes. This is particularly true for structural systems (e.g., civil or mechanical structures), which are typically high-dimensional in nature. In the scope of physics-informed machine learning, this paper proposes a framework -- termed Neural Modal ODEs -- to integrate physics-based modeling with deep learning for modeling the dynamics of monitored and high-dimensional engineered systems. Neural Ordinary Differential Equations -- Neural ODEs are exploited as the deep learning operator. In this initiating exploration, we restrict ourselves to linear or mildly nonlinear systems. We propose an architecture that couples a dynamic version of variational autoencoders with physics-informed Neural ODEs (Pi-Neural ODEs). An encoder, as a part of the autoencoder, learns the abstract mappings from the first few items of observational data to the initial values of the latent variables, which drive the learning of embedded dynamics via physics-informed Neural ODEs, imposing a modal model structure on that latent space. The decoder of the proposed model adopts the eigenmodes derived from an eigen-analysis applied to the linearized portion of a physics-based model: a process implicitly carrying the spatial relationship between degrees-of-freedom (DOFs). The framework is validated on a numerical example, and an experimental dataset of a scaled cable-stayed bridge, where the learned hybrid model is shown to outperform a purely physics-based approach to modeling. We further show the functionality of the proposed scheme within the context of virtual sensing, i.e., the recovery of generalized response quantities in unmeasured DOFs from spatially sparse data.
△ Less
Submitted 30 November, 2022; v1 submitted 16 July, 2022;
originally announced July 2022.
-
Nonlinear Reduced Order Modelling of Soil Structure Interaction Effects via LSTM and Autoencoder Neural Networks
Authors:
Thomas Simpson,
Nikolaos Dervilis,
Philippe Couturier,
Nico Maljaars,
Eleni Chatzi
Abstract:
In the field of structural health monitoring (SHM), inverse problems which require repeated analyses are common. With the increase in the use of nonlinear models, the development of nonlinear reduced order modelling techniques is of paramount interest. Of considerable research interest, is the use of flexible and scalable machine learning methods which can learn to approximate the behaviour of non…
▽ More
In the field of structural health monitoring (SHM), inverse problems which require repeated analyses are common. With the increase in the use of nonlinear models, the development of nonlinear reduced order modelling techniques is of paramount interest. Of considerable research interest, is the use of flexible and scalable machine learning methods which can learn to approximate the behaviour of nonlinear dynamic systems using input and output data. One such nonlinear system of interest, in the context of wind turbine structures, is the soil structure interaction (SSI) problem. Soil demonstrates strongly nonlinear behaviour with regards to its restoring force and has been shown to considerably influence the dynamic response of wind turbine structures. In this work, we demonstrate the application of a recently developed nonlinear reduced order modelling method, which leverages Autoencoder and LSTM neural networks, to a nonlinear soil structure interaction problem of a wind turbine monopile subject to realistic loading at the seabed level. The accuracy and efficiency of the methodology is compared to full order simulations carried out using Abaqus. The ROM was shown to have good fidelity and a considerable reduction in computational time for the system considered.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
On an application of graph neural networks in population based SHM
Authors:
G. Tsialiamanis,
C. Mylonas,
E. Chatzi,
D. J. Wagg,
N. Dervilis,
K. Worden
Abstract:
Attempts have been made recently in the field of population-based structural health monitoring (PBSHM), to transfer knowledge between SHM models of different structures. The attempts have been focussed on homogeneous and heterogeneous populations. A more general approach to transferring knowledge between structures, is by considering all plausible structures as points on a multidimensional base ma…
▽ More
Attempts have been made recently in the field of population-based structural health monitoring (PBSHM), to transfer knowledge between SHM models of different structures. The attempts have been focussed on homogeneous and heterogeneous populations. A more general approach to transferring knowledge between structures, is by considering all plausible structures as points on a multidimensional base manifold and building a fibre bundle. The idea is quite powerful, since, a mapping between points in the base manifold and their fibres, the potential states of any arbitrary structure, can be learnt. A smaller scale problem, but still useful, is that of learning a specific point of every fibre, i.e. that corresponding to the undamaged state of structures within a population. Under the framework of PBSHM, a data-driven approach to the aforementioned problem is developed. Structures are converted into graphs and inference is attempted within a population, using a graph neural network (GNN) algorithm. The algorithm solves a major problem existing in such applications. Structures comprise different sizes and are defined as abstract objects, thus attempting to perform inference within a heterogeneous population is not trivial. The proposed approach is tested in a simulated population of trusses. The goal of the application is to predict the first natural frequency of trusses of different sizes, across different environmental temperatures and having different bar member types. After training the GNN using part of the total population, it was tested on trusses that were not included in the training dataset. Results show that the accuracy of the regression is satisfactory even in structures with higher number of nodes and members than those used to train it.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
An adapted deflated conjugate gradient solver for robust extended/generalised finite element solutions of large scale, 3D crack propagation problems
Authors:
Konstantinos Agathos,
Tim Dodwell,
Eleni Chatzi,
Stephane P. A. Bordas
Abstract:
An adapted deflation preconditioner is employed to accelerate the solution of linear systems resulting from the discretization of fracture mechanics problems with well-conditioned extended/generalized finite elements. The deflation space typically used for linear elasticity problems is enriched with additional vectors, accounting for the enrichment functions used, thus effectively removing low fre…
▽ More
An adapted deflation preconditioner is employed to accelerate the solution of linear systems resulting from the discretization of fracture mechanics problems with well-conditioned extended/generalized finite elements. The deflation space typically used for linear elasticity problems is enriched with additional vectors, accounting for the enrichment functions used, thus effectively removing low frequency components of the error. To further improve performance, deflation is combined, in a multiplicative way, with a block-Jacobi preconditioner, which removes high frequency components of the error as well as linear dependencies introduced by enrichment. The resulting scheme is tested on a series of non-planar crack propagation problems and compared to alternative linear solvers in terms of performance.
△ Less
Submitted 17 November, 2021;
originally announced November 2021.
-
Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks
Authors:
Giacomo Arcieri,
David Wölfle,
Eleni Chatzi
Abstract:
The need for algorithms able to solve Reinforcement Learning (RL) problems with few trials has motivated the advent of model-based RL methods. The reported performance of model-based algorithms has dramatically increased within recent years. However, it is not clear how much of the recent progress is due to improved algorithms or due to improved models. While different modeling options are availab…
▽ More
The need for algorithms able to solve Reinforcement Learning (RL) problems with few trials has motivated the advent of model-based RL methods. The reported performance of model-based algorithms has dramatically increased within recent years. However, it is not clear how much of the recent progress is due to improved algorithms or due to improved models. While different modeling options are available to choose from when applying a model-based approach, the distinguishing traits and particular strengths of different models are not clear. The main contribution of this work lies precisely in assessing the model influence on the performance of RL algorithms. A set of commonly adopted models is established for the purpose of model comparison. These include Neural Networks (NNs), ensembles of NNs, two different approximations of Bayesian NNs (BNNs), that is, the Concrete Dropout NN and the Anchored Ensembling, and Gaussian Processes (GPs). The model comparison is evaluated on a suite of continuous control benchmarking tasks. Our results reveal that significant differences in model performance do exist. The Concrete Dropout NN reports persistently superior performance. We summarize these differences for the benefit of the modeler and suggest that the model choice is tailored to the standards required by each specific application.
△ Less
Submitted 21 March, 2022; v1 submitted 25 October, 2021;
originally announced October 2021.
-
Physics-guided Deep Markov Models for Learning Nonlinear Dynamical Systems with Uncertainty
Authors:
Wei Liu,
Zhilu Lai,
Kiran Bacsa,
Eleni Chatzi
Abstract:
In this paper, we propose a probabilistic physics-guided framework, termed Physics-guided Deep Markov Model (PgDMM). The framework targets the inference of the characteristics and latent structure of nonlinear dynamical systems from measurement data, where exact inference of latent variables is typically intractable. A recently surfaced option pertains to leveraging variational inference to perfor…
▽ More
In this paper, we propose a probabilistic physics-guided framework, termed Physics-guided Deep Markov Model (PgDMM). The framework targets the inference of the characteristics and latent structure of nonlinear dynamical systems from measurement data, where exact inference of latent variables is typically intractable. A recently surfaced option pertains to leveraging variational inference to perform approximate inference. In such a scheme, transition and emission functions of the system are parameterized via feed-forward neural networks (deep generative models). However, due to the generalized and highly versatile formulation of neural network functions, the learned latent space often lacks physical interpretation and structured representation. To address this, we bridge physics-based state space models with Deep Markov Models, thus delivering a hybrid modeling framework for unsupervised learning and identification of nonlinear dynamical systems. The proposed framework takes advantage of the expressive power of deep learning, while retaining the driving physics of the dynamical system by imposing physics-driven restrictions on the side of the latent space. We demonstrate the benefits of such a fusion in terms of achieving improved performance on illustrative simulation examples and experimental case studies of nonlinear systems. Our results indicate that the physics-based models involved in the employed transition and emission functions essentially enforce a more structured and physically interpretable latent space, which is essential for enhancing and generalizing the predictive capabilities of deep learning-based models.
△ Less
Submitted 25 May, 2022; v1 submitted 16 October, 2021;
originally announced October 2021.
-
Machine Learning Approach to Model Order Reduction of Nonlinear Systems via Autoencoder and LSTM Networks
Authors:
Thomas Simpson,
Nikolaos Dervilis,
Eleni Chatzi
Abstract:
In analyzing and assessing the condition of dynamical systems, it is necessary to account for nonlinearity. Recent advances in computation have rendered previously computationally infeasible analyses readily executable on common computer hardware. However, in certain use cases, such as uncertainty quantification or high precision real-time simulation, the computational cost remains a challenge. Th…
▽ More
In analyzing and assessing the condition of dynamical systems, it is necessary to account for nonlinearity. Recent advances in computation have rendered previously computationally infeasible analyses readily executable on common computer hardware. However, in certain use cases, such as uncertainty quantification or high precision real-time simulation, the computational cost remains a challenge. This necessitates the adoption of reduced-order modelling methods, which can reduce the computational toll of such nonlinear analyses. In this work, we propose a reduction scheme relying on the exploitation of an autoencoder as means to infer a latent space from output-only response data. This latent space, which in essence approximates the system's nonlinear normal modes (NNMs), serves as an invertible reduction basis for the nonlinear system. The proposed machine learning framework is then complemented via the use of long short term memory (LSTM) networks in the reduced space. These are used for creating an nonlinear reduced-order model (ROM) of the system, able to recreate the full system's dynamic response under a known driving input.
△ Less
Submitted 23 September, 2021;
originally announced September 2021.
-
Moment fitted cut spectral elements for explicit analysis of guided wave propagation
Authors:
Sergio Nicoli,
Konstantinos Agathos,
Eleni Chatzi
Abstract:
In this work, a method for the simulation of guided wave propagation in solids defined by implicit surfaces is presented. The method employs structured grids of spectral elements in combination to a fictitious domain approach to represent complex geometrical features through singed distance functions. A novel approach, based on moment fitting, is introduced to restore the diagonal mass matrix prop…
▽ More
In this work, a method for the simulation of guided wave propagation in solids defined by implicit surfaces is presented. The method employs structured grids of spectral elements in combination to a fictitious domain approach to represent complex geometrical features through singed distance functions. A novel approach, based on moment fitting, is introduced to restore the diagonal mass matrix property in elements intersected by interfaces, thus enabling the use of explicit time integrators. Since this approach can lead to significantly decreased critical time steps for intersected elements, a "leap-frog" algorithm is employed to locally comply with this condition, thus introducing only a small computational overhead. The resulting method is tested through a series of numerical examples of increasing complexity, where it is shown that it offers increased accuracy compared to other similar approaches. Due to these improvements, components of interest for SHM-related tasks can be effectively discretized, while maintaining a performance comparable or only slightly worse than the standard spectral element method.
△ Less
Submitted 9 August, 2021;
originally announced August 2021.
-
Relational VAE: A Continuous Latent Variable Model for Graph Structured Data
Authors:
Charilaos Mylonas,
Imad Abdallah,
Eleni Chatzi
Abstract:
Graph Networks (GNs) enable the fusion of prior knowledge and relational reasoning with flexible function approximations. In this work, a general GN-based model is proposed which takes full advantage of the relational modeling capabilities of GNs and extends these to probabilistic modeling with Variational Bayes (VB). To that end, we combine complementary pre-existing approaches on VB for graph da…
▽ More
Graph Networks (GNs) enable the fusion of prior knowledge and relational reasoning with flexible function approximations. In this work, a general GN-based model is proposed which takes full advantage of the relational modeling capabilities of GNs and extends these to probabilistic modeling with Variational Bayes (VB). To that end, we combine complementary pre-existing approaches on VB for graph data and propose an approach that relies on graph-structured latent and conditioning variables. It is demonstrated that Neural Processes can also be viewed through the lens of the proposed model. We show applications on the problem of structured probability density modeling for simulated and real wind farm monitoring data, as well as on the meta-learning of simulated Gaussian Process data. We release the source code, along with the simulated datasets.
△ Less
Submitted 30 June, 2021;
originally announced June 2021.
-
Foundations of Population-Based SHM, Part IV: The Geometry of Spaces of Structures and their Feature Spaces
Authors:
George Tsialiamanis,
Charilaos Mylonas,
Eleni Chatzi,
Nikolaos Dervilis,
David J. Wagg,
Keith Worden
Abstract:
One of the requirements of the population-based approach to Structural Health Monitoring (SHM) proposed in the earlier papers in this sequence, is that structures be represented by points in an abstract space. Furthermore, these spaces should be metric spaces in a loose sense; i.e. there should be some measure of distance applicable to pairs of points; similar structures should then be close in th…
▽ More
One of the requirements of the population-based approach to Structural Health Monitoring (SHM) proposed in the earlier papers in this sequence, is that structures be represented by points in an abstract space. Furthermore, these spaces should be metric spaces in a loose sense; i.e. there should be some measure of distance applicable to pairs of points; similar structures should then be close in the metric. However, this geometrical construction is not enough for the framing of problems in data-based SHM, as it leaves undefined the notion of feature spaces. Interpreting the feature values on a structure-by-structure basis as a type of field over the space of structures, it seems sensible to borrow an idea from modern theoretical physics, and define feature assignments as sections in a vector bundle over the structure space. With this idea in place, one can interpret the effect of environmental and operational variations as gauge degrees of freedom, as in modern gauge field theories. This paper will discuss the various geometrical structures required for an abstract theory of feature spaces in SHM, and will draw analogies with how these structures have shown their power in modern physics. In the second part of the paper, the problem of determining the normal condition cross section of a feature bundle is addressed. The solution is provided by the application of Graph Neural Networks (GNN), a versatile non-Euclidean machine learning algorithm which is not restricted to inputs and outputs from vector spaces. In particular, the algorithm is well suited to operating directly on the sort of graph structures which are an important part of the proposed framework for PBSHM. The solution of the normal section problem is demonstrated for a heterogeneous population of truss structures for which the feature of interest is the first natural frequency.
△ Less
Submitted 5 March, 2021;
originally announced March 2021.
-
Bayesian graph neural networks for strain-based crack localization
Authors:
Charilaos Mylonas,
George Tsialiamanis,
Keith Worden,
Eleni N. Chatzi
Abstract:
A common shortcoming of vibration-based damage localization techniques is that localized damages, i.e. small cracks, have a limited influence on the spectral characteristics of a structure. In contrast, even the smallest of defects, under particular loading conditions, cause localized strain concentrations with predictable spatial configuration. However, the effect of a small defect on strain deca…
▽ More
A common shortcoming of vibration-based damage localization techniques is that localized damages, i.e. small cracks, have a limited influence on the spectral characteristics of a structure. In contrast, even the smallest of defects, under particular loading conditions, cause localized strain concentrations with predictable spatial configuration. However, the effect of a small defect on strain decays quickly with distance from the defect, making strain-based localization rather challenging. In this work, an attempt is made to approximate, in a fully data-driven manner, the posterior distribution of a crack location, given arbitrary dynamic strain measurements at arbitrary discrete locations on a structure. The proposed technique leverages Graph Neural Networks (GNNs) and recent developments in scalable learning for Bayesian neural networks. The technique is demonstrated on the problem of inferring the position of an unknown crack via patterns of dynamic strain field measurements at discrete locations. The dataset consists of simulations of a hollow tube under random time-dependent excitations with randomly sampled crack geometry and orientation.
△ Less
Submitted 19 May, 2023; v1 submitted 12 December, 2020;
originally announced December 2020.
-
Remaining Useful Life Estimation Under Uncertainty with Causal GraphNets
Authors:
Charilaos Mylonas,
Eleni Chatzi
Abstract:
In this work, a novel approach for the construction and training of time series models is presented that deals with the problem of learning on large time series with non-equispaced observations, which at the same time may possess features of interest that span multiple scales. The proposed method is appropriate for constructing predictive models for non-stationary stochastic time series.The effica…
▽ More
In this work, a novel approach for the construction and training of time series models is presented that deals with the problem of learning on large time series with non-equispaced observations, which at the same time may possess features of interest that span multiple scales. The proposed method is appropriate for constructing predictive models for non-stationary stochastic time series.The efficacy of the method is demonstrated on a simulated stochastic degradation dataset and on a real-world accelerated life testing dataset for ball-bearings. The proposed method, which is based on GraphNets, implicitly learns a model that describes the evolution of the system at the level of a state-vector rather than of a raw observation. The proposed approach is compared to a recurrent network with a temporal convolutional feature extractor head (RNN-tCNN) which forms a known viable alternative for the problem context considered. Finally, by taking advantage of recent advances in the computation of reparametrization gradients for learning probability distributions, a simple yet effective technique for representing prediction uncertainty as a Gamma distribution over remaining useful life predictions is employed.
△ Less
Submitted 23 November, 2020;
originally announced November 2020.
-
On Dynamic Substructuring of Systems with Localised Nonlinearities
Authors:
Thomas Simpson,
Dimitrios Giagopoulos,
Vasilis Dertimanis,
Eleni Chatzi
Abstract:
Dynamic substructuring (DS) methods encompass a range of techniques to decompose large structural systems into multiple coupled subsystems. This decomposition has the principle benefit of reducing computational time for dynamic simulation of the system. In this context, DS methods may form an essential component of hybrid simulation, wherein they can be used to couple physical and numerical substr…
▽ More
Dynamic substructuring (DS) methods encompass a range of techniques to decompose large structural systems into multiple coupled subsystems. This decomposition has the principle benefit of reducing computational time for dynamic simulation of the system. In this context, DS methods may form an essential component of hybrid simulation, wherein they can be used to couple physical and numerical substructures at reduced computational cost. Since most engineered systems are inherently nonlinear, particular potential lies in incorporating nonlinear methods in existing substructuring schemes which are largely linear methods.
The most widely used and studied DS methods are classical linear techniques such as the Craig-Bampton (CB) method. However, as linear methods they naturally break down in the presence of nonlinearities. Recent advancements in substructuring have involved the development of enrichments to linear methods, which allow for some degree of nonlinearity to be captured. The use of mode shape derivatives has been shown to be able to capture geometrically non-linear effects as an extension to the CBmethod. Other candidates include the method of Finite Element Tearing and Interconnecting.
In this work, a virtual hybrid simulation is presented in which a linear elastic vehicle frame supported on four nonlinear spring damper isolators is decomposed into separate domains. One domain consisting of the finite element model of the vehicle frame, which is reduced using the CB method. The second domain consists of the nonlinear isolators whose restoring forces are characterised by nonlinear spring and damper forces. Coupling between the models is carried out using a Lagrange multiplier method and time series simulations of the system are conducted and compared to the full global system with regards to simulation time and accuracy.
△ Less
Submitted 30 June, 2020;
originally announced June 2020.
-
On the Potential of Dynamic Substructuring Methods for Model Updating
Authors:
Thomas Simpson,
Vasilis Dertimanis,
Costas Papadimitriou,
Eleni Chatzi
Abstract:
While purely data-driven assessment is feasible for the first levels of the Structural Health Monitoring (SHM) process, namely damage detection and arguably damage localization, this does not hold true for more advanced processes. The tasks of damage quantification and eventually residual life prognosis are invariably linked to availability of a representation of the system, which bears physical c…
▽ More
While purely data-driven assessment is feasible for the first levels of the Structural Health Monitoring (SHM) process, namely damage detection and arguably damage localization, this does not hold true for more advanced processes. The tasks of damage quantification and eventually residual life prognosis are invariably linked to availability of a representation of the system, which bears physical connotation. In this context, it is often desirable to assimilate data and models, into what is often termed a digital twin of the monitored system.
One common take to such an end lies in exploitation of structural mechanics models, relying on use of Finite Element approximations. proper updating of these models, and their incorporation in an inverse problem setting may allow for damage quantification and localization, as well as more advanced tasks, including reliability analysis and fatigue assessment. However, this may only be achieved by means of repetitive analyses of the forward model, which implies considerable computational toll, when the model used is a detailed FE representation. In tackling this issue, reduced order models can be adopted, which retain the parameterisation and link to the parameters regulating the physical properties, albeit greatly reducing the computational burden.
In this work a detailed FE model of a wind turbine tower is considered, reduced forms of this model are found using both the Craig Bampton and Dual Craig Bampton methods. These reduced order models are then used and compared in a Transitional Markov Chain Monte Carlo procedure to localise and quantify damage which is introduced to the system.
△ Less
Submitted 30 April, 2021; v1 submitted 30 June, 2020;
originally announced June 2020.
-
A local basis approximation approach for nonlinear parametric model order reduction
Authors:
Konstantinos Vlachas,
Konstantinos Tatsis,
Konstantinos Agathos,
Adam R. Brink,
Eleni Chatzi
Abstract:
The efficient condition assessment of engineered systems requires the coupling of high fidelity models with data extracted from the state of the system `as-is'. In enabling this task, this paper implements a parametric Model Order Reduction (pMOR) scheme for nonlinear structural dynamics, and the particular case of material nonlinearity. A physics-based parametric representation is developed, inco…
▽ More
The efficient condition assessment of engineered systems requires the coupling of high fidelity models with data extracted from the state of the system `as-is'. In enabling this task, this paper implements a parametric Model Order Reduction (pMOR) scheme for nonlinear structural dynamics, and the particular case of material nonlinearity. A physics-based parametric representation is developed, incorporating dependencies on system properties and/or excitation characteristics. The pMOR formulation relies on use of a Proper Orthogonal Decomposition applied to a series of snapshots of the nonlinear dynamic response. A new approach to manifold interpolation is proposed, with interpolation taking place on the reduced coefficient matrix mapping local bases to a global one. We demonstrate the performance of this approach firstly on the simple example of a shear-frame structure, and secondly on the more complex 3D numerical case study of an earthquake-excited wind turbine tower. Parametric dependence pertains to structural properties, as well as the temporal and spectral characteristics of the applied excitation. The developed parametric Reduced Order Model (pROM) can be exploited for a number of tasks including monitoring and diagnostics, control of vibrating structures, and residual life estimation of critical components.
△ Less
Submitted 16 March, 2020;
originally announced March 2020.
-
Value of structural health information in partially observable stochastic environments
Authors:
C. P. Andriotis,
K. G. Papakonstantinou,
E. N. Chatzi
Abstract:
Efficient integration of uncertain observations with decision-making optimization is key for prescribing informed intervention actions, able to preserve structural safety of deteriorating engineering systems. To this end, it is necessary that scheduling of inspection and monitoring strategies be objectively performed on the basis of their expected value-based gains that, among others, reflect quan…
▽ More
Efficient integration of uncertain observations with decision-making optimization is key for prescribing informed intervention actions, able to preserve structural safety of deteriorating engineering systems. To this end, it is necessary that scheduling of inspection and monitoring strategies be objectively performed on the basis of their expected value-based gains that, among others, reflect quantitative metrics such as the Value of Information (VoI) and the Value of Structural Health Monitoring (VoSHM). In this work, we introduce and study the theoretical and computational foundations of the above metrics within the context of Partially Observable Markov Decision Processes (POMDPs), thus alluding to a broad class of decision-making problems of partially observable stochastic deteriorating environments that can be modeled as POMDPs. Step-wise and life-cycle VoI and VoSHM definitions are devised and their bounds are analyzed as per the properties stemming from the Bellman equation and the resulting optimal value function. It is shown that a POMDP policy inherently leverages the notion of VoI to guide observational actions in an optimal way at every decision step, and that the permanent or intermittent information provided by SHM or inspection visits, respectively, can only improve the cost of this policy in the long-term, something that is not necessarily true under locally optimal policies, typically adopted in decision-making of structures and infrastructure. POMDP solutions are derived based on point-based value iteration methods, and the various definitions are quantified in stationary and non-stationary deteriorating environments, with both infinite and finite planning horizons, featuring single- or multi-component engineering systems.
△ Less
Submitted 20 July, 2020; v1 submitted 28 December, 2019;
originally announced December 2019.
-
Multiscale Surrogate Modeling and Uncertainty Quantification for Periodic Composite Structures
Authors:
Charilaos Mylonas,
Valentin Bemetz,
Eleni Chatzi
Abstract:
Computational modeling of the structural behavior of continuous fiber composite materials often takes into account the periodicity of the underlying micro-structure. A well established method dealing with the structural behavior of periodic micro-structures is the so- called Asymptotic Expansion Homogenization (AEH). By considering a periodic perturbation of the material displacement, scale bridgi…
▽ More
Computational modeling of the structural behavior of continuous fiber composite materials often takes into account the periodicity of the underlying micro-structure. A well established method dealing with the structural behavior of periodic micro-structures is the so- called Asymptotic Expansion Homogenization (AEH). By considering a periodic perturbation of the material displacement, scale bridging functions, also referred to as elastic correctors, can be derived in order to connect the strains at the level of the macro-structure with micro- structural strains. For complicated inhomogeneous micro-structures, the derivation of such functions is usually performed by the numerical solution of a PDE problem - typically with the Finite Element Method. Moreover, when dealing with uncertain micro-structural geometry and material parameters, there is considerable uncertainty introduced in the actual stresses experienced by the materials. Due to the high computational cost of computing the elastic correctors, the choice of a pure Monte-Carlo approach for dealing with the inevitable material and geometric uncertainties is clearly computationally intractable. This problem is even more pronounced when the effect of damage in the micro-scale is considered, where re-evaluation of the micro-structural representative volume element is necessary for every occurring damage. The novelty in this paper is that a non-intrusive surrogate modeling approach is employed with the purpose of directly bridging the macro-scale behavior of the structure with the material behavior in the micro-scale, therefore reducing the number of costly evaluations of corrector functions, allowing for future developments on the incorporation of fatigue or static damage in the analysis of composite structural components.
△ Less
Submitted 11 July, 2017;
originally announced July 2017.