subscribe to arXiv mailings

Data-driven Prior Learning for Bayesian Optimisation

Authors: Sigrid Passano Hellan, Christopher G. Lucas, Nigel H. Goddard

Abstract: Transfer learning for Bayesian optimisation has generally assumed a strong similarity between optimisation tasks, with at least a subset having similar optimal inputs. This assumption can reduce computational costs, but it is violated in a wide range of optimisation problems where transfer learning may nonetheless be useful. We replace this assumption with a weaker one only requiring the shape of… ▽ More Transfer learning for Bayesian optimisation has generally assumed a strong similarity between optimisation tasks, with at least a subset having similar optimal inputs. This assumption can reduce computational costs, but it is violated in a wide range of optimisation problems where transfer learning may nonetheless be useful. We replace this assumption with a weaker one only requiring the shape of the optimisation landscape to be similar, and analyse the recent method Prior Learning for Bayesian Optimisation - PLeBO - in this setting. By learning priors for the hyperparameters of the Gaussian process surrogate model we can better approximate the underlying function, especially for few function evaluations. We validate the learned priors and compare to a breadth of transfer learning approaches, using synthetic data and a recent air pollution optimisation problem as benchmarks. We show that PLeBO and prior transfer find good inputs in fewer evaluations. △ Less

Submitted 19 April, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

Comments: Presented at the NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World

arXiv:2306.04343 [pdf, other]

Bayesian Optimisation Against Climate Change: Applications and Benchmarks

Authors: Sigrid Passano Hellan, Christopher G. Lucas, Nigel H. Goddard

Abstract: Bayesian optimisation is a powerful method for optimising black-box functions, popular in settings where the true function is expensive to evaluate and no gradient information is available. Bayesian optimisation can improve responses to many optimisation problems within climate change for which simulator models are unavailable or expensive to sample from. While there have been several feasibility… ▽ More Bayesian optimisation is a powerful method for optimising black-box functions, popular in settings where the true function is expensive to evaluate and no gradient information is available. Bayesian optimisation can improve responses to many optimisation problems within climate change for which simulator models are unavailable or expensive to sample from. While there have been several feasibility demonstrations of Bayesian optimisation in climate-related applications, there has been no unifying review of applications and benchmarks. We provide such a review here, to encourage the use of Bayesian optimisation in important and well-suited application domains. We identify four main application domains: material discovery, wind farm layout, optimal renewable control and environmental monitoring. For each domain we identify a public benchmark or data set that is easy to use and evaluate systems against, while being representative of real-world problems. Due to the lack of a suitable benchmark for environmental monitoring, we propose LAQN-BO, based on air pollution data. Our contributions are: a) identifying a representative range of benchmarks, providing example code where necessary; b) introducing a new benchmark, LAQN-BO; and c) promoting a wider use of climate change applications among Bayesian optimisation practitioners. △ Less

Submitted 7 June, 2023; originally announced June 2023.

arXiv:2202.07595 [pdf, other]

doi 10.1609/aaai.v36i11.21448

Bayesian Optimisation for Active Monitoring of Air Pollution

Authors: Sigrid Passano Hellan, Christopher G. Lucas, Nigel H. Goddard

Abstract: Air pollution is one of the leading causes of mortality globally, resulting in millions of deaths each year. Efficient monitoring is important to measure exposure and enforce legal limits. New low-cost sensors can be deployed in greater numbers and in more varied locations, motivating the problem of efficient automated placement. Previous work suggests Bayesian optimisation is an appropriate metho… ▽ More Air pollution is one of the leading causes of mortality globally, resulting in millions of deaths each year. Efficient monitoring is important to measure exposure and enforce legal limits. New low-cost sensors can be deployed in greater numbers and in more varied locations, motivating the problem of efficient automated placement. Previous work suggests Bayesian optimisation is an appropriate method, but only considered a satellite data set, with data aggregated over all altitudes. It is ground-level pollution, that humans breathe, which matters most. We improve on those results using hierarchical models and evaluate our models on urban pollution data in London to show that Bayesian optimisation can be successfully applied to the problem. △ Less

Submitted 19 April, 2024; v1 submitted 15 February, 2022; originally announced February 2022.

Comments: Presented at AAAI 2022 in the Special Track on AI for Social Impact. Updates: - Small corrections to references - Correction that baselines use gradient-based optimisation, not gradient descent - Correction to data preprocessing for LAQN data - Correction that the kernel signal variances were modelled internally, not their square roots - Correction to iteration for Table 3 (31, not 30)

arXiv:2012.10770 [pdf, other]

Optimising Placement of Pollution Sensors in Windy Environments

Authors: Sigrid Passano Hellan, Christopher G. Lucas, Nigel H. Goddard

Abstract: Air pollution is one of the most important causes of mortality in the world. Monitoring air pollution is useful to learn more about the link between health and pollutants, and to identify areas for intervention. Such monitoring is expensive, so it is important to place sensors as efficiently as possible. Bayesian optimisation has proven useful in choosing sensor locations, but typically relies on… ▽ More Air pollution is one of the most important causes of mortality in the world. Monitoring air pollution is useful to learn more about the link between health and pollutants, and to identify areas for intervention. Such monitoring is expensive, so it is important to place sensors as efficiently as possible. Bayesian optimisation has proven useful in choosing sensor locations, but typically relies on kernel functions that neglect the statistical structure of air pollution, such as the tendency of pollution to propagate in the prevailing wind direction. We describe two new wind-informed kernels and investigate their advantage for the task of actively learning locations of maximum pollution using Bayesian optimisation. △ Less

Submitted 28 August, 2022; v1 submitted 19 December, 2020; originally announced December 2020.

Comments: Presented at the AI for Earth Sciences Workshop at Advances in Neural Information Processing Systems (NeurIPS) 2020. Updated August 2022 to correct scale of y axis on distance plots

arXiv:1812.03915 [pdf, other]

Non-Intrusive Load Monitoring with Fully Convolutional Networks

Authors: Cillian Brewitt, Nigel Goddard

Abstract: Non-intrusive load monitoring or energy disaggregation involves estimating the power consumption of individual appliances from measurements of the total power consumption of a home. Deep neural networks have been shown to be effective for energy disaggregation. In this work, we present a deep neural network architecture which achieves state of the art disaggregation performance with substantially… ▽ More Non-intrusive load monitoring or energy disaggregation involves estimating the power consumption of individual appliances from measurements of the total power consumption of a home. Deep neural networks have been shown to be effective for energy disaggregation. In this work, we present a deep neural network architecture which achieves state of the art disaggregation performance with substantially improved computational efficiency, reducing model training time by a factor of 32 and prediction time by a factor of 43. This improvement in efficiency could be especially useful for applications where disaggregation must be performed in home on lower power devices, or for research experiments which involve training a large number of models. △ Less

Submitted 10 December, 2018; originally announced December 2018.

arXiv:1612.09106 [pdf, other]

Sequence-to-point learning with neural networks for nonintrusive load monitoring

Authors: Chaoyun Zhang, Mingjun Zhong, Zongzuo Wang, Nigel Goddard, Charles Sutton

Abstract: Energy disaggregation (a.k.a nonintrusive load monitoring, NILM), a single-channel blind source separation problem, aims to decompose the mains which records the whole house electricity consumption into appliance-wise readings. This problem is difficult because it is inherently unidentifiable. Recent approaches have shown that the identifiability problem could be reduced by introducing domain know… ▽ More Energy disaggregation (a.k.a nonintrusive load monitoring, NILM), a single-channel blind source separation problem, aims to decompose the mains which records the whole house electricity consumption into appliance-wise readings. This problem is difficult because it is inherently unidentifiable. Recent approaches have shown that the identifiability problem could be reduced by introducing domain knowledge into the model. Deep neural networks have been shown to be a promising approach for these problems, but sliding windows are necessary to handle the long sequences which arise in signal processing problems, which raises issues about how to combine predictions from different sliding windows. In this paper, we propose sequence-to-point learning, where the input is a window of the mains and the output is a single point of the target appliance. We use convolutional neural networks to train the model. Interestingly, we systematically show that the convolutional neural networks can inherently learn the signatures of the target appliances, which are automatically added into the model to reduce the identifiability problem. We applied the proposed neural network approaches to real-world household energy data, and show that the methods achieve state-of-the-art performance, improving two standard error measures by 84% and 92%. △ Less

Submitted 18 September, 2017; v1 submitted 29 December, 2016; originally announced December 2016.

Comments: 8 pages, 3 figures

Journal ref: The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), 2018

arXiv:1510.09130 [pdf, other]

Latent Bayesian melding for integrating individual and population models

Authors: Mingjun Zhong, Nigel Goddard, Charles Sutton

Abstract: In many statistical problems, a more coarse-grained model may be suitable for population-level behaviour, whereas a more detailed model is appropriate for accurate modelling of individual behaviour. This raises the question of how to integrate both types of models. Methods such as posterior regularization follow the idea of generalized moment matching, in that they allow matching expectations betw… ▽ More In many statistical problems, a more coarse-grained model may be suitable for population-level behaviour, whereas a more detailed model is appropriate for accurate modelling of individual behaviour. This raises the question of how to integrate both types of models. Methods such as posterior regularization follow the idea of generalized moment matching, in that they allow matching expectations between two models, but sometimes both models are most conveniently expressed as latent variable models. We propose latent Bayesian melding, which is motivated by averaging the distributions over populations statistics of both the individual-level and the population-level models under a logarithmic opinion pool framework. In a case study on electricity disaggregation, which is a type of single-channel blind source separation problem, we show that latent Bayesian melding leads to significantly more accurate predictions than an approach based solely on generalized moment matching. △ Less

Submitted 30 October, 2015; originally announced October 2015.

Comments: 11 pages, Advances in Neural Information Processing Systems (NIPS), 2015. (Spotlight Presentation)

Showing 1–7 of 7 results for author: Goddard, N