-
Enhancing reliability in prediction intervals using point forecasters: Heteroscedastic Quantile Regression and Width-Adaptive Conformal Inference
Authors:
Carlos Sebastián,
Carlos E. González-Guillén,
Jesús Juan
Abstract:
Building prediction intervals for time series forecasting problems presents a complex challenge, particularly when relying solely on point predictors, a common scenario for practitioners in the industry. While research has primarily focused on achieving increasingly efficient valid intervals, we argue that, when evaluating a set of intervals, traditional measures alone are insufficient. There are…
▽ More
Building prediction intervals for time series forecasting problems presents a complex challenge, particularly when relying solely on point predictors, a common scenario for practitioners in the industry. While research has primarily focused on achieving increasingly efficient valid intervals, we argue that, when evaluating a set of intervals, traditional measures alone are insufficient. There are additional crucial characteristics: the intervals must vary in length, with this variation directly linked to the difficulty of the prediction, and the coverage of the interval must remain independent of the difficulty of the prediction for practical utility. We propose the Heteroscedastic Quantile Regression (HQR) model and the Width-Adaptive Conformal Inference (WACI) method, providing theoretical coverage guarantees, to overcome those issues, respectively. The methodologies are evaluated in the context of Electricity Price Forecasting and Wind Power Forecasting, representing complex scenarios in time series forecasting. The results demonstrate that HQR and WACI not only improve or achieve typical measures of validity and efficiency but also successfully fulfil the commonly ignored mentioned characteristics.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation
Authors:
Xiaoqi Wang,
Wenbin He,
Xiwei Xuan,
Clint Sebastian,
Jorge Piazentin Ono,
Xin Li,
Sima Behpour,
Thang Doan,
Liang Gou,
Han Wei Shen,
Liu Ren
Abstract:
The open-vocabulary image segmentation task involves partitioning images into semantically meaningful segments and classifying them with flexible text-defined categories. The recent vision-based foundation models such as the Segment Anything Model (SAM) have shown superior performance in generating class-agnostic image segments. The main challenge in open-vocabulary image segmentation now lies in…
▽ More
The open-vocabulary image segmentation task involves partitioning images into semantically meaningful segments and classifying them with flexible text-defined categories. The recent vision-based foundation models such as the Segment Anything Model (SAM) have shown superior performance in generating class-agnostic image segments. The main challenge in open-vocabulary image segmentation now lies in accurately classifying these segments into text-defined categories. In this paper, we introduce the Universal Segment Embedding (USE) framework to address this challenge. This framework is comprised of two key components: 1) a data pipeline designed to efficiently curate a large amount of segment-text pairs at various granularities, and 2) a universal segment embedding model that enables precise segment classification into a vast range of text-defined categories. The USE model can not only help open-vocabulary image segmentation but also facilitate other downstream tasks (e.g., querying and ranking). Through comprehensive experimental studies on semantic segmentation and part segmentation benchmarks, we demonstrate that the USE framework outperforms state-of-the-art open-vocabulary segmentation methods.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
An adaptive standardisation methodology for Day-Ahead electricity price forecasting
Authors:
Carlos Sebastián,
Carlos E. González-Guillén,
Jesús Juan
Abstract:
The study of Day-Ahead prices in the electricity market is one of the most popular problems in time series forecasting. Previous research has focused on employing increasingly complex learning algorithms to capture the sophisticated dynamics of the market. However, there is a threshold where increased complexity fails to yield substantial improvements. In this work, we propose an alternative appro…
▽ More
The study of Day-Ahead prices in the electricity market is one of the most popular problems in time series forecasting. Previous research has focused on employing increasingly complex learning algorithms to capture the sophisticated dynamics of the market. However, there is a threshold where increased complexity fails to yield substantial improvements. In this work, we propose an alternative approach by introducing an adaptive standardisation to mitigate the effects of dataset shifts that commonly occur in the market. By doing so, learning algorithms can prioritize uncovering the true relationship between the target variable and the explanatory variables. We investigate five distinct markets, including two novel datasets, previously unexplored in the literature. These datasets provide a more realistic representation of the current market context, that conventional datasets do not show. The results demonstrate a significant improvement across all five markets using the widely accepted learning algorithms in the literature (LEAR and DNN). In particular, the combination of the proposed methodology with the methodology previously presented in the literature obtains the best results. This significant advancement unveils new lines of research in this field, highlighting the potential of adaptive transformations in enhancing the performance of forecasting models.
△ Less
Submitted 26 April, 2024; v1 submitted 5 November, 2023;
originally announced November 2023.
-
A feature selection method based on Shapley values robust to concept shift in regression
Authors:
Carlos Sebastián,
Carlos E. González-Guillén
Abstract:
Feature selection is one of the most relevant processes in any methodology for creating a statistical learning model. Usually, existing algorithms establish some criterion to select the most influential variables, discarding those that do not contribute to the model with any relevant information. This methodology makes sense in a static situation where the joint distribution of the data does not v…
▽ More
Feature selection is one of the most relevant processes in any methodology for creating a statistical learning model. Usually, existing algorithms establish some criterion to select the most influential variables, discarding those that do not contribute to the model with any relevant information. This methodology makes sense in a static situation where the joint distribution of the data does not vary over time. However, when dealing with real data, it is common to encounter the problem of the dataset shift and, specifically, changes in the relationships between variables (concept shift). In this case, the influence of a variable cannot be the only indicator of its quality as a regressor of the model, since the relationship learned in the training phase may not correspond to the current situation. In tackling this problem, our approach establishes a direct relationship between the Shapley values and prediction errors, operating at a more local level to effectively detect the individual biases introduced by each variable. The proposed methodology is evaluated through various examples, including synthetic scenarios mimicking sudden and incremental shift situations, as well as two real-world cases characterized by concept shifts. Additionally, we perform three analyses of standard situations to assess the algorithm's robustness in the absence of shifts. The results demonstrate that our proposed algorithm significantly outperforms state-of-the-art feature selection methods in concept shift scenarios, while matching the performance of existing methodologies in static situations.
△ Less
Submitted 25 September, 2023; v1 submitted 28 April, 2023;
originally announced April 2023.
-
Dual Embedding Expansion for Vehicle Re-identification
Authors:
Clint Sebastian,
Raffaele Imbriaco,
Egor Bondarev,
Peter H. N. de With
Abstract:
Vehicle re-identification plays a crucial role in the management of transportation infrastructure and traffic flow. However, this is a challenging task due to the large view-point variations in appearance, environmental and instance-related factors. Modern systems deploy CNNs to produce unique representations from the images of each vehicle instance. Most work focuses on leveraging new losses and…
▽ More
Vehicle re-identification plays a crucial role in the management of transportation infrastructure and traffic flow. However, this is a challenging task due to the large view-point variations in appearance, environmental and instance-related factors. Modern systems deploy CNNs to produce unique representations from the images of each vehicle instance. Most work focuses on leveraging new losses and network architectures to improve the descriptiveness of these representations. In contrast, our work concentrates on re-ranking and embedding expansion techniques. We propose an efficient approach for combining the outputs of multiple models at various scales while exploiting tracklet and neighbor information, called dual embedding expansion (DEx). Additionally, a comparative study of several common image retrieval techniques is presented in the context of vehicle re-ID. Our system yields competitive performance in the 2020 NVIDIA AI City Challenge with promising results. We demonstrate that DEx when combined with other re-ranking techniques, can produce an even larger gain without any additional attribute labels or manual supervision.
△ Less
Submitted 18 April, 2020;
originally announced April 2020.
-
Contextual Pyramid Attention Network for Building Segmentation in Aerial Imagery
Authors:
Clint Sebastian,
Raffaele Imbriaco,
Egor Bondarev,
Peter H. N. de With
Abstract:
Building extraction from aerial images has several applications in problems such as urban planning, change detection, and disaster management. With the increasing availability of data, Convolutional Neural Networks (CNNs) for semantic segmentation of remote sensing imagery has improved significantly in recent years. However, convolutions operate in local neighborhoods and fail to capture non-local…
▽ More
Building extraction from aerial images has several applications in problems such as urban planning, change detection, and disaster management. With the increasing availability of data, Convolutional Neural Networks (CNNs) for semantic segmentation of remote sensing imagery has improved significantly in recent years. However, convolutions operate in local neighborhoods and fail to capture non-local features that are essential in semantic understanding of aerial images. In this work, we propose to improve building segmentation of different sizes by capturing long-range dependencies using contextual pyramid attention (CPA). The pathways process the input at multiple scales efficiently and combine them in a weighted manner, similar to an ensemble model. The proposed method obtains state-of-the-art performance on the Inria Aerial Image Labelling Dataset with minimal computation costs. Our method improves 1.8 points over current state-of-the-art methods and 12.6 points higher than existing baselines on the Intersection over Union (IoU) metric without any post-processing. Code and models will be made publicly available.
△ Less
Submitted 15 April, 2020;
originally announced April 2020.
-
Adversarial Loss for Semantic Segmentation of Aerial Imagery
Authors:
Clint Sebastian,
Raffaele Imbriaco,
Egor Bondarev,
Peter H. N. de With
Abstract:
Automatic building extraction from aerial imagery has several applications in urban planning, disaster management, and change detection. In recent years, several works have adopted deep convolutional neural networks (CNNs) for building extraction, since they produce rich features that are invariant against lighting conditions, shadows, etc. Although several advances have been made, building extrac…
▽ More
Automatic building extraction from aerial imagery has several applications in urban planning, disaster management, and change detection. In recent years, several works have adopted deep convolutional neural networks (CNNs) for building extraction, since they produce rich features that are invariant against lighting conditions, shadows, etc. Although several advances have been made, building extraction from aerial imagery still presents multiple challenges. Most of the deep learning segmentation methods optimize the per-pixel loss with respect to the ground truth without knowledge of the context. This often leads to imperfect outputs that may lead to missing or unrefined regions. In this work, we propose a novel loss function combining both adversarial and cross-entropy losses that learn to understand both local and global contexts for semantic segmentation. The newly proposed loss function deployed on the DeepLab v3+ network obtains state-of-the-art results on the Massachusetts buildings dataset. The loss function improves the structure and refines the edges of buildings without requiring any of the commonly used post-processing methods, such as Conditional Random Fields. We also perform ablation studies to understand the impact of the adversarial loss. Finally, the proposed method achieves a relaxed F1 score of 95.59% on the Massachusetts buildings dataset compared to the previous best F1 of 94.88%.
△ Less
Submitted 18 January, 2020; v1 submitted 13 January, 2020;
originally announced January 2020.
-
Privacy Protection in Street-View Panoramas using Depth and Multi-View Imagery
Authors:
Ries Uittenbogaard,
Clint Sebastian,
Julien Vijverberg,
Bas Boom,
Dariu M. Gavrila,
Peter H. N. de With
Abstract:
The current paradigm in privacy protection in street-view images is to detect and blur sensitive information. In this paper, we propose a framework that is an alternative to blurring, which automatically removes and inpaints moving objects (e.g. pedestrians, vehicles) in street-view imagery. We propose a novel moving object segmentation algorithm exploiting consistencies in depth across multiple s…
▽ More
The current paradigm in privacy protection in street-view images is to detect and blur sensitive information. In this paper, we propose a framework that is an alternative to blurring, which automatically removes and inpaints moving objects (e.g. pedestrians, vehicles) in street-view imagery. We propose a novel moving object segmentation algorithm exploiting consistencies in depth across multiple street-view images that are later combined with the results of a segmentation network. The detected moving objects are removed and inpainted with information from other views, to obtain a realistic output image such that the moving object is not visible anymore. We evaluate our results on a dataset of 1000 images to obtain a peak noise-to-signal ratio (PSNR) and L1 loss of 27.2 dB and 2.5%, respectively. To ensure the subjective quality, To assess overall quality, we also report the results of a survey conducted on 35 professionals, asked to visually inspect the images whether object removal and inpainting had taken place. The inpainting dataset will be made publicly available for scientific benchmarking purposes at https://research.cyclomedia.com
△ Less
Submitted 27 March, 2019;
originally announced March 2019.
-
Aggregated Deep Local Features for Remote Sensing Image Retrieval
Authors:
Raffaele Imbriaco,
Clint Sebastian,
Egor Bondarev,
Peter H. N. de With
Abstract:
Remote Sensing Image Retrieval remains a challenging topic due to the special nature of Remote Sensing Imagery. Such images contain various different semantic objects, which clearly complicates the retrieval task. In this paper, we present an image retrieval pipeline that uses attentive, local convolutional features and aggregates them using the Vector of Locally Aggregated Descriptors (VLAD) to p…
▽ More
Remote Sensing Image Retrieval remains a challenging topic due to the special nature of Remote Sensing Imagery. Such images contain various different semantic objects, which clearly complicates the retrieval task. In this paper, we present an image retrieval pipeline that uses attentive, local convolutional features and aggregates them using the Vector of Locally Aggregated Descriptors (VLAD) to produce a global descriptor. We study various system parameters such as the multiplicative and additive attention mechanisms and descriptor dimensionality. We propose a query expansion method that requires no external inputs. Experiments demonstrate that even without training, the local convolutional features and global representation outperform other systems. After system tuning, we can achieve state-of-the-art or competitive results. Furthermore, we observe that our query expansion method increases overall system performance by about 3%, using only the top-three retrieved images. Finally, we show how dimensionality reduction produces compact descriptors with increased retrieval performance and fast retrieval computation times, e.g. 50% faster than the current systems.
△ Less
Submitted 22 March, 2019;
originally announced March 2019.
-
LiDAR-assisted Large-scale Privacy Protection in Street-view Cycloramas
Authors:
Clint Sebastian,
Bas Boom,
Egor Bondarev,
Peter H. N. de With
Abstract:
Recently, privacy has a growing importance in several domains, especially in street-view images. The conventional way to achieve this is to automatically detect and blur sensitive information from these images. However, the processing cost of blurring increases with the ever-growing resolution of images. We propose a system that is cost-effective even after increasing the resolution by a factor of…
▽ More
Recently, privacy has a growing importance in several domains, especially in street-view images. The conventional way to achieve this is to automatically detect and blur sensitive information from these images. However, the processing cost of blurring increases with the ever-growing resolution of images. We propose a system that is cost-effective even after increasing the resolution by a factor of 2.5. The new system utilizes depth data obtained from LiDAR to significantly reduce the search space for detection, thereby reducing the processing cost. Besides this, we test several detectors after reducing the detection space and provide an alternative solution based on state-of-the-art deep learning detectors to the existing HoG-SVM-Deep system that is faster and has a higher performance.
△ Less
Submitted 13 March, 2019;
originally announced March 2019.
-
Towards Accurate Camera Geopositioning by Image Matching
Authors:
Raffaele Imbriaco,
Clint Sebastian,
Egor Bondarev,
Peter de With
Abstract:
In this work, we present a camera geopositioning system based on matching a query image against a database with panoramic images. For matching, our system uses memory vectors aggregated from global image descriptors based on convolutional features to facilitate fast searching in the database. To speed up searching, a clustering algorithm is used to balance geographical positioning and computation…
▽ More
In this work, we present a camera geopositioning system based on matching a query image against a database with panoramic images. For matching, our system uses memory vectors aggregated from global image descriptors based on convolutional features to facilitate fast searching in the database. To speed up searching, a clustering algorithm is used to balance geographical positioning and computation time. We refine the obtained position from the query image using a new outlier removal algorithm. The matching of the query image is obtained with a recall@5 larger than 90% for panorama-to-panorama matching. We cluster available panoramas from geographically adjacent locations into a single compact representation and observe computational gains of approximately 50% at the cost of only a small (approximately 3%) recall loss. Finally, we present a coordinate estimation algorithm that reduces the median geopositioning error by up to 20%.
△ Less
Submitted 13 March, 2019;
originally announced March 2019.
-
Time- and frequency-resolved covariance analysis for detection and characterization of seizures from intracraneal EEG recordings
Authors:
Melisa Maidana Capitán,
Nuria Cámpora,
Claudio Sebastián,
Sigvard Silvia Kochen,
Inés Samengo
Abstract:
The amount of power in different frequency bands of the electroencephalogram (EEG) carries information about the behavioral state of a subject. Hence, neurologists treating epileptic patients monitor the temporal evolution of the different bands. We propose a covariance-based method to detect and characterize epileptic seizures operating on the band-filtered EEG signal. The algorithm is unsupervis…
▽ More
The amount of power in different frequency bands of the electroencephalogram (EEG) carries information about the behavioral state of a subject. Hence, neurologists treating epileptic patients monitor the temporal evolution of the different bands. We propose a covariance-based method to detect and characterize epileptic seizures operating on the band-filtered EEG signal. The algorithm is unsupervised, and performs a principal component analysis of intra-cranial EEG recordings, detecting transient fluctuations of the power in each frequency band. Its simplicity makes it suitable for online implementation. Good sampling of the non-ictal periods is required, while no demands are imposed on the amount of data during ictal activity. We tested the method with 32 seizures registered in 5 patients. The area below the resulting receiver-operating characteristic curves was 87\% for the detection of seizures and 91\% for the detection of recruited electrodes. To identify the behaviorally relevant correlates of the physiological signal, we identified transient changes in the variance of each band that were correlated with the degree of loss of consciousness, the latter assessed by the so-called Consciousness Seizure Scale, summarizing the performance of the subject in a number of behavioral tests requested during seizures. We concluded that those crisis with maximal impairment of consciousness tended to exhibit an increase of variance approximately 40 seconds after seizure onset, with predominant power in the theta and alpha bands, and reduced delta and beta activity.
△ Less
Submitted 15 June, 2020; v1 submitted 28 February, 2019;
originally announced February 2019.
-
Bootstrapped CNNs for Building Segmentation on RGB-D Aerial Imagery
Authors:
Clint Sebastian,
Bas Boom,
Thijs van Lankveld,
Egor Bondarev,
Peter H. N. De With
Abstract:
Detection of buildings and other objects from aerial images has various applications in urban planning and map making. Automated building detection from aerial imagery is a challenging task, as it is prone to varying lighting conditions, shadows and occlusions. Convolutional Neural Networks (CNNs) are robust against some of these variations, although they fail to distinguish easy and difficult exa…
▽ More
Detection of buildings and other objects from aerial images has various applications in urban planning and map making. Automated building detection from aerial imagery is a challenging task, as it is prone to varying lighting conditions, shadows and occlusions. Convolutional Neural Networks (CNNs) are robust against some of these variations, although they fail to distinguish easy and difficult examples. We train a detection algorithm from RGB-D images to obtain a segmented mask by using the CNN architecture DenseNet.First, we improve the performance of the model by applying a statistical re-sampling technique called Bootstrapping and demonstrate that more informative examples are retained. Second, the proposed method outperforms the non-bootstrapped version by utilizing only one-sixth of the original training data and it obtains a precision-recall break-even of 95.10% on our aerial imagery dataset.
△ Less
Submitted 8 October, 2018;
originally announced October 2018.
-
Conditional Transfer with Dense Residual Attention: Synthesizing traffic signs from street-view imagery
Authors:
Clint Sebastian,
Ries Uittenbogaard,
Julien Vijverberg,
Bas Boom,
Peter H. N. de With
Abstract:
Object detection and classification of traffic signs in street-view imagery is an essential element for asset management, map making and autonomous driving. However, some traffic signs occur rarely and consequently, they are difficult to recognize automatically. To improve the detection and classification rates, we propose to generate images of traffic signs, which are then used to train a detecto…
▽ More
Object detection and classification of traffic signs in street-view imagery is an essential element for asset management, map making and autonomous driving. However, some traffic signs occur rarely and consequently, they are difficult to recognize automatically. To improve the detection and classification rates, we propose to generate images of traffic signs, which are then used to train a detector/classifier. In this research, we present an end-to-end framework that generates a realistic image of a traffic sign from a given image of a traffic sign and a pictogram of the target class. We propose a residual attention mechanism with dense concatenation called Dense Residual Attention, that preserves the background information while transferring the object information. We also propose to utilize multi-scale discriminators, so that the smaller scales of the output guide the higher resolution output. We have performed detection and classification tests across a large number of traffic sign classes, by training the detector using the combination of real and generated data. The newly trained model reduces the number of false positives by 1.2 - 1.5% at 99% recall in the detection tests and an absolute improvement of 4.65% (top-1 accuracy) in the classification tests.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.
-
Searching for hexagonal analogues of the half-metallic half-Heusler XYZ compounds
Authors:
Frederick Casper,
Claudia Felser,
Ram Seshadri,
C. Peter Sebastian,
Rainer Poettgen
Abstract:
The XYZ half-Heusler crystal structure can conveniently be described as a tetrahedral zinc blende YZ structure which is stuffed by a slightly ionic X species. This description is well suited to understand the electronic structure of semiconducting 8-electron compounds such as LiAlSi (formulated Li$^+$[AlSi]$^-$) or semiconducting 18-electron compounds such as TiCoSb (formulated Ti$^{4+}$[CoSb]…
▽ More
The XYZ half-Heusler crystal structure can conveniently be described as a tetrahedral zinc blende YZ structure which is stuffed by a slightly ionic X species. This description is well suited to understand the electronic structure of semiconducting 8-electron compounds such as LiAlSi (formulated Li$^+$[AlSi]$^-$) or semiconducting 18-electron compounds such as TiCoSb (formulated Ti$^{4+}$[CoSb]$^{4-}$). The basis for this is that [AlSi]$^-$ (with the same electron count as Si$_2$) and [CoSb]$^{4-}$ (the same electron count as GaSb), are both structurally and electronically, zinc-blende semiconductors. The electronic structure of half-metallic ferromagnets in this structure type can then be described as semiconductors with stuffing magnetic ions which have a local moment: For example, 22 electron MnNiSb can be written Mn$^{3+}$[NiSb]$^{3-}$. The tendency in the 18 electron compound for a semiconducting gap -- believed to arise from strong covalency -- is carried over in MnNiSb to a tendency for a gap in one spin direction. Here we similarly propose the systematic examination of 18-electron hexagonal compounds for semiconducting gaps; these would be the "stuffed wurtzite" analogues of the "stuffed zinc blende" half-Heusler compounds. These semiconductors could then serve as the basis for possibly new families of half-metallic compounds, attained through appropriate replacement of non-magnetic ions by magnetic ones. These semiconductors and semimetals with tunable charge carrier concentrations could also be interesting in the context of magnetoresistive and thermoelectric materials.
△ Less
Submitted 30 October, 2007;
originally announced October 2007.
-
Structure and properties of alpha- and beta- CeCuSn: A single-crystal and Mossbauer spectroscopic investigation
Authors:
C. Peter Sebastian,
Sudhindra Rayaprol,
Rolf-Dieter Hoffmann,
Ute Ch. Rodewald,
Tania Pape,
Rainer Pottgen
Abstract:
Two modifications of CeCuSn were prepared from the elements: the high-temperature (beta) modification crystallizes directly from the quenched sample, while the low-temperature (alpha) modification forms after annealing at 700 deg C for one month. Both modifications were investigated by X-ray powder and single crystal diffraction. We find for alpha-CeCuSn a structure of ZrBeSi type, space group P…
▽ More
Two modifications of CeCuSn were prepared from the elements: the high-temperature (beta) modification crystallizes directly from the quenched sample, while the low-temperature (alpha) modification forms after annealing at 700 deg C for one month. Both modifications were investigated by X-ray powder and single crystal diffraction. We find for alpha-CeCuSn a structure of ZrBeSi type, space group P63/mmc, a = 458.2(1), c = 793.7(2) pm, wR2 = 0.0727, 148 F2 values, 8 variable parameters. In the case of beta-CeCuSn we find the NdPtSb type structure, space group P63mc, a = 458.4(1), c = 785.8(2) pm, wR2 = 0.0764, 233 F2 values, 11 variable parameters. The copper and tin atoms build up layers of ordered [Cu3Sn3] hexagons. The layers are planar in beta-CeCuSn, however, with highly anisotropic displacements of the copper and tin atoms. In alpha-CeCuSn a puckering effect is observed resulting in a decrease of the c lattice parameter. Both modifications of CeCuSn exhibit antiferromagnetic ordering, however, there is a considerable difference in their magnetic behaviour. We show the anomalies in the physical properties of the alpha- and beta- modifications of CeCuSn by Mossbauer spectroscopy,magnetic and specific heat measurements and explain their structure-property relations.
△ Less
Submitted 9 December, 2006;
originally announced December 2006.