-
Boosting, Voting Classifiers and Randomized Sample Compression Schemes
Authors:
Arthur da Cunha,
Kasper Green Larsen,
Martin Ritzert
Abstract:
In boosting, we aim to leverage multiple weak learners to produce a strong learner. At the center of this paradigm lies the concept of building the strong learner as a voting classifier, which outputs a weighted majority vote of the weak learners. While many successful boosting algorithms, such as the iconic AdaBoost, produce voting classifiers, their theoretical performance has long remained sub-…
▽ More
In boosting, we aim to leverage multiple weak learners to produce a strong learner. At the center of this paradigm lies the concept of building the strong learner as a voting classifier, which outputs a weighted majority vote of the weak learners. While many successful boosting algorithms, such as the iconic AdaBoost, produce voting classifiers, their theoretical performance has long remained sub-optimal: the best known bounds on the number of training examples necessary for a voting classifier to obtain a given accuracy has so far always contained at least two logarithmic factors above what is known to be achievable by general weak-to-strong learners. In this work, we break this barrier by proposing a randomized boosting algorithm that outputs voting classifiers whose generalization error contains a single logarithmic dependency on the sample size. We obtain this result by building a general framework that extends sample compression methods to support randomized learning algorithms based on sub-sampling.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets
Authors:
Arthur da Cunha,
Francesco d'Amore,
Emanuele Natale
Abstract:
The Strong Lottery Ticket Hypothesis (SLTH) states that randomly-initialised neural networks likely contain subnetworks that perform well without any training. Although unstructured pruning has been extensively studied in this context, its structured counterpart, which can deliver significant computational and memory efficiency gains, has been largely unexplored. One of the main reasons for this g…
▽ More
The Strong Lottery Ticket Hypothesis (SLTH) states that randomly-initialised neural networks likely contain subnetworks that perform well without any training. Although unstructured pruning has been extensively studied in this context, its structured counterpart, which can deliver significant computational and memory efficiency gains, has been largely unexplored. One of the main reasons for this gap is the limitations of the underlying mathematical tools used in formal analyses of the SLTH. In this paper, we overcome these limitations: we leverage recent advances in the multidimensional generalisation of the Random Subset-Sum Problem and obtain a variant that admits the stochastic dependencies that arise when addressing structured pruning in the SLTH. We apply this result to prove, for a wide class of random Convolutional Neural Networks, the existence of structured subnetworks that can approximate any sufficiently smaller network.
This result provides the first sub-exponential bound around the SLTH for structured pruning, opening up new avenues for further research on the hypothesis and contributing to the understanding of the role of over-parameterization in deep learning.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets
Authors:
Dominique Beaini,
Shenyang Huang,
Joao Alex Cunha,
Zhiyi Li,
Gabriela Moisescu-Pareja,
Oleksandr Dymov,
Samuel Maddrell-Mander,
Callum McLean,
Frederik Wenkel,
Luis Müller,
Jama Hussein Mohamud,
Ali Parviz,
Michael Craig,
Michał Koziarski,
Jiarui Lu,
Zhaocheng Zhu,
Cristian Gabellini,
Kerstin Klaser,
Josef Dean,
Cas Wognum,
Maciej Sypetkowski,
Guillaume Rabusseau,
Reihaneh Rabbany,
Jian Tang,
Christopher Morris
, et al. (10 additional authors not shown)
Abstract:
Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by…
▽ More
Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by size into three distinct categories: ToyMix, LargeMix and UltraLarge. These datasets push the boundaries in both the scale and the diversity of supervised labels for molecular learning. They cover nearly 100 million molecules and over 3000 sparsely defined tasks, totaling more than 13 billion individual labels of both quantum and biological nature. In comparison, our datasets contain 300 times more data points than the widely used OGB-LSC PCQM4Mv2 dataset, and 13 times more than the quantum-only QM1B dataset. In addition, to support the development of foundational models based on our proposed datasets, we present the Graphium graph machine learning library which simplifies the process of building and training molecular machine learning models for multi-task and multi-level molecular datasets. Finally, we present a range of baseline results as a starting point of multi-task and multi-level training on these datasets. Empirically, we observe that performance on low-resource biological datasets show improvement by also training on large amounts of quantum data. This indicates that there may be potential in multi-task and multi-level training of a foundation model and fine-tuning it to resource-constrained downstream tasks.
△ Less
Submitted 18 October, 2023; v1 submitted 6 October, 2023;
originally announced October 2023.
-
MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision
Authors:
Jianning Li,
Zongwei Zhou,
Jiancheng Yang,
Antonio Pepe,
Christina Gsaxner,
Gijs Luijten,
Chongyu Qu,
Tiezheng Zhang,
Xiaoxi Chen,
Wenxuan Li,
Marek Wodzinski,
Paul Friedrich,
Kangxian Xie,
Yuan Jin,
Narmada Ambigapathy,
Enrico Nasca,
Naida Solak,
Gian Marco Melito,
Viet Duc Vu,
Afaque R. Memon,
Christopher Schlachta,
Sandrine De Ribaupierre,
Rajnikant Patel,
Roy Eagleson,
Xiaojun Chen
, et al. (132 additional authors not shown)
Abstract:
Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of Shape…
▽ More
Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of ShapeNet (about 51,300 models) and Princeton ModelNet (127,915 models). For the medical domain, we present a large collection of anatomical shapes (e.g., bones, organs, vessels) and 3D models of surgical instrument, called MedShapeNet, created to facilitate the translation of data-driven vision algorithms to medical applications and to adapt SOTA vision algorithms to medical problems. As a unique feature, we directly model the majority of shapes on the imaging data of real patients. As of today, MedShapeNet includes 23 dataset with more than 100,000 shapes that are paired with annotations (ground truth). Our data is freely accessible via a web interface and a Python application programming interface (API) and can be used for discriminative, reconstructive, and variational benchmarks as well as various applications in virtual, augmented, or mixed reality, and 3D printing. Exemplary, we present use cases in the fields of classification of brain tumors, facial and skull reconstructions, multi-class anatomy completion, education, and 3D printing. In future, we will extend the data and improve the interfaces. The project pages are: https://medshapenet.ikim.nrw/ and https://github.com/Jianningli/medshapenet-feedback
△ Less
Submitted 12 December, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Um banco de dados de empregos formais georreferenciados em cidades brasileiras
Authors:
Andre Borgato Morelli,
André de Carvalho Fiedler,
André Luiz Cunha
Abstract:
Currently, transport planning has changed its paradigm from projects oriented to guarantee service levels to projects oriented to guarantee accessibility to opportunities. In this context, a number of studies and tools aimed at calculating accessibility are being made available, however these tools depend on job location data that are not always easily accessible. Thus, this work proposes the crea…
▽ More
Currently, transport planning has changed its paradigm from projects oriented to guarantee service levels to projects oriented to guarantee accessibility to opportunities. In this context, a number of studies and tools aimed at calculating accessibility are being made available, however these tools depend on job location data that are not always easily accessible. Thus, this work proposes the creation of a database with the locations of formal jobs in Brazilian cities. The method uses the RAIS jobs database and the CNEFE street faces database to infer the location of jobs in urban regions from the zip code and the number of non-residential addresses on street faces. As a result, jobs can be located more accurately in large and medium-sized cities and approximately in single zip code cities. Finally, the databases are made available openly so that researchers and planning professionals can easily apply accessibility analyzes throughout the national territory.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Probabilistic maps on bistable vibration energy harvesters
Authors:
João Pedro Norenberg,
Americo Cunha Jr,
Samuel da Silva,
Paulo Sergio Varoto
Abstract:
This paper analyzes the impact of parametric uncertainties on the dynamics of bistable energy harvesters, focusing on obtaining statistical information about how each parameter's variability affects the energy harvesting process. To model the parametric uncertainties, we use a probability distribution derived from the maximum entropy principle, while polynomial chaos is employed to propagate uncer…
▽ More
This paper analyzes the impact of parametric uncertainties on the dynamics of bistable energy harvesters, focusing on obtaining statistical information about how each parameter's variability affects the energy harvesting process. To model the parametric uncertainties, we use a probability distribution derived from the maximum entropy principle, while polynomial chaos is employed to propagate uncertainty. Conditional probabilities and probability maps are obtained to investigate the effect of uncertainty on harvesting energy. We consider different models of bistable energy harvesters that account for nonlinear piezoelectric coupling and asymmetries. Our findings suggest a higher probability of increasing harvested power in the intrawell motion regime as the excitation frequency increases. In contrast, increasing the excitation amplitude and piezoelectric coupling are more likely to increase power in the chaotic and interwell motion regimes, respectively. An illustrative example is presented to emphasize the importance of investigating the influence when all parameters vary simultaneously.
△ Less
Submitted 16 October, 2023; v1 submitted 19 February, 2023;
originally announced February 2023.
-
Nonlinear dynamics of asymmetric bistable energy harvesters
Authors:
João Pedro Norenberg,
Roberto Luo,
Vinicius Goncaalves Lopes,
João Victor L. L. Peterson,
Americo Cunha Jr
Abstract:
The paper investigates asymmetries effects over a nonlinear vibration energy harvester dynamics. The asymmetric system performance is compared with symmetric ones. Different asymmetry levels on restoring force and gravity action are investigated from a system-sloping angle variation. Bifurcation diagrams and basins of attraction are used to examine the local and global characteristics underlying d…
▽ More
The paper investigates asymmetries effects over a nonlinear vibration energy harvester dynamics. The asymmetric system performance is compared with symmetric ones. Different asymmetry levels on restoring force and gravity action are investigated from a system-sloping angle variation. Bifurcation diagrams and basins of attraction are used to examine the local and global characteristics underlying dynamical systems under different excitation energy. The results show the adverse effects of asymmetries on system dynamics. They also reveal ways to overcome them by canceling asymmetric influence from optimal sloping angle values and improving asymmetric system performance over symmetrical ones. This comprehensive numerical study provides novel valuable insights into asymmetrical energy harvester dynamics, a wide and still less explored topic.
△ Less
Submitted 9 June, 2023; v1 submitted 20 August, 2022;
originally announced September 2022.
-
On the Multidimensional Random Subset Sum Problem
Authors:
Luca Becchetti,
Arthur Carvalho Walraven da Cunha,
Andrea Clementi,
Francesco d'Amore,
Hicham Lesfari,
Emanuele Natale,
Luca Trevisan
Abstract:
In the Random Subset Sum Problem, given $n$ i.i.d. random variables $X_1, ..., X_n$, we wish to approximate any point $z \in [-1,1]$ as the sum of a suitable subset $X_{i_1(z)}, ..., X_{i_s(z)}$ of them, up to error $\varepsilon$. Despite its simple statement, this problem is of fundamental interest to both theoretical computer science and statistical mechanics. More recently, it gained renewed at…
▽ More
In the Random Subset Sum Problem, given $n$ i.i.d. random variables $X_1, ..., X_n$, we wish to approximate any point $z \in [-1,1]$ as the sum of a suitable subset $X_{i_1(z)}, ..., X_{i_s(z)}$ of them, up to error $\varepsilon$. Despite its simple statement, this problem is of fundamental interest to both theoretical computer science and statistical mechanics. More recently, it gained renewed attention for its implications in the theory of Artificial Neural Networks. An obvious multidimensional generalisation of the problem is to consider $n$ i.i.d. $d$-dimensional random vectors, with the objective of approximating every point $\mathbf{z} \in [-1,1]^d$. In 1998, G. S. Lueker showed that, in the one-dimensional setting, $n=\mathcal{O}(\log \frac 1\varepsilon)$ samples guarantee the approximation property with high probability.In this work, we prove that, in $d$ dimensions, $n = \mathcal{O}(d^3\log \frac 1\varepsilon \cdot (\log \frac 1\varepsilon + \log d))$ samples suffice for the approximation property to hold with high probability. As an application highlighting the potential interest of this result, we prove that a recently proposed neural network model exhibits universality: with high probability, the model can approximate any neural network within a polynomial overhead in the number of parameters.
△ Less
Submitted 17 November, 2022; v1 submitted 28 July, 2022;
originally announced July 2022.
-
Uncertainty quantification in mechanistic epidemic models via cross-entropy approximate Bayesian computation
Authors:
Americo Cunha Jr,
David A. W. Barton,
Thiago G. Ritto
Abstract:
This paper proposes a data-driven approximate Bayesian computation framework for parameter estimation and uncertainty quantification of epidemic models, which incorporates two novelties: (i) the identification of the initial conditions by using plausible dynamic states that are compatible with observational data; (ii) learning of an informative prior distribution for the model parameters via the c…
▽ More
This paper proposes a data-driven approximate Bayesian computation framework for parameter estimation and uncertainty quantification of epidemic models, which incorporates two novelties: (i) the identification of the initial conditions by using plausible dynamic states that are compatible with observational data; (ii) learning of an informative prior distribution for the model parameters via the cross-entropy method. The new methodology's effectiveness is illustrated with the aid of actual data from the COVID-19 epidemic in Rio de Janeiro city in Brazil, employing an ordinary differential equation-based model with a generalized SEIR mechanistic structure that includes time-dependent transmission rate, asymptomatics, and hospitalizations. A minimization problem with two cost terms (number of hospitalizations and deaths) is formulated, and twelve parameters are identified. The calibrated model provides a consistent description of the available data, able to extrapolate forecasts over a few weeks, making the proposed methodology very appealing for real-time epidemic modeling.
△ Less
Submitted 2 February, 2023; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Multilingual Disinformation Detection for Digital Advertising
Authors:
Zofia Trstanova,
Nadir El Manouzi,
Maryline Chen,
Andre L. V. da Cunha,
Sergei Ivanov
Abstract:
In today's world, the presence of online disinformation and propaganda is more widespread than ever. Independent publishers are funded mostly via digital advertising, which is unfortunately also the case for those publishing disinformation content. The question of how to remove such publishers from advertising inventory has long been ignored, despite the negative impact on the open internet. In th…
▽ More
In today's world, the presence of online disinformation and propaganda is more widespread than ever. Independent publishers are funded mostly via digital advertising, which is unfortunately also the case for those publishing disinformation content. The question of how to remove such publishers from advertising inventory has long been ignored, despite the negative impact on the open internet. In this work, we make the first step towards quickly detecting and red-flagging websites that potentially manipulate the public with disinformation. We build a machine learning model based on multilingual text embeddings that first determines whether the page mentions a topic of interest, then estimates the likelihood of the content being malicious, creating a shortlist of publishers that will be reviewed by human experts. Our system empowers internal teams to proactively, rather than defensively, blacklist unsafe content, thus protecting the reputation of the advertisement provider.
△ Less
Submitted 4 July, 2022;
originally announced July 2022.
-
On the reduction of nonlinear electromechanical systems
Authors:
Americo Cunha Jr,
Marcelo Pereira,
Rafael Avanço,
Angelo Marcelo Tusset,
José Manoel Balthazar
Abstract:
The present work revisits the reduction of the nonlinear dynamics of an electromechanical system through a quasi-steady state hypothesis, discussing the fundamental aspects of this type of approach and clarifying some confusing points found in the literature. Expressions for the characteristic time scales of dynamics are deduced from a physical analysis that establishes an analogy between electrom…
▽ More
The present work revisits the reduction of the nonlinear dynamics of an electromechanical system through a quasi-steady state hypothesis, discussing the fundamental aspects of this type of approach and clarifying some confusing points found in the literature. Expressions for the characteristic time scales of dynamics are deduced from a physical analysis that establishes an analogy between electromechanical dynamics and the kinetics of a chemical reaction. It provides a physical justification, supplemented by non-dimensionalization and scaling of the equations, to reduce the dynamics of interest by assuming a quasi-steady state for the electrical subsystem, eliminating the inductive term from the electrical equation. Numerical experiments help to illustrate the typical behavior of the electromechanical system, a boundary layer phenomenon near the initial dynamic state, and the validity limits of the electromechanical quasi-steady-state assumption discussed here.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Truck Axle Detection with Convolutional Neural Networks
Authors:
Leandro Arab Marcomini,
André Luiz Cunha
Abstract:
Axle count in trucks is important to the classification of vehicles and to the operation of road systems. It is used in the determination of service fees and in the impact on the pavement. Although axle count can be achieved with traditional methods, such as manual labor, it is increasingly possible to count axles using deep learning and computer vision methods. This paper aims to compare three de…
▽ More
Axle count in trucks is important to the classification of vehicles and to the operation of road systems. It is used in the determination of service fees and in the impact on the pavement. Although axle count can be achieved with traditional methods, such as manual labor, it is increasingly possible to count axles using deep learning and computer vision methods. This paper aims to compare three deep-learning object detection algorithms, YOLO, Faster R-CNN, and SSD, for the detection of truck axles. A dataset was built to provide training and testing examples for the neural networks. The training was done on different base models, to increase training time efficiency and to compare results. We evaluated results based on five metrics: precision, recall, mAP, F1-score, and FPS count. Results indicate that YOLO and SSD have similar accuracy and performance, with more than 96\% mAP for both models. Datasets and codes are publicly available for download.
△ Less
Submitted 3 March, 2023; v1 submitted 4 April, 2022;
originally announced April 2022.
-
Non-intrusive polynomial chaos expansion for topology optimization using polygonal meshes
Authors:
Nilton Cuellar,
Anderson Pereira,
Ivan F. M. Menezes,
Americo Cunha Jr
Abstract:
This paper deals with the applications of stochastic spectral methods for structural topology optimization in the presence of uncertainties. A non-intrusive polynomial chaos expansion is integrated into a topology optimization algorithm to calculate low-order statistical moments of the mechanical-mathematical model response. This procedure, known as robust topology optimization, can optimize the m…
▽ More
This paper deals with the applications of stochastic spectral methods for structural topology optimization in the presence of uncertainties. A non-intrusive polynomial chaos expansion is integrated into a topology optimization algorithm to calculate low-order statistical moments of the mechanical-mathematical model response. This procedure, known as robust topology optimization, can optimize the mean of the compliance while simultaneously minimizing its standard deviation. In order to address possible variabilities in the loads applied to the mechanical system of interest, magnitude and direction of the external forces are assumed to be uncertain. In this probabilistic framework, forces are described as a random field or a set of random variables. Representation of the random objects and propagation of load uncertainties through the model are efficiently done through Karhunen-Loève and polynomial chaos expansions. We take advantage of using polygonal elements, which have been shown to be effective in suppressing checkerboard patterns and reducing mesh dependency in the solution of topology optimization problems. Accuracy and applicability of the proposed methodology are demonstrated by means of several topology optimization examples. The obtained results, which are in excellent agreement with reference solutions computed via Monte Carlo method, show that load uncertainties play an important role in optimal design of structural systems, so that they must be taken into account to ensure a reliable optimization process.
△ Less
Submitted 14 July, 2021;
originally announced August 2021.
-
The nonlinear dynamics of a bistable energy harvesting system with colored noise disturbances
Authors:
Vinicius Gonçalves Lopes,
João Victor L. L. Peterson,
Americo Cunha Jr
Abstract:
This paper deals with the nonlinear stochastic dynamics of a piezoelectric energy harvesting system subjected to a harmonic external excitation disturbed by Gaussian colored noise. A parametric analysis is conducted, where the effects of the standard deviation and the correlation time of colored noise on the system response are investigated. The numerical results suggest a strong influence of nois…
▽ More
This paper deals with the nonlinear stochastic dynamics of a piezoelectric energy harvesting system subjected to a harmonic external excitation disturbed by Gaussian colored noise. A parametric analysis is conducted, where the effects of the standard deviation and the correlation time of colored noise on the system response are investigated. The numerical results suggest a strong influence of noise on the system response for higher values of correlation time and standard deviation, and a low (noise level independent) influence for low values of correlation time.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
Identification of parameters in the torsional dynamics of a drilling process through Bayesian statistics
Authors:
Mario Germán Sandoval,
Americo Cunha Jr,
Rubens Sampaio
Abstract:
This work presents the estimation of the parameters of an experimental setup, which is modeled as a system with three degrees of freedom, composed by a shaft, two rotors, and a DC motor, that emulates a drilling process. A Bayesian technique is used in the estimation process, to take into account the uncertainties and variabilities intrinsic to the measurement taken, which are modeled as a noise o…
▽ More
This work presents the estimation of the parameters of an experimental setup, which is modeled as a system with three degrees of freedom, composed by a shaft, two rotors, and a DC motor, that emulates a drilling process. A Bayesian technique is used in the estimation process, to take into account the uncertainties and variabilities intrinsic to the measurement taken, which are modeled as a noise of Gaussian nature. With this procedure it is expected to check the reliability of the nominal values of the physical parameters of the test rig. An estimation process assuming that nine parameters of the experimental apparatus are unknown is conducted, and the results show that for some quantities the relative deviation with respect to the nominal values is very high. This deviation evidentiates a strong deficiency in the mathematical model used to describe the dynamic behavior of the experimental apparatus.
△ Less
Submitted 26 July, 2021;
originally announced July 2021.
-
Effect of an attached end mass in the dynamics of uncertainty nonlinear continuous random system
Authors:
Americo Cunha Jr,
Rubens Sampaio
Abstract:
This work studies the dynamics of a one dimensional elastic bar with random elastic modulus and prescribed boundary conditions, say, fixed at one end, and attached to a lumped mass and two springs (one linear and another nonlinear) on the other extreme. The system analysis assumes that the elastic modulus has gamma probability distribution and uses Monte Carlo simulations to compute the propagatio…
▽ More
This work studies the dynamics of a one dimensional elastic bar with random elastic modulus and prescribed boundary conditions, say, fixed at one end, and attached to a lumped mass and two springs (one linear and another nonlinear) on the other extreme. The system analysis assumes that the elastic modulus has gamma probability distribution and uses Monte Carlo simulations to compute the propagation of uncertainty in this continuous--discrete system. After describing the deterministic and the stochastic modeling of the system, some configurations of the model are analyzed in order to characterize the effect of the lumped mass in the overall behavior of this dynamical system.
△ Less
Submitted 26 July, 2021;
originally announced July 2021.
-
Quantification of parametric uncertainties induced by irregular soil loading in orchard tower sprayer nonlinear dynamics
Authors:
Americo Cunha Jr,
Jorge Luis Palacios Felix,
José Manoel Balthazar
Abstract:
This paper deals with the nonlinear stochastic dynamics of an orchard tower sprayer subjected to random excitations due to soil irregularities. A consistent stochastic model of uncertainties is constructed to describe random loadings and to predict variabilities in mechanical system response. The dynamics is addressed in time and frequency domains. Monte Carlo method is employed to compute the pro…
▽ More
This paper deals with the nonlinear stochastic dynamics of an orchard tower sprayer subjected to random excitations due to soil irregularities. A consistent stochastic model of uncertainties is constructed to describe random loadings and to predict variabilities in mechanical system response. The dynamics is addressed in time and frequency domains. Monte Carlo method is employed to compute the propagation of uncertainties through the stochastic model. Numerical simulations reveals a very rich dynamics, which is able to produce chaos. This numerical study also indicates that lateral vibrations follow a direct energy cascade law. A probabilistic analysis reveals the possibility of large lateral vibrations during the equipment operation.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
Global sensitivity analysis of asymmetric energy harvesters
Authors:
João Pedro Norenberg,
Americo Cunha Jr,
Samuel da Silva,
Paulo Sérgio Varoto
Abstract:
Parametric variability is inevitable in actual energy harvesters. It can significantly affect crucial aspects of the system performance, especially in harvesting systems that present geometric parameters, material properties, or excitation conditions that are susceptible to small perturbations. This work aims to develop an investigation to identify the most critical parameters in the dynamic behav…
▽ More
Parametric variability is inevitable in actual energy harvesters. It can significantly affect crucial aspects of the system performance, especially in harvesting systems that present geometric parameters, material properties, or excitation conditions that are susceptible to small perturbations. This work aims to develop an investigation to identify the most critical parameters in the dynamic behavior of asymmetric bistable energy harvesters with nonlinear piezoelectric coupling, considering the variability of their physical and excitation properties. For this purpose, a global sensitivity analysis based on orthogonal variance decomposition, employing Sobol indices, is performed to quantify the effect of the harvester parameters on the variance of the recovered power. This technique quantifies the variance concerning each parameter individually and collectively regarding the total variation of the model. The results indicate that the frequency and amplitude of excitation, asymmetric terms and electrical proprieties of the piezoelectric coupling are the most critical parameters that affect the mean power harvested. It is also shown that the order of importance of the parameters can change according to the stability of the harvester's dynamic response. In this way, a better understanding of the system under analysis is obtained since the study allows the identification of vital parameters that rule the change of dynamic behavior and therefore constitutes a powerful tool in the robust design, optimization, and response prediction of nonlinear harvesters.
△ Less
Submitted 25 May, 2022; v1 submitted 9 July, 2021;
originally announced July 2021.
-
Assessment of a transient homogeneous reactor through in situ adaptive tabulation
Authors:
Americo Cunha Jr,
Luis Fernando Figueira da Silva
Abstract:
The development of computational models for the numerical simulation of chemically reacting flows operating in the turbulent regime requires the solution of partial differential equations that represent the balance of mass, linear momentum, chemical species, and energy. The chemical reactions of the model may involve detailed reaction mechanisms for the description of the physicochemical phenomena…
▽ More
The development of computational models for the numerical simulation of chemically reacting flows operating in the turbulent regime requires the solution of partial differential equations that represent the balance of mass, linear momentum, chemical species, and energy. The chemical reactions of the model may involve detailed reaction mechanisms for the description of the physicochemical phenomena. One of the biggest challenges is the stiffness of the numerical simulation of these models and the nonlinear nature of species rate of reaction. This work presents a study of in situ adaptive tabulation (ISAT) technique, focusing on the accuracy, efficiency, and memory usage in the simulation of homogeneous stirred reactor models using simple and complex reaction mechanisms. The combustion of carbon monoxide with oxygen and methane with air mixtures are considered, using detailed reaction mechanisms with 4 and 53 species, 3 and 325 reactions, respectively. The results of these simulations indicate that the developed implementation of ISAT technique has a absolute global error smaller than 1 %. Moreover, ISAT technique provides gains, in terms of computational time, of up to 80% when compared with the direct integration of the full chemical kinetics. However, in terms of memory usage the present implementation of ISAT technique is found to be excessively demanding.
△ Less
Submitted 27 May, 2021;
originally announced June 2021.
-
Enhancing the performance of a bistable energy harvesting device via the cross-entropy method
Authors:
Americo Cunha Jr
Abstract:
This work deals with the solution of a non-convex optimization problem to enhance the performance of an energy harvesting device, which involves a nonlinear objective function and a discontinuous constraint. This optimization problem, which seeks to find a suitable configuration of parameters that maximize the electrical power recovered by a bistable energy harvesting system, is formulated in term…
▽ More
This work deals with the solution of a non-convex optimization problem to enhance the performance of an energy harvesting device, which involves a nonlinear objective function and a discontinuous constraint. This optimization problem, which seeks to find a suitable configuration of parameters that maximize the electrical power recovered by a bistable energy harvesting system, is formulated in terms of the dynamical system response and a binary classifier obtained from 0 to 1 test for chaos. A stochastic solution strategy that combines penalization and the cross-entropy method is proposed and numerically tested. Computational experiments are conducted to address the performance of the proposed optimization approach by comparison with a reference solution, obtained via an exhaustive search in a refined numerical mesh. The obtained results illustrate the effectiveness and robustness of the cross-entropy optimization strategy (even in the presence of noise or in moderately higher dimensions), showing that the proposed framework may be a very useful and powerful tool to solve optimization problems involving nonlinear energy harvesting dynamical systems.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
Computational modeling of the nonlinear stochastic dynamics of horizontal drillstrings
Authors:
Americo Cunha Jr,
Christian Soize,
Rubens Sampaio
Abstract:
This work intends to analyze the nonlinear stochastic dynamics of drillstrings in horizontal configuration. For this purpose, it considers a beam theory, with effects of rotatory inertia and shear deformation, which is capable of reproducing the large displacements that the beam undergoes. The friction and shock effects, due to beam/borehole wall transversal impacts, as well as the force and torqu…
▽ More
This work intends to analyze the nonlinear stochastic dynamics of drillstrings in horizontal configuration. For this purpose, it considers a beam theory, with effects of rotatory inertia and shear deformation, which is capable of reproducing the large displacements that the beam undergoes. The friction and shock effects, due to beam/borehole wall transversal impacts, as well as the force and torque induced by bit-rock interaction, are also considered in the model. Uncertainties of bit-rock interaction model are taken into account using a parametric probabilistic approach. Numerical simulations have shown that the mechanical system of interest has a very rich nonlinear stochastic dynamics, which generate phenomena such as bit-bounce, stick-slip, and transverse impacts. A study aiming to maximize the drilling process efficiency, varying drillstring velocities of translation and rotation is presented. Also, the work presents the definition and solution of two optimizations problems, one deterministic and one robust, where the objective is to maximize drillstring rate of penetration into the soil respecting its structural limits.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
On the nonlinear stochastic dynamics of a continuous system with discrete attached elements
Authors:
Americo Cunha Jr,
Rubens Sampaio
Abstract:
This paper presents a theoretical study on the influence of a discrete element in the nonlinear dynamics of a continuous mechanical system subject to randomness in the model parameters. This system is composed by an elastic bar, attached to springs and a lumped mass, with a random elastic modulus and subjected to a Gaussian white-noise distributed external force. One can note that the dynamic beha…
▽ More
This paper presents a theoretical study on the influence of a discrete element in the nonlinear dynamics of a continuous mechanical system subject to randomness in the model parameters. This system is composed by an elastic bar, attached to springs and a lumped mass, with a random elastic modulus and subjected to a Gaussian white-noise distributed external force. One can note that the dynamic behavior of the bar is significantly altered when the lumped mass is varied, becoming, on the right extreme and for large values of the concentrated mass, similar to a mass-spring system. It is also observed that the system response is more influenced by the randomness for small values of the lumped mass. The study conducted also show an irregular distribution of energy through the spectrum of frequencies, asymmetries and multimodal behavior in the probability distributions of the lumped mass velocity.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
Uncertainty quantification through Monte Carlo method in a cloud computing setting
Authors:
A. Cunha Jr,
R. Nasser,
R. Sampaio,
H. Lopes,
K. Breitman
Abstract:
The Monte Carlo (MC) method is the most common technique used for uncertainty quantification, due to its simplicity and good statistical results. However, its computational cost is extremely high, and, in many cases, prohibitive. Fortunately, the MC algorithm is easily parallelizable, which allows its use in simulations where the computation of a single realization is very costly. This work presen…
▽ More
The Monte Carlo (MC) method is the most common technique used for uncertainty quantification, due to its simplicity and good statistical results. However, its computational cost is extremely high, and, in many cases, prohibitive. Fortunately, the MC algorithm is easily parallelizable, which allows its use in simulations where the computation of a single realization is very costly. This work presents a methodology for the parallelization of the MC method, in the context of cloud computing. This strategy is based on the MapReduce paradigm, and allows an efficient distribution of tasks in the cloud. This methodology is illustrated on a problem of structural dynamics that is subject to uncertainties. The results show that the technique is capable of producing good results concerning statistical moments of low order. It is shown that even a simple problem may require many realizations for convergence of histograms, which makes the cloud computing strategy very attractive (due to its high scalability capacity and low-cost). Additionally, the results regarding the time of processing and storage space usage allow one to qualify this new methodology as a solution for simulations that require a number of MC realizations beyond the standard.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
The High-Assurance ROS Framework
Authors:
André Santos,
Alcino Cunha,
Nuno Macedo
Abstract:
This tool paper presents the High-Assurance ROS (HAROS) framework. HAROS is a framework for the analysis and quality improvement of robotics software developed using the popular Robot Operating System (ROS). It builds on a static analysis foundation to automatically extract models from the source code. Such models are later used to enable other sorts of analyses, such as Model Checking, Runtime Ve…
▽ More
This tool paper presents the High-Assurance ROS (HAROS) framework. HAROS is a framework for the analysis and quality improvement of robotics software developed using the popular Robot Operating System (ROS). It builds on a static analysis foundation to automatically extract models from the source code. Such models are later used to enable other sorts of analyses, such as Model Checking, Runtime Verification, and Property-based Testing. It has been applied to multiple real-world examples, helping developers find and correct various issues.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
alurity, a toolbox for robot cybersecurity
Authors:
Víctor Mayoral-Vilches,
Irati Abad-Fernández,
Martin Pinzger,
Stefan Rass,
Bernhard Dieber,
Alcino Cunha,
Francisco J. Rodríguez-Lera,
Giovanni Lacava,
Angelica Marotta,
Fabio Martinelli,
Endika Gil-Uriarte
Abstract:
The reuse of technologies and inherent complexity of most robotic systems is increasingly leading to robots with wide attack surfaces and a variety of potential vulnerabilities. Given their growing presence in public environments, security research is increasingly becoming more important than in any other area, specially due to the safety implications that robot vulnerabilities could cause on huma…
▽ More
The reuse of technologies and inherent complexity of most robotic systems is increasingly leading to robots with wide attack surfaces and a variety of potential vulnerabilities. Given their growing presence in public environments, security research is increasingly becoming more important than in any other area, specially due to the safety implications that robot vulnerabilities could cause on humans. We argue that security triage in robotics is still immature and that new tools must be developed to accelerate the testing-triage-exploitation cycle, necessary for prioritizing and accelerating the mitigation of flaws.
The present work tackles the current lack of offensive cybersecurity research in robotics by presenting a toolbox and the results obtained with it through several use cases conducted over a year period. We propose a modular and composable toolbox for robot cybersecurity: alurity. By ensuring that both roboticists and security researchers working on a project have a common, consistent and easily reproducible development environment, alurity aims to facilitate the cybersecurity research and the collaboration across teams.
△ Less
Submitted 16 October, 2020; v1 submitted 15 October, 2020;
originally announced October 2020.
-
Hybrid Session-based News Recommendation using Recurrent Neural Networks
Authors:
Gabriel de Souza P. Moreira,
Dietmar Jannach,
Adilson Marques da Cunha
Abstract:
We describe a hybrid meta-architecture -- the CHAMELEON -- for session-based news recommendation that is able to leverage a variety of information types using Recurrent Neural Networks. We evaluated our approach on two public datasets, using a temporal evaluation protocol that simulates the dynamics of a news portal in a realistic way. Our results confirm the benefits of modeling the sequence of s…
▽ More
We describe a hybrid meta-architecture -- the CHAMELEON -- for session-based news recommendation that is able to leverage a variety of information types using Recurrent Neural Networks. We evaluated our approach on two public datasets, using a temporal evaluation protocol that simulates the dynamics of a news portal in a realistic way. Our results confirm the benefits of modeling the sequence of session clicks with RNNs and leveraging side information about users and articles, resulting in significantly higher recommendation accuracy and catalog coverage than other session-based algorithms.
△ Less
Submitted 22 June, 2020;
originally announced June 2020.
-
Improved Formulations and Branch-and-cut Algorithms for the Angular Constrained Minimum Spanning Tree Problem
Authors:
Alexandre Salles da Cunha
Abstract:
The Angular Constrained Minimum Spanning Tree Problem ($α$-MSTP) is defined in terms of a complete undirected graph $G=(V,E)$ and an angle $α\in (0,2π]$. Vertices of $G$ define points in the Euclidean plane while edges, the line segments connecting them, are weighted by the Euclidean distance between their endpoints. A spanning tree is an $α$-spanning tree ($α$-ST) of $G$ if, for any $i \in V$, th…
▽ More
The Angular Constrained Minimum Spanning Tree Problem ($α$-MSTP) is defined in terms of a complete undirected graph $G=(V,E)$ and an angle $α\in (0,2π]$. Vertices of $G$ define points in the Euclidean plane while edges, the line segments connecting them, are weighted by the Euclidean distance between their endpoints. A spanning tree is an $α$-spanning tree ($α$-ST) of $G$ if, for any $i \in V$, the smallest angle that encloses all line segments corresponding to its $i$-incident edges does not exceed $α$. $α$-MSTP consists in finding an $α$-ST with the least weight. We introduce two $α-$MSTP integer programming formulations, ${\mathcal F}_{xy}^*$ and $\mathcal{F}_x^{++}$ and their accompanying Branch-and-cut (BC) algorithms, BCFXY$^*$ and BCFX$^{++}$. Both formulations can be seen as improvements over formulations coming from the literature. The strongest of them, $\mathcal{F}_x^{++}$, was obtained by: (i) lifting an existing set of inequalities in charge of enforcing $α$ angular constraints and (ii) characterizing $α$-MSTP valid inequalities from the Stable Set polytope, a structure behind $α-$STs, that we disclosed here. These formulations and their predecessors in the literature were compared from a polyhedral perspective. From a numerical standpoint, we observed that BCFXY$^*$ and BCFX$^{++}$ compare favorably to their competitors in the literature. In fact, thanks to the quality of the bounds provided by $\mathcal{F}_x^{++}$, BCFX$^{++}$ seems to outperform the other existing $α-$MSTP algorithms. It is able to solve more instances to proven optimality and to provide sharper lower bounds, when optimality is not attested within an imposed time limit. As a by-product, BCFX$^{++}$ provided 8 new optimality certificates for instances coming from the literature.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
Embedded model discrepancy: A case study of Zika modeling
Authors:
Rebecca E. Morrison,
Americo Cunha Jr
Abstract:
Mathematical models of epidemiological systems enable investigation of and predictions about potential disease outbreaks. However, commonly used models are often highly simplified representations of incredibly complex systems. Because of these simplifications, the model output, of say new cases of a disease over time, or when an epidemic will occur, may be inconsistent with available data. In this…
▽ More
Mathematical models of epidemiological systems enable investigation of and predictions about potential disease outbreaks. However, commonly used models are often highly simplified representations of incredibly complex systems. Because of these simplifications, the model output, of say new cases of a disease over time, or when an epidemic will occur, may be inconsistent with available data. In this case, we must improve the model, especially if we plan to make decisions based on it that could affect human health and safety, but direct improvements are often beyond our reach. In this work, we explore this problem through a case study of the Zika outbreak in Brazil in 2016. We propose an embedded discrepancy operator---a modification to the model equations that requires modest information about the system and is calibrated by all relevant data. We show that the new enriched model demonstrates greatly increased consistency with real data. Moreover, the method is general enough to easily apply to many other mathematical models in epidemiology.
△ Less
Submitted 13 April, 2020;
originally announced April 2020.
-
Deep Metric Structured Learning For Facial Expression Recognition
Authors:
Pedro D. Marrero Fernandez,
Tsang Ing Ren,
Tsang Ing Jyh,
Fidel A. Guerrero Peña,
Alexandre Cunha
Abstract:
We propose a deep metric learning model to create embedded sub-spaces with a well defined structure. A new loss function that imposes Gaussian structures on the output space is introduced to create these sub-spaces thus shaping the distribution of the data. Having a mixture of Gaussians solution space is advantageous given its simplified and well established structure. It allows fast discovering o…
▽ More
We propose a deep metric learning model to create embedded sub-spaces with a well defined structure. A new loss function that imposes Gaussian structures on the output space is introduced to create these sub-spaces thus shaping the distribution of the data. Having a mixture of Gaussians solution space is advantageous given its simplified and well established structure. It allows fast discovering of classes within classes and the identification of mean representatives at the centroids of individual classes. We also propose a new semi-supervised method to create sub-classes. We illustrate our methods on the facial expression recognition problem and validate results on the FER+, AffectNet, Extended Cohn-Kanade (CK+), BU-3DFE, and JAFFE datasets. We experimentally demonstrate that the learned embedding can be successfully used for various applications including expression retrieval and emotion recognition.
△ Less
Submitted 5 January, 2022; v1 submitted 18 January, 2020;
originally announced January 2020.
-
Simulation under Arbitrary Temporal Logic Constraints
Authors:
Julien Brunel,
David Chemouil,
Alcino Cunha,
Nuno Macedo
Abstract:
Most model checkers provide a useful simulation mode, that allows users to explore the set of possible behaviours by interactively picking at each state which event to execute next. Traditionally this simulation mode cannot take into consideration additional temporal logic constraints, such as arbitrary fairness restrictions, substantially reducing its usability for debugging the modelled system b…
▽ More
Most model checkers provide a useful simulation mode, that allows users to explore the set of possible behaviours by interactively picking at each state which event to execute next. Traditionally this simulation mode cannot take into consideration additional temporal logic constraints, such as arbitrary fairness restrictions, substantially reducing its usability for debugging the modelled system behaviour. Similarly, when a specification is false, even if all its counter-examples combined also form a set of behaviours, most model checkers only present one of them to the user, providing little or no mechanism to explore alternatives. In this paper, we present a simple on-the-fly verification technique to allow the user to explore the behaviours that satisfy an arbitrary temporal logic specification, with an interactive process akin to simulation. This technique enables a unified interface for simulating the modelled system and exploring its counter-examples. The technique is formalised in the framework of state/event linear temporal logic and a proof of concept was implemented in an event-based variant of the Electrum framework.
△ Less
Submitted 23 December, 2019;
originally announced December 2019.
-
LNDb: A Lung Nodule Database on Computed Tomography
Authors:
João Pedrosa,
Guilherme Aresta,
Carlos Ferreira,
Márcio Rodrigues,
Patrícia Leitão,
André Silva Carvalho,
João Rebelo,
Eduardo Negrão,
Isabel Ramos,
António Cunha,
Aurélio Campilho
Abstract:
Lung cancer is the deadliest type of cancer worldwide and late detection is the major factor for the low survival rate of patients. Low dose computed tomography has been suggested as a potential screening tool but manual screening is costly, time-consuming and prone to variability. This has fueled the development of automatic methods for the detection, segmentation and characterisation of pulmonar…
▽ More
Lung cancer is the deadliest type of cancer worldwide and late detection is the major factor for the low survival rate of patients. Low dose computed tomography has been suggested as a potential screening tool but manual screening is costly, time-consuming and prone to variability. This has fueled the development of automatic methods for the detection, segmentation and characterisation of pulmonary nodules but its application to clinical routine is challenging. In this study, a new database for the development and testing of pulmonary nodule computer-aided strategies is presented which intends to complement current databases by giving additional focus to radiologist variability and local clinical reality. State-of-the-art nodule detection, segmentation and characterization methods are tested and compared to manual annotations as well as collaborative strategies combining multiple radiologists and radiologists and computer-aided systems. It is shown that state-of-the-art methodologies can determine a patient's follow-up recommendation as accurately as a radiologist, though the nodule detection method used shows decreased performance in this database.
△ Less
Submitted 19 December, 2019; v1 submitted 19 November, 2019;
originally announced November 2019.
-
Bottom-Up Meta-Policy Search
Authors:
Luckeciano C. Melo,
Marcos R. O. A. Maximo,
Adilson Marques da Cunha
Abstract:
Despite of the recent progress in agents that learn through interaction, there are several challenges in terms of sample efficiency and generalization across unseen behaviors during training. To mitigate these problems, we propose and apply a first-order Meta-Learning algorithm called Bottom-Up Meta-Policy Search (BUMPS), which works with two-phase optimization procedure: firstly, in a meta-traini…
▽ More
Despite of the recent progress in agents that learn through interaction, there are several challenges in terms of sample efficiency and generalization across unseen behaviors during training. To mitigate these problems, we propose and apply a first-order Meta-Learning algorithm called Bottom-Up Meta-Policy Search (BUMPS), which works with two-phase optimization procedure: firstly, in a meta-training phase, it distills few expert policies to create a meta-policy capable of generalizing knowledge to unseen tasks during training; secondly, it applies a fast adaptation strategy named Policy Filtering, which evaluates few policies sampled from the meta-policy distribution and selects which best solves the task. We conducted all experiments in the RoboCup 3D Soccer Simulation domain, in the context of kick motion learning. We show that, given our experimental setup, BUMPS works in scenarios where simple multi-task Reinforcement Learning does not. Finally, we performed experiments in a way to evaluate each component of the algorithm.
△ Less
Submitted 9 December, 2019; v1 submitted 22 October, 2019;
originally announced October 2019.
-
J Regularization Improves Imbalanced Multiclass Segmentation
Authors:
Fidel A. Guerrero Peña,
Pedro D. Marrero Fernandez,
Paul T. Tarr,
Tsang Ing Ren,
Elliot M. Meyerowitz,
Alexandre Cunha
Abstract:
We propose a new loss formulation to further advance the multiclass segmentation of cluttered cells under weakly supervised conditions.
We improve the separation of touching and immediate cells, obtaining sharp segmentation boundaries with high adequacy, when we add Youden's $J$ statistic regularization term to the cross entropy loss. This regularization intrinsically supports class imbalance th…
▽ More
We propose a new loss formulation to further advance the multiclass segmentation of cluttered cells under weakly supervised conditions.
We improve the separation of touching and immediate cells, obtaining sharp segmentation boundaries with high adequacy, when we add Youden's $J$ statistic regularization term to the cross entropy loss. This regularization intrinsically supports class imbalance thus eliminating the necessity of explicitly using weights to balance training. Simulations demonstrate this capability and show how the regularization leads to better results by helping advancing the optimization when cross entropy stalls.
We build upon our previous work on multiclass segmentation by adding yet another training class representing gaps between adjacent cells.
This addition helps the classifier identify narrow gaps as background and no longer as touching regions.
We present results of our methods for 2D and 3D images, from bright field to confocal stacks containing different types of cells, and we show that they accurately segment individual cells after training with a limited number of annotated images, some of which are poorly annotated.
△ Less
Submitted 22 October, 2019;
originally announced October 2019.
-
Did you miss it? Automatic lung nodule detection combined with gaze information improves radiologists' screening performance
Authors:
Guilherme Aresta,
Carlos Ferreira,
João Pedrosa,
Teresa Araújo,
João Rebelo,
Eduardo Negrão,
Margarida Morgado,
Filipe Alves,
António Cunha,
Isabel Ramos,
Aurélio Campilho
Abstract:
Early diagnosis of lung cancer via computed tomography can significantly reduce the morbidity and mortality rates associated with the pathology. However, search lung nodules is a high complexity task, which affects the success of screening programs. Whilst computer-aided detection systems can be used as second observers, they may bias radiologists and introduce significant time overheads. With thi…
▽ More
Early diagnosis of lung cancer via computed tomography can significantly reduce the morbidity and mortality rates associated with the pathology. However, search lung nodules is a high complexity task, which affects the success of screening programs. Whilst computer-aided detection systems can be used as second observers, they may bias radiologists and introduce significant time overheads. With this in mind, this study assesses the potential of using gaze information for integrating automatic detection systems in the clinical practice. For that purpose, 4 radiologists were asked to annotate 20 scans from a public dataset while being monitored by an eye tracker device and an automatic lung nodule detection system was developed. Our results show that radiologists follow a similar search routine and tend to have lower fixation periods in regions where finding errors occur. The overall detection sensitivity of the specialists was 0.67$\pm$0.07, whereas the system achieved 0.69. Combining the annotations of one radiologist with the automatic system significantly improves the detection performance to similar levels of two annotators. Likewise, combining the findings of radiologist with the detection algorithm only for low fixation regions still significantly improves the detection sensitivity without increasing the number of false-positives. The combination of the automatic system with the gaze information allows to mitigate possible errors of the radiologist without some of the issues usually associated with automatic detection system.
△ Less
Submitted 9 October, 2019;
originally announced October 2019.
-
A Multiple Source Hourglass Deep Network for Multi-Focus Image Fusion
Authors:
Fidel Alejandro Guerrero Peña,
Pedro Diamel Marrero Fernández,
Tsang Ing Ren,
Germano Crispim Vasconcelos,
Alexandre Cunha
Abstract:
Multi-Focus Image Fusion seeks to improve the quality of an acquired burst of images with different focus planes. For solving the task, an activity level measurement and a fusion rule are typically established to select and fuse the most relevant information from the sources. However, the design of this kind of method by hand is really hard and sometimes restricted to solution spaces where the opt…
▽ More
Multi-Focus Image Fusion seeks to improve the quality of an acquired burst of images with different focus planes. For solving the task, an activity level measurement and a fusion rule are typically established to select and fuse the most relevant information from the sources. However, the design of this kind of method by hand is really hard and sometimes restricted to solution spaces where the optimal all-in-focus images are not contained. Then, we propose here two fast and straightforward approaches for image fusion based on deep neural networks. Our solution uses a multiple source Hourglass architecture trained in an end-to-end fashion. Models are data-driven and can be easily generalized for other kinds of fusion problems. A segmentation approach is used for recognition of the focus map, while the weighted average rule is used for fusion. We designed a training loss function for our regression-based fusion function, which allows the network to learn both the activity level measurement and the fusion rule. Experimental results show our approach has comparable results to the state-of-the-art methods with a 60X increase of computational efficiency for 520X520 resolution images.
△ Less
Submitted 28 August, 2019;
originally announced August 2019.
-
A Weakly Supervised Method for Instance Segmentation of Biological Cells
Authors:
Fidel A. Guerrero-Peña,
Pedro D. Marrero Fernandez,
Tsang Ing Ren,
Alexandre Cunha
Abstract:
We present a weakly supervised deep learning method to perform instance segmentation of cells present in microscopy images. Annotation of biomedical images in the lab can be scarce, incomplete, and inaccurate. This is of concern when supervised learning is used for image analysis as the discriminative power of a learning model might be compromised in these situations. To overcome the curse of poor…
▽ More
We present a weakly supervised deep learning method to perform instance segmentation of cells present in microscopy images. Annotation of biomedical images in the lab can be scarce, incomplete, and inaccurate. This is of concern when supervised learning is used for image analysis as the discriminative power of a learning model might be compromised in these situations. To overcome the curse of poor labeling, our method focuses on three aspects to improve learning: i) we propose a loss function operating in three classes to facilitate separating adjacent cells and to drive the optimizer to properly classify underrepresented regions; ii) a contour-aware weight map model is introduced to strengthen contour detection while improving the network generalization capacity; and iii) we augment data by carefully modulating local intensities on edges shared by adjoining regions and to account for possibly weak signals on these edges. Generated probability maps are segmented using different methods, with the watershed based one generally offering the best solutions, specially in those regions where the prevalence of a single class is not clear. The combination of these contributions allows segmenting individual cells on challenging images. We demonstrate our methods in sparse and crowded cell images, showing improvements in the learning process for a fixed network architecture.
△ Less
Submitted 26 August, 2019;
originally announced August 2019.
-
On the Importance of News Content Representation in Hybrid Neural Session-based Recommender Systems
Authors:
Gabriel de Souza P. Moreira,
Dietmar Jannach,
Adilson Marques da Cunha
Abstract:
News recommender systems are designed to surface relevant information for online readers by personalizing their user experiences. A particular problem in that context is that online readers are often anonymous, which means that this personalization can only be based on the last few recorded interactions with the user, a setting named session-based recommendation. Another particularity of the news…
▽ More
News recommender systems are designed to surface relevant information for online readers by personalizing their user experiences. A particular problem in that context is that online readers are often anonymous, which means that this personalization can only be based on the last few recorded interactions with the user, a setting named session-based recommendation. Another particularity of the news domain is that constantly fresh articles are published, which should be immediately considered for recommendation. To deal with this item cold-start problem, it is important to consider the actual content of items when recommending. Hybrid approaches are therefore often considered as the method of choice in such settings. In this work, we analyze the importance of considering content information in a hybrid neural news recommender system. We contrast content-aware and content-agnostic techniques and also explore the effects of using different content encodings. Experiments on two public datasets confirm the importance of adopting a hybrid approach. Furthermore, we show that the choice of the content encoding can have an impact on the resulting performance.
△ Less
Submitted 6 September, 2019; v1 submitted 12 July, 2019;
originally announced July 2019.
-
Sharing and Learning Alloy on the Web
Authors:
Nuno Macedo,
Alcino Cunha,
José Pereira,
Renato Carvalho,
Ricardo Silva,
Ana C. R. Paiva,
Miguel S. Ramalho,
Daniel Silva
Abstract:
We present Alloy4Fun, a web application that enables online editing and sharing of Alloy models and instances, to be used mainly in an educational context. By introducing the notion of secret paragraphs and commands in the models, it also allows the distribution and automatic evaluation of simple specification challenges, a useful mechanism that enables students to learn relational logic at their…
▽ More
We present Alloy4Fun, a web application that enables online editing and sharing of Alloy models and instances, to be used mainly in an educational context. By introducing the notion of secret paragraphs and commands in the models, it also allows the distribution and automatic evaluation of simple specification challenges, a useful mechanism that enables students to learn relational logic at their own pace. Alloy4Fun stores all versions of shared and analyzed models, as well as derivation trees that depict how those models evolved over time: this wealth of information can be mined by researchers or tutors to identify, for example, learning breakdowns in the class or typical mistakes made by students and other Alloy users. A beta version of Alloy4Fun was already used in two formal methods courses, and we present some results of this preliminary evaluation.
△ Less
Submitted 4 July, 2019;
originally announced July 2019.
-
Contextual Hybrid Session-based News Recommendation with Recurrent Neural Networks
Authors:
Gabriel de Souza Pereira Moreira,
Dietmar Jannach,
Adilson Marques da Cunha
Abstract:
Recommender systems help users deal with information overload by providing tailored item suggestions to them. The recommendation of news is often considered to be challenging, since the relevance of an article for a user can depend on a variety of factors, including the user's short-term reading interests, the reader's context, or the recency or popularity of an article. Previous work has shown th…
▽ More
Recommender systems help users deal with information overload by providing tailored item suggestions to them. The recommendation of news is often considered to be challenging, since the relevance of an article for a user can depend on a variety of factors, including the user's short-term reading interests, the reader's context, or the recency or popularity of an article. Previous work has shown that the use of Recurrent Neural Networks is promising for the next-in-session prediction task, but has certain limitations when only recorded item click sequences are used as input. In this work, we present a contextual hybrid, deep learning based approach for session-based news recommendation that is able to leverage a variety of information types. We evaluated our approach on two public datasets, using a temporal evaluation protocol that simulates the dynamics of a news portal in a realistic way. Our results confirm the benefits of considering additional types of information, including article popularity and recency, in the proposed way, resulting in significantly higher recommendation accuracy and catalog coverage than other session-based algorithms. Additional experiments show that the proposed parameterizable loss function used in our method also allows us to balance two usually conflicting quality factors, accuracy and novelty.
Keywords: Artificial Neural Networks, Context-Aware Recommender Systems, Hybrid Recommender Systems, News Recommender Systems, Session-based Recommendation
△ Less
Submitted 8 December, 2019; v1 submitted 15 April, 2019;
originally announced April 2019.
-
Classification of Motorcycles using Extracted Images of Traffic Monitoring Videos
Authors:
Adriano Belletti Felicio,
André Luiz Cunha
Abstract:
Due to the great growth of motorcycles in the urban fleet and the growth of the study on its behavior and of how this vehicle affects the flow of traffic becomes necessary the development of tools and techniques different from the conventional ones to identify its presence in the traffic flow and be able to extract your information. The article in question attempts to contribute to the study on th…
▽ More
Due to the great growth of motorcycles in the urban fleet and the growth of the study on its behavior and of how this vehicle affects the flow of traffic becomes necessary the development of tools and techniques different from the conventional ones to identify its presence in the traffic flow and be able to extract your information. The article in question attempts to contribute to the study on this type of vehicle by generating a motorcycle image bank and developing and calibrating a motorcycle classifier by combining the LBP techniques to create the characteristic vectors and the classification technique LinearSVC to perform the predictions. In this way the classifier of vehicles of the type motorcycle developed in this research can classify the images of vehicles extracted of videos of monitoring between two classes motorcycles and non-motorcycles with a precision and an accuracy superior to 0,9.
△ Less
Submitted 20 June, 2019; v1 submitted 30 March, 2019;
originally announced April 2019.
-
FERAtt: Facial Expression Recognition with Attention Net
Authors:
Pedro D. Marrero Fernandez,
Fidel A. Guerrero Peña,
Tsang Ing Ren,
Alexandre Cunha
Abstract:
We present a new end-to-end network architecture for facial expression recognition with an attention model. It focuses attention in the human face and uses a Gaussian space representation for expression recognition. We devise this architecture based on two fundamental complementary components: (1) facial image correction and attention and (2) facial expression representation and classification. Th…
▽ More
We present a new end-to-end network architecture for facial expression recognition with an attention model. It focuses attention in the human face and uses a Gaussian space representation for expression recognition. We devise this architecture based on two fundamental complementary components: (1) facial image correction and attention and (2) facial expression representation and classification. The first component uses an encoder-decoder style network and a convolutional feature extractor that are pixel-wise multiplied to obtain a feature attention map. The second component is responsible for obtaining an embedded representation and classification of the facial expression. We propose a loss function that creates a Gaussian structure on the representation space. To demonstrate the proposed method, we create two larger and more comprehensive synthetic datasets using the traditional BU3DFE and CK+ facial datasets. We compared results with the PreActResNet18 baseline. Our experiments on these datasets have shown the superiority of our approach in recognizing facial expressions.
△ Less
Submitted 8 February, 2019;
originally announced February 2019.
-
Learning Humanoid Robot Motions Through Deep Neural Networks
Authors:
Luckeciano Carvalho Melo,
Marcos Ricardo Omena Albuquerque Maximo,
Adilson Marques da Cunha
Abstract:
Controlling a high degrees of freedom humanoid robot is acknowledged as one of the hardest problems in Robotics. Due to the lack of mathematical models, an approach frequently employed is to rely on human intuition to design keyframe movements by hand, usually aided by graphical tools. In this paper, we propose a learning framework based on neural networks in order to mimic humanoid robot movement…
▽ More
Controlling a high degrees of freedom humanoid robot is acknowledged as one of the hardest problems in Robotics. Due to the lack of mathematical models, an approach frequently employed is to rely on human intuition to design keyframe movements by hand, usually aided by graphical tools. In this paper, we propose a learning framework based on neural networks in order to mimic humanoid robot movements. The developed technique does not make any assumption about the underlying implementation of the movement, therefore both keyframe and model-based motions may be learned. The framework was applied in the RoboCup 3D Soccer Simulation domain and promising results were obtained using the same network architecture for several motions, even when copying motions from another teams.
△ Less
Submitted 2 January, 2019;
originally announced January 2019.
-
iW-Net: an automatic and minimalistic interactive lung nodule segmentation deep network
Authors:
Guilherme Aresta,
Colin Jacobs,
Teresa Araújo,
António Cunha,
Isabel Ramos,
Bram van Ginneken,
Aurélio Campilho
Abstract:
We propose iW-Net, a deep learning model that allows for both automatic and interactive segmentation of lung nodules in computed tomography images. iW-Net is composed of two blocks: the first one provides an automatic segmentation and the second one allows to correct it by analyzing 2 points introduced by the user in the nodule's boundary. For this purpose, a physics inspired weight map that takes…
▽ More
We propose iW-Net, a deep learning model that allows for both automatic and interactive segmentation of lung nodules in computed tomography images. iW-Net is composed of two blocks: the first one provides an automatic segmentation and the second one allows to correct it by analyzing 2 points introduced by the user in the nodule's boundary. For this purpose, a physics inspired weight map that takes the user input into account is proposed, which is used both as a feature map and in the system's loss function. Our approach is extensively evaluated on the public LIDC-IDRI dataset, where we achieve a state-of-the-art performance of 0.55 intersection over union vs the 0.59 inter-observer agreement. Also, we show that iW-Net allows to correct the segmentation of small nodules, essential for proper patient referral decision, as well as improve the segmentation of the challenging non-solid nodules and thus may be an important tool for increasing the early diagnosis of lung cancer.
△ Less
Submitted 30 November, 2018;
originally announced November 2018.
-
Improving the Visualization of Alloy Instances
Authors:
Rui Couto,
José C. Campos,
Nuno Macedo,
Alcino Cunha
Abstract:
Alloy is a lightweight formal specification language, supported by an IDE, which has proven well-suited for reasoning about software design in early development stages. The IDE provides a visualizer that produces graphical representations of analysis results, which is essential for the proper validation of the model. Alloy is a rich language but inherently static, so behavior needs to be explicitl…
▽ More
Alloy is a lightweight formal specification language, supported by an IDE, which has proven well-suited for reasoning about software design in early development stages. The IDE provides a visualizer that produces graphical representations of analysis results, which is essential for the proper validation of the model. Alloy is a rich language but inherently static, so behavior needs to be explicitly encoded and reasoned about. Even though this is a common scenario, the visualizer presents limitations when dealing with such models. The main contribution of this paper is a principled approach to generate instance visualizations, which improves the current Alloy Visualizer, focusing on the representation of behavior.
△ Less
Submitted 27 November, 2018;
originally announced November 2018.
-
Geometric Median Shapes
Authors:
Alexandre Cunha
Abstract:
We present an algorithm to compute the geometric median of shapes which is based on the extension of median to high dimensions. The median finding problem is formulated as an optimization over distances and it is solved directly using the watershed method as an optimizer. We show that computing the geometric median of shapes is robust in the presence of outliers and it is superior to the mean shap…
▽ More
We present an algorithm to compute the geometric median of shapes which is based on the extension of median to high dimensions. The median finding problem is formulated as an optimization over distances and it is solved directly using the watershed method as an optimizer. We show that computing the geometric median of shapes is robust in the presence of outliers and it is superior to the mean shape which can easily be affected by the presence of outliers. The geometric median shape thus faithfully represents the true central tendency of the data, contaminated or not. Our approach can be applied to manifold and non manifold shapes, with connected or disconnected shapes. The application of distance transforms and watershed algorithm, two well established constructs of image processing, lead to an algorithm that can be quickly implemented to generate fast solutions with linear storage requirements. We demonstrate our methods in synthetic and natural shapes and compare median and mean results under increasing contamination by strong outliers.
△ Less
Submitted 15 April, 2019; v1 submitted 29 October, 2018;
originally announced October 2018.
-
CNN-based Preprocessing to Optimize Watershed-based Cell Segmentation in 3D Confocal Microscopy Images
Authors:
Dennis Eschweiler,
Thiago V. Spina,
Rohan C. Choudhury,
Elliot Meyerowitz,
Alexandre Cunha,
Johannes Stegmaier
Abstract:
The quantitative analysis of cellular membranes helps understanding developmental processes at the cellular level. Particularly 3D microscopic image data offers valuable insights into cell dynamics, but error-free automatic segmentation remains challenging due to the huge amount of data generated and strong variations in image intensities. In this paper, we propose a new 3D segmentation approach w…
▽ More
The quantitative analysis of cellular membranes helps understanding developmental processes at the cellular level. Particularly 3D microscopic image data offers valuable insights into cell dynamics, but error-free automatic segmentation remains challenging due to the huge amount of data generated and strong variations in image intensities. In this paper, we propose a new 3D segmentation approach which combines the discriminative power of convolutional neural networks (CNNs) for preprocessing and investigates the performance of three watershed-based postprocessing strategies (WS), which are well suited to segment object shapes, even when supplied with vague seed and boundary constraints. To leverage the full potential of the watershed algorithm, the multi-instance segmentation problem is initially interpreted as three-class semantic segmentation problem, which in turn is well-suited for the application of CNNs. Using manually annotated 3D confocal microscopy images of Arabidopsis thaliana, we show the superior performance of the proposed method compared to the state of the art.
△ Less
Submitted 16 October, 2018;
originally announced October 2018.
-
A Comparison between Background Modelling Methods for Vehicle Segmentation in Highway Traffic Videos
Authors:
L. A. Marcomini,
A. L. Cunha
Abstract:
The objective of this paper is to compare the performance of three background-modeling algorithms in segmenting and detecting vehicles in highway traffic videos. All algorithms are available in OpenCV and were all coded in Python. We analyzed seven videos, totaling 2 hours of recording. To compare the algorithms, we created 35 ground-truth images, five from each video, and we used three different…
▽ More
The objective of this paper is to compare the performance of three background-modeling algorithms in segmenting and detecting vehicles in highway traffic videos. All algorithms are available in OpenCV and were all coded in Python. We analyzed seven videos, totaling 2 hours of recording. To compare the algorithms, we created 35 ground-truth images, five from each video, and we used three different metrics: accuracy rate, precision rate, and processing time. By using accuracy and precision, we aim to identify how well the algorithms perform in detection and segmentation, while using the processing time to evaluate the impact on the computational system. Results indicate that all three algorithms had more than 90% of precision rate, while obtaining an average of 80% on accuracy. The algorithm with the lowest impact on processing time allowed the computation of 60 frames per second.
△ Less
Submitted 5 October, 2018;
originally announced October 2018.
-
News Session-Based Recommendations using Deep Neural Networks
Authors:
Gabriel de Souza P. Moreira,
Felipe Ferreira,
Adilson Marques da Cunha
Abstract:
News recommender systems are aimed to personalize users experiences and help them to discover relevant articles from a large and dynamic search space. Therefore, news domain is a challenging scenario for recommendations, due to its sparse user profiling, fast growing number of items, accelerated item's value decay, and users preferences dynamic shift. Some promising results have been recently achi…
▽ More
News recommender systems are aimed to personalize users experiences and help them to discover relevant articles from a large and dynamic search space. Therefore, news domain is a challenging scenario for recommendations, due to its sparse user profiling, fast growing number of items, accelerated item's value decay, and users preferences dynamic shift. Some promising results have been recently achieved by the usage of Deep Learning techniques on Recommender Systems, specially for item's feature extraction and for session-based recommendations with Recurrent Neural Networks. In this paper, it is proposed an instantiation of the CHAMELEON -- a Deep Learning Meta-Architecture for News Recommender Systems. This architecture is composed of two modules, the first responsible to learn news articles representations, based on their text and metadata, and the second module aimed to provide session-based recommendations using Recurrent Neural Networks. The recommendation task addressed in this work is next-item prediction for users sessions: "what is the next most likely article a user might read in a session?" Users sessions context is leveraged by the architecture to provide additional information in such extreme cold-start scenario of news recommendation. Users' behavior and item features are both merged in an hybrid recommendation approach. A temporal offline evaluation method is also proposed as a complementary contribution, for a more realistic evaluation of such task, considering dynamic factors that affect global readership interests like popularity, recency, and seasonality. Experiments with an extensive number of session-based recommendation methods were performed and the proposed instantiation of CHAMELEON meta-architecture obtained a significant relative improvement in top-n accuracy and ranking metrics (10% on Hit Rate and 13% on MRR) over the best benchmark methods.
△ Less
Submitted 16 September, 2018; v1 submitted 31 July, 2018;
originally announced August 2018.
-
Multiclass Weighted Loss for Instance Segmentation of Cluttered Cells
Authors:
Fidel A. Guerrero-Pena,
Pedro D. Marrero Fernandez,
Tsang Ing Ren,
Mary Yui,
Ellen Rothenberg,
Alexandre Cunha
Abstract:
We propose a new multiclass weighted loss function for instance segmentation of cluttered cells. We are primarily motivated by the need of developmental biologists to quantify and model the behavior of blood T-cells which might help us in understanding their regulation mechanisms and ultimately help researchers in their quest for developing an effective immuno-therapy cancer treatment. Segmenting…
▽ More
We propose a new multiclass weighted loss function for instance segmentation of cluttered cells. We are primarily motivated by the need of developmental biologists to quantify and model the behavior of blood T-cells which might help us in understanding their regulation mechanisms and ultimately help researchers in their quest for developing an effective immuno-therapy cancer treatment. Segmenting individual touching cells in cluttered regions is challenging as the feature distribution on shared borders and cell foreground are similar thus difficulting discriminating pixels into proper classes. We present two novel weight maps applied to the weighted cross entropy loss function which take into account both class imbalance and cell geometry. Binary ground truth training data is augmented so the learning model can handle not only foreground and background but also a third touching class. This framework allows training using U-Net. Experiments with our formulations have shown superior results when compared to other similar schemes, outperforming binary class models with significant improvement of boundary adequacy and instance detection. We validate our results on manually annotated microscope images of T-cells.
△ Less
Submitted 21 February, 2018;
originally announced February 2018.
-
SEGMENT3D: A Web-based Application for Collaborative Segmentation of 3D images used in the Shoot Apical Meristem
Authors:
Thiago V. Spina,
Johannes Stegmaier,
Alexandre X. Falcão,
Elliot Meyerowitz,
Alexandre Cunha
Abstract:
The quantitative analysis of 3D confocal microscopy images of the shoot apical meristem helps understanding the growth process of some plants. Cell segmentation in these images is crucial for computational plant analysis and many automated methods have been proposed. However, variations in signal intensity across the image mitigate the effectiveness of those approaches with no easy way for user co…
▽ More
The quantitative analysis of 3D confocal microscopy images of the shoot apical meristem helps understanding the growth process of some plants. Cell segmentation in these images is crucial for computational plant analysis and many automated methods have been proposed. However, variations in signal intensity across the image mitigate the effectiveness of those approaches with no easy way for user correction. We propose a web-based collaborative 3D image segmentation application, SEGMENT3D, to leverage automatic segmentation results. The image is divided into 3D tiles that can be either segmented interactively from scratch or corrected from a pre-existing segmentation. Individual segmentation results per tile are then automatically merged via consensus analysis and then stitched to complete the segmentation for the entire image stack. SEGMENT3D is a comprehensive application that can be applied to other 3D imaging modalities and general objects. It also provides an easy way to create supervised data to advance segmentation using machine learning models.
△ Less
Submitted 26 October, 2017;
originally announced October 2017.