-
Singular knee identification to support emergence recognition in physical swarm and cellular automata trajectories
Authors:
Imraan A. Faruque,
Ishriak Ahmed
Abstract:
After decades of attention, emergence continues to lack a centralized mathematical definition that leads to a rigorous emergence test applicable to physical flocks and swarms, particularly those containing both deterministic elements (eg, interactions) and stochastic perturbations like measurement noise. This study develops a heuristic test based on singular value curve analysis of data matrices c…
▽ More
After decades of attention, emergence continues to lack a centralized mathematical definition that leads to a rigorous emergence test applicable to physical flocks and swarms, particularly those containing both deterministic elements (eg, interactions) and stochastic perturbations like measurement noise. This study develops a heuristic test based on singular value curve analysis of data matrices containing deterministic and Gaussian noise signals. The minimum detection criteria are identified, and statistical and matrix space analysis developed to determine upper and lower bounds. This study applies the analysis to representative examples by using recorded trajectories of mixed deterministic and stochastic trajectories for multi-agent, cellular automata, and biological video. Examples include Cucker Smale and Vicsek flocking, Gaussian noise and its integration, recorded observations of bird flocking, and 1D cellular automata. Ensemble simulations including measurement noise are performed to compute statistical variation and discussed relative to random matrix theory noise bounds. The results indicate singular knee analysis of recorded trajectories can detect gradated levels on a continuum of structure and noise. Across the eight singular value decay metrics considered, the angle subtended at the singular value knee emerges with the most potential for supporting cross-embodiment emergence detection, the size of noise bounds is used as an indication of required sample size, and the presence of a large fraction of singular values inside noise bounds as an indication of noise.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
A Unified Deep Transfer Learning Model for Accurate IoT Localization in Diverse Environments
Authors:
Abdullahi Isa Ahmed,
Yaya Etiabi,
Ali Waqar Azim,
El Mehdi Amhoud
Abstract:
Internet of Things (IoT) is an ever-evolving technological paradigm that is reshaping industries and societies globally. Real-time data collection, analysis, and decision-making facilitated by localization solutions form the foundation for location-based services, enabling them to support critical functions within diverse IoT ecosystems. However, most existing works on localization focus on single…
▽ More
Internet of Things (IoT) is an ever-evolving technological paradigm that is reshaping industries and societies globally. Real-time data collection, analysis, and decision-making facilitated by localization solutions form the foundation for location-based services, enabling them to support critical functions within diverse IoT ecosystems. However, most existing works on localization focus on single environment, resulting in the development of multiple models to support multiple environments. In the context of smart cities, these raise costs and complexity due to the dynamicity of such environments. To address these challenges, this paper presents a unified indoor-outdoor localization solution that leverages transfer learning (TL) schemes to build a single deep learning model. The model accurately predicts the localization of IoT devices in diverse environments. The performance evaluation shows that by adopting an encoder-based TL scheme, we can improve the baseline model by about 17.18% in indoor environments and 9.79% in outdoor environments.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network
Authors:
Md Hassanuzzaman,
Nurul Akhtar Hasan,
Mohammad Abdullah Al Mamun,
Khawza I Ahmed,
Ahsan H Khandoker,
Raqibul Mostafa
Abstract:
Congenital anomalies arising as a result of a defect in the structure of the heart and great vessels are known as congenital heart diseases or CHDs. A PCG can provide essential details about the mechanical conduction system of the heart and point out specific patterns linked to different kinds of CHD. This study aims to investigate the minimum signal duration required for the automatic classificat…
▽ More
Congenital anomalies arising as a result of a defect in the structure of the heart and great vessels are known as congenital heart diseases or CHDs. A PCG can provide essential details about the mechanical conduction system of the heart and point out specific patterns linked to different kinds of CHD. This study aims to investigate the minimum signal duration required for the automatic classification of heart sounds. This study also investigated the optimum signal quality assessment indicator (Root Mean Square of Successive Differences) RMSSD and (Zero Crossings Rate) ZCR value. Mel-frequency cepstral coefficients (MFCCs) based feature is used as an input to build a Transformer-Based residual one-dimensional convolutional neural network, which is then used for classifying the heart sound. The study showed that 0.4 is the ideal threshold for getting suitable signals for the RMSSD and ZCR indicators. Moreover, a minimum signal length of 5s is required for effective heart sound classification. It also shows that a shorter signal (3 s heart sound) does not have enough information to categorize heart sounds accurately, and the longer signal (15 s heart sound) may contain more noise. The best accuracy, 93.69%, is obtained for the 5s signal to distinguish the heart sound.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
Enhancing UAV Security Through Zero Trust Architecture: An Advanced Deep Learning and Explainable AI Analysis
Authors:
Ekramul Haque,
Kamrul Hasan,
Imtiaz Ahmed,
Md. Sahabul Alam,
Tariqul Islam
Abstract:
In the dynamic and ever-changing domain of Unmanned Aerial Vehicles (UAVs), the utmost importance lies in guaranteeing resilient and lucid security measures. This study highlights the necessity of implementing a Zero Trust Architecture (ZTA) to enhance the security of unmanned aerial vehicles (UAVs), hence departing from conventional perimeter defences that may expose vulnerabilities. The Zero Tru…
▽ More
In the dynamic and ever-changing domain of Unmanned Aerial Vehicles (UAVs), the utmost importance lies in guaranteeing resilient and lucid security measures. This study highlights the necessity of implementing a Zero Trust Architecture (ZTA) to enhance the security of unmanned aerial vehicles (UAVs), hence departing from conventional perimeter defences that may expose vulnerabilities. The Zero Trust Architecture (ZTA) paradigm requires a rigorous and continuous process of authenticating all network entities and communications. The accuracy of our methodology in detecting and identifying unmanned aerial vehicles (UAVs) is 84.59\%. This is achieved by utilizing Radio Frequency (RF) signals within a Deep Learning framework, a unique method. Precise identification is crucial in Zero Trust Architecture (ZTA), as it determines network access. In addition, the use of eXplainable Artificial Intelligence (XAI) tools such as SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) contributes to the improvement of the model's transparency and interpretability. Adherence to Zero Trust Architecture (ZTA) standards guarantees that the classifications of unmanned aerial vehicles (UAVs) are verifiable and comprehensible, enhancing security within the UAV field.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Empowering Healthcare through Privacy-Preserving MRI Analysis
Authors:
Al Amin,
Kamrul Hasan,
Saleh Zein-Sabatto,
Deo Chimba,
Liang Hong,
Imtiaz Ahmed,
Tariqul Islam
Abstract:
In the healthcare domain, Magnetic Resonance Imaging (MRI) assumes a pivotal role, as it employs Artificial Intelligence (AI) and Machine Learning (ML) methodologies to extract invaluable insights from imaging data. Nonetheless, the imperative need for patient privacy poses significant challenges when collecting data from diverse healthcare sources. Consequently, the Deep Learning (DL) communities…
▽ More
In the healthcare domain, Magnetic Resonance Imaging (MRI) assumes a pivotal role, as it employs Artificial Intelligence (AI) and Machine Learning (ML) methodologies to extract invaluable insights from imaging data. Nonetheless, the imperative need for patient privacy poses significant challenges when collecting data from diverse healthcare sources. Consequently, the Deep Learning (DL) communities occasionally face difficulties detecting rare features. In this research endeavor, we introduce the Ensemble-Based Federated Learning (EBFL) Framework, an innovative solution tailored to address this challenge. The EBFL framework deviates from the conventional approach by emphasizing model features over sharing sensitive patient data. This unique methodology fosters a collaborative and privacy-conscious environment for healthcare institutions, empowering them to harness the capabilities of a centralized server for model refinement while upholding the utmost data privacy standards.Conversely, a robust ensemble architecture boasts potent feature extraction capabilities, distinguishing itself from a single DL model. This quality makes it remarkably dependable for MRI analysis. By harnessing our groundbreaking EBFL methodology, we have achieved remarkable precision in the classification of brain tumors, including glioma, meningioma, pituitary, and non-tumor instances, attaining a precision rate of 94% for the Global model and an impressive 96% for the Ensemble model. Our models underwent rigorous evaluation using conventional performance metrics such as Accuracy, Precision, Recall, and F1 Score. Integrating DL within the Federated Learning (FL) framework has yielded a methodology that offers precise and dependable diagnostics for detecting brain tumors.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Over-the-Air Emulation of Electronically Adjustable Rician MIMO Channels in a Programmable-Metasurface-Stirred Reverberation Chamber
Authors:
Ismail Ahmed,
Matthieu Davy,
Hugo Prod'homme,
Philippe Besnier,
Philipp del Hougne
Abstract:
We experimentally investigate the feasibility of evaluating multiple-input multiple-output (MIMO) radio equipment under adjustable Rician fading channel conditions in a programmable-metasurface-stirred (PM-stirred) reverberation chamber (RC). Whereas within the "smart radio environment" paradigm PMs offer partial control over the channels to the wireless system, in our use case the PM emulates the…
▽ More
We experimentally investigate the feasibility of evaluating multiple-input multiple-output (MIMO) radio equipment under adjustable Rician fading channel conditions in a programmable-metasurface-stirred (PM-stirred) reverberation chamber (RC). Whereas within the "smart radio environment" paradigm PMs offer partial control over the channels to the wireless system, in our use case the PM emulates the uncontrollable fading. We implement a desired Rician K-factor by sweeping a suitably sized subset of all meta-atoms through random configurations. We discover in our setup an upper bound on the accessible K-factors for which the statistics of the channel coefficient distributions closely follow the sought-after Rician distribution. We also discover a lower bound on the accessible K-factors in our setup: there are unstirred paths that never encounter the PM, and paths that encounter the PM are not fully stirred because the average of the meta-atoms' accessible polarizability values is not zero (i.e., the meta-atoms have a non-zero "structural" cross-section). We corroborate these findings with experiments in an anechoic chamber, physics-compliant PhysFad simulations with Lorentzian vs "ideal" meta-atoms, and theoretical analysis. Our work clarifies the scope of applicability of PM-stirred RCs for MIMO Rician channel emulation, as well as electromagnetic compatibility test.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
An ML-assisted OTFS vs. OFDM adaptable modem
Authors:
I. Zakir Ahmed,
Hamid R. Sadjadpour
Abstract:
The Orthogonal-Time-Frequency-Space (OTFS) signaling is known to be resilient to doubly-dispersive channels, which impacts high mobility scenarios. On the other hand, the Orthogonal-Frequency-Division-Multiplexing (OFDM) waveforms enjoy the benefits of the reuse of legacy architectures, simplicity of receiver design, and low-complexity detection. Several studies that compare the performance of OFD…
▽ More
The Orthogonal-Time-Frequency-Space (OTFS) signaling is known to be resilient to doubly-dispersive channels, which impacts high mobility scenarios. On the other hand, the Orthogonal-Frequency-Division-Multiplexing (OFDM) waveforms enjoy the benefits of the reuse of legacy architectures, simplicity of receiver design, and low-complexity detection. Several studies that compare the performance of OFDM and OTFS have indicated mixed outcomes due to the plethora of system parameters at play beyond high-mobility conditions. In this work, we exemplify this observation using simulations and propose a deep neural network (DNN)-based adaptation scheme to switch between using either an OTFS or OFDM signal processing chain at the transmitter and receiver for optimal mean-squared-error (MSE) performance. The DNN classifier is trained to switch between the two schemes by observing the channel condition, received SNR, and modulation format. We compare the performance of the OTFS, OFDM, and the proposed switched-waveform scheme. The simulations indicate superior performance with the proposed scheme with a well-trained DNN, thus improving the MSE performance of the communication significantly.
△ Less
Submitted 19 October, 2023; v1 submitted 3 September, 2023;
originally announced September 2023.
-
A Reinforcement Learning Approach for Robust Supervisory Control of UAVs Under Disturbances
Authors:
Ibrahim Ahmed,
Marcos Quinones-Grueiro,
Gautam Biswas
Abstract:
In this work, we present an approach to supervisory reinforcement learning control for unmanned aerial vehicles (UAVs). UAVs are dynamic systems where control decisions in response to disturbances in the environment have to be made in the order of milliseconds. We formulate a supervisory control architecture that interleaves with extant embedded control and demonstrates robustness to environmental…
▽ More
In this work, we present an approach to supervisory reinforcement learning control for unmanned aerial vehicles (UAVs). UAVs are dynamic systems where control decisions in response to disturbances in the environment have to be made in the order of milliseconds. We formulate a supervisory control architecture that interleaves with extant embedded control and demonstrates robustness to environmental disturbances in the form of adverse wind conditions. We run case studies with a Tarot T-18 Octorotor to demonstrate the effectiveness of our approach and compare it against a classic cascade control architecture used in most vehicles. While the results show the performance difference is marginal for nominal operations, substantial performance improvement is obtained with the supervisory RL approach under unseen wind conditions.
△ Less
Submitted 21 May, 2023;
originally announced May 2023.
-
Model-based adaptation for sample efficient transfer in reinforcement learning control of parameter-varying systems
Authors:
Ibrahim Ahmed,
Marcos Quinones-Grueiro,
Gautam Biswas
Abstract:
In this paper, we leverage ideas from model-based control to address the sample efficiency problem of reinforcement learning (RL) algorithms. Accelerating learning is an active field of RL highly relevant in the context of time-varying systems. Traditional transfer learning methods propose to use prior knowledge of the system behavior to devise a gradual or immediate data-driven transformation of…
▽ More
In this paper, we leverage ideas from model-based control to address the sample efficiency problem of reinforcement learning (RL) algorithms. Accelerating learning is an active field of RL highly relevant in the context of time-varying systems. Traditional transfer learning methods propose to use prior knowledge of the system behavior to devise a gradual or immediate data-driven transformation of the control policy obtained through RL. Such transformation is usually computed by estimating the performance of previous control policies based on measurements recently collected from the system. However, such retrospective measures have debatable utility with no guarantees of positive transfer in most cases. Instead, we propose a model-based transformation, such that when actions from a control policy are applied to the target system, a positive transfer is achieved. The transformation can be used as an initialization for the reinforcement learning process to converge to a new optimum. We validate the performance of our approach through four benchmark examples. We demonstrate that our approach is more sample-efficient than fine-tuning with reinforcement learning alone and achieves comparable performance to linear-quadratic-regulators and model-predictive control when an accurate linear model is known in the three cases. If an accurate model is not known, we empirically show that the proposed approach still guarantees positive transfer with jump-start improvement.
△ Less
Submitted 20 May, 2023;
originally announced May 2023.
-
Implementation of a Sustainable Security Architecture using Radio Frequency Identification (RFID) Technology for Access Control
Authors:
Shakiru Olajide Kassim,
Aisha Samaila Idriss,
Abdullahi Isa Ahmed
Abstract:
Implementation of a sustainable security architecture has been quite a challenging task with several technology deployed to achieve the feat. Automatic IDentification (Auto-ID) procedures exist to provide information about people, animals, goods and products in transit and found several applications in purchasing and distribution logistics, industries, manufacturing companies and material flow sys…
▽ More
Implementation of a sustainable security architecture has been quite a challenging task with several technology deployed to achieve the feat. Automatic IDentification (Auto-ID) procedures exist to provide information about people, animals, goods and products in transit and found several applications in purchasing and distribution logistics, industries, manufacturing companies and material flow systems. This work focuses on the development and implementation of an access control system using Radio Frequency Identification (RFID) technology to enhance a sustainable security architecture. The system controls access into a restricted area by granting access only to authorized persons, which incorporates the RFID hardware (RFID tags and readers and their antennas) and the software. The antenna are to be configured for a read range of about 1.5 m and TMBE kit reader module was used to test the RFID tags. The encoding and decoding process for the reading and writing to the tag as well as interfacing of the hardware and software was achieved through the use of a FissaiD RFID Reader Writer. The software that controls the whole system was designed using in Java Language. The database required for saving the necessary information, staff/guest was designed using appropriate DataBase Management System (DBMS). The system designed and implemented provide records of all accesses (check-in and check-out) made into the restricted area with time records. Other than this system, Model based modeling through the MATLAB/Simulink, Arduino platform, etc. can be used for similar implementation.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
An information-theoretic branch-and-prune algorithm for discrete phase optimization of RIS in massive MIMO
Authors:
I. Zakir Ahmed,
Hamid R. Sadjadpour,
Shahram Yousefi
Abstract:
In this paper, we consider passive RIS-assisted multi-user communication between wireless nodes to improve the blocked line-of-sight (LOS) link performance. The wireless nodes are assumed to be equipped with Massive Multiple-Input Multiple-Output antennas, hybrid precoder, combiner, and low-resolution analog-to-digital converters (ADCs). We first derive the expression for the Cramer-Rao lower boun…
▽ More
In this paper, we consider passive RIS-assisted multi-user communication between wireless nodes to improve the blocked line-of-sight (LOS) link performance. The wireless nodes are assumed to be equipped with Massive Multiple-Input Multiple-Output antennas, hybrid precoder, combiner, and low-resolution analog-to-digital converters (ADCs). We first derive the expression for the Cramer-Rao lower bound (CRLB) of the Mean Squared Error (MSE) of the received and combined signal at the intended receiver under interference. By appropriate design of the hybrid precoder, combiner, and RIS phase settings, it can be shown that the MSE achieves the CRLB. We further show that minimizing the MSE w.r.t. the phase settings of the RIS is equivalent to maximizing the throughput and energy efficiency of the system. We then propose a novel Information-Directed Branch-and-Prune (IDBP) algorithm to derive the phase settings of the RIS. We, for the first time in the literature, use an information-theoretic measure to decide on the pruning rules in a tree-search algorithm to arrive at the RIS phase-setting solution, which is vastly different compared to the traditional branch-and-bound algorithm that uses bounds of the cost function to define the pruning rules. In addition, we provide the theoretical guarantees of the near-optimality of the RIS phase-setting solution thus obtained using the Asymptotic Equipartition property. This also ensures near-optimal throughput and MSE performance.
△ Less
Submitted 15 January, 2023;
originally announced January 2023.
-
An Efficient End-to-End Deep Neural Network for Interstitial Lung Disease Recognition and Classification
Authors:
Masum Shah Junayed,
Afsana Ahsan Jeny,
Md Baharul Islam,
Ikhtiar Ahmed,
A F M Shahen Shah
Abstract:
The automated Interstitial Lung Diseases (ILDs) classification technique is essential for assisting clinicians during the diagnosis process. Detecting and classifying ILDs patterns is a challenging problem. This paper introduces an end-to-end deep convolution neural network (CNN) for classifying ILDs patterns. The proposed model comprises four convolutional layers with different kernel sizes and R…
▽ More
The automated Interstitial Lung Diseases (ILDs) classification technique is essential for assisting clinicians during the diagnosis process. Detecting and classifying ILDs patterns is a challenging problem. This paper introduces an end-to-end deep convolution neural network (CNN) for classifying ILDs patterns. The proposed model comprises four convolutional layers with different kernel sizes and Rectified Linear Unit (ReLU) activation function, followed by batch normalization and max-pooling with a size equal to the final feature map size well as four dense layers. We used the ADAM optimizer to minimize categorical cross-entropy. A dataset consisting of 21328 image patches of 128 CT scans with five classes is taken to train and assess the proposed model. A comparison study showed that the presented model outperformed pre-trained CNNs and five-fold cross-validation on the same dataset. For ILDs pattern classification, the proposed approach achieved the accuracy scores of 99.09% and the average F score of 97.9%, outperforming three pre-trained CNNs. These outcomes show that the proposed model is relatively state-of-the-art in precision, recall, f score, and accuracy.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Constrained Resource Allocation Problems in Communications: An Information-assisted Approach
Authors:
I. Zakir Ahmed,
Hamid Sadjadpour,
Shahram Yousefi
Abstract:
We consider a class of resource allocation problems given a set of unconditional constraints whose objective function satisfies Bellman's optimality principle. Such problems are ubiquitous in wireless communication, signal processing, and networking. These constrained combinatorial optimization problems are, in general, NP-Hard. This paper proposes two algorithms to solve this class of problems us…
▽ More
We consider a class of resource allocation problems given a set of unconditional constraints whose objective function satisfies Bellman's optimality principle. Such problems are ubiquitous in wireless communication, signal processing, and networking. These constrained combinatorial optimization problems are, in general, NP-Hard. This paper proposes two algorithms to solve this class of problems using a dynamic programming framework assisted by an information-theoretic measure. We demonstrate that the proposed algorithms ensure optimal solutions under carefully chosen conditions and use significantly reduced computational resources. We substantiate our claims by solving the power-constrained bit allocation problem in 5G massive Multiple-Input Multiple-Output receivers using the proposed approach.
△ Less
Submitted 7 December, 2021;
originally announced December 2021.
-
A Low-Complexity Multi-Survivor Dynamic Programming for Constrained Discrete Optimization
Authors:
I. Zakir Ahmed,
Hamid Sadjadpour,
Shahram Yousefi
Abstract:
Constrained discrete optimization problems are encountered in many areas of communication and machine learning. We consider the case where the objective function satisfies Bellman's optimality principle without the constraints on which we place no conditions. We first show that these problems are a generalization of optimization in constrained Markov decision processes with finite horizon used in…
▽ More
Constrained discrete optimization problems are encountered in many areas of communication and machine learning. We consider the case where the objective function satisfies Bellman's optimality principle without the constraints on which we place no conditions. We first show that these problems are a generalization of optimization in constrained Markov decision processes with finite horizon used in reinforcement learning and are NP-Hard. We then present a novel multi-survivor dynamic programming (msDP) algorithm that guarantees optimality at significant computational savings. We demonstrate this by solving 5G quantizer bit allocation and DNA fragment assembly problems. The results are very promising and suggest that msDP can be used for many applications.
△ Less
Submitted 13 May, 2021;
originally announced May 2021.
-
An Optimal Low-Complexity Energy-Efficient ADC Bit Allocation for Massive MIMO
Authors:
I. Zakir Ahmed,
Hamid Sadjadpour,
Shahram Yousefi
Abstract:
Fixed low-resolution Analog to Digital Converters (ADC) help reduce the power consumption in millimeter-wave Massive Multiple-Input Multiple-Output (Ma-MIMO) receivers operating at large bandwidths. However, they do not guarantee optimal Energy Efficiency (EE). It has been shown that adopting variable-resolution (VR) ADCs in Ma-MIMO receivers can improve performance with Mean Squared Error (MSE) a…
▽ More
Fixed low-resolution Analog to Digital Converters (ADC) help reduce the power consumption in millimeter-wave Massive Multiple-Input Multiple-Output (Ma-MIMO) receivers operating at large bandwidths. However, they do not guarantee optimal Energy Efficiency (EE). It has been shown that adopting variable-resolution (VR) ADCs in Ma-MIMO receivers can improve performance with Mean Squared Error (MSE) and throughput while providing better EE. In this paper, we present an optimal energy-efficient bit allocation (BA) algorithm for Ma-MIMO receivers equipped with VR ADCs under a power constraint. We derive an expression for EE as a function of the Cramer-Rao Lower Bound on the MSE of the received, combined, and quantized signal. An optimal BA condition is derived by maximizing EE under a power constraint. We show that the optimal BA thus obtained is exactly the same as that obtained using the brute-force BA with a significant reduction in computational complexity. We also study the EE performance and computational complexity of a heuristic algorithm that yields a near-optimal solution.
△ Less
Submitted 11 April, 2021;
originally announced April 2021.
-
Performance-Weighed Policy Sampling for Meta-Reinforcement Learning
Authors:
Ibrahim Ahmed,
Marcos Quinones-Grueiro,
Gautam Biswas
Abstract:
This paper discusses an Enhanced Model-Agnostic Meta-Learning (E-MAML) algorithm that generates fast convergence of the policy function from a small number of training examples when applied to new learning tasks. Built on top of Model-Agnostic Meta-Learning (MAML), E-MAML maintains a set of policy parameters learned in the environment for previous tasks. We apply E-MAML to developing reinforcement…
▽ More
This paper discusses an Enhanced Model-Agnostic Meta-Learning (E-MAML) algorithm that generates fast convergence of the policy function from a small number of training examples when applied to new learning tasks. Built on top of Model-Agnostic Meta-Learning (MAML), E-MAML maintains a set of policy parameters learned in the environment for previous tasks. We apply E-MAML to developing reinforcement learning (RL)-based online fault tolerant control schemes for dynamic systems. The enhancement is applied when a new fault occurs, to re-initialize the parameters of a new RL policy that achieves faster adaption with a small number of samples of system behavior with the new fault. This replaces the random task sampling step in MAML. Instead, it exploits the extant previously generated experiences of the controller. The enhancement is sampled to maximally span the parameter space to facilitate adaption to the new fault. We demonstrate the performance of our approach combining E-MAML with proximal policy optimization (PPO) on the well-known cart pole example, and then on the fuel transfer system of an aircraft.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
Complementary Meta-Reinforcement Learning for Fault-Adaptive Control
Authors:
Ibrahim Ahmed,
Marcos Quinones-Grueiro,
Gautam Biswas
Abstract:
Faults are endemic to all systems. Adaptive fault-tolerant control maintains degraded performance when faults occur as opposed to unsafe conditions or catastrophic events. In systems with abrupt faults and strict time constraints, it is imperative for control to adapt quickly to system changes to maintain system operations. We present a meta-reinforcement learning approach that quickly adapts its…
▽ More
Faults are endemic to all systems. Adaptive fault-tolerant control maintains degraded performance when faults occur as opposed to unsafe conditions or catastrophic events. In systems with abrupt faults and strict time constraints, it is imperative for control to adapt quickly to system changes to maintain system operations. We present a meta-reinforcement learning approach that quickly adapts its control policy to changing conditions. The approach builds upon model-agnostic meta learning (MAML). The controller maintains a complement of prior policies learned under system faults. This "library" is evaluated on a system after a new fault to initialize the new policy. This contrasts with MAML, where the controller derives intermediate policies anew, sampled from a distribution of similar systems, to initialize a new policy. Our approach improves sample efficiency of the reinforcement learning process. We evaluate our approach on an aircraft fuel transfer system under abrupt faults.
△ Less
Submitted 26 September, 2020;
originally announced September 2020.
-
Fault-Tolerant Control of Degrading Systems with On-Policy Reinforcement Learning
Authors:
Ibrahim Ahmed,
Marcos Quiñones-Grueiro,
Gautam Biswas
Abstract:
We propose a novel adaptive reinforcement learning control approach for fault tolerant control of degrading systems that is not preceded by a fault detection and diagnosis step. Therefore, \textit{a priori} knowledge of faults that may occur in the system is not required. The adaptive scheme combines online and offline learning of the on-policy control method to improve exploration and sample effi…
▽ More
We propose a novel adaptive reinforcement learning control approach for fault tolerant control of degrading systems that is not preceded by a fault detection and diagnosis step. Therefore, \textit{a priori} knowledge of faults that may occur in the system is not required. The adaptive scheme combines online and offline learning of the on-policy control method to improve exploration and sample efficiency, while guaranteeing stable learning. The offline learning phase is performed using a data-driven model of the system, which is frequently updated to track the system's operating conditions. We conduct experiments on an aircraft fuel transfer system to demonstrate the effectiveness of our approach.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
Comparison of Model Predictive and Reinforcement Learning Methods for Fault Tolerant Control
Authors:
Ibrahim Ahmed,
Hamed Khorasgani,
Gautam Biswas
Abstract:
A desirable property in fault-tolerant controllers is adaptability to system changes as they evolve during systems operations. An adaptive controller does not require optimal control policies to be enumerated for possible faults. Instead it can approximate one in real-time. We present two adaptive fault-tolerant control schemes for a discrete time system based on hierarchical reinforcement learnin…
▽ More
A desirable property in fault-tolerant controllers is adaptability to system changes as they evolve during systems operations. An adaptive controller does not require optimal control policies to be enumerated for possible faults. Instead it can approximate one in real-time. We present two adaptive fault-tolerant control schemes for a discrete time system based on hierarchical reinforcement learning. We compare their performance against a model predictive controller in presence of sensor noise and persistent faults. The controllers are tested on a fuel tank model of a C-130 plane. Our experiments demonstrate that reinforcement learning-based controllers perform more robustly than model predictive controllers under faults, partially observable system models, and varying sensor noise levels.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks
Authors:
Kazi Ishfaq Ahmed,
Ekram Hossain
Abstract:
Optimal resource allocation is a fundamental challenge for dense and heterogeneous wireless networks with massive wireless connections. Because of the non-convex nature of the optimization problem, it is computationally demanding to obtain the optimal resource allocation. Recently, deep reinforcement learning (DRL) has emerged as a promising technique in solving non-convex optimization problems. U…
▽ More
Optimal resource allocation is a fundamental challenge for dense and heterogeneous wireless networks with massive wireless connections. Because of the non-convex nature of the optimization problem, it is computationally demanding to obtain the optimal resource allocation. Recently, deep reinforcement learning (DRL) has emerged as a promising technique in solving non-convex optimization problems. Unlike deep learning (DL), DRL does not require any optimal/ near-optimal training dataset which is either unavailable or computationally expensive in generating synthetic data. In this paper, we propose a novel centralized DRL based downlink power allocation scheme for a multi-cell system intending to maximize the total network throughput. Specifically, we apply a deep Q-learning (DQL) approach to achieve near-optimal power allocation policy. For benchmarking the proposed approach, we use a Genetic Algorithm (GA) to obtain near-optimal power allocation solution. Simulation results show that the proposed DRL-based power allocation scheme performs better compared to the conventional power allocation schemes in a multi-cell scenario.
△ Less
Submitted 29 April, 2019;
originally announced April 2019.
-
Optimal Bit Allocation Variable-Resolution ADC for Massive MIMO
Authors:
I. Zakir Ahmed,
Hamid Sadjadpour,
Shahram Yousefi
Abstract:
In this paper, we derive an optimal ADC bit-allocation (BA) condition for a Single-User (SU) Millimeter wave (mmWave) Massive Multiple-Input Multiple-Output (Ma-MIMO) receiver equipped with variable-resolution ADCs under power constraint with the following criteria: (i) Minimizing the Mean Squared Error (MSE) of the received, quantized and combined symbol vector and (ii) Maximizing the capacity of…
▽ More
In this paper, we derive an optimal ADC bit-allocation (BA) condition for a Single-User (SU) Millimeter wave (mmWave) Massive Multiple-Input Multiple-Output (Ma-MIMO) receiver equipped with variable-resolution ADCs under power constraint with the following criteria: (i) Minimizing the Mean Squared Error (MSE) of the received, quantized and combined symbol vector and (ii) Maximizing the capacity of the SU mmWave Ma-MIMO channel encompassing hybrid precoder and combiner. Optimal BA under both criteria results the same. We jointly design the hybrid combiner based on the SVD of the channel. We demonstrate improvement of the proposed optimal BA over the BA based on Minimization of the Mean Square Quantization Error (MSQE). Using Monte-Carlo simulations, it is shown that the MSE and capacity performance of the proposed BA is very close to that of the Exhaustive Search (ES). The computational complexity of the proposed techniques are compared with ES and MQSE BA algorithms.
△ Less
Submitted 9 February, 2019;
originally announced February 2019.
-
Capacity analysis and bit allocation design for variable-resolution ADCs in Massive MIMO
Authors:
I. Zakir Ahmed,
Hamid Sadjadpour,
Shahram Yousefi
Abstract:
We derive an expression for the capacity of massive multiple-input multiple-output Millimeter wave (mmWave) channel where the receiver is equipped with a variable-resolution Analog to Digital Converter (ADC) and a hybrid combiner. The capacity is shown to be a function of Cramer-Rao Lower Bound (CRLB) for a given bit-allocation matrix and hybrid combiner. The condition for optimal ADC bit-allocati…
▽ More
We derive an expression for the capacity of massive multiple-input multiple-output Millimeter wave (mmWave) channel where the receiver is equipped with a variable-resolution Analog to Digital Converter (ADC) and a hybrid combiner. The capacity is shown to be a function of Cramer-Rao Lower Bound (CRLB) for a given bit-allocation matrix and hybrid combiner. The condition for optimal ADC bit-allocation under a receiver power constraint is derived. This is derived based on the maximization of capacity with respect to bit-allocation matrix for a given channel, hybrid precoder, and hybrid combiner. It is shown that this condition coincides with that obtained using the CRLB minimization proposed by Ahmed et al. Monte-carlo simulations show that the capacity calculated using the proposed condition matches very closely with the capacity obtained using the Exhaustive Search bit allocation.
△ Less
Submitted 8 September, 2018;
originally announced September 2018.
-
Deep Learning for Radio Resource Allocation in Multi-Cell Networks
Authors:
K. I. Ahmed,
H. Tabassum,
E. Hossain
Abstract:
Increased complexity and heterogeneity of emerging 5G and beyond 5G (B5G) wireless networks will require a paradigm shift from traditional resource allocation mechanisms. Deep learning (DL) is a powerful tool where a multi-layer neural network can be trained to model a resource management algorithm using network data.Therefore, resource allocation decisions can be obtained without intensive online…
▽ More
Increased complexity and heterogeneity of emerging 5G and beyond 5G (B5G) wireless networks will require a paradigm shift from traditional resource allocation mechanisms. Deep learning (DL) is a powerful tool where a multi-layer neural network can be trained to model a resource management algorithm using network data.Therefore, resource allocation decisions can be obtained without intensive online computations which would be required otherwise for the solution of resource allocation problems. In this context, this article focuses on the application of DL to obtain solutions for the radio resource allocation problems in multi-cell networks. Starting with a brief overview of a deep neural network (DNN) as a DL model, relevant DNN architectures and the data training procedure, we provide an overview of existing state-of-the-art applying DL in the context of radio resource allocation. A qualitative comparison is provided in terms of their objectives, inputs/outputs, learning and data training methods. Then, we present a supervised DL model to solve the sub-band and power allocation problem in a multi-cell network. Using the data generated by a genetic algorithm, we first train the model and then test the accuracy of the proposed model in predicting the resource allocation solutions. Simulation results show that the trained DL model is able to provide the desired optimal solution 86.3% of time.
△ Less
Submitted 2 August, 2018;
originally announced August 2018.
-
Single-User mmWave Massive MIMO: SVD-based ADC Bit Allocation and Combiner Design
Authors:
I. Zakir Ahmed,
Hamid Sadjadpour,
Shahram Yousefi
Abstract:
In this paper, we propose a Singular-Value-Decomposition-based variable-resolution Analog to Digital Converter (ADC) bit allocation design for a single-user Millimeter wave massive Multiple-Input Multiple-Output receiver. We derive the optimality condition for bit allocation under a power constraint. This condition ensures optimal receiver performance in the Mean Squared Error (MSE) sense. We deri…
▽ More
In this paper, we propose a Singular-Value-Decomposition-based variable-resolution Analog to Digital Converter (ADC) bit allocation design for a single-user Millimeter wave massive Multiple-Input Multiple-Output receiver. We derive the optimality condition for bit allocation under a power constraint. This condition ensures optimal receiver performance in the Mean Squared Error (MSE) sense. We derive the MSE expression and show that it approaches the Cramer-Rao Lower Bound (CRLB). The CRLB is seen to be a function of the analog combiner, the digital combiner, and the bit allocation matrix. We attempt to minimize the CRLB with respect to the bit allocation matrix by making suitable assumptions regarding the structure of the combiners. In doing so, the bit allocation design reduces to a set of simple inequalities consisting of ADC bits, channel singular values and covariance of the quantization noise along each RF path. This results in a simple and computationally efficient bit allocation algorithm. Using simulations, we show that the MSE performance of our proposed bit allocation is very close to that of the Full Search (FS) bit allocation. We also show that the computational complexity of our proposed method has an order of magnitude improvement compared to FS and Genetic Algorithm based bit allocation of $\cite{Zakir1}$
△ Less
Submitted 23 April, 2018;
originally announced April 2018.
-
A Joint Combiner and Bit Allocation Design for Massive MIMO Using Genetic Algorithm
Authors:
I. Zakir Ahmed,
Hamid Sadjadpour,
Shahram Yousefi
Abstract:
In this paper, we derive a closed-form expression for the combiner of a multiple-input-multiple-output (MIMO) receiver equipped with a minimum-mean-square-error (MMSE) estimator. We propose using variable-bit-resolution analog-to- digital converters (ADC) across radio frequency (RF) paths. The combiner designed is a function of the quantization errors across each RF path. Using very low bit resolu…
▽ More
In this paper, we derive a closed-form expression for the combiner of a multiple-input-multiple-output (MIMO) receiver equipped with a minimum-mean-square-error (MMSE) estimator. We propose using variable-bit-resolution analog-to- digital converters (ADC) across radio frequency (RF) paths. The combiner designed is a function of the quantization errors across each RF path. Using very low bit resolution ADCs (1-2bits) is a popular approach with massive MIMO receiver architectures to mitigate large power demands. We show that for certain channel conditions, adopting unequal bit resolution ADCs (e.g., between 1 and 4 bits) on different RF chains, along with the proposed combiner, improves the performance of the MIMO receiver in the Mean Squared Error (MSE) sense. The variable-bit-resolution ADCs is still within the power constraint of using equal bit resolution ADCs on all paths (e.g., 2-bits). We propose a genetic algorithm in conjunction with the derived combiner to arrive at an optimal ADC bit allocation framework with significant reduction in computational complexity.
△ Less
Submitted 17 November, 2017;
originally announced November 2017.