-
Leveraging Task-Specific Knowledge from LLM for Semi-Supervised 3D Medical Image Segmentation
Authors:
Suruchi Kumari,
Aryan Das,
Swalpa Kumar Roy,
Indu Joshi,
Pravendra Singh
Abstract:
Traditional supervised 3D medical image segmentation models need voxel-level annotations, which require huge human effort, time, and cost. Semi-supervised learning (SSL) addresses this limitation of supervised learning by facilitating learning with a limited annotated and larger amount of unannotated training samples. However, state-of-the-art SSL models still struggle to fully exploit the potenti…
▽ More
Traditional supervised 3D medical image segmentation models need voxel-level annotations, which require huge human effort, time, and cost. Semi-supervised learning (SSL) addresses this limitation of supervised learning by facilitating learning with a limited annotated and larger amount of unannotated training samples. However, state-of-the-art SSL models still struggle to fully exploit the potential of learning from unannotated samples. To facilitate effective learning from unannotated data, we introduce LLM-SegNet, which exploits a large language model (LLM) to integrate task-specific knowledge into our co-training framework. This knowledge aids the model in comprehensively understanding the features of the region of interest (ROI), ultimately leading to more efficient segmentation. Additionally, to further reduce erroneous segmentation, we propose a Unified Segmentation loss function. This loss function reduces erroneous segmentation by not only prioritizing regions where the model is confident in predicting between foreground or background pixels but also effectively addressing areas where the model lacks high confidence in predictions. Experiments on publicly available Left Atrium, Pancreas-CT, and Brats-19 datasets demonstrate the superior performance of LLM-SegNet compared to the state-of-the-art. Furthermore, we conducted several ablation studies to demonstrate the effectiveness of various modules and loss functions leveraged by LLM-SegNet.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Physics-informed Neural Networks for Heterogeneous Poroelastic Media
Authors:
Sumanta Roy,
Chandrasekhar Annavarapu,
Pratanu Roy,
Dakshina Murthy Valiveti
Abstract:
This study introduces a novel physics-informed neural networks (PINNs) framework designed to model coupled-field problems specifically tailored for heterogeneous poroelastic media. Firstly, a composite neural network is developed where distinct neural networks are dedicated to predicting displacement and pressure variables for each material, employing identical activation functions but trained sep…
▽ More
This study introduces a novel physics-informed neural networks (PINNs) framework designed to model coupled-field problems specifically tailored for heterogeneous poroelastic media. Firstly, a composite neural network is developed where distinct neural networks are dedicated to predicting displacement and pressure variables for each material, employing identical activation functions but trained separately across all other parameters. Secondly, we handle the challenges of heterogeneous material interfaces by the Interface- PINNs (I-PINNs) framework, where different activation functions across any material interface are prescribed to ensure that the discontinuities in solution fields and gradients are accurately captured. We compare the modified PINNs framework with the conventional approach on two one-dimensional benchmark examples for poroelasticity in heterogeneous media. Furthermore, we assess a single neural network architecture, comparing it against the composite neural network proposed in this work. These examples show that the proposed framework demonstrates superior approximation accuracy in both displacements and pressures, and better convergence behavior.
△ Less
Submitted 9 July, 2024; v1 submitted 1 July, 2024;
originally announced July 2024.
-
Impact of an Autonomous Shuttle Service on Urban Road Capacity: Experiments by Microscopic Traffic Simulation
Authors:
Sudipta Roy,
Bat-hen Nahmias-Biran,
Samiul Hasan
Abstract:
Autonomous vehicles are expected to transform transportation systems with rapid technological advancement. Human mobility would become more accessible and safer with the emergence of driverless vehicles. To this end, autonomous shuttle services are currently introduced in different urban conditions throughout the world. As a result, studies are needed to assess the safety and mobility performance…
▽ More
Autonomous vehicles are expected to transform transportation systems with rapid technological advancement. Human mobility would become more accessible and safer with the emergence of driverless vehicles. To this end, autonomous shuttle services are currently introduced in different urban conditions throughout the world. As a result, studies are needed to assess the safety and mobility performance of such autonomous shuttle services. However, calibrating the movement of autonomous shuttles in a simulation environment has been a difficult task due to the absence of any real-world data. This study aims to calibrate autonomous shuttles in a microscopic traffic simulation model and consequently assess the impact of the shuttle service on urban road capacity through simulation experiments. For this analysis, a prototype of an operational shuttle system at Lake Nona, Orlando, Florida is emulated in a microscopic traffic simulator during different times of the day. The movements of autonomous vehicles are calibrated using real-world trajectory data which help replicate the driving behavior of the shuttle in the simulation. The analysis reveals that with increasing frequency of the shuttle service the delay time percentage of the shared road sections increases and traveling speed decreases. It is also found that increasing the speed of shuttles up to 5 mph during off-peak hours and 10 mph during peak hours will improve traffic conditions. The findings from this study will assist policymakers and transportation agencies to revise policies for deploying autonomous shuttles and for planning road infrastructures for shared road-use of autonomous shuttles and human driven vehicles.
△ Less
Submitted 11 June, 2024;
originally announced July 2024.
-
Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?
Authors:
Pallabi Dutta,
Soham Bose,
Swalpa Kumar Roy,
Sushmita Mitra
Abstract:
The advancement of developing efficient medical image segmentation has evolved from initial dependence on Convolutional Neural Networks (CNNs) to the present investigation of hybrid models that combine CNNs with Vision Transformers. Furthermore, there is an increasing focus on creating architectures that are both high-performing in medical image segmentation tasks and computationally efficient to…
▽ More
The advancement of developing efficient medical image segmentation has evolved from initial dependence on Convolutional Neural Networks (CNNs) to the present investigation of hybrid models that combine CNNs with Vision Transformers. Furthermore, there is an increasing focus on creating architectures that are both high-performing in medical image segmentation tasks and computationally efficient to be deployed on systems with limited resources. Although transformers have several advantages like capturing global dependencies in the input data, they face challenges such as high computational and memory complexity. This paper investigates the integration of CNNs and Vision Extended Long Short-Term Memory (Vision-xLSTM) models by introducing a novel approach called UVixLSTM. The Vision-xLSTM blocks captures temporal and global relationships within the patches extracted from the CNN feature maps. The convolutional feature reconstruction path upsamples the output volume from the Vision-xLSTM blocks to produce the segmentation output. Our primary objective is to propose that Vision-xLSTM forms a reliable backbone for medical image segmentation tasks, offering excellent segmentation performance and reduced computational complexity. UVixLSTM exhibits superior performance compared to state-of-the-art networks on the publicly-available Synapse dataset. Code is available at: https://github.com/duttapallabi2907/UVixLSTM
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Revisiting Multi-User Downlink in IEEE 802.11ax: A Designers Guide to MU-MIMO
Authors:
Liu Cao,
Lyutianyang Zhang,
Sumit Roy,
Sian Jin
Abstract:
Downlink (DL) Multi-User (MU) Multiple Input Multiple Output (MU-MIMO) is a key technology that allows multiple concurrent data transmissions from an Access Point (AP) to a selected sub-set of clients for higher network efficiency in IEEE 802.11ax. However, DL MU-MIMO feature is typically turned off as the default setting in AP vendors' products, that is, turning on the DL MU-MIMO may not help inc…
▽ More
Downlink (DL) Multi-User (MU) Multiple Input Multiple Output (MU-MIMO) is a key technology that allows multiple concurrent data transmissions from an Access Point (AP) to a selected sub-set of clients for higher network efficiency in IEEE 802.11ax. However, DL MU-MIMO feature is typically turned off as the default setting in AP vendors' products, that is, turning on the DL MU-MIMO may not help increase the network efficiency, which is counter-intuitive. In this article, we provide a sufficiently deep understanding of the interplay between the various underlying factors, i.e., CSI overhead and spatial correlation, which result in negative results when turning on the DL MU-MIMO. Furthermore, we provide a fundamental guideline as a function of operational scenarios to address the fundamental question "when the DL MU-MIMO should be turned on/off".
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Small-Signal Dynamics of Lossy Inverter-Based Microgrids for Generalized Droop Controls
Authors:
Abdullah Al Maruf,
Anamika Dubey,
Sandip Roy
Abstract:
A network-level small-signal model is developed for lossy microgrids, which considers coupled angle and voltage dynamics of inverter-based microgrids and uses a more general framework of droop controls in the inverter. It is shown that when relative resistances of the lines in the microgrid are reasonably consistent and differences of voltage angles across the lines are small at the operating poin…
▽ More
A network-level small-signal model is developed for lossy microgrids, which considers coupled angle and voltage dynamics of inverter-based microgrids and uses a more general framework of droop controls in the inverter. It is shown that when relative resistances of the lines in the microgrid are reasonably consistent and differences of voltage angles across the lines are small at the operating point, the generalized droop controls can be designed to enforce decoupling between angle dynamics and voltage dynamics. Next, structural results for the asymptotic stability of small-signal angle and voltage dynamics are given for the case when generalized droop control achieves decoupling. Simulated transient responses of a modified IEEE 9-bus system are presented to validate the theoretical findings which show the effectiveness of generalized droop controls in independently shaping the settling times of the angle and voltage responses of the lossy microgrid system.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures
Authors:
Yannick Kirchhoff,
Maximilian R. Rokuss,
Saikat Roy,
Balint Kovacs,
Constantin Ulrich,
Tassilo Wald,
Maximilian Zenk,
Philipp Vollmuth,
Jens Kleesiek,
Fabian Isensee,
Klaus Maier-Hein
Abstract:
Accurately segmenting thin tubular structures, such as vessels, nerves, roads or concrete cracks, is a crucial task in computer vision. Standard deep learning-based segmentation loss functions, such as Dice or Cross-Entropy, focus on volumetric overlap, often at the expense of preserving structural connectivity or topology. This can lead to segmentation errors that adversely affect downstream task…
▽ More
Accurately segmenting thin tubular structures, such as vessels, nerves, roads or concrete cracks, is a crucial task in computer vision. Standard deep learning-based segmentation loss functions, such as Dice or Cross-Entropy, focus on volumetric overlap, often at the expense of preserving structural connectivity or topology. This can lead to segmentation errors that adversely affect downstream tasks, including flow calculation, navigation, and structural inspection. Although current topology-focused losses mark an improvement, they introduce significant computational and memory overheads. This is particularly relevant for 3D data, rendering these losses infeasible for larger volumes as well as increasingly important multi-class segmentation problems. To mitigate this, we propose a novel Skeleton Recall Loss, which effectively addresses these challenges by circumventing intensive GPU-based calculations with inexpensive CPU operations. It demonstrates overall superior performance to current state-of-the-art approaches on five public datasets for topology-preserving segmentation, while substantially reducing computational overheads by more than 90%. In doing so, we introduce the first multi-class capable loss function for thin structure segmentation, excelling in both efficiency and efficacy for topology-preservation.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Low-cost, Lightweight Electronic Flow Regulators for Throttling Liquid Rocket Engines
Authors:
Vint Lee,
Sohom Roy
Abstract:
For small-scale liquid rockets, pressure-fed systems are commonly favoured due to their simplicity and low weight. In such systems, accurate regulation of both tank and injector pressures over a wide range of upstream pressures is critical $-$ more accurate regulation allows for higher engine efficiency and minimal tank mass, thus improving flight performance. However, existing methods such as dom…
▽ More
For small-scale liquid rockets, pressure-fed systems are commonly favoured due to their simplicity and low weight. In such systems, accurate regulation of both tank and injector pressures over a wide range of upstream pressures is critical $-$ more accurate regulation allows for higher engine efficiency and minimal tank mass, thus improving flight performance. However, existing methods such as dome-loaded pressure regulators are inflexible, or require extensive characterization to function accurately. These methods also suffer from limited orifice size, droop, and slow reaction times, making them unsuitable for throttling by adjusting pressures in flight, which are increasingly important as propulsively landing rockets become more common. To overcome these challenges, we designed an electronic pressure regulator (eReg), a multi-input multi-output system utilising closed loop feedback to accurately control downstream pressures. Our design is simple, low-cost and robust: with a single ball valve actuated by a motor, we regulate both gaseous pressurant and cryogenic liquid propellant at high flow rates (1.14 kg/s of liquid; 0.39 kg/s of gas) and upstream pressures (310 bar). Using 2 eRegs to regulate propellant tank pressures, and 2 eRegs for regulating propellant flow to the engine, we demonstrated our system's ability, in a static fire test, to regulate pressures accurately (within 0.2 bar) while simultaneously throttling our engine. To the best of our knowledge, this is the first time any undergraduate team has successfully throttled a liquid bipropellant engine.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
Predicting Multi-Joint Kinematics of the Upper Limb from EMG Signals Across Varied Loads with a Physics-Informed Neural Network
Authors:
Rajnish Kumar,
Suriya Prakash Muthukrishnan,
Lalan Kumar,
Sitikantha Roy
Abstract:
In this research, we present an innovative method known as a physics-informed neural network (PINN) model to predict multi-joint kinematics using electromyography (EMG) signals recorded from the muscles surrounding these joints across various loads. The primary aim is to simultaneously predict both the shoulder and elbow joint angles while executing elbow flexion-extension (FE) movements, especial…
▽ More
In this research, we present an innovative method known as a physics-informed neural network (PINN) model to predict multi-joint kinematics using electromyography (EMG) signals recorded from the muscles surrounding these joints across various loads. The primary aim is to simultaneously predict both the shoulder and elbow joint angles while executing elbow flexion-extension (FE) movements, especially under varying load conditions. The PINN model is constructed by combining a feed-forward Artificial Neural Network (ANN) with a joint torque computation model. During the training process, the model utilizes a custom loss function derived from an inverse dynamics joint torque musculoskeletal model, along with a mean square angle loss. The training dataset for the PINN model comprises EMG and time data collected from four different subjects. To assess the model's performance, we conducted a comparison between the predicted joint angles and experimental data using a testing data set. The results demonstrated strong correlations of 58% to 83% in joint angle prediction. The findings highlight the potential of incorporating physical principles into the model, not only increasing its versatility but also enhancing its accuracy. The findings could have significant implications for the precise estimation of multi-joint kinematics in dynamic scenarios, particularly concerning the advancement of human-machine interfaces (HMIs) for exoskeletons and prosthetic control systems.
△ Less
Submitted 28 November, 2023;
originally announced December 2023.
-
Geometric Tracking Control of a Multi-rotor UAV for Partially Known Trajectories
Authors:
Yogesh Kumar,
S. B. Roy,
P. B. Sujit
Abstract:
This paper presents a trajectory-tracking controller for multi-rotor unmanned aerial vehicles (UAVs) in scenarios where only the desired position and heading are known without the higher-order derivatives. The proposed solution modifies the state-of-the-art geometric controller, effectively addressing challenges related to the non-existence of the desired attitude and ensuring positive total thrus…
▽ More
This paper presents a trajectory-tracking controller for multi-rotor unmanned aerial vehicles (UAVs) in scenarios where only the desired position and heading are known without the higher-order derivatives. The proposed solution modifies the state-of-the-art geometric controller, effectively addressing challenges related to the non-existence of the desired attitude and ensuring positive total thrust input for all time. We tackle the additional challenge of the non-availability of the higher derivatives of the trajectory by introducing novel nonlinear filter structures. We formalize theoretically the effect of these filter structures on the system error dynamics. Subsequently, through a rigorous theoretical analysis, we demonstrate that the proposed controller leads to uniformly ultimately bounded system error dynamics.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Adaptive Control of Euler-Lagrange Systems under Time-varying State Constraints without a Priori Bounded Uncertainty
Authors:
Viswa Narayanan Sankaranarayanan,
Sumeet Gajanan Satpute,
Spandan Roy,
George Nikolakopoulos
Abstract:
In this article, a novel adaptive controller is designed for Euler-Lagrangian systems under predefined time-varying state constraints. The proposed controller could achieve this objective without a priori knowledge of system parameters and, crucially, of state-dependent uncertainties. The closed-loop stability is verified using the Lyapunov method, while the overall efficacy of the proposed scheme…
▽ More
In this article, a novel adaptive controller is designed for Euler-Lagrangian systems under predefined time-varying state constraints. The proposed controller could achieve this objective without a priori knowledge of system parameters and, crucially, of state-dependent uncertainties. The closed-loop stability is verified using the Lyapunov method, while the overall efficacy of the proposed scheme is verified using a simulated robotic arm compared to the state of the art.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
Semi-Persistent Scheduling in NR Sidelink Mode 2: MAC Packet Reception Ratio Model and Validation
Authors:
Liu Cao,
Sumit Roy,
Collin Brady
Abstract:
5G NR Sidelink (SL) has demonstrated the promising capability for infrastructure-less cellular coverage. Understanding the fundamentals of the NR SL channel access mechanism, Semi-Persistent Scheduling (SPS), which is specified by the 3rd Generation Partnership Project (3GPP), is a necessity to enhance the NR SL Packet Reception Ratio (PRR). However, most existing works fail to account for the new…
▽ More
5G NR Sidelink (SL) has demonstrated the promising capability for infrastructure-less cellular coverage. Understanding the fundamentals of the NR SL channel access mechanism, Semi-Persistent Scheduling (SPS), which is specified by the 3rd Generation Partnership Project (3GPP), is a necessity to enhance the NR SL Packet Reception Ratio (PRR). However, most existing works fail to account for the new SPS features introduced in NR SL, which might be out-of-date for comprehensively describing the NR SL PRR. The existing models ignore the relationships between SPS parameters and therefore do not provide sufficient insights into the PRR of SPS. This work proposes a novel SPS PRR model incorporating MAC collisions based on new features in NR SL. We extend our model by loosening several simplifying assumptions made in our initial modeling. The extended models illustrate how the PRR is affected by various SPS parameters. The computed results are validated via simulations using the network simulator (ns-3), which provides important guidelines for future NR SL enhancement work.
△ Less
Submitted 26 July, 2023;
originally announced September 2023.
-
Learning end-to-end inversion of circular Radon transforms in the partial radial setup
Authors:
Deep Ray,
Souvik Roy
Abstract:
We present a deep learning-based computational algorithm for inversion of circular Radon transforms in the partial radial setup, arising in photoacoustic tomography. We first demonstrate that the truncated singular value decomposition-based method, which is the only traditional algorithm available to solve this problem, leads to severe artifacts which renders the reconstructed field as unusable. W…
▽ More
We present a deep learning-based computational algorithm for inversion of circular Radon transforms in the partial radial setup, arising in photoacoustic tomography. We first demonstrate that the truncated singular value decomposition-based method, which is the only traditional algorithm available to solve this problem, leads to severe artifacts which renders the reconstructed field as unusable. With the objective of overcoming this computational bottleneck, we train a ResBlock based U-Net to recover the inferred field that directly operates on the measured data. Numerical results with augmented Shepp-Logan phantoms, in the presence of noisy full and limited view data, demonstrate the superiority of the proposed algorithm.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Test Time Adaptation for Blind Image Quality Assessment
Authors:
Subhadeep Roy,
Shankhanil Mitra,
Soma Biswas,
Rajiv Soundararajan
Abstract:
While the design of blind image quality assessment (IQA) algorithms has improved significantly, the distribution shift between the training and testing scenarios often leads to a poor performance of these methods at inference time. This motivates the study of test time adaptation (TTA) techniques to improve their performance at inference time. Existing auxiliary tasks and loss functions used for T…
▽ More
While the design of blind image quality assessment (IQA) algorithms has improved significantly, the distribution shift between the training and testing scenarios often leads to a poor performance of these methods at inference time. This motivates the study of test time adaptation (TTA) techniques to improve their performance at inference time. Existing auxiliary tasks and loss functions used for TTA may not be relevant for quality-aware adaptation of the pre-trained model. In this work, we introduce two novel quality-relevant auxiliary tasks at the batch and sample levels to enable TTA for blind IQA. In particular, we introduce a group contrastive loss at the batch level and a relative rank loss at the sample level to make the model quality aware and adapt to the target data. Our experiments reveal that even using a small batch of images from the test distribution helps achieve significant improvement in performance by updating the batch normalization statistics of the source model.
△ Less
Submitted 26 September, 2023; v1 submitted 27 July, 2023;
originally announced July 2023.
-
Static Background Removal in Vehicular Radar: Filtering in Azimuth-Elevation-Doppler Domain
Authors:
Xiangyu Gao,
Sumit Roy,
Lyutianyang Zhang
Abstract:
Anti-collision assistance (as part of the current push towards increasing vehicular autonomy) critically depends on accurate detection/localization of moving targets in vicinity. An effective solution pathway involves removing background or static objects from the scene, so as to enhance the detection/localization of moving targets as a key component for improving overall system performance. In th…
▽ More
Anti-collision assistance (as part of the current push towards increasing vehicular autonomy) critically depends on accurate detection/localization of moving targets in vicinity. An effective solution pathway involves removing background or static objects from the scene, so as to enhance the detection/localization of moving targets as a key component for improving overall system performance. In this paper, we present an efficient algorithm for background removal for automotive scenarios, applicable to commodity frequency-modulated continuous wave (FMCW)-based radars. Our proposed algorithm follows a three-step approach: a) preprocessing of back-scattered received radar signal for 4-dimensional (4D) point clouds generation, b) 3-dimensional (3D) radar ego-motion estimation, and c) notch filter-based background removal in the azimuth-elevation-Doppler domain. To begin, we model the received signal corresponding to multiple-input multiple-output (MIMO) FMCW transmissions and develop a signal processing framework for extracting 4D point clouds. Subsequently, we introduce a robust 3D ego-motion estimation algorithm that accurately estimates source radar velocity, accounting for measurement errors and Doppler ambiguity, by processing the point clouds. Additionally, our algorithm leverages the relationship between Doppler velocity, azimuth angle, elevation angle, and radar ego-motion velocity to identify the background clutter spectrum and employ notch filters for its removal. The performance of our algorithm is evaluated using both simulated data and experiments with real-world data. By offering a fast and computationally efficient solution, our approach contributes to a potential pathway for challenges posed by non-homogeneous environments and real-time processing requirements.
△ Less
Submitted 29 July, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction
Authors:
Ali Jamali,
Swalpa Kumar Roy,
Jonathan Li,
Pedram Ghamisi
Abstract:
In the domain of remote sensing image interpretation, road extraction from high-resolution aerial imagery has already been a hot research topic. Although deep CNNs have presented excellent results for semantic segmentation, the efficiency and capabilities of vision transformers are yet to be fully researched. As such, for accurate road extraction, a deep semantic segmentation neural network that u…
▽ More
In the domain of remote sensing image interpretation, road extraction from high-resolution aerial imagery has already been a hot research topic. Although deep CNNs have presented excellent results for semantic segmentation, the efficiency and capabilities of vision transformers are yet to be fully researched. As such, for accurate road extraction, a deep semantic segmentation neural network that utilizes the abilities of residual learning, HetConvs, UNet, and vision transformers, which is called \texttt{ResUNetFormer}, is proposed in this letter. The developed \texttt{ResUNetFormer} is evaluated on various cutting-edge deep learning-based road extraction techniques on the public Massachusetts road dataset. Statistical and visual results demonstrate the superiority of the \texttt{ResUNetFormer} over the state-of-the-art CNNs and vision transformers for segmentation. The code will be made available publicly at \url{https://github.com/aj1365/ResUNetFormer}.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
atTRACTive: Semi-automatic white matter tract segmentation using active learning
Authors:
Robin Peretzke,
Klaus Maier-Hein,
Jonas Bohn,
Yannick Kirchhoff,
Saikat Roy,
Sabrina Oberli-Palma,
Daniela Becker,
Pavlina Lenga,
Peter Neher
Abstract:
Accurately identifying white matter tracts in medical images is essential for various applications, including surgery planning and tract-specific analysis. Supervised machine learning models have reached state-of-the-art solving this task automatically. However, these models are primarily trained on healthy subjects and struggle with strong anatomical aberrations, e.g. caused by brain tumors. This…
▽ More
Accurately identifying white matter tracts in medical images is essential for various applications, including surgery planning and tract-specific analysis. Supervised machine learning models have reached state-of-the-art solving this task automatically. However, these models are primarily trained on healthy subjects and struggle with strong anatomical aberrations, e.g. caused by brain tumors. This limitation makes them unsuitable for tasks such as preoperative planning, wherefore time-consuming and challenging manual delineation of the target tract is typically employed. We propose semi-automatic entropy-based active learning for quick and intuitive segmentation of white matter tracts from whole-brain tractography consisting of millions of streamlines. The method is evaluated on 21 openly available healthy subjects from the Human Connectome Project and an internal dataset of ten neurosurgical cases. With only a few annotations, the proposed approach enables segmenting tracts on tumor cases comparable to healthy subjects (dice=0.71), while the performance of automatic methods, like TractSeg dropped substantially (dice=0.34) in comparison to healthy subjects. The method is implemented as a prototype named atTRACTive in the freely available software MITK Diffusion. Manual experiments on tumor data showed higher efficiency due to lower segmentation times compared to traditional ROI-based segmentation.
△ Less
Submitted 3 August, 2023; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Robust and lightweight audio fingerprint for Automatic Content Recognition
Authors:
Anoubhav Agarwaal,
Prabhat Kanaujia,
Sartaki Sinha Roy,
Susmita Ghose
Abstract:
This research paper presents a novel audio fingerprinting system for Automatic Content Recognition (ACR). By using signal processing techniques and statistical transformations, our proposed method generates compact fingerprints of audio segments that are robust to noise degradations present in real-world audio. The system is designed to be highly scalable, with the ability to identify thousands of…
▽ More
This research paper presents a novel audio fingerprinting system for Automatic Content Recognition (ACR). By using signal processing techniques and statistical transformations, our proposed method generates compact fingerprints of audio segments that are robust to noise degradations present in real-world audio. The system is designed to be highly scalable, with the ability to identify thousands of hours of content using fingerprints generated from millions of TVs. The fingerprint's high temporal correlation and utilization of existing GPU-compatible Approximate Nearest Neighbour (ANN) search algorithms make this possible. Furthermore, the fingerprint generation can run on low-power devices with limited compute, making it accessible to a wide range of applications. Experimental results show improvements in our proposed system compared to a min-hash based audio fingerprint on all evaluated metrics, including accuracy on proprietary ACR datasets, retrieval speed, memory usage, and robustness to various noises. For similar retrieval accuracy, our system is 30x faster and uses 6x fewer fingerprints than the min-hash method.
△ Less
Submitted 17 May, 2023; v1 submitted 16 May, 2023;
originally announced May 2023.
-
Adaptive Gravity Compensation Control of a Cable-Driven Upper-Arm Soft Exosuit
Authors:
Joyjit Mukherjee,
Ankit Chatterjee,
Shreeshan Jena,
Nitesh Kumar,
Suriya Prakash Muthukrishnan,
Sitikantha Roy,
Shubhendu Bhasin
Abstract:
This paper proposes an adaptive gravity compensation (AGC) control strategy for a cable-driven upper-limb exosuit intended to assist the wearer with lifting tasks. Unlike most model-based control techniques used for this human-robot interaction task, the proposed control design does not assume knowledge of the anthropometric parameters of the wearer's arm and the payload. Instead, the uncertaintie…
▽ More
This paper proposes an adaptive gravity compensation (AGC) control strategy for a cable-driven upper-limb exosuit intended to assist the wearer with lifting tasks. Unlike most model-based control techniques used for this human-robot interaction task, the proposed control design does not assume knowledge of the anthropometric parameters of the wearer's arm and the payload. Instead, the uncertainties in human arm parameters, such as mass, length, and payload, are estimated online using an indirect adaptive control law that compensates for the gravity moment about the elbow joint. Additionally, the AGC controller is agnostic to the desired joint trajectory followed by the human arm. For the purpose of controller design, the human arm is modeled using a 1-DOF manipulator model. Further, a cable-driven actuator model is proposed that maps the assistive elbow torque to the actuator torque. The performance of the proposed method is verified through a co-simulation, wherein the control input realized in MATLAB is applied to the human bio-mechanical model in OpenSim under varying payload conditions. Significant reductions in human effort in terms of human muscle torque and metabolic cost are observed with the proposed control strategy. Further, simulation results show that the performance of the AGC controller converges to that of the gravity compensation (GC) controller, demonstrating the efficacy of AGC-based online parameter learning.
△ Less
Submitted 28 April, 2023;
originally announced April 2023.
-
Using Demand Response to Improve Power System Small-Signal Stability
Authors:
Mengqi Yao,
Sandip Roy,
Johanna L. Mathieu
Abstract:
With the increase of uncertain and intermittent renewable energy supply on the grid, the power system has become more vulnerable to instability. In this paper, we develop a demand response strategy to improve power system small-signal stability. We pose the problem as an optimization problem wherein the total demand-responsive load is held constant at each time instance but shifted between differe…
▽ More
With the increase of uncertain and intermittent renewable energy supply on the grid, the power system has become more vulnerable to instability. In this paper, we develop a demand response strategy to improve power system small-signal stability. We pose the problem as an optimization problem wherein the total demand-responsive load is held constant at each time instance but shifted between different buses to improve small-signal stability, which is measured by small-signal stability metrics that are functions of subsets of the system's eigenvalues, such as the smallest damping ratio. To solve the problem, we use iterative linear programming and generalized eigenvalue sensitivities. We demonstrate the approach via a case study that uses the IEEE 14-bus system. Our results show that shifting the load between buses, can improve a small-signal stability margin. We explore the use of models of different fidelity and find that it is important to include models of the automatic voltage regulators and power system stabilizers. In addition, we show that load shifting can achieve similar improvements to generation shifting and better improvement than simply tuning power system stabilizers.
△ Less
Submitted 14 April, 2023; v1 submitted 11 April, 2023;
originally announced April 2023.
-
SAM.MD: Zero-shot medical image segmentation capabilities of the Segment Anything Model
Authors:
Saikat Roy,
Tassilo Wald,
Gregor Koehler,
Maximilian R. Rokuss,
Nico Disch,
Julius Holzschuh,
David Zimmerer,
Klaus H. Maier-Hein
Abstract:
Foundation models have taken over natural language processing and image generation domains due to the flexibility of prompting. With the recent introduction of the Segment Anything Model (SAM), this prompt-driven paradigm has entered image segmentation with a hitherto unexplored abundance of capabilities. The purpose of this paper is to conduct an initial evaluation of the out-of-the-box zero-shot…
▽ More
Foundation models have taken over natural language processing and image generation domains due to the flexibility of prompting. With the recent introduction of the Segment Anything Model (SAM), this prompt-driven paradigm has entered image segmentation with a hitherto unexplored abundance of capabilities. The purpose of this paper is to conduct an initial evaluation of the out-of-the-box zero-shot capabilities of SAM for medical image segmentation, by evaluating its performance on an abdominal CT organ segmentation task, via point or bounding box based prompting. We show that SAM generalizes well to CT data, making it a potential catalyst for the advancement of semi-automatic segmentation tools for clinicians. We believe that this foundation model, while not reaching state-of-the-art segmentation performance in our investigations, can serve as a highly potent starting point for further adaptations of such models to the intricacies of the medical domain. Keywords: medical image segmentation, SAM, foundation models, zero-shot learning
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
Mutual Interference Mitigation for MIMO-FMCW Automotive Radar
Authors:
Sian Jin,
Pu Perry Wang,
Petros Boufounos,
Philip V. Orlik,
Ryuhei Takahashi,
Sumit Roy
Abstract:
This paper considers mutual interference mitigation among automotive radars using frequency-modulated continuous wave (FMCW) signal and multiple-input multiple-output (MIMO) virtual arrays. For the first time, we derive a general interference signal model that fully accounts for not only the time-frequency incoherence, e.g., different FMCW configuration parameters and time offsets, but also the sl…
▽ More
This paper considers mutual interference mitigation among automotive radars using frequency-modulated continuous wave (FMCW) signal and multiple-input multiple-output (MIMO) virtual arrays. For the first time, we derive a general interference signal model that fully accounts for not only the time-frequency incoherence, e.g., different FMCW configuration parameters and time offsets, but also the slow-time code MIMO incoherence and array configuration differences between the victim and interfering radars. Along with a standard MIMO-FMCW object signal model, we turn the interference mitigation into a spatial-domain object detection under incoherent MIMO-FMCW interference described by the explicit interference signal model, and propose a constant false alarm rate (CFAR) detector. More specifically, the proposed detector exploits the structural property of the derived interference model at both \emph{transmit} and \emph{receive} steering vector space. We also derive analytical closed-form expressions for probabilities of detection and false alarm. Performance evaluation using both synthetic-level and phased array system-level simulation confirms the effectiveness of our proposed detector over selected baseline methods.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
Authors:
Saikat Roy,
Gregor Koehler,
Constantin Ulrich,
Michael Baumgartner,
Jens Petersen,
Fabian Isensee,
Paul F. Jaeger,
Klaus Maier-Hein
Abstract:
There has been exploding interest in embracing Transformer-based architectures for medical image segmentation. However, the lack of large-scale annotated medical datasets make achieving performances equivalent to those in natural images challenging. Convolutional networks, in contrast, have higher inductive biases and consequently, are easily trainable to high performance. Recently, the ConvNeXt a…
▽ More
There has been exploding interest in embracing Transformer-based architectures for medical image segmentation. However, the lack of large-scale annotated medical datasets make achieving performances equivalent to those in natural images challenging. Convolutional networks, in contrast, have higher inductive biases and consequently, are easily trainable to high performance. Recently, the ConvNeXt architecture attempted to modernize the standard ConvNet by mirroring Transformer blocks. In this work, we improve upon this to design a modernized and scalable convolutional architecture customized to challenges of data-scarce medical settings. We introduce MedNeXt, a Transformer-inspired large kernel segmentation network which introduces - 1) A fully ConvNeXt 3D Encoder-Decoder Network for medical image segmentation, 2) Residual ConvNeXt up and downsampling blocks to preserve semantic richness across scales, 3) A novel technique to iteratively increase kernel sizes by upsampling small kernel networks, to prevent performance saturation on limited medical data, 4) Compound scaling at multiple levels (depth, width, kernel size) of MedNeXt. This leads to state-of-the-art performance on 4 tasks on CT and MRI modalities and varying dataset sizes, representing a modernized deep architecture for medical image segmentation. Our code is made publicly available at: https://github.com/MIC-DKFZ/MedNeXt.
△ Less
Submitted 2 June, 2024; v1 submitted 17 March, 2023;
originally announced March 2023.
-
Graph-Theoretic Analyses and Model Reduction for an Open Jackson Queueing Network
Authors:
Chenyan Zhu,
Sandip Roy
Abstract:
A graph-theoretic analysis of the steady-state behavior of an open Jackson queueing network is developed. In particular, a number of queueing-network performance metrics are shown to exhibit a spatial dependence on local drivers (e.g. increments to local exogenous arrival rates), wherein the impacts fall off across graph cutsets away from a target queue. This graph-theoretic analysis is also used…
▽ More
A graph-theoretic analysis of the steady-state behavior of an open Jackson queueing network is developed. In particular, a number of queueing-network performance metrics are shown to exhibit a spatial dependence on local drivers (e.g. increments to local exogenous arrival rates), wherein the impacts fall off across graph cutsets away from a target queue. This graph-theoretic analysis is also used to motivate a structure-preserving model reduction algorithm, and an algorithm that exactly matches performance statistics of the original model is proposed. The graph-theoretic results and model-reduction method are evaluated via simulations of an example queueing-network model.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
MRAC with Memory for Switched Linear Systems
Authors:
Pritesh Patel,
Sayan Basu Roy,
Shubhendu Bhasin
Abstract:
This work proposes a switched model reference adaptive control (S-MRAC) architecture for a multi-input multi-output (MIMO) switched linear system with memory for enhanced learning. A salient feature of the proposed method that separates it from most previous results is the use of memory that store the estimator states at switching and facilitate parameter learning during both active and inactive p…
▽ More
This work proposes a switched model reference adaptive control (S-MRAC) architecture for a multi-input multi-output (MIMO) switched linear system with memory for enhanced learning. A salient feature of the proposed method that separates it from most previous results is the use of memory that store the estimator states at switching and facilitate parameter learning during both active and inactive phases of a subsystem, thereby improving the tracking performance of the overall switched system. Specifically, the learning experience from the previous active duration of a subsystem is retained in the memory and reused when the subsystem is inactive and when the subsystem becomes active again. Parameter convergence is shown based on an intermittent initial excitation (IIE), which is significantly relaxed than the classical persistence of excitation (PE) condition. A common Lyapunov function is considered to ensure closed-loop stability with S-MRAC. Further under IIE, the exponential stability of tracking and parameter estimation error dynamics are guaranteed.
△ Less
Submitted 28 January, 2023;
originally announced January 2023.
-
BiCurNet: Pre-Movement EEG based Neural Decoder for Biceps Curl Trajectory Estimation
Authors:
Manali Saini,
Anant Jain,
Lalan Kumar,
Suriya Prakash Muthukrishnan,
Shubhendu Bhasin,
Sitikantha Roy
Abstract:
Kinematic parameter (KP) estimation from early electroencephalogram (EEG) signals is essential for positive augmentation using wearable robot. However, work related to early estimation of KPs from surface EEG is sparse. In this work, a deep learning-based model, BiCurNet, is presented for early estimation of biceps curl using collected EEG signal. The model utilizes light-weight architecture with…
▽ More
Kinematic parameter (KP) estimation from early electroencephalogram (EEG) signals is essential for positive augmentation using wearable robot. However, work related to early estimation of KPs from surface EEG is sparse. In this work, a deep learning-based model, BiCurNet, is presented for early estimation of biceps curl using collected EEG signal. The model utilizes light-weight architecture with depth-wise separable convolution layers and customized attention module. The feasibility of early estimation of KPs is demonstrated using brain source imaging. Computationally efficient EEG features in spherical and head harmonics domain is utilized for the first time for KP prediction. The best Pearson correlation coefficient (PCC) between estimated and actual trajectory of $0.7$ is achieved when combined EEG features (spatial and harmonics domain) in delta band is utilized. Robustness of the proposed network is demonstrated for subject-dependent and subject-independent training, using EEG signals with artifacts.
△ Less
Submitted 26 October, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Optimal Beam Training for mmWave Massive MIMO using 802.11ay
Authors:
Lyutianyang Zhang,
Sumit Roy
Abstract:
Beam training of 802.11 ad is a technology that helps accelerate the analog weighting vector (AWV) selection process under the constraint of the existing code-book for AWV. However, 5G milli-meter wave (mmWave) multiple-input-multiple-output (MIMO) system brings challenges to this new technology due to the higher order of complexity of antennae. Hence, the existing codebook of 11ad is unlikely to…
▽ More
Beam training of 802.11 ad is a technology that helps accelerate the analog weighting vector (AWV) selection process under the constraint of the existing code-book for AWV. However, 5G milli-meter wave (mmWave) multiple-input-multiple-output (MIMO) system brings challenges to this new technology due to the higher order of complexity of antennae. Hence, the existing codebook of 11ad is unlikely to even include the near-optimal AWV and the data rate will degrade severely. To cope with this situation, this paper proposed a new beam training protocol combined with the state-of-the-art compressed sensing channel estimation in order to find the AWV to maximize the optimal data-rate. Simulation is implemented to show the data-rate of AWV achieved by 11 ad is worse than the proposed protocol.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Propagation Stability Concepts for Network Synchronization Processes
Authors:
Sandip Roy,
Subir Sarker,
Mengran Xue
Abstract:
A notion of disturbance propagation stability is defined for dynamical network processes, in terms of decrescence of an input-output energy metric along cutsets away from the disturbance source. A characterization of the disturbance propagation notion is developed for a canonical model for synchronization of linearly-coupled homogeneous subsystems. Specifically, propagation stability is equivalenc…
▽ More
A notion of disturbance propagation stability is defined for dynamical network processes, in terms of decrescence of an input-output energy metric along cutsets away from the disturbance source. A characterization of the disturbance propagation notion is developed for a canonical model for synchronization of linearly-coupled homogeneous subsystems. Specifically, propagation stability is equivalenced with the frequency response of a certain local closed-loop model, which is defined from the subsystem model and local network connections, being sub-unity gain. For the case where the subsystem is single-input single-output (SISO), a further simplification in terms of the subsystem's open loop Nyquist plot is obtained. An extension of the disturbance propagation stability concept toward imperviousness of subnetworks to disturbances is briefly developed, and an example focused on networks with planar subsystems is considered.
△ Less
Submitted 9 October, 2022;
originally announced October 2022.
-
Fault Signature Identification for BLDC motor Drive System -A Statistical Signal Fusion Approach
Authors:
Tribeni Prasad Banerjee,
Susanta Roy,
B. K. Panigrahi
Abstract:
A hybrid approach based on multirate signal processing and sensory data fusion is proposed for the condition monitoring and identification of fault signal signatures used in the Flight ECS (Engine Control System) unit. Though motor current signature analysis (MCSA) is widely used for fault detection now-a-days, the proposed hybrid method qualifies as one of the most powerful online/offline techniq…
▽ More
A hybrid approach based on multirate signal processing and sensory data fusion is proposed for the condition monitoring and identification of fault signal signatures used in the Flight ECS (Engine Control System) unit. Though motor current signature analysis (MCSA) is widely used for fault detection now-a-days, the proposed hybrid method qualifies as one of the most powerful online/offline techniques for diagnosing the process faults. Existing approaches have some drawbacks that can degrade the performance and accuracy of a process-diagnosis system. In particular, it is very difficult to detect random stochastic noise due to the nonlinear behavior of valve controller. Using only Short Time Fourier Transform (STFT), frequency leakage and the small amplitude of the current components related to the fault can be observed, but the fault due to the controller behavior cannot be observed. Therefore, a framework of advanced multirate signal and data-processing aided with sensor fusion algorithms is proposed in this article and satisfactory results are obtained. For implementing the system, a DSP-based BLDC motor controller with three-phase inverter module (TMS 320F2812) is used and the performance of the proposed method is validated on real time data.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Architecture-Algorithmic Trade-offs in Multi-path Channel Estimation for mmWAVE Systems
Authors:
Lyutianyang Zhang,
Sumit Roy,
Liu Cao
Abstract:
5G mmWave massive MIMO systems are likely to be deployed in dense urban scenarios, where increasing network capacity is the primary objective. A key component in mmWave transceiver design is channel estimation which is challenging due to the very large signal bandwidths (order of GHz) implying significant resolved spatial multipath, coupled with large # of Tx/Rx antennas for large-scale MIMO. This…
▽ More
5G mmWave massive MIMO systems are likely to be deployed in dense urban scenarios, where increasing network capacity is the primary objective. A key component in mmWave transceiver design is channel estimation which is challenging due to the very large signal bandwidths (order of GHz) implying significant resolved spatial multipath, coupled with large # of Tx/Rx antennas for large-scale MIMO. This results in significantly increased training overhead that in turn leads to unacceptably high computational complexity and power cost. Our work thus highlights the interplay of transceiver architecture and receiver signal processing algorithm choices that fundamentally address (mobile) handset power consumption, with minimal degradation in performance. We investigate trade-offs enabled by conjunction of hybrid beamforming mmWave receiver and channel estimation algorithms that exploit available sparsity in such wideband scenarios. A compressive sensing (CS) framework for sparse channel estimation -- Binary Iterative Hard Thresholding (BIHT) \cite{jacques2013robust} followed by linear reconstruction method with varying quantization (ADC) levels -- is explored to compare the trade-offs between bit-depth and sampling rate for a given ADC power budget. Performance analysis of the BIHT+ linear reconstruction method is conducted via simulation studies for 5G specified multi-path channel models and compared to oracle-assisted bounds for validation.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
On the Spatial Pattern of Input-Output Metrics for a Network Synchronization Process
Authors:
Subir Sarker,
Sandip Roy
Abstract:
A graph-theoretic analysis is undertaken for a compendium of input-output (transfer) metrics of a standard discrete-time linear synchronization model, including lp gains, frequency responses, frequency-band energy, and Markov parameters. We show that these transfer metrics exhibit a spatial degradation, such that they are monotonically nonincreasing along vertex cutsets away from an exogenous inpu…
▽ More
A graph-theoretic analysis is undertaken for a compendium of input-output (transfer) metrics of a standard discrete-time linear synchronization model, including lp gains, frequency responses, frequency-band energy, and Markov parameters. We show that these transfer metrics exhibit a spatial degradation, such that they are monotonically nonincreasing along vertex cutsets away from an exogenous input. We use this spatial analysis to characterize signal-to-noise ratios (SNRs) in diffusive networks driven by process noise, and to develop a notion of propagation stability for dynamical networks. Finally, the formal results are illustrated through an example.
△ Less
Submitted 22 July, 2022;
originally announced July 2022.
-
Multi-Access Point Coordination for Next-Gen Wi-Fi Networks Aided by Deep Reinforcement Learning
Authors:
Lyutianyang Zhang,
Hao Yin,
Sumit Roy,
Liu Cao
Abstract:
Wi-Fi in the enterprise - characterized by overlapping Wi-Fi cells - constitutes the design challenge for next-generation networks. Standardization for recently started IEEE 802.11be (Wi-Fi 7) Working Groups has focused on significant medium access control layer changes that emphasize the role of the access point (AP) in radio resource management (RRM) for coordinating channel access due to the hi…
▽ More
Wi-Fi in the enterprise - characterized by overlapping Wi-Fi cells - constitutes the design challenge for next-generation networks. Standardization for recently started IEEE 802.11be (Wi-Fi 7) Working Groups has focused on significant medium access control layer changes that emphasize the role of the access point (AP) in radio resource management (RRM) for coordinating channel access due to the high collision probability with the distributed coordination function (DCF), especially in dense overlapping Wi-Fi networks. This paper proposes a novel multi-AP coordination system architecture aided by a centralized AP controller (APC). Meanwhile, a deep reinforcement learning channel access (DLCA) protocol is developed to replace the binary exponential backoff mechanism in DCF to enhance the network throughput by enabling the coordination of APs. First-Order Model-Agnostic Meta-Learning further enhances the network throughput. Subsequently, we also put forward a new greedy algorithm to maintain proportional fairness (PF) among multiple APs. Via the simulation, the performance of DLCA protocol in dense overlapping Wi-Fi networks is verified to have strong stability and outperform baselines such as Shared Transmission Opportunity (SH-TXOP) and Request-to-Send/Clear-to-Send (RTS/CTS) in terms of the network throughput by 10% and 3% as well as the network utility considering proportional fairness by 28.3% and 13.8%, respectively.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
Composite Adaptive Control for Time-varying Systems with Dual Adaptation
Authors:
Raghavv Goel,
Sayan Basu Roy
Abstract:
This paper proposes a composite adaptive control architecture using dual adaptation scheme for dynamical systems comprising time-varying uncertain parameters. While majority of the adaptive control schemes in literature address the case of constant parameters, recent research has conceptualized improved adaptive control techniques for time-varying systems with rigorous stability proofs. The propos…
▽ More
This paper proposes a composite adaptive control architecture using dual adaptation scheme for dynamical systems comprising time-varying uncertain parameters. While majority of the adaptive control schemes in literature address the case of constant parameters, recent research has conceptualized improved adaptive control techniques for time-varying systems with rigorous stability proofs. The proposed work is an effort towards a similar direction, where a novel dual adaptation mechanism is introduced to efficiently tackle the time-varying nature of the parameters. Projection and $σ$-modification algorithms are strategically combined using congelation of variables to claim a global result for the tracking error space. While the classical adaptive systems demand a restrictive condition of persistence of excitation (PE) for accurate parameter estimation, the proposed work relies on a milder condition, called initial excitation (IE) for the same. A rigorous Lyapunov stability analysis is carried out to establish uniformly ultimately bounded (UUB) stability of the closed-loop system. Further it is analytically shown that the proposed work can recover the performance of previously designed IE-based adaptive controller in case of time invariant systems.
△ Less
Submitted 3 June, 2022;
originally announced June 2022.
-
Efficient PHY Layer Abstraction under Imperfect Channel Estimation
Authors:
Liu Cao,
Lyutianyang Zhang,
Sian Jin,
Sumit Roy
Abstract:
As most existing work investigate the PHY layer abstraction under an assumption of perfect channel estimation, it may become unreliable if there exists channel estimation error in a real communication system. This letter improves an efficient PHY layer method, EESM-log-SGN PHY layer abstraction, by considering the presence of channel estimation error. We develop two methods for implementing the EE…
▽ More
As most existing work investigate the PHY layer abstraction under an assumption of perfect channel estimation, it may become unreliable if there exists channel estimation error in a real communication system. This letter improves an efficient PHY layer method, EESM-log-SGN PHY layer abstraction, by considering the presence of channel estimation error. We develop two methods for implementing the EESM-log-SGN PHY abstraction under imperfect channel estimation. We show that the effective SINR is not impacted by the channel estimation error under multiple-input and single-output (MISO)/single-input and single-output (SISO) configuration, which is also verified by the full PHY simulation. The developed methods are then validated under different orthogonal frequency division multiplexing (OFDM) scenarios.
△ Less
Submitted 8 October, 2022; v1 submitted 22 May, 2022;
originally announced May 2022.
-
Online Adaptive Identification of Switched Affine Systems Using a Two-Tier Filter Architecture with Memory
Authors:
Pritesh Patel,
Sayan Basu Roy,
Shubhendu Bhasin
Abstract:
This work proposes an online adaptive identification method for multi-input multi-output (MIMO) switched affine systems with guaranteed parameter convergence. A family of online parameter estimators is used that is equipped with a dual-layer low pass filter architecture to facilitate parameter learning and identification of each subsystem. The filters capture information about the unknown paramete…
▽ More
This work proposes an online adaptive identification method for multi-input multi-output (MIMO) switched affine systems with guaranteed parameter convergence. A family of online parameter estimators is used that is equipped with a dual-layer low pass filter architecture to facilitate parameter learning and identification of each subsystem. The filters capture information about the unknown parameters in the form of a prediction error which is used in the parameter estimation algorithm. A salient feature of the proposed method that distinguishes it from most previous results is the use of a memory bank that stores filter values and promotes parameter learning during both active and inactive phases of a subsystem. Specifically, the learnt experience from the previous active phase of a subsystem is retained in the memory and leveraged for parameter learning in its subsequent active and inactive phases. Further, a new notion of intermittent initial excitation (IIE) is introduced that extends the previously established initial excitation (IE) condition to the switched system framework. IIE is shown to be sufficient to ensure exponential convergence of the switched system parameters.
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
Deep Hyperspectral Unmixing using Transformer Network
Authors:
Preetam Ghosh,
Swalpa Kumar Roy,
Bikram Koirala,
Behnood Rasti,
Paul Scheunders
Abstract:
Currently, this paper is under review in IEEE. Transformers have intrigued the vision research community with their state-of-the-art performance in natural language processing. With their superior performance, transformers have found their way in the field of hyperspectral image classification and achieved promising results. In this article, we harness the power of transformers to conquer the task…
▽ More
Currently, this paper is under review in IEEE. Transformers have intrigued the vision research community with their state-of-the-art performance in natural language processing. With their superior performance, transformers have found their way in the field of hyperspectral image classification and achieved promising results. In this article, we harness the power of transformers to conquer the task of hyperspectral unmixing and propose a novel deep unmixing model with transformers. We aim to utilize the ability of transformers to better capture the global feature dependencies in order to enhance the quality of the endmember spectra and the abundance maps. The proposed model is a combination of a convolutional autoencoder and a transformer. The hyperspectral data is encoded by the convolutional encoder. The transformer captures long-range dependencies between the representations derived from the encoder. The data are reconstructed using a convolutional decoder. We applied the proposed unmixing model to three widely used unmixing datasets, i.e., Samson, Apex, and Washington DC mall and compared it with the state-of-the-art in terms of root mean squared error and spectral angle distance. The source code for the proposed model will be made publicly available at \url{https://github.com/preetam22n/DeepTrans-HSU}.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Multimodal Fusion Transformer for Remote Sensing Image Classification
Authors:
Swalpa Kumar Roy,
Ankur Deria,
Danfeng Hong,
Behnood Rasti,
Antonio Plaza,
Jocelyn Chanussot
Abstract:
Vision transformers (ViTs) have been trending in image classification tasks due to their promising performance when compared to convolutional neural networks (CNNs). As a result, many researchers have tried to incorporate ViTs in hyperspectral image (HSI) classification tasks. To achieve satisfactory performance, close to that of CNNs, transformers need fewer parameters. ViTs and other similar tra…
▽ More
Vision transformers (ViTs) have been trending in image classification tasks due to their promising performance when compared to convolutional neural networks (CNNs). As a result, many researchers have tried to incorporate ViTs in hyperspectral image (HSI) classification tasks. To achieve satisfactory performance, close to that of CNNs, transformers need fewer parameters. ViTs and other similar transformers use an external classification (CLS) token which is randomly initialized and often fails to generalize well, whereas other sources of multimodal datasets, such as light detection and ranging (LiDAR) offer the potential to improve these models by means of a CLS. In this paper, we introduce a new multimodal fusion transformer (MFT) network which comprises a multihead cross patch attention (mCrossPA) for HSI land-cover classification. Our mCrossPA utilizes other sources of complementary information in addition to the HSI in the transformer encoder to achieve better generalization. The concept of tokenization is used to generate CLS and HSI patch tokens, helping to learn a {distinctive representation} in a reduced and hierarchical feature space. Extensive experiments are carried out on {widely used benchmark} datasets {i.e.,} the University of Houston, Trento, University of Southern Mississippi Gulfpark (MUUFL), and Augsburg. We compare the results of the proposed MFT model with other state-of-the-art transformers, classical CNNs, and conventional classifiers models. The superior performance achieved by the proposed model is due to the use of multihead cross patch attention. The source code will be made available publicly at \url{https://github.com/AnkurDeria/MFT}.}
△ Less
Submitted 20 June, 2023; v1 submitted 31 March, 2022;
originally announced March 2022.
-
A Barrier Certificate-based Simplex Architecture with Application to Microgrids
Authors:
Amol Damare,
Shouvik Roy,
Scott A. Smolka,
Scott D. Stoller
Abstract:
We present Barrier Certificate-based Simplex (BC-Simplex), a new, provably correct design for runtime assurance of continuous dynamical systems. BC-Simplex is centered around the Simplex Control Architecture, which consists of a high-performance advanced controller which is not guaranteed to maintain safety of the plant, a verified-safe baseline controller, and a decision module that switches cont…
▽ More
We present Barrier Certificate-based Simplex (BC-Simplex), a new, provably correct design for runtime assurance of continuous dynamical systems. BC-Simplex is centered around the Simplex Control Architecture, which consists of a high-performance advanced controller which is not guaranteed to maintain safety of the plant, a verified-safe baseline controller, and a decision module that switches control of the plant between the two controllers to ensure safety without sacrificing performance. In BC-Simplex, Barrier certificates are used to prove that the baseline controller ensures safety. Furthermore, BC-Simplex features a new automated method for deriving, from the barrier certificate, the conditions for switching between the controllers. Our method is based on the Taylor expansion of the barrier certificate and yields computationally inexpensive switching conditions. We consider a significant application of BC-Simplex to a microgrid featuring an advanced controller in the form of a neural network trained using reinforcement learning. The microgrid is modeled in RTDS, an industry-standard high-fidelity, real-time power systems simulator. Our results demonstrate that BC-Simplex can automatically derive switching conditions for complex systems, the switching conditions are not overly conservative, and BC-Simplex ensures safety even in the presence of adversarial attacks on the neural controller.
△ Less
Submitted 2 June, 2022; v1 submitted 19 February, 2022;
originally announced February 2022.
-
Attention Mechanism Meets with Hybrid Dense Network for Hyperspectral Image Classification
Authors:
Muhammad Ahmad,
Adil Mehmood Khan,
Manuel Mazzara,
Salvatore Distefano,
Swalpa Kumar Roy,
Xin Wu
Abstract:
Convolutional Neural Networks (CNN) are more suitable, indeed. However, fixed kernel sizes make traditional CNN too specific, neither flexible nor conducive to feature learning, thus impacting on the classification accuracy. The convolution of different kernel size networks may overcome this problem by capturing more discriminating and relevant information. In light of this, the proposed solution…
▽ More
Convolutional Neural Networks (CNN) are more suitable, indeed. However, fixed kernel sizes make traditional CNN too specific, neither flexible nor conducive to feature learning, thus impacting on the classification accuracy. The convolution of different kernel size networks may overcome this problem by capturing more discriminating and relevant information. In light of this, the proposed solution aims at combining the core idea of 3D and 2D Inception net with the Attention mechanism to boost the HSIC CNN performance in a hybrid scenario. The resulting \textit{attention-fused hybrid network} (AfNet) is based on three attention-fused parallel hybrid sub-nets with different kernels in each block repeatedly using high-level features to enhance the final ground-truth maps. In short, AfNet is able to selectively filter out the discriminative features critical for classification. Several tests on HSI datasets provided competitive results for AfNet compared to state-of-the-art models. The proposed pipeline achieved, indeed, an overall accuracy of 97\% for the Indian Pines, 100\% for Botswana, 99\% for Pavia University, Pavia Center, and Salinas datasets.
△ Less
Submitted 4 January, 2022;
originally announced January 2022.
-
Learning to Detect Open Carry and Concealed Object with 77GHz Radar
Authors:
Xiangyu Gao,
Hui Liu,
Sumit Roy,
Guanbin Xing,
Ali Alansari,
Youchen Luo
Abstract:
Detecting harmful carried objects plays a key role in intelligent surveillance systems and has widespread applications, for example, in airport security. In this paper, we focus on the relatively unexplored area of using low-cost 77GHz mmWave radar for the carried objects detection problem. The proposed system is capable of real-time detecting three classes of objects - laptop, phone, and knife -…
▽ More
Detecting harmful carried objects plays a key role in intelligent surveillance systems and has widespread applications, for example, in airport security. In this paper, we focus on the relatively unexplored area of using low-cost 77GHz mmWave radar for the carried objects detection problem. The proposed system is capable of real-time detecting three classes of objects - laptop, phone, and knife - under open carry and concealed cases where objects are hidden with clothes or bags. This capability is achieved by the initial signal processing for localization and generating range-azimuth-elevation image cubes, followed by a deep learning-based prediction network and a multi-shot post-processing module for detecting objects. Extensive experiments for validating the system performance on detecting open carry and concealed objects have been presented with a self-built radar-camera testbed and collected dataset. Additionally, the influence of different input formats, factors, and parameters on system performance is analyzed, providing an intuitive understanding of the system. This system would be the very first baseline for other future works aiming to detect carried objects using 77GHz radar.
△ Less
Submitted 26 April, 2022; v1 submitted 31 October, 2021;
originally announced November 2021.
-
EEG based stress analysis using rhythm specific spectral feature for video gameplay
Authors:
Shidhartho Roy,
Monira Islam,
Md. Salah Uddin Yusuf,
Nushrat Jahan
Abstract:
For the emerging significance of mental stress, various research directives have been established over time to better understand the causes of stress and how to deal with it. In recent years, the rise of video gameplay is unprecedented, further triggered by the lockdown imposed due to the COVID-19 pandemic. This paper presents an end-to-end stress analysis for video gaming stimuli using EEG. The P…
▽ More
For the emerging significance of mental stress, various research directives have been established over time to better understand the causes of stress and how to deal with it. In recent years, the rise of video gameplay is unprecedented, further triggered by the lockdown imposed due to the COVID-19 pandemic. This paper presents an end-to-end stress analysis for video gaming stimuli using EEG. The PSD value of the Alpha and Beta bands is computed to calculate the Beta-to-Alpha ratio (BAR). In this article, BAR is used to denote mental stress. Subjects are chosen based on various factors such as gender, gameplay experience, age, and BMI. EEG is recorded using Scan SynAmps2 Express equipment. There are three types of video gameplay: strategic, puzzle, and combinational. Relaxation is accomplished in this study by the use of music of various pitches. Two types of regression analysis are done to mathematically model stress and relaxation curve. Brain topography is rendered to indicate the stressed and relaxed region of the brain. In the relaxed state, the subjects have BAR 0.701, which is considered the baseline value. Non-gamer subjects have an average BAR of 2.403 for 1 hour of strategic video gameplay, whereas gamers have 2.218 BAR concurrently. After 12 minutes of listening to low-pitch music, gamers achieved 0.709 BAR, which is nearly the baseline value. In comparison to Quartic regression, the 4PL symmetrical sigmoid function performs regression analysis with fewer parameters and computational power.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Probabilistic Verification for Reliability of a Two-by-Two Network-on-Chip System
Authors:
Riley Roberts,
Benjamin Lewis,
Arnd Hartmanns,
Prabal Basu,
Sanghamitra Roy,
Koushik Chakraborty,
Zhen Zhang
Abstract:
Modern network-on-chip (NoC) systems face reliability issues due to process and environmental variations. The power supply noise (PSN) in the power delivery network of a NoC plays a key role in determining reliability. PSN leads to voltage droop, which can cause timing errors in the NoC. This paper makes a novel contribution towards formally analyzing PSN in NoC systems. We present a probabilistic…
▽ More
Modern network-on-chip (NoC) systems face reliability issues due to process and environmental variations. The power supply noise (PSN) in the power delivery network of a NoC plays a key role in determining reliability. PSN leads to voltage droop, which can cause timing errors in the NoC. This paper makes a novel contribution towards formally analyzing PSN in NoC systems. We present a probabilistic model checking approach to observe the PSN in a generic 2x2 mesh NoC with a uniform random traffic load. Key features of PSN are measured at the behavioral level. To tackle state explosion, we apply incremental abstraction techniques, including a novel probabilistic choice abstraction, based on observations of NoC behavior. The Modest Toolset is used for probabilistic modeling and verification. Results are obtained for several flit injection patterns to reveal their impacts on PSN. Our analysis finds an optimal flit pattern generation with zero probability of PSN events and suggests spreading flits rather than releasing them in consecutive cycles in order to minimize PSN.
△ Less
Submitted 28 May, 2021;
originally announced August 2021.
-
Compressive Representations of Weather Scenes for Strategic Air Traffic Flow Management
Authors:
Sandip Roy
Abstract:
Terse representation of high-dimensional weather scene data is explored, in support of strategic air traffic flow management objectives. Specifically, we consider whether aviation-relevant weather scenes are compressible, in the sense that each scene admits a possibly-different sparse representation in a basis of interest. Here, compression of weather scenes extracted from METAR data (including te…
▽ More
Terse representation of high-dimensional weather scene data is explored, in support of strategic air traffic flow management objectives. Specifically, we consider whether aviation-relevant weather scenes are compressible, in the sense that each scene admits a possibly-different sparse representation in a basis of interest. Here, compression of weather scenes extracted from METAR data (including temperature, flight categories, and visibility profiles for the contiguous United States) is examined, for the graph-spectral basis. The scenes are found to be compressible, with 75-95% of the scene content captured using 0.5-4% of the basis vectors. Further, the dominant basis vectors for each scene are seen to identify time-varying spatial characteristics of the weather, and reconstruction from the compressed representation is demonstrated. Finally, potential uses of the compressive representations in strategic TFM design are briefly scoped.
△ Less
Submitted 2 July, 2021;
originally announced July 2021.
-
Semantic-WER: A Unified Metric for the Evaluation of ASR Transcript for End Usability
Authors:
Somnath Roy
Abstract:
Recent advances in supervised, semi-supervised and self-supervised deep learning algorithms have shown significant improvement in the performance of automatic speech recognition(ASR) systems. The state-of-the-art systems have achieved a word error rate (WER) less than 5%. However, in the past, researchers have argued the non-suitability of the WER metric for the evaluation of ASR systems for downs…
▽ More
Recent advances in supervised, semi-supervised and self-supervised deep learning algorithms have shown significant improvement in the performance of automatic speech recognition(ASR) systems. The state-of-the-art systems have achieved a word error rate (WER) less than 5%. However, in the past, researchers have argued the non-suitability of the WER metric for the evaluation of ASR systems for downstream tasks such as spoken language understanding (SLU) and information retrieval. The reason is that the WER works at the surface level and does not include any syntactic and semantic knowledge.The current work proposes Semantic-WER (SWER), a metric to evaluate the ASR transcripts for downstream applications in general. The SWER can be easily customized for any down-stream task.
△ Less
Submitted 15 October, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Distinguishing Aerial Intruders from Trajectory Data: A Model-Based Hypothesis-Testing Approach
Authors:
David Petrizze,
Kasra Koorehdavoudi,
Mengran Xue,
Sandip Roy
Abstract:
Motivated by security needs in unmanned aerial system (UAS) operations, an algorithm for identifying airspace intruders (e.g., birds vs. drones) is developed. The algorithm is structured to use sensed intruder velocity data from Internet-of-Things platforms together with limited knowledge of physical models. The identification problem is posed as a statistical hypothesis testing or detection probl…
▽ More
Motivated by security needs in unmanned aerial system (UAS) operations, an algorithm for identifying airspace intruders (e.g., birds vs. drones) is developed. The algorithm is structured to use sensed intruder velocity data from Internet-of-Things platforms together with limited knowledge of physical models. The identification problem is posed as a statistical hypothesis testing or detection problem, wherein inertial feedback-controlled objects subject to stochastic actuation must be distinguished by speed data. The maximum a posteriori probability detector is obtained, and then is simplified to an explicit computation based on two points in the sample autocorrelation of the data. The simplified form allows computationally-friendly implementation of the algorithm, and simplified learning from archived data. Also, the total probability of error of the detector is computed and characterized. Simulations based on synthesized data are presented to illustrate and supplement the formal analyses.
△ Less
Submitted 16 May, 2021;
originally announced May 2021.
-
Perception Through 2D-MIMO FMCW Automotive Radar Under Adverse Weather
Authors:
Xiangyu Gao,
Sumit Roy,
Guanbin Xing,
Sian Jin
Abstract:
Millimeter-wave (mmWave) radars are being increasingly integrated in commercial vehicles to support new Adaptive Driver Assisted Systems (ADAS) features that require accurate location and Doppler velocity estimates of objects, independent of environmental conditions. To explore radar-based ADAS applications, we have updated our test-bed with Texas Instrument's 4-chip cascaded FMCW radar (TIDEP-010…
▽ More
Millimeter-wave (mmWave) radars are being increasingly integrated in commercial vehicles to support new Adaptive Driver Assisted Systems (ADAS) features that require accurate location and Doppler velocity estimates of objects, independent of environmental conditions. To explore radar-based ADAS applications, we have updated our test-bed with Texas Instrument's 4-chip cascaded FMCW radar (TIDEP-01012) that forms a non-uniform 2D MIMO virtual array. In this paper, we develop the necessary received signal models for applying different direction of arrival (DoA) estimation algorithms and experimentally validating their performance on formed virtual array under controlled scenarios. To test the robustness of mmWave radars under adverse weather conditions, we collected raw radar dataset (I-Q samples post demodulated) for various objects by a driven vehicle-mounted platform, specifically for snowy and foggy situations where cameras are largely ineffective. Initial results from radar imaging algorithms to this dataset are presented.
△ Less
Submitted 21 January, 2023; v1 submitted 4 April, 2021;
originally announced April 2021.
-
Compressibility of Network Opinion and Spread States in the Laplacian-Eigenvector Basis
Authors:
Sandip Roy,
Mengran Xue
Abstract:
Opinion-evolution and spread processes on networks (e.g., infectious disease spread, opinion formation in social networks) are not only high dimensional but also volatile and multiscale in nature. In this study, we explore whether snapshot data from these processes can admit terse representations. Specifically, using three case studies, we explore whether the data are compressible in the Laplacian…
▽ More
Opinion-evolution and spread processes on networks (e.g., infectious disease spread, opinion formation in social networks) are not only high dimensional but also volatile and multiscale in nature. In this study, we explore whether snapshot data from these processes can admit terse representations. Specifically, using three case studies, we explore whether the data are compressible in the Laplacian-eigenvector basis, in the sense that each snapshot can be approximated well using a (possibly different) small set of basis vectors. The first case study is concerned with a linear consensus model that is subject to a stochastic input at an unknown location; both empirical and formal analyses are used to characterize compressibility. Second, compressibility of state snapshots for a stochastic voter model is assessed via an empirical study. Finally, compressibility is studied for state-level daily COVID-19 positivity-rate data. The three case studies indicate that state snapshots from opinion-evolution and spread processes allow terse representations, which nevertheless capture their rich propagative dynamics.
△ Less
Submitted 28 March, 2021;
originally announced March 2021.
-
Source Aware Deep Learning Framework for Hand Kinematic Reconstruction using EEG Signal
Authors:
Sidharth Pancholi,
Amita Giri,
Anant Jain,
Lalan Kumar,
Sitikantha Roy
Abstract:
The ability to reconstruct the kinematic parameters of hand movement using non-invasive electroencephalography (EEG) is essential for strength and endurance augmentation using exosuit/exoskeleton. For system development, the conventional classification based brain computer interface (BCI) controls external devices by providing discrete control signals to the actuator. A continuous kinematic recons…
▽ More
The ability to reconstruct the kinematic parameters of hand movement using non-invasive electroencephalography (EEG) is essential for strength and endurance augmentation using exosuit/exoskeleton. For system development, the conventional classification based brain computer interface (BCI) controls external devices by providing discrete control signals to the actuator. A continuous kinematic reconstruction from EEG signal is better suited for practical BCI applications. The state-of-the-art multi-variable linear regression (mLR) method provides a continuous estimate of hand kinematics, achieving maximum correlation of upto 0.67 between the measured and the estimated hand trajectory. In this work, three novel source aware deep learning models are proposed for motion trajectory prediction (MTP). In particular, multi layer perceptron (MLP), convolutional neural network - long short term memory (CNN-LSTM), and wavelet packet decomposition (WPD) CNN-LSTM are presented. Additional novelty of the work includes utilization of brain source localization (using sLORETA) for the reliable decoding of motor intention mapping (channel selection) and accurate EEG time segment selection. Performance of the proposed models are compared with the traditionally utilised mLR technique on the real grasp and lift (GAL) dataset. Effectiveness of the proposed framework is established using the Pearson correlation coefficient and trajectory analysis. A significant improvement in the correlation coefficient is observed when compared with state-of-the-art mLR model. Our work bridges the gap between the control and the actuator block, enabling real time BCI implementation.
△ Less
Submitted 4 January, 2022; v1 submitted 25 March, 2021;
originally announced March 2021.
-
Observability-Blocking Control using Sparser and Regional Feedback for Network Synchronization Processes
Authors:
Abdullah Al Maruf,
Sandip Roy
Abstract:
The design of feedback control systems to block observability in a network synchronization model, i.e. to make the dynamics unobservable from measurements at a subset of the network's nodes, is studied. First, a general design algorithm is presented for blocking observability at any specified group of $m$ nodes, by applying state feedback controls at $m+2$ specified actuation nodes. The algorithm…
▽ More
The design of feedback control systems to block observability in a network synchronization model, i.e. to make the dynamics unobservable from measurements at a subset of the network's nodes, is studied. First, a general design algorithm is presented for blocking observability at any specified group of $m$ nodes, by applying state feedback controls at $m+2$ specified actuation nodes. The algorithm is based on a method for eigenstructure assignment, which allows surgical modification of particular eigenvectors to block observability while preserving the remaining open-loop eigenstructure. Next, the topological structure of the network is exploited to reduce the number of controllers required for blocking observability; the result is based on blocking observability on the nodes associated with a vertex-cutset separating the actuation and measurement locations. Also, the design is modified to encompass regional feedback controls, which only use data from a subset of accessible nodes. The regional feedback design does not maintain the open-loop eigenstructure, but can be guaranteed to preserve stability via a time-scale argument. The results are illustrated with numerical examples.
△ Less
Submitted 10 September, 2021; v1 submitted 15 March, 2021;
originally announced March 2021.
-
Contrast Adaptive Tissue Classification by Alternating Segmentation and Synthesis
Authors:
Dzung L. Pham,
Yi-Yu Chou,
Blake E. Dewey,
Daniel S. Reich,
John A. Butman,
Snehashis Roy
Abstract:
Deep learning approaches to the segmentation of magnetic resonance images have shown significant promise in automating the quantitative analysis of brain images. However, a continuing challenge has been its sensitivity to the variability of acquisition protocols. Attempting to segment images that have different contrast properties from those within the training data generally leads to significantl…
▽ More
Deep learning approaches to the segmentation of magnetic resonance images have shown significant promise in automating the quantitative analysis of brain images. However, a continuing challenge has been its sensitivity to the variability of acquisition protocols. Attempting to segment images that have different contrast properties from those within the training data generally leads to significantly reduced performance. Furthermore, heterogeneous data sets cannot be easily evaluated because the quantitative variation due to acquisition differences often dwarfs the variation due to the biological differences that one seeks to measure. In this work, we describe an approach using alternating segmentation and synthesis steps that adapts the contrast properties of the training data to the input image. This allows input images that do not resemble the training data to be more consistently segmented. A notable advantage of this approach is that only a single example of the acquisition protocol is required to adapt to its contrast properties. We demonstrate the efficacy of our approaching using brain images from a set of human subjects scanned with two different T1-weighted volumetric protocols.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.