-
ARA-O-RAN: End-to-End Programmable O-RAN Living Lab for Agriculture and Rural Communities
Authors:
Tianyi Zhang,
Joshua Ofori Boateng,
Taimoor UI Islam,
Arsalan Ahmad,
Hongwei Zhang,
Daji Qiao
Abstract:
As wireless networks evolve towards open architectures like O-RAN, testing, and integration platforms are crucial to address challenges like interoperability. This paper describes ARA-O-RAN, a novel O-RAN testbed established through the NSF Platforms for Advanced Wireless Research (PAWR) ARA platform. ARA provides an at-scale rural wireless living lab focused on technologies for digital agricultur…
▽ More
As wireless networks evolve towards open architectures like O-RAN, testing, and integration platforms are crucial to address challenges like interoperability. This paper describes ARA-O-RAN, a novel O-RAN testbed established through the NSF Platforms for Advanced Wireless Research (PAWR) ARA platform. ARA provides an at-scale rural wireless living lab focused on technologies for digital agriculture and rural communities. As an O-RAN Alliance certified Open Testing and Integration Centre (OTIC), ARA launched ARA-O-RAN -- the first public O-RAN testbed tailored to rural and agriculture use cases, together with the end-to-end, whole-stack programmability. ARA-O-RAN uniquely combines support for outdoor testing across a university campus, surrounding farmlands, and rural communities with a 50-node indoor sandbox. The testbed facilitates vital R\&D to implement open architectures that can meet rural connectivity needs. The paper outlines ARA-O-RAN's hardware system design, software architecture, and enabled research experiments. It also discusses plans aligned with national spectrum policy and rural spectrum innovation. ARA-O-RAN exemplifies the value of purpose-built wireless testbeds in accelerating impactful wireless research.
△ Less
Submitted 14 June, 2024;
originally announced July 2024.
-
Quantifying distribution system resilience from utility data: large event risk and benefits of investments
Authors:
Arslan Ahmad,
Ian Dobson
Abstract:
We focus on large blackouts in electric distribution systems caused by extreme winds. Such events have a large cost and impact on customers. To quantify resilience to these events, we formulate large event risk and show how to calculate it from the historical outage data routinely collected by utilities' outage management systems. Risk is defined using an event cost exceedance curve. The tail of t…
▽ More
We focus on large blackouts in electric distribution systems caused by extreme winds. Such events have a large cost and impact on customers. To quantify resilience to these events, we formulate large event risk and show how to calculate it from the historical outage data routinely collected by utilities' outage management systems. Risk is defined using an event cost exceedance curve. The tail of this curve and the large event risk is described by the probability of a large cost event and the slope magnitude of the tail on a log-log plot. Resilience can be improved by planned investments to upgrade system components or speed up restoration. The benefits that these investments would have had if they had been made in the past can be quantified by "rerunning history" with the effects of the investment included, and then recalculating the large event risk to find the improvement in resilience. An example using utility data shows a 12% and 22% reduction in the probability of a large cost event due to 10% wind hardening and 10% faster restoration respectively. This new data-driven approach to quantify resilience and resilience investments is realistic and much easier to apply than complicated approaches based on modeling all the phases of resilience. Moreover, an appeal to improvements to past lived experience may well be persuasive to customers and regulators in making the case for resilience investments.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Automated detection of gibbon calls from passive acoustic monitoring data using convolutional neural networks in the "torch for R" ecosystem
Authors:
Dena J. Clink,
Jinsung Kim,
Hope Cross-Jaya,
Abdul Hamid Ahmad,
Moeurk Hong,
Roeun Sala,
Hélène Birot,
Cain Agger,
Thinh Tien Vu,
Hoa Nguyen Thi,
Thanh Nguyen Chi,
Holger Klinck
Abstract:
Automated detection of acoustic signals is crucial for effective monitoring of vocal animals and their habitats across ecologically-relevant spatial and temporal scales. Recent advances in deep learning have made these approaches more accessible. However, there are few deep learning approaches that can be implemented natively in the R programming environment; approaches that run natively in R may…
▽ More
Automated detection of acoustic signals is crucial for effective monitoring of vocal animals and their habitats across ecologically-relevant spatial and temporal scales. Recent advances in deep learning have made these approaches more accessible. However, there are few deep learning approaches that can be implemented natively in the R programming environment; approaches that run natively in R may be more accessible for ecologists. The "torch for R" ecosystem has made the use of transfer learning with convolutional neural networks accessible for R users. Here, we evaluate a workflow that uses transfer learning for the automated detection of acoustic signals from passive acoustic monitoring (PAM) data. Our specific goals include: 1) present a method for automated detection of gibbon calls from PAM data using the "torch for R" ecosystem; 2) compare the results of transfer learning for six pretrained CNN architectures; and 3) investigate how well the different architectures perform on datasets of the female calls from two different gibbon species: the northern grey gibbon (Hylobates funereus) and the southern yellow-cheeked crested gibbon (Nomascus gabriellae). We found that the highest performing architecture depended on the test dataset. We successfully deployed the top performing model for each gibbon species to investigate spatial of variation in gibbon calling behavior across two grids of autonomous recording units in Danum Valley Conservation Area, Malaysia and Keo Seima Wildlife Sanctuary, Cambodia. The fields of deep learning and automated detection are rapidly evolving, and we provide the methods and datasets as benchmarks for future work.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
Looks can be Deceptive: Distinguishing Repetition Disfluency from Reduplication
Authors:
Arif Ahmad,
Mothika Gayathri Khyathi,
Pushpak Bhattacharyya
Abstract:
Reduplication and repetition, though similar in form, serve distinct linguistic purposes. Reduplication is a deliberate morphological process used to express grammatical, semantic, or pragmatic nuances, while repetition is often unintentional and indicative of disfluency. This paper presents the first large-scale study of reduplication and repetition in speech using computational linguistics. We i…
▽ More
Reduplication and repetition, though similar in form, serve distinct linguistic purposes. Reduplication is a deliberate morphological process used to express grammatical, semantic, or pragmatic nuances, while repetition is often unintentional and indicative of disfluency. This paper presents the first large-scale study of reduplication and repetition in speech using computational linguistics. We introduce IndicRedRep, a new publicly available dataset containing Hindi, Telugu, and Marathi text annotated with reduplication and repetition at the word level. We evaluate transformer-based models for multi-class reduplication and repetition token classification, utilizing the Reparandum-Interregnum-Repair structure to distinguish between the two phenomena. Our models achieve macro F1 scores of up to 85.62% in Hindi, 83.95% in Telugu, and 84.82% in Marathi for reduplication-repetition classification.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Beyond Aesthetics: Cultural Competence in Text-to-Image Models
Authors:
Nithish Kannen,
Arif Ahmad,
Marco Andreetto,
Vinodkumar Prabhakaran,
Utsav Prabhu,
Adji Bousso Dieng,
Pushpak Bhattacharyya,
Shachi Dave
Abstract:
Text-to-Image (T2I) models are being increasingly adopted in diverse global communities where they create visual representations of their unique cultures. Current T2I benchmarks primarily focus on faithfulness, aesthetics, and realism of generated images, overlooking the critical dimension of cultural competence. In this work, we introduce a framework to evaluate cultural competence of T2I models…
▽ More
Text-to-Image (T2I) models are being increasingly adopted in diverse global communities where they create visual representations of their unique cultures. Current T2I benchmarks primarily focus on faithfulness, aesthetics, and realism of generated images, overlooking the critical dimension of cultural competence. In this work, we introduce a framework to evaluate cultural competence of T2I models along two crucial dimensions: cultural awareness and cultural diversity, and present a scalable approach using a combination of structured knowledge bases and large language models to build a large dataset of cultural artifacts to enable this evaluation. In particular, we apply this approach to build CUBE (CUltural BEnchmark for Text-to-Image models), a first-of-its-kind benchmark to evaluate cultural competence of T2I models. CUBE covers cultural artifacts associated with 8 countries across different geo-cultural regions and along 3 concepts: cuisine, landmarks, and art. CUBE consists of 1) CUBE-1K, a set of high-quality prompts that enable the evaluation of cultural awareness, and 2) CUBE-CSpace, a larger dataset of cultural artifacts that serves as grounding to evaluate cultural diversity. We also introduce cultural diversity as a novel T2I evaluation component, leveraging quality-weighted Vendi score. Our evaluations reveal significant gaps in the cultural awareness of existing models across countries and provide valuable insights into the cultural diversity of T2I outputs for under-specified prompts. Our methodology is extendable to other cultural regions and concepts, and can facilitate the development of T2I models that better cater to the global population.
△ Less
Submitted 11 July, 2024; v1 submitted 9 July, 2024;
originally announced July 2024.
-
Analysis of genetic diversity among some Iraqi durum wheat cultivars revealed by different molecular markers
Authors:
Mihraban Sharif Maeruf,
Djshwar Dhahir Lateef,
Kamil Mahmood Mustafa,
Hero Fatih Hamakareem,
Shang Hasseb Abdalqadir,
Dastan Ahmad Ahmad,
Shokhan Mahmood Sleman,
Kamaran Salh Rasul
Abstract:
Durum wheat has been cultivated since the beginning of crop domestication, occupying now the tenth ranking among the global most significant cultivated crops. Despite the fact that, the extent of the crop genetic diversity has not yet fully incorporated into modern varieties through breeding programs. In this study, a total of 35 markers (11 RAPD, 12 ISSR, and 12 CDDP) were utilized to assess the…
▽ More
Durum wheat has been cultivated since the beginning of crop domestication, occupying now the tenth ranking among the global most significant cultivated crops. Despite the fact that, the extent of the crop genetic diversity has not yet fully incorporated into modern varieties through breeding programs. In this study, a total of 35 markers (11 RAPD, 12 ISSR, and 12 CDDP) were utilized to assess the genetic variability and population structure of sixteen different cultivars of Iraqi durum wheat. Out of 294 bands obtained, 171 were identified as polymorphic: 47.00 polymorphic alleles from 98 RAPD bands, 53 polymorphic alleles from a total of 89 ISSR bands, and 71 alleles from 107 CDDP bands. The average number of observed alleles (Na), effective number of alleles (Ne), Shannon's information index (I), expected heterozygosity or gene diversity (He), unbiased expected heterozygosity (uHe), and polymorphic information content (PIC) (1.45, 1.38, 0.32, 0.22, 0.24, and 0.28, respectively) were obtained for RAPDs , (1.63, 1.45, 0.40, 0.27, 0.29, and 0.32, respectively) ISSRs and (1.35, 1.35, 0.31, 0.21, 0.23, and 0.30, respectively) for the CDDP markers. A dendrogram of two main clades (unweighted pair group method with arithmetic mean; UPGMA) and three populations of structure analysis, were obtained based on the three markers data. The analysis of molecular variance indicated 97.00%, 97.00%, and 90.00% variability within populations, applying RAPD, ISSR, and CDDP markers, respectively. The highest diversity indices were revealed in population 2 under the RAPD and CDDP markers, whereas population 1 had the highest values of these indices according to the ISSR markers. The results provide greater knowledge on the genetic makeup of Iraqi durum wheat cultivars, that facilitate future breeding programs of this crop.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Wireless Spectrum in Rural Farmlands: Status, Challenges and Opportunities
Authors:
Mukaram Shahid,
Kunal Das,
Taimoor Ul Islam,
Christ Somiah,
Daji Qiao,
Arsalan Ahmad,
Jimming Song,
Zhengyuan Zhu,
Sarath Babu,
Yong Guan,
Tusher Chakraborty,
Suraj Jog,
Ranveer Chandra,
Hongwei Zhang
Abstract:
Due to factors such as low population density and expansive geographical distances, network deployment falls behind in rural regions, leading to a broadband divide. Wireless spectrum serves as the blood and flesh of wireless communications. Shared white spaces such as those in the TVWS and CBRS spectrum bands offer opportunities to expand connectivity, innovate, and provide affordable access to hi…
▽ More
Due to factors such as low population density and expansive geographical distances, network deployment falls behind in rural regions, leading to a broadband divide. Wireless spectrum serves as the blood and flesh of wireless communications. Shared white spaces such as those in the TVWS and CBRS spectrum bands offer opportunities to expand connectivity, innovate, and provide affordable access to high-speed Internet in under-served areas without additional cost to expensive licensed spectrum. However, the current methods to utilize these white spaces are inefficient due to very conservative models and spectrum policies, causing under-utilization of valuable spectrum resources. This hampers the full potential of innovative wireless technologies that could benefit farmers, small Internet Service Providers (ISPs) or Mobile Network Operators (MNOs) operating in rural regions. This study explores the challenges faced by farmers and service providers when using shared spectrum bands to deploy their networks while ensuring maximum system performance and minimizing interference with other users. Additionally, we discuss how spatiotemporal spectrum models, in conjunction with database-driven spectrum-sharing solutions, can enhance the allocation and management of spectrum resources, ultimately improving the efficiency and reliability of wireless networks operating in shared spectrum bands.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Semifinite von Neumann algebras in gauge theory and gravity
Authors:
Shadi Ali Ahmad,
Marc S. Klinger,
Simon Lin
Abstract:
von Neumann algebras have been playing an increasingly important role in the context of gauge theories and gravity. The crossed product presents a natural method for implementing constraints through the commutation theorem, rendering it a useful tool for constructing gauge invariant algebras. The crossed product of a Type III algebra with its modular automorphism group is semifinite, which means t…
▽ More
von Neumann algebras have been playing an increasingly important role in the context of gauge theories and gravity. The crossed product presents a natural method for implementing constraints through the commutation theorem, rendering it a useful tool for constructing gauge invariant algebras. The crossed product of a Type III algebra with its modular automorphism group is semifinite, which means that the crossed product regulates divergences in local quantum field theories. In this letter, we find a sufficient condition for the semifiniteness of the crossed product of a type III algebra with any locally compact group containing the modular automorphism group. Our condition surprisingly implies the centrality of the modular flow in the symmetry group, and we provide evidence for the necessity of this condition. Under these conditions, we construct an associated trace which computes physical expectation values. We comment on the importance of this result and and its implications for subregion physics in gauge theory and gravity.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
A Machine Learning Approach for Identifying Anatomical Biomarkers of Early Mild Cognitive Impairment
Authors:
Alwani Liyana Ahmad,
Jose Sanchez-Bornot,
Roberto C. Sotero,
Damien Coyle,
Zamzuri Idris,
Ibrahima Faye
Abstract:
Alzheimer's Disease (AD) is a progressive neurodegenerative disorder that primarily affects the aging population by impairing cognitive and motor functions. Early detection of AD through accessible methodologies like magnetic resonance imaging (MRI) is vital for developing effective interventions to halt or slow the disease's progression. This study aims to perform a comprehensive analysis of mach…
▽ More
Alzheimer's Disease (AD) is a progressive neurodegenerative disorder that primarily affects the aging population by impairing cognitive and motor functions. Early detection of AD through accessible methodologies like magnetic resonance imaging (MRI) is vital for developing effective interventions to halt or slow the disease's progression. This study aims to perform a comprehensive analysis of machine learning techniques for selecting MRI-based biomarkers and classifying individuals into healthy controls (HC) and unstable controls (uHC) who later show mild cognitive impairment within five years. The research utilizes MRI data from the Alzheimer's Disease Neuroinformatics Initiative (ADNI) and the Open Access Series of Imaging Studies 3 (OASIS-3), focusing on both HC and uHC participants. The study addresses the challenges of imbalanced data by testing classification methods on balanced and unbalanced datasets, and harmonizes data using polynomial regression to mitigate nuisance variables like age, gender, and intracranial volume. Results indicate that Gaussian Naive Bayes and RusBoost classifiers shows an optimal performance, achieving accuracies of up to 76.46% and 72.48% respectively on the ADNI dataset. For the OASIS-3 dataset, Kernel Naive Bayes and RusBoost yield accuracies ranging from 64.66% to 75.71%, improving further in age-matched datasets. Brain regions like the entorhinal cortex, hippocampus, lateral ventricle, and lateral orbitofrontal cortex are identified as significantly impacted during early cognitive decline. Despite limitations such as small sample sizes, the study's harmonization approach enhances the robustness of biomarker selection, suggesting the potential of this semi-automatic machine learning pipeline for early AD detection using MRI.
△ Less
Submitted 29 May, 2024;
originally announced July 2024.
-
Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter
Authors:
M. Aamir,
B. Acar,
G. Adamov,
T. Adams,
C. Adloff,
S. Afanasiev,
C. Agrawal,
C. Agrawal,
A. Ahmad,
H. A. Ahmed,
S. Akbar,
N. Akchurin,
B. Akgul,
B. Akgun,
R. O. Akpinar,
E. Aktas,
A. AlKadhim,
V. Alexakhin,
J. Alimena,
J. Alison,
A. Alpana,
W. Alshehri,
P. Alvarez Dominguez,
M. Alyari,
C. Amendola
, et al. (550 additional authors not shown)
Abstract:
A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr…
▽ More
A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadronic section. The shower reconstruction method is based on graph neural networks and it makes use of a dynamic reduction network architecture. It is shown that the algorithm is able to capture and mitigate the main effects that normally hinder the reconstruction of hadronic showers using classical reconstruction methods, by compensating for fluctuations in the multiplicity, energy, and spatial distributions of the shower's constituents. The performance of the algorithm is evaluated using test beam data collected in 2018 prototype of the CMS HGCAL accompanied by a section of the CALICE AHCAL prototype. The capability of the method to mitigate the impact of energy leakage from the calorimeter is also demonstrated.
△ Less
Submitted 30 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Robust Communication and Computation using Deep Learning via Joint Uncertainty Injection
Authors:
Robert-Jeron Reifert,
Hayssam Dahrouj,
Alaa Alameer Ahmad,
Haris Gacanin,
Aydin Sezgin
Abstract:
The convergence of communication and computation, along with the integration of machine learning and artificial intelligence, stand as key empowering pillars for the sixth-generation of communication systems (6G). This paper considers a network of one base station serving a number of devices simultaneously using spatial multiplexing. The paper then presents an innovative deep learning-based approa…
▽ More
The convergence of communication and computation, along with the integration of machine learning and artificial intelligence, stand as key empowering pillars for the sixth-generation of communication systems (6G). This paper considers a network of one base station serving a number of devices simultaneously using spatial multiplexing. The paper then presents an innovative deep learning-based approach to simultaneously manage the transmit and computing powers, alongside computation allocation, amidst uncertainties in both channel and computing states information. More specifically, the paper aims at proposing a robust solution that minimizes the worst-case delay across the served devices subject to computation and power constraints. The paper uses a deep neural network (DNN)-based solution that maps estimated channels and computation requirements to optimized resource allocations. During training, uncertainty samples are injected after the DNN output to jointly account for both communication and computation estimation errors. The DNN is then trained via backpropagation using the robust utility, thus implicitly learning the uncertainty distributions. Our results validate the enhanced robust delay performance of the joint uncertainty injection versus the classical DNN approach, especially in high channel and computational uncertainty regimes.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Fast and Practical Strassen's Matrix Multiplication using FPGAs
Authors:
Afzal Ahmad,
Linfeng Du,
Wei Zhang
Abstract:
Matrix multiplication is a cornerstone operation in a wide array of scientific fields, including machine learning and computer graphics. The standard algorithm for matrix multiplication has a complexity of $\mathcal{O}(n^3)$ for $n\times n$ matrices. Strassen's algorithm improves this to $\mathcal{O}(n^{2.807})$, but its practicality is limited for small to medium matrix sizes due to the large num…
▽ More
Matrix multiplication is a cornerstone operation in a wide array of scientific fields, including machine learning and computer graphics. The standard algorithm for matrix multiplication has a complexity of $\mathcal{O}(n^3)$ for $n\times n$ matrices. Strassen's algorithm improves this to $\mathcal{O}(n^{2.807})$, but its practicality is limited for small to medium matrix sizes due to the large number of additions it introduces. This paper presents a novel FPGA-based implementation of Strassen's algorithm that achieves superior speed over an optimized General Matrix Multiply (GeMM) implementation for matrices as small as $n=256$. Our design, tested extensively on two high-performance FPGA accelerators (Alveo U50 and U280) across various data types, matches or surpasses the performance of a highly optimized baseline across a range of matrix sizes.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Geometry, anomaly, topology, and transport in Weyl fermions
Authors:
Azaz Ahmad,
Gautham Varma K.,
Gargee Sharma
Abstract:
Weyl fermions are one of the simplest objects that link ideas in geometry and topology to highenergy physics and condensed matter physics. Although the existence of Weyl fermions as elementary particles remains dubious, there is mounting evidence of their existence as quasiparticles in certain condensed matter systems. Such systems are termed Weyl semimetals (WSMs). Needless to say, WSMs have emer…
▽ More
Weyl fermions are one of the simplest objects that link ideas in geometry and topology to highenergy physics and condensed matter physics. Although the existence of Weyl fermions as elementary particles remains dubious, there is mounting evidence of their existence as quasiparticles in certain condensed matter systems. Such systems are termed Weyl semimetals (WSMs). Needless to say, WSMs have emerged as a fascinating class of materials with unique electronic properties, offering a rich playground for both fundamental research and potential technological applications. This review examines recent advancements in understanding electron transport in Weyl semimetals (WSMs). We begin with a pedagogical introduction to the geometric and topological concepts critical to understanding quantum transport in Weyl fermions. We then explore chiral anomaly (CA), a defining feature of WSMs, and its impact on transport phenomena such as longitudinal magnetoconductance (LMC) and the planar Hall effect (PHE). The Maxwell-Boltzmann transport theory extended beyond the standard relaxation-time approximation is then discussed in the context of Weyl fermions, which is used to evaluate various transport properties. Attention is also given to the effects of strain-induced gauge fields and external magnetic fields in both time-reversal broken and inversion asymmetric inhomogeneous WSMs. The review synthesizes theoretical insights, experimental observations, and numerical simulations to provide a comprehensive understanding of the complex transport behaviors in WSMs, aiming to bridge the gap between theoretical predictions and experimental verification.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Enhancing Plant Disease Detection: A Novel CNN-Based Approach with Tensor Subspace Learning and HOWSVD-MD
Authors:
Abdelmalik Ouamane,
Ammar Chouchane,
Yassine Himeur,
Abderrazak Debilou,
Abbes Amira,
Shadi Atalla,
Wathiq Mansoor,
Hussain Al Ahmad
Abstract:
Machine learning has revolutionized the field of agricultural science, particularly in the early detection and management of plant diseases, which are crucial for maintaining crop health and productivity. Leveraging advanced algorithms and imaging technologies, researchers are now able to identify and classify plant diseases with unprecedented accuracy and speed. Effective management of tomato dis…
▽ More
Machine learning has revolutionized the field of agricultural science, particularly in the early detection and management of plant diseases, which are crucial for maintaining crop health and productivity. Leveraging advanced algorithms and imaging technologies, researchers are now able to identify and classify plant diseases with unprecedented accuracy and speed. Effective management of tomato diseases is crucial for enhancing agricultural productivity. The development and application of tomato disease classification methods are central to this objective. This paper introduces a cutting-edge technique for the detection and classification of tomato leaf diseases, utilizing insights from the latest pre-trained Convolutional Neural Network (CNN) models. We propose a sophisticated approach within the domain of tensor subspace learning, known as Higher-Order Whitened Singular Value Decomposition (HOWSVD), designed to boost the discriminatory power of the system. Our approach to Tensor Subspace Learning is methodically executed in two phases, beginning with HOWSVD and culminating in Multilinear Discriminant Analysis (MDA). The efficacy of this innovative method was rigorously tested through comprehensive experiments on two distinct datasets, namely PlantVillage and the Taiwan dataset. The findings reveal that HOWSVD-MDA outperforms existing methods, underscoring its capability to markedly enhance the precision and dependability of diagnosing tomato leaf diseases. For instance, up to 98.36\% and 89.39\% accuracy scores have been achieved under PlantVillage and the Taiwan datasets, respectively.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Free-Space Optical Channel Turbulence Prediction: A Machine Learning Approach
Authors:
Md Zobaer Islam,
Ethan Abele,
Fahim Ferdous Hossain,
Arsalan Ahmad,
Sabit Ekin,
John F. O'Hara
Abstract:
Channel turbulence presents a formidable obstacle for free-space optical (FSO) communication. Anticipation of turbulence levels is highly important for mitigating disruptions. We study the application of machine learning (ML) to FSO data streams to rapidly predict channel turbulence levels with no additional sensing hardware. An optical bit stream was transmitted through a controlled channel in th…
▽ More
Channel turbulence presents a formidable obstacle for free-space optical (FSO) communication. Anticipation of turbulence levels is highly important for mitigating disruptions. We study the application of machine learning (ML) to FSO data streams to rapidly predict channel turbulence levels with no additional sensing hardware. An optical bit stream was transmitted through a controlled channel in the lab under six distinct turbulence levels, and the efficacy of using ML to classify turbulence levels was examined. ML-based turbulence level classification was found to be >98% accurate with multiple ML training parameters, but highly dependent upon the timescale of changes between turbulence levels.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Magnetic single wall CrI3 nanotubes encapsulated within multiwall Carbon Nanotubes
Authors:
Ihsan Caha,
Loukya Boddapatti,
Aqrab ul Ahmad,
Manuel Banobre,
Antonio T. Costa,
Andrey N. Enyashin,
Weibin Li,
Pierluigi Gargiani,
Manuel Valvidares,
Joaquin Fernandez-Rossier,
Francis Leonard Deepak
Abstract:
CrI3 is a layered ferromagnetic insulator that has recently attracted enormous interest as it was the first example of a stand-alone monolayer ferromagnet, paving the way towards the study of two-dimensional magnetic materials and their use as building blocks of hybrid van der Waals layered heterostructures. Here we go one step down in the dimensionality ladder and report the synthesis and charact…
▽ More
CrI3 is a layered ferromagnetic insulator that has recently attracted enormous interest as it was the first example of a stand-alone monolayer ferromagnet, paving the way towards the study of two-dimensional magnetic materials and their use as building blocks of hybrid van der Waals layered heterostructures. Here we go one step down in the dimensionality ladder and report the synthesis and characterization of a tubular one-dimensional van der Waals heterostructure where CrI3 nanotubes are encapsulated within multiwall carbon nanotubes, integrating a magnetic insulator and a conductor. By means of the capillary filling of multi-wall carbon nanotubes (MWCNT), we obtained single-wall CrI3 nanotubes with diameters ranging between 2 nm and 10 nm, with an average of 5.3 nm. Using aberration corrected electron microscopy in combination with spectroscopic techniques we confirm the structure and chemical composition of the nanotubes. SQUID measurements, combined with element-specific X-ray magnetic circular dichroism (XMCD) indicate unequivocally that the Cr atoms in encapsulated CrI3 nanotubes are magnetic with a collective state compatible with a radial magnetization state predicted both by first-principles calculations and a model Hamiltonian. Our results represent a step forward in establishing 1D van der Waals heterostructures as a playground for the exploration of non-collinear magnetic states arising from the interplay between magnetic anisotropy and curvature in tubular geometries.
△ Less
Submitted 1 June, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
Quantum Reference Frames from Top-Down Crossed Products
Authors:
Shadi Ali Ahmad,
Wissam Chemissany,
Marc S. Klinger,
Robert G. Leigh
Abstract:
All physical observations are made relative to a reference frame, which is a system in its own right. If the system of interest admits a group symmetry, the reference frame observing it must transform commensurately under the group to ensure the covariance of the combined system. We point out that the crossed product is a way to realize quantum reference frames from the bottom-up; adjoining a quan…
▽ More
All physical observations are made relative to a reference frame, which is a system in its own right. If the system of interest admits a group symmetry, the reference frame observing it must transform commensurately under the group to ensure the covariance of the combined system. We point out that the crossed product is a way to realize quantum reference frames from the bottom-up; adjoining a quantum reference frame and imposing constraints generates a crossed product algebra. We provide a top-down specification of crossed product algebras and show that one cannot obtain inequivalent quantum reference frames using this approach. As a remedy, we define an abstract algebra associated to the system and symmetry group built out of relational crossed product algebras associated with different choices of quantum reference frames. We term this object the G-framed algebra, and show how potentially inequivalent frames are realized within this object. We comment on this algebra's analog of the classical Gribov problem in gauge theory, its importance in gravity where we show that it is relevant for semiclassical de Sitter and potentially beyond the semiclassical limit, and its utility for understanding the frame-dependence of physical notions like observables, density states, and entropies.
△ Less
Submitted 1 July, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
Anticipating Optical Availability in Hybrid RF/FSO Links Using RF Beacons and Deep Learning
Authors:
Mostafa Ibrahim,
Arsalan Ahmad,
Sabit Ekin,
Peter LoPresti,
Serhat Altunc,
Obadiah Kegege,
John F. O'Hara
Abstract:
Radio frequency (RF) communications offer reliable but low data rates and energy-inefficient satellite links, while free-space optical (FSO) promises high bandwidth but struggles with disturbances imposed by atmospheric effects. A hybrid RF/FSO architecture aims to achieve optimal reliability along with high data rates for space communications. Accurate prediction of dynamic ground-to-satellite FS…
▽ More
Radio frequency (RF) communications offer reliable but low data rates and energy-inefficient satellite links, while free-space optical (FSO) promises high bandwidth but struggles with disturbances imposed by atmospheric effects. A hybrid RF/FSO architecture aims to achieve optimal reliability along with high data rates for space communications. Accurate prediction of dynamic ground-to-satellite FSO link availability is critical for routing decisions in low-earth orbit constellations. In this paper, we propose a system leveraging ubiquitous RF links to proactively forecast FSO link degradation prior to signal drops below threshold levels. This enables pre-calculation of rerouting to maximally maintain high data rate FSO links throughout the duration of weather effects. We implement a supervised learning model to anticipate FSO attenuation based on the analysis of RF patterns. Through the simulation of a dense lower earth orbit (LEO) satellite constellation, we demonstrate the efficacy of our approach in a simulated satellite network, highlighting the balance between predictive accuracy and prediction duration. An emulated cloud attenuation model is proposed which provides insight into the temporal profiles of RF signals and their correlation to FSO channel dynamics. Our investigation sheds light on the trade-offs between prediction horizon and accuracy arising from RF beacon proximity, achieving a prediction accuracy of 86\% with 16 RF beacons.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Adaptive Reinforcement Learning for Robot Control
Authors:
Yu Tang Liu,
Nilaksh Singh,
Aamir Ahmad
Abstract:
Deep reinforcement learning (DRL) has shown remarkable success in simulation domains, yet its application in designing robot controllers remains limited, due to its single-task orientation and insufficient adaptability to environmental changes. To overcome these limitations, we present a novel adaptive agent that leverages transfer learning techniques to dynamically adapt policy in response to dif…
▽ More
Deep reinforcement learning (DRL) has shown remarkable success in simulation domains, yet its application in designing robot controllers remains limited, due to its single-task orientation and insufficient adaptability to environmental changes. To overcome these limitations, we present a novel adaptive agent that leverages transfer learning techniques to dynamically adapt policy in response to different tasks and environmental conditions. The approach is validated through the blimp control challenge, where multitasking capabilities and environmental adaptability are essential. The agent is trained using a custom, highly parallelized simulator built on IsaacGym. We perform zero-shot transfer to fly the blimp in the real world to solve various tasks. We share our code at \url{https://github.com/robot-perception-group/adaptive\_agent/}.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Formation of low mass protostars and their circumstellar disks
Authors:
Adnan Ali Ahmad,
Matthias González,
Patrick Hennebelle,
Benoît Commerçon
Abstract:
The birth process of circumstellar disks remains poorly constrained due to observational and numerical challenges. Recent numerical works have shown that the small-scale physics, often wrapped into a sub-grid model, play a crucial role in disk formation and evolution. This calls for a combined approach in which both the protostar and circumstellar disk are studied in concert. We aim to elucidate t…
▽ More
The birth process of circumstellar disks remains poorly constrained due to observational and numerical challenges. Recent numerical works have shown that the small-scale physics, often wrapped into a sub-grid model, play a crucial role in disk formation and evolution. This calls for a combined approach in which both the protostar and circumstellar disk are studied in concert. We aim to elucidate the small scale physics and constrain sub-grid parameters commonly chosen in the literature by resolving the star-disk interaction. We carry out a set of very high resolution 3D radiative-hydrodynamics simulations that self-consistently describe the collapse of a turbulent dense molecular cloud core to stellar densities. We study the birth of the protostar, the circumstellar disk, and its early evolution (< 6 yr after protostellar formation). Following the second gravitational collapse, the nascent protostar quickly reaches breakup velocity and sheds its surface material, thus forming a hot ($\sim 10^{3}$ K), dense, and highly flared circumstellar disk. The protostar is embedded within the disk, such that material can flow without crossing any shock fronts. The circumstellar disk mass quickly exceeds that of the protostar, and its kinematics are dominated by self-gravity. Accretion onto the disk is highly anisotropic, and accretion onto the protostar mainly occurs through material that slides on the disk surface. The polar mass flux is negligible in comparison. The radiative behavior also displays a strong anisotropy, as the polar accretion shock is shown to be supercritical whereas its equatorial counterpart is subcritical. We also find a remarkable convergence of our results with respect to initial conditions. These results reveal the structure and kinematics in the smallest spatial scales relevant to protostellar and circumstellar disk evolution.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
PACNav: Enhancing Collective Navigation for UAV Swarms in Communication-Challenged Environments
Authors:
Afzal Ahmad,
Daniel Bonilla Licea,
Giuseppe Silano,
Tomas Baca,
Martin Saska
Abstract:
This article presents Persistence Administered Collective Navigation (PACNav) as an approach for achieving decentralized collective navigation of Unmanned Aerial Vehicle (UAV) swarms. The technique is inspired by the flocking and collective navigation behavior observed in natural swarms, such as cattle herds, bird flocks, and even large groups of humans. PACNav relies solely on local observations…
▽ More
This article presents Persistence Administered Collective Navigation (PACNav) as an approach for achieving decentralized collective navigation of Unmanned Aerial Vehicle (UAV) swarms. The technique is inspired by the flocking and collective navigation behavior observed in natural swarms, such as cattle herds, bird flocks, and even large groups of humans. PACNav relies solely on local observations of relative positions of UAVs, making it suitable for large swarms deprived of communication capabilities and external localization systems. We introduce the novel concepts of path persistence and path similarity, which allow each swarm member to analyze the motion of others. PACNav is grounded on two main principles: (1) UAVs with little variation in motion direction exhibit high path persistence and are considered reliable leaders by other UAVs; (2) groups of UAVs that move in a similar direction demonstrate high path similarity, and such groups are assumed to contain a reliable leader. The proposed approach also incorporates a reactive collision avoidance mechanism to prevent collisions with swarm members and environmental obstacles. The method is validated through simulated and real-world experiments conducted in a natural forest.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
GEOBIND: Binding Text, Image, and Audio through Satellite Images
Authors:
Aayush Dhakal,
Subash Khanal,
Srikumar Sastry,
Adeel Ahmad,
Nathan Jacobs
Abstract:
In remote sensing, we are interested in modeling various modalities for some geographic location. Several works have focused on learning the relationship between a location and type of landscape, habitability, audio, textual descriptions, etc. Recently, a common way to approach these problems is to train a deep-learning model that uses satellite images to infer some unique characteristics of the l…
▽ More
In remote sensing, we are interested in modeling various modalities for some geographic location. Several works have focused on learning the relationship between a location and type of landscape, habitability, audio, textual descriptions, etc. Recently, a common way to approach these problems is to train a deep-learning model that uses satellite images to infer some unique characteristics of the location. In this work, we present a deep-learning model, GeoBind, that can infer about multiple modalities, specifically text, image, and audio, from satellite imagery of a location. To do this, we use satellite images as the binding element and contrastively align all other modalities to the satellite image data. Our training results in a joint embedding space with multiple types of data: satellite image, ground-level image, audio, and text. Furthermore, our approach does not require a single complex dataset that contains all the modalities mentioned above. Rather it only requires multiple satellite-image paired data. While we only align three modalities in this paper, we present a general framework that can be used to create an embedding space with any number of modalities by using satellite images as the binding element. Our results show that, unlike traditional unimodal models, GeoBind is versatile and can reason about multiple modalities for a given satellite image input.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Periodicity in New York State COVID-19 Hospitalizations Leveraged from the Variable Bandpass Periodic Block Bootstrap
Authors:
Asmaa Ahmad,
Edward Valachovic
Abstract:
The outbreak of the SARS-CoV-2 virus, which led to an unprecedented global pandemic, has underscored the critical importance of understanding seasonal patterns. This knowledge is fundamental for decision-making in healthcare and public health domains. Investigating the presence, intensity, and precise nature of seasonal trends, as well as these temporal patterns, is essential for forecasting futur…
▽ More
The outbreak of the SARS-CoV-2 virus, which led to an unprecedented global pandemic, has underscored the critical importance of understanding seasonal patterns. This knowledge is fundamental for decision-making in healthcare and public health domains. Investigating the presence, intensity, and precise nature of seasonal trends, as well as these temporal patterns, is essential for forecasting future occurrences, planning interventions, and making informed decisions based on the evolution of events over time. This study employs the Variable Bandpass Periodic Block Bootstrap (VBPBB) to separate and analyze different periodic components by frequency in time series data, focusing on annually correlated (PC) principal components. Bootstrapping, a method used to estimate statistical sampling distributions through random sampling with replacement, is particularly useful in this context. Specifically, block bootstrapping, a model-independent resampling method suitable for time series data, is utilized. Its extensions are aimed at preserving the correlation structures inherent in PC processes. The VBPBB applies a bandpass filter to isolate the relevant PC frequency, thereby minimizing contamination from extraneous frequencies and noise. This approach significantly narrows the confidence intervals, enhancing the precision of estimated sampling distributions for the investigated periodic characteristics. Furthermore, we compared the outcomes of block bootstrapping for periodically correlated time series with VBPBB against those from more traditional bootstrapping methods. Our analysis shows VBPBB provides strong evidence of the existence of an annual seasonal PC pattern in hospitalization rates not detectible by other methods, providing timing and confidence intervals for their impact.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Airship Formations for Animal Motion Capture and Behavior Analysis
Authors:
Eric Price,
Aamir Ahmad
Abstract:
Using UAVs for wildlife observation and motion capture offers manifold advantages for studying animals in the wild, especially grazing herds in open terrain. The aerial perspective allows observation at a scale and depth that is not possible on the ground, offering new insights into group behavior. However, the very nature of wildlife field-studies puts traditional fixed wing and multi-copter syst…
▽ More
Using UAVs for wildlife observation and motion capture offers manifold advantages for studying animals in the wild, especially grazing herds in open terrain. The aerial perspective allows observation at a scale and depth that is not possible on the ground, offering new insights into group behavior. However, the very nature of wildlife field-studies puts traditional fixed wing and multi-copter systems to their limits: limited flight time, noise and safety aspects affect their efficacy, where lighter than air systems can remain on station for many hours. Nevertheless, airships are challenging from a ground handling perspective as well as from a control point of view, being voluminous and highly affected by wind. In this work, we showcase a system designed to use airship formations to track, follow, and visually record wild horses from multiple angles, including airship design, simulation, control, on board computer vision, autonomous operation and practical aspects of field experiments.
△ Less
Submitted 24 May, 2024; v1 submitted 13 April, 2024;
originally announced April 2024.
-
Toward FAIR Semantic Publishing of Research Dataset Metadata in the Open Research Knowledge Graph
Authors:
Raia Abu Ahmad,
Jennifer D'Souza,
Matthäus Zloch,
Wolfgang Otto,
Georg Rehm,
Allard Oelen,
Stefan Dietze,
Sören Auer
Abstract:
Search engines these days can serve datasets as search results. Datasets get picked up by search technologies based on structured descriptions on their official web pages, informed by metadata ontologies such as the Dataset content type of schema.org. Despite this promotion of the content type dataset as a first-class citizen of search results, a vast proportion of datasets, particularly research…
▽ More
Search engines these days can serve datasets as search results. Datasets get picked up by search technologies based on structured descriptions on their official web pages, informed by metadata ontologies such as the Dataset content type of schema.org. Despite this promotion of the content type dataset as a first-class citizen of search results, a vast proportion of datasets, particularly research datasets, still need to be made discoverable and, therefore, largely remain unused. This is due to the sheer volume of datasets released every day and the inability of metadata to reflect a dataset's content and context accurately. This work seeks to improve this situation for a specific class of datasets, namely research datasets, which are the result of research endeavors and are accompanied by a scholarly publication. We propose the ORKG-Dataset content type, a specialized branch of the Open Research Knowledge Graoh (ORKG) platform, which provides descriptive information and a semantic model for research datasets, integrating them with their accompanying scholarly publications. This work aims to establish a standardized framework for recording and reporting research datasets within the ORKG-Dataset content type. This, in turn, increases research dataset transparency on the web for their improved discoverability and applied use. In this paper, we present a proposal -- the minimum FAIR, comparable, semantic description of research datasets in terms of salient properties of their supporting publication. We design a specific application of the ORKG-Dataset semantic model based on 40 diverse research datasets on scientific information extraction.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Accel-NASBench: Sustainable Benchmarking for Accelerator-Aware NAS
Authors:
Afzal Ahmad,
Linfeng Du,
Zhiyao Xie,
Wei Zhang
Abstract:
One of the primary challenges impeding the progress of Neural Architecture Search (NAS) is its extensive reliance on exorbitant computational resources. NAS benchmarks aim to simulate runs of NAS experiments at zero cost, remediating the need for extensive compute. However, existing NAS benchmarks use synthetic datasets and model proxies that make simplified assumptions about the characteristics o…
▽ More
One of the primary challenges impeding the progress of Neural Architecture Search (NAS) is its extensive reliance on exorbitant computational resources. NAS benchmarks aim to simulate runs of NAS experiments at zero cost, remediating the need for extensive compute. However, existing NAS benchmarks use synthetic datasets and model proxies that make simplified assumptions about the characteristics of these datasets and models, leading to unrealistic evaluations. We present a technique that allows searching for training proxies that reduce the cost of benchmark construction by significant margins, making it possible to construct realistic NAS benchmarks for large-scale datasets. Using this technique, we construct an open-source bi-objective NAS benchmark for the ImageNet2012 dataset combined with the on-device performance of accelerators, including GPUs, TPUs, and FPGAs. Through extensive experimentation with various NAS optimizers and hardware platforms, we show that the benchmark is accurate and allows searching for state-of-the-art hardware-aware models at zero cost.
△ Less
Submitted 18 June, 2024; v1 submitted 9 April, 2024;
originally announced April 2024.
-
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context
Authors:
Nihar Ranjan Sahoo,
Pranamya Prashant Kulkarni,
Narjis Asad,
Arif Ahmad,
Tanu Goyal,
Aparna Garimella,
Pushpak Bhattacharyya
Abstract:
The pervasive influence of social biases in language data has sparked the need for benchmark datasets that capture and evaluate these biases in Large Language Models (LLMs). Existing efforts predominantly focus on English language and the Western context, leaving a void for a reliable dataset that encapsulates India's unique socio-cultural nuances. To bridge this gap, we introduce IndiBias, a comp…
▽ More
The pervasive influence of social biases in language data has sparked the need for benchmark datasets that capture and evaluate these biases in Large Language Models (LLMs). Existing efforts predominantly focus on English language and the Western context, leaving a void for a reliable dataset that encapsulates India's unique socio-cultural nuances. To bridge this gap, we introduce IndiBias, a comprehensive benchmarking dataset designed specifically for evaluating social biases in the Indian context. We filter and translate the existing CrowS-Pairs dataset to create a benchmark dataset suited to the Indian context in Hindi language. Additionally, we leverage LLMs including ChatGPT and InstructGPT to augment our dataset with diverse societal biases and stereotypes prevalent in India. The included bias dimensions encompass gender, religion, caste, age, region, physical appearance, and occupation. We also build a resource to address intersectional biases along three intersectional dimensions. Our dataset contains 800 sentence pairs and 300 tuples for bias measurement across different demographics. The dataset is available in English and Hindi, providing a size comparable to existing benchmark datasets. Furthermore, using IndiBias we compare ten different language models on multiple bias measurement metrics. We observed that the language models exhibit more bias across a majority of the intersectional groups.
△ Less
Submitted 3 April, 2024; v1 submitted 29 March, 2024;
originally announced March 2024.
-
Hypothesis-Driven Deep Learning for Out of Distribution Detection
Authors:
Yasith Jayawardana,
Azeem Ahmad,
Balpreet S. Ahluwalia,
Rafi Ahmad,
Sampath Jayarathna,
Dushan N. Wadduwage
Abstract:
Predictions of opaque black-box systems are frequently deployed in high-stakes applications such as healthcare. For such applications, it is crucial to assess how models handle samples beyond the domain of training data. While several metrics and tests exist to detect out-of-distribution (OoD) data from in-distribution (InD) data to a deep neural network (DNN), their performance varies significant…
▽ More
Predictions of opaque black-box systems are frequently deployed in high-stakes applications such as healthcare. For such applications, it is crucial to assess how models handle samples beyond the domain of training data. While several metrics and tests exist to detect out-of-distribution (OoD) data from in-distribution (InD) data to a deep neural network (DNN), their performance varies significantly across datasets, models, and tasks, which limits their practical use. In this paper, we propose a hypothesis-driven approach to quantify whether a new sample is InD or OoD. Given a trained DNN and some input, we first feed the input through the DNN and compute an ensemble of OoD metrics, which we term latent responses. We then formulate the OoD detection problem as a hypothesis test between latent responses of different groups, and use permutation-based resampling to infer the significance of the observed latent responses under a null hypothesis. We adapt our method to detect an unseen sample of bacteria to a trained deep learning model, and show that it reveals interpretable differences between InD and OoD latent responses. Our work has implications for systematic novelty detection and informed decision-making from classifiers trained on a subset of labels.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Predicting Confinement Effect of Carbon Fiber Reinforced Polymers on Strength of Concrete using Metaheuristics-based Artificial Neural Networks
Authors:
Sarmed Wahab,
Mohamed Suleiman,
Faisal Shabbir,
Nasim Shakouri Mahmoudabadi,
Sarmad Waqas,
Nouman Herl,
Afaq Ahmad
Abstract:
This article deals with the study of predicting the confinement effect of carbon fiber reinforced polymers (CFRPs) on concrete cylinder strength using metaheuristics-based artificial neural networks. A detailed database of 708 CFRP confined concrete cylinders is developed from previously published research with information on 8 parameters including geometrical parameters like the diameter (d) and…
▽ More
This article deals with the study of predicting the confinement effect of carbon fiber reinforced polymers (CFRPs) on concrete cylinder strength using metaheuristics-based artificial neural networks. A detailed database of 708 CFRP confined concrete cylinders is developed from previously published research with information on 8 parameters including geometrical parameters like the diameter (d) and height (h) of a cylinder, unconfined compressive strength of concrete (fco'), thickness (nt), the elastic modulus of CFRP (Ef), unconfined concrete strain confined concrete strain and the ultimate compressive strength of confined concrete fcc'. Three metaheuristic models are implemented including particle swarm optimization (PSO), grey wolf optimizer (GWO), and bat algorithm (BA). These algorithms are trained on the data using an objective function of mean square error and their predicted results are validated against the experimental studies and finite element analysis. The study shows that the hybrid model of PSO predicted the strength of CFRP-confined concrete cylinders with maximum accuracy of 99.13% and GWO predicted the results with an accuracy of 98.17%. The high accuracy of axial compressive strength predictions demonstrated that these prediction models are a reliable solution to the empirical methods. The prediction models are especially suitable for avoiding full-scale time-consuming experimental tests that make the process quick and economical.
△ Less
Submitted 22 December, 2023;
originally announced March 2024.
-
Containerization in Multi-Cloud Environment: Roles, Strategies, Challenges, and Solutions for Effective Implementation
Authors:
Muhammad Waseem,
Aakash Ahmad,
Peng Liang,
Muhammad Azeem Akbar,
Arif Ali Khan,
Iftikhar Ahmad,
Manu Setälä,
Tommi Mikkonen
Abstract:
Containerization in a multi-cloud environment facilitates workload portability and optimized resource utilization. Containerization in multi-cloud environments has received significant attention in recent years both from academic research and industrial development perspectives. However, there exists no effort to systematically investigate the state of research on this topic. The aim of this resea…
▽ More
Containerization in a multi-cloud environment facilitates workload portability and optimized resource utilization. Containerization in multi-cloud environments has received significant attention in recent years both from academic research and industrial development perspectives. However, there exists no effort to systematically investigate the state of research on this topic. The aim of this research is to systematically identify and categorize the multiple aspects of container utilization in multi-cloud environment. We conduct the Systematic Mapping Study (SMS) on the literature published between January 2013 and March 2023. Eighty-six studies were finally selected and the key results are: (1) Four leading themes on cloud computing and network systems research were identified: 'Scalability and High Availability', 'Performance and Optimization', 'Security and Privacy', and 'Multi-Cloud Container Monitoring and Adaptation'. (2) Seventy-four patterns and strategies for containerization in multi-cloud environment were classified across 10 subcategories and 4 categories. (3) Ten quality attributes considered were identified with 47 associated tactics. (4) Four distinct frameworks were introduced based on the analysis of identified challenges and solutions: a security challenge-solution framework, an automation challenge-solution framework, a deployment challenge-solution framework, and a monitoring challenge-solution framework. The results of this SMS will assist researchers and practitioners in pursuing further studies on containerization in multi-cloud environment and developing specialized solutions for challenges related to containerization applications in multi-cloud environment.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Lifelong LERF: Local 3D Semantic Inventory Monitoring Using FogROS2
Authors:
Adam Rashid,
Chung Min Kim,
Justin Kerr,
Letian Fu,
Kush Hari,
Ayah Ahmad,
Kaiyuan Chen,
Huang Huang,
Marcus Gualtieri,
Michael Wang,
Christian Juette,
Nan Tian,
Liu Ren,
Ken Goldberg
Abstract:
Inventory monitoring in homes, factories, and retail stores relies on maintaining data despite objects being swapped, added, removed, or moved. We introduce Lifelong LERF, a method that allows a mobile robot with minimal compute to jointly optimize a dense language and geometric representation of its surroundings. Lifelong LERF maintains this representation over time by detecting semantic changes…
▽ More
Inventory monitoring in homes, factories, and retail stores relies on maintaining data despite objects being swapped, added, removed, or moved. We introduce Lifelong LERF, a method that allows a mobile robot with minimal compute to jointly optimize a dense language and geometric representation of its surroundings. Lifelong LERF maintains this representation over time by detecting semantic changes and selectively updating these regions of the environment, avoiding the need to exhaustively remap. Human users can query inventory by providing natural language queries and receiving a 3D heatmap of potential object locations. To manage the computational load, we use Fog-ROS2, a cloud robotics platform, to offload resource-intensive tasks. Lifelong LERF obtains poses from a monocular RGBD SLAM backend, and uses these poses to progressively optimize a Language Embedded Radiance Field (LERF) for semantic monitoring. Experiments with 3-5 objects arranged on a tabletop and a Turtlebot with a RealSense camera suggest that Lifelong LERF can persistently adapt to changes in objects with up to 91% accuracy.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction
Authors:
Senjuti Dutta,
Sherol Chen,
Sunny Mak,
Amnah Ahmad,
Katherine Collins,
Alena Butryna,
Deepak Ramachandran,
Krishnamurthy Dvijotham,
Ellie Pavlick,
Ravi Rajakumar
Abstract:
Image generation models are poised to become ubiquitous in a range of applications. These models are often fine-tuned and evaluated using human quality judgments that assume a universal standard, failing to consider the subjectivity of such tasks. To investigate how to quantify subjectivity, and the scale of its impact, we measure how assessments differ among human annotators across different use…
▽ More
Image generation models are poised to become ubiquitous in a range of applications. These models are often fine-tuned and evaluated using human quality judgments that assume a universal standard, failing to consider the subjectivity of such tasks. To investigate how to quantify subjectivity, and the scale of its impact, we measure how assessments differ among human annotators across different use cases. Simulating the effects of ordinarily latent elements of annotators subjectivity, we contrive a set of motivations (t-shirt graphics, presentation visuals, and phone background images) to contextualize a set of crowdsourcing tasks. Our results show that human evaluations of images vary within individual contexts and across combinations of contexts. Three key factors affecting this subjectivity are image appearance, image alignment with text, and representation of objects mentioned in the text. Our study highlights the importance of taking individual users and contexts into account, both when building and evaluating generative models
△ Less
Submitted 26 February, 2024;
originally announced March 2024.
-
Multiplicity dependence of the freezeout parameters in high energy hadron-hadron collisions
Authors:
Muhammad Ajaz,
Majid Shehzad,
Muhammad Waqas,
Haifa I. Alrebdi,
Momhammad Ayaz Ahmad,
Antalov Jagnandan,
Shawn Jagnandan,
Murad Badshah,
Jalal Hasan Baker,
Abdul Mosawir Quraishi
Abstract:
We examined the transverse momentum spectra of various identified particles, across different multiplicity classes in proton-proton collisions at a center-of-mass energy of $\sqrt{s}$ = 7 TeV. Utilizing the Tsallis and Hagedorn models, parameters relevant to the bulk properties of nuclear matter were extracted. Both models exhibit good agreement with experimental data. In our analyses, we observed…
▽ More
We examined the transverse momentum spectra of various identified particles, across different multiplicity classes in proton-proton collisions at a center-of-mass energy of $\sqrt{s}$ = 7 TeV. Utilizing the Tsallis and Hagedorn models, parameters relevant to the bulk properties of nuclear matter were extracted. Both models exhibit good agreement with experimental data. In our analyses, we observed a consistent decrease in the effective temperature for the Tsallis model and the kinetic or thermal freeze-out temperature for the Hagedorn model, as we transition from higher multiplicity (class-I) to lower multiplicity (class-X). Additionally, the transverse flow velocity experiences a decline from class-I to class-X. The normalization constant which represents the multiplicity of produced particles is observed to decrease as we move towards higher multiplicity classes. While the effective and kinetic freeze-out temperatures, as well as the transverse flow velocity, show a mild dependency on multiplicity for lighter particles, this relationship becomes more pronounced for heavier particles. Various particle species are observed to undergo decoupling from the fireball at distinct temperatures: lighter particles exhibit lower temperatures, while heavier ones show higher temperatures, thereby supporting the concept of multiple freeze-out scenarios. Moreover, we identified a positive correlation between the kinetic freeze-out temperature and transverse flow velocity, a scenario where particles experience stronger collective motion at higher freeze-out temperature. The reason for this positive correlation is that as the multiplicity increases, more energy is transferred into the system. This heightened energy causes greater excitation and pressure within the system, leading to a quick expansion.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Can Large Language Models Serve as Data Analysts? A Multi-Agent Assisted Approach for Qualitative Data Analysis
Authors:
Zeeshan Rasheed,
Muhammad Waseem,
Aakash Ahmad,
Kai-Kristian Kemell,
Wang Xiaofeng,
Anh Nguyen Duc,
Pekka Abrahamsson
Abstract:
Recent advancements in Large Language Models (LLMs) have enabled collaborative human-bot interactions in Software Engineering (SE), similar to many other professions. However, the potential benefits and implications of incorporating LLMs into qualitative data analysis in SE have not been completely explored. For instance, conducting qualitative data analysis manually can be a time-consuming, effor…
▽ More
Recent advancements in Large Language Models (LLMs) have enabled collaborative human-bot interactions in Software Engineering (SE), similar to many other professions. However, the potential benefits and implications of incorporating LLMs into qualitative data analysis in SE have not been completely explored. For instance, conducting qualitative data analysis manually can be a time-consuming, effort-intensive, and error-prone task for researchers. LLM-based solutions, such as generative AI models trained on massive datasets, can be utilized to automate tasks in software development as well as in qualitative data analysis. To this end, we utilized LLMs to automate and expedite the qualitative data analysis processes. We employed a multi-agent model, where each agent was tasked with executing distinct, individual research related activities. Our proposed model interpreted large quantities of textual documents and interview transcripts to perform several common tasks used in qualitative analysis. The results show that this technical assistant speeds up significantly the data analysis process, enabling researchers to manage larger datasets much more effectively. Furthermore, this approach introduces a new dimension of scalability and accuracy in qualitative research, potentially transforming data interpretation methodologies in SE.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
The 3-3-1 model with exotic electric charges, right-handed neutrinos with type-I+II seesaw mechanism and its effects on LFV
Authors:
Abrar Ahmad,
Shakeel Mahmood,
Farida Tahir,
Wasi Uz Zaman,
Fizza Atif
Abstract:
In this research, we propose a modified version of the 3-3-1 model, incorporating a type I+II seesaw mechanism and Z4 discrete symmetry, as a framework for investigating lepton flavor-violating (LFV) decays. This model successfully yields the left-handed neutrinos mass square difference in the eV scale, with specific values of mass square differences; and generates mixing angles that align with ex…
▽ More
In this research, we propose a modified version of the 3-3-1 model, incorporating a type I+II seesaw mechanism and Z4 discrete symmetry, as a framework for investigating lepton flavor-violating (LFV) decays. This model successfully yields the left-handed neutrinos mass square difference in the eV scale, with specific values of mass square differences; and generates mixing angles that align with experimental data. Furthermore, we develop SARAH and SPheno algorithms tailored for our modified model, enabling us to estimate the magnitude of LFV observables. Our calculations indicate favorable results for various LFV branching ratios. These findings demonstrate improved agreement with experimental measurements compared to previously reported results, which typically fall within the range of 10^-2 to 10^-6.
△ Less
Submitted 22 January, 2024; v1 submitted 22 January, 2024;
originally announced January 2024.
-
$π$- and $K$-meson properties for large $N_f$ and $N_c$
Authors:
Aftab Ahmad,
Mumtaz Khan
Abstract:
Dynamical chiral symmetry restoration for higher number of light quark flavors $N_f$ and breaking for higher number of colors $N_c$ implies the suppression and enhancement of the dynamically generated quark mass. The study of various larger values of number of colors and flavors may have greater impact on the internal structure of light hadrons. In this work, we study the properties of the pion an…
▽ More
Dynamical chiral symmetry restoration for higher number of light quark flavors $N_f$ and breaking for higher number of colors $N_c$ implies the suppression and enhancement of the dynamically generated quark mass. The study of various larger values of number of colors and flavors may have greater impact on the internal structure of light hadrons. In this work, we study the properties of the pion and kaon, such as mass, condensate, and leptonic decay constant, for various $N_f$ and $N_c$. We use the symmetry-preserving vector-vector flavor-dependent contact interaction model of quark. The dynamical quark masses are calculated by using the Schwinger-Dyson equation (SDE). The masses of the pion and kaon for different values of $N_f$ and $N_c$ are determined using the homogeneous Bethe-Salpeter equation. For fixed $N_f=2$ and $N_c$ is increased, the dynamically generated quark mass ( mass of up and down quarks), strange quark mass, meson in-condensate, and decay constant, all increases. The pion mass remains approximately constant until $N_c$ reaches around 6.5, after which it grows rapidly. On the other hand, the kaon mass increases slowly with increasing $N_c$ until it reaches approximately $N_c=7.5$, beyond which it rises quickly. When $N_c=3$ is fixed at and various values of $N_f$ are considered, all the parameter values decrease as a function of $N_f$, except for the pion and kaon mass, which increase above a critical value of $N_f$ around $8$. This is the region where chiral symmetry is restored, and the pion and kaon behave as free particles, similar to their behavior in the presence of a heat bath. The results obtained for fixed $N_f=2$ and $N_c=3$ are fairly in decent agreement with experimentally calculated statistics and previous model calculations based on the Schwinger-Dyson equation (SDE) and Bethe-Salpeter equation (BSE).
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
Behavior Trees with Dataflow: Coordinating Reactive Tasks in Lingua Franca
Authors:
Alexander Schulz-Rosengarten,
Akash Ahmad,
Malte Clement,
Reinhard von Hanxleden,
Benjamin Asch,
Marten Lohstroh,
Edward A. Lee,
Gustavo Quiros Araya,
Ankit Shukla
Abstract:
Behavior Trees (BTs) provide a lean set of control flow elements that are easily composable in a modular tree structure. They are well established for modeling the high-level behavior of non-player characters in computer games and recently gained popularity in other areas such as industrial automation. While BTs nicely express control, data handling aspects so far must be provided separately, e. g…
▽ More
Behavior Trees (BTs) provide a lean set of control flow elements that are easily composable in a modular tree structure. They are well established for modeling the high-level behavior of non-player characters in computer games and recently gained popularity in other areas such as industrial automation. While BTs nicely express control, data handling aspects so far must be provided separately, e. g. in the form of blackboards. This may hamper reusability and can be a source of nondeterminism. We here present a dataflow extension to BTs that explicitly models data relations and communication. We provide a combined textual/graphical approach in line with modern, productivity-enhancing pragmatics-aware modeling techniques. We realized and validated that approach in the recently introduced polyglot coordination language Lingua Franca (LF).
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Efficient UAVs Deployment and Resource Allocation in UAV-Relay Assisted Public Safety Networks for Video Transmission
Authors:
Naveed Khan,
Ayaz Ahmad,
Abdul Wakeel,
Zeeshan Kaleem,
Bushra Rashid,
Waqas Khalid
Abstract:
Wireless communication highly depends on the cellular ground base station (GBS). A failure of the cellular GBS, fully or partially, during natural or man-made disasters creates a communication gap in the disaster-affected areas. In such situations, public safety communication (PSC) can significantly save the national infrastructure, property, and lives. Throughout emergencies, the PSC can provide…
▽ More
Wireless communication highly depends on the cellular ground base station (GBS). A failure of the cellular GBS, fully or partially, during natural or man-made disasters creates a communication gap in the disaster-affected areas. In such situations, public safety communication (PSC) can significantly save the national infrastructure, property, and lives. Throughout emergencies, the PSC can provide mission-critical communication and video transmission services in the affected area. Unmanned aerial vehicles (UAVs) as flying base stations (UAV-BSs) are particularly suitable for PSC services as they are flexible, mobile, and easily deployable. This manuscript considers a multi-UAV-assisted PSC network with an observational UAV receiving videos from the affected area's ground users (AGUs) and transmitting them to the nearby GBS via a relay UAV. The objective of the proposed study is to maximize the average utility of the video streams generated by the AGUs upon reaching the GBS. This is achieved by optimizing the positions of the observational and relay UAVs, as well as the distribution of communication resources, such as bandwidth, and transmit power, while satisfying the system-designed constraints, such as transmission rate, rate outage probability, transmit power budget, and available bandwidth. To this end, a joint UAVs placement and resource allocation problem is mathematically formulated. The proposed problem poses a significant challenge for a solution. Considering the block coordinate descent and successive convex approximation techniques, an efficient iterative algorithm is proposed. Finally, simulation results are provided which show that our proposed approach outperforms the existing methods.
△ Less
Submitted 3 January, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
LD-SDM: Language-Driven Hierarchical Species Distribution Modeling
Authors:
Srikumar Sastry,
Xin Xing,
Aayush Dhakal,
Subash Khanal,
Adeel Ahmad,
Nathan Jacobs
Abstract:
We focus on the problem of species distribution modeling using global-scale presence-only data. Most previous studies have mapped the range of a given species using geographical and environmental features alone. To capture a stronger implicit relationship between species, we encode the taxonomic hierarchy of species using a large language model. This enables range mapping for any taxonomic rank an…
▽ More
We focus on the problem of species distribution modeling using global-scale presence-only data. Most previous studies have mapped the range of a given species using geographical and environmental features alone. To capture a stronger implicit relationship between species, we encode the taxonomic hierarchy of species using a large language model. This enables range mapping for any taxonomic rank and unseen species without additional supervision. Further, we propose a novel proximity-aware evaluation metric that enables evaluating species distribution models using any pixel-level representation of ground-truth species range map. The proposed metric penalizes the predictions of a model based on its proximity to the ground truth. We describe the effectiveness of our model by systematically evaluating on the task of species range prediction, zero-shot prediction and geo-feature regression against the state-of-the-art. Results show our model outperforms the strong baselines when trained with a variety of multi-label learning losses.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Fast as Potoroo: Radio Continuum Detection of a Bow-Shock Pulsar Wind Nebula Powered by Pulsar J1638-4713
Authors:
Sanja Lazarević,
Miroslav D. Filipović,
Shi Dai,
Roland Kothes,
Adeel Ahmad,
Rami Z. E. Alsaberi,
Joel C. F. Balzan,
Luke A. Barnes,
William D. Cotton,
Philip G. Edwards,
Yjan A. Gordon,
Frank Haberl,
Andrew M. Hopkins,
Bärbel S. Koribalski,
Denis Leahy,
Chandreyee Maitra,
Marko Mićić,
Gavin Rowell,
Manami Sasaki,
Nicholas F. H. Tothill,
Grazia Umana,
Velibor Velović
Abstract:
We report the discovery of a bow-shock pulsar wind nebula (PWN), named Potoroo, and the detection of a young pulsar J1638-4713 that powers the nebula. We present a radio continuum study of the PWN based on 20-cm observations obtained from the Australian Square Kilometre Array Pathfinder (ASKAP) and MeerKAT. PSR J1638-4713 was identified using Parkes radio telescope observations at frequencies abov…
▽ More
We report the discovery of a bow-shock pulsar wind nebula (PWN), named Potoroo, and the detection of a young pulsar J1638-4713 that powers the nebula. We present a radio continuum study of the PWN based on 20-cm observations obtained from the Australian Square Kilometre Array Pathfinder (ASKAP) and MeerKAT. PSR J1638-4713 was identified using Parkes radio telescope observations at frequencies above 3 GHz. The pulsar has the second-highest dispersion measure of all known radio pulsars (1553 pc/cm^3), a spin period of 65.74 ms and a spin-down luminosity of 6.1x10^36 erg/s. The PWN has a cometary morphology and one of the greatest projected lengths among all the observed pulsar radio tails, measuring over 21 pc for an assumed distance of 10 kpc. The remarkably long tail and atypically steep radio spectral index are attributed to the interplay of a supernova reverse shock and the PWN. The originating supernova remnant is not known so far. We estimated the pulsar kick velocity to be in the range of 1000-2000 km/s for ages between 23 and 10 kyr. The X-ray counterpart found in Chandra data, CXOU J163802.6-471358, shows the same tail morphology as the radio source but is shorter by a factor of 10. The peak of the X-ray emission is offset from the peak of the radio total intensity (Stokes I) emission by approximately 4.7", but coincides well with circularly polarised (Stokes V) emission. No infrared counterpart was found.
△ Less
Submitted 27 April, 2024; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Energy and system size dependence of strongly intensive fluctuation measures in heavy-ion collisions at FAIR energies
Authors:
Bushra Ali,
Shakeel Ahmad,
A. Ahmad
Abstract:
Event-by-event fluctuations of multiplicity and transverse momentum of charged hadrons produced in heavy-ion collisions at FAIR energies, 10A, 20A, 30A and 40A GeV are studied in the framework of relativistic transport model, URQMD. Dependence of two families of strongly intensive measures of multiplicity($N$) and transverse momentum($p_{\rm T}$) fluctuations, $Δ[p_{\rm T},N]$ and…
▽ More
Event-by-event fluctuations of multiplicity and transverse momentum of charged hadrons produced in heavy-ion collisions at FAIR energies, 10A, 20A, 30A and 40A GeV are studied in the framework of relativistic transport model, URQMD. Dependence of two families of strongly intensive measures of multiplicity($N$) and transverse momentum($p_{\rm T}$) fluctuations, $Δ[p_{\rm T},N]$ and $Σ[p_{\rm T},N]$, on collision centrality, centrality bin-widths and pseudorapidity windows are examined. Attempts are also made to study $NN$, $N$$p_{\rm T}$ and $p_{\rm T}$$p_{\rm T}$ fluctuations using two window analysis method. The findings suggest that the measure, $Δ[p_{\rm T},N]$ be dealt with proper selection of centrality intervals. This measure also exhibits a strong dependence on the widths of $η$ windows. The variable $Σ[p_{\rm T},N]$, however, is observed to be insensitive to the centrality bin-widths and shows a variation of $< 5\%$ with the widths of $η$ windows. The analysis of data after event mixing gives $Δ[p_{\rm T},N]$ and $Σ[p_{\rm T},N]$ values as $\sim 1$ irrespective of the widths of $η$ windows and collision centrality, as predicted by model of independent particle emission, IPM. The study of joint fluctuations of the two quantities on two $η$ windows separated in $η$ space, reveals that $Σ[N_{\rm F},N_{\rm B}]$ values are $\sim 1$ irrespective of the position of $η$ windows whereas, the values of $Σ[N_{\rm F},p_{\rm T_B}]$ and $Σ[p_{\rm T_F},p_{\rm T_B}]$ firstly increase with $η_{sep}$ and later acquire saturations. The observed trend of centrality dependence of $Σ[N_{\rm F},N_{\rm B}], Σ[N_{\rm F},p_{\rm T_B}]$ and $Σ[p_{\rm T_F},p_{\rm T_B}]$ agrees fairly well with those observed in MC simulated studies carried out for AA collisions at LHC energies in the framework model of string fusion.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Architecture Decisions in Quantum Software Systems: An Empirical Study on Stack Exchange and GitHub
Authors:
Mst Shamima Aktar,
Peng Liang,
Muhammad Waseem,
Amjed Tahir,
Aakash Ahmad,
Beiqi Zhang,
Zengyang Li
Abstract:
Quantum computing provides a new dimension in computation, utilizing the principles of quantum mechanics to potentially solve complex problems that are currently intractable for classical computers. However, little research has been conducted about the architecture decisions made in quantum software development, which have a significant influence on the functionality, performance, scalability, and…
▽ More
Quantum computing provides a new dimension in computation, utilizing the principles of quantum mechanics to potentially solve complex problems that are currently intractable for classical computers. However, little research has been conducted about the architecture decisions made in quantum software development, which have a significant influence on the functionality, performance, scalability, and reliability of these systems. The study aims to empirically investigate and analyze architecture decisions made during the development of quantum software systems, identifying prevalent challenges and limitations by using the posts and issues from Stack Exchange and GitHub. We used a qualitative approach to analyze the obtained data from Stack Exchange Sites and GitHub projects. Specifically, we collected data from 385 issues (from 87 GitHub projects) and 70 posts (from three Stack Exchange sites) related to architecture decisions in quantum software development. The results show that in quantum software development (1) architecture decisions are articulated in six linguistic patterns, the most common of which are Solution Proposal and Information Giving, (2) the two major categories of architectural decisions are Implementation Decision and Technology Decision, (3) Softwar Development Tools are the most common application domain among the twenty application domains identified, (4) Maintainability is the most frequently considered quality attribute, and (5) Design Issues and High Error Rates are the major limitations and challenges that practitioners face when making architecture decisions in quantum software development. Our results show that the limitations and challenges encountered in architecture decision-making during the development of quantum software systems are strongly linked to the particular features (e.g., quantum entanglement, superposition, and decoherence) of those systems.
△ Less
Submitted 8 July, 2024; v1 submitted 8 December, 2023;
originally announced December 2023.
-
Isospin Decomposition of D Mesons
Authors:
Shakeel Mahmood,
Mudassir Hussain,
Abrar Ahmad
Abstract:
This work focuses on decomposition of isospin amplitude of D meson non-leptonic decays. Isospin vector algebra is used to show the equivalency of two amplitude decompositions. We restrict to the transitions involving only Del I = 1 and Del I = 0: The isospin symmetry is relating charge channel (D+ -> K+,Pi+ Pi-; D+ -> K+, Pi0, Pi0 and D+-> K0, Pi0,Pi+); and neutral channel (D0 -> K0,Pi+,Pi- D0 ->…
▽ More
This work focuses on decomposition of isospin amplitude of D meson non-leptonic decays. Isospin vector algebra is used to show the equivalency of two amplitude decompositions. We restrict to the transitions involving only Del I = 1 and Del I = 0: The isospin symmetry is relating charge channel (D+ -> K+,Pi+ Pi-; D+ -> K+, Pi0, Pi0 and D+-> K0, Pi0,Pi+); and neutral channel (D0 -> K0,Pi+,Pi- D0 -> K0, Pi0, Pi0 and D0 -> K+, Pi0, Pi-) with each other. Equivalent triangle relations are obtained for both channels.
△ Less
Submitted 24 November, 2023;
originally announced November 2023.
-
Comparative Analysis of Shear Strength Prediction Models for Reinforced Concrete Slab-Column Connections
Authors:
Sarmed Wahab,
Nasim Shakouri Mahmoudabadi,
Sarmad Waqas,
Nouman Herl,
Muhammad Iqbal,
Khurshid Alam,
Afaq Ahmad
Abstract:
This research aims at comparative analysis of shear strength prediction at slab-column connection, unifying machine learning, design codes and Finite Element Analysis. Current design codes (CDCs) of ACI 318-19 (ACI), Eurocode 2 (EC2), Compressive Force Path (CFP) method, Feed Forward Neural Network (FNN) based Artificial Neural Network (ANN), PSO-based FNN (PSOFNN), and BAT algorithm-based BATFNN…
▽ More
This research aims at comparative analysis of shear strength prediction at slab-column connection, unifying machine learning, design codes and Finite Element Analysis. Current design codes (CDCs) of ACI 318-19 (ACI), Eurocode 2 (EC2), Compressive Force Path (CFP) method, Feed Forward Neural Network (FNN) based Artificial Neural Network (ANN), PSO-based FNN (PSOFNN), and BAT algorithm-based BATFNN are used. The study is complemented with FEA of slab for validating the experimental results and machine learning predictions.In the case of hybrid models of PSOFNN and BATFNN, mean square error is used as an objective function to obtain the optimized values of the weights, that are used by Feed Forward Neural Network to perform predictions on the slab data. Seven different models of PSOFNN, BATFNN, and FNN are trained on this data and the results exhibited that PSOFNN is the best model overall. PSOFNN has the best results for SCS=1 with highest value of R as 99.37% and lowest of MSE, and MAE values of 0.0275%, and 1.214% respectively which are better than the best FNN model for SCS=4 having the values of R, MSE, and MAE as 97.464%, 0.0492%, and 1.43%, respectively.
△ Less
Submitted 28 November, 2023; v1 submitted 29 September, 2023;
originally announced November 2023.
-
Effect of some plant extracts on hardwood cuttings of Bottlebrush (Callistemon viminalis)
Authors:
Hemn Abdalla Mustafa,
Tariq Abubakr Ahmad,
Aram Akram Mohammed,
Zainab Sabah Lazim,
Chopi Omer Ibrahim,
Roshna Faeq Kak bra,
Shvan Ramzi Salih
Abstract:
The study was conducted at the Collage of Agricultural Engineering Sciences, University of Sulaimani, Kurdistan Region-Iraq so as to investigate response hardwood cuttings of Callistemon viminalis to some plant extracts. The hardwood cuttings were taken on 11 March 2021 and soaked separately in 3 and 6 g/L aqueous extracts of moringa leaf, licorice root, willow shoot, fenugreek seed and cinnamon b…
▽ More
The study was conducted at the Collage of Agricultural Engineering Sciences, University of Sulaimani, Kurdistan Region-Iraq so as to investigate response hardwood cuttings of Callistemon viminalis to some plant extracts. The hardwood cuttings were taken on 11 March 2021 and soaked separately in 3 and 6 g/L aqueous extracts of moringa leaf, licorice root, willow shoot, fenugreek seed and cinnamon bark for 1 hour. They were compared to the cuttings dipped in 3000 ppm IBA for 10s and control cuttings which were soaked in distilled water for 1 hour. The experiment laid out in CRD with three replications in a greenhouse, and each replication included six cuttings which planted in a mixture of sand and rice husk medium. The results showed that the highest (86.66%) rooting was achieved in the cuttings treated with 6 g/L licorice extract and they were significantly different with control cuttings (53.33%), but they were not significantly different with 3000 ppm IBA (66.66%). Cinnamon 3g/L and fenugreek 3g/L extracts gave the lowest (6.66% and 33.33%, respectively) rooting and other studied parameters. The cuttings dipped in 3000 ppm IBA gave the highest (18.91) root number and the highest (66.66%) survival cuttings after transplanting. The longest root (15.54 cm) was found in cuttings were treated with 6 g/L moringa extract. The longest (5.83 cm) shoot was observed in treated cuttings with 3 g/L willow extract. The highest chlorophyll a and b (10.08 and 4.62 mg/L, respectively) were observed in cuttings treated with 6 g/L willow extract. Moreover, 3000 ppm IBA gave the highest (20.23%) total carbohydrate and (1.77 mg/g) IAA content along with 6 g/L licorice, moringa and fenugreek extracts, after 30 days from planting of the cuttings. Licorice root extract at 6 g/L fairly improved the measurements similar to 3000 ppm IBA throughout the study.
△ Less
Submitted 9 September, 2023;
originally announced November 2023.
-
Efficient Continual Pre-training for Building Domain Specific Large Language Models
Authors:
Yong Xie,
Karan Aggarwal,
Aitzaz Ahmad
Abstract:
Large language models (LLMs) have demonstrated remarkable open-domain capabilities. Traditionally, LLMs tailored for a domain are trained from scratch to excel at handling domain-specific tasks. In this work, we explore an alternative strategy of continual pre-training as a means to develop domain-specific LLMs. We introduce FinPythia-6.9B, developed through domain-adaptive continual pre-training…
▽ More
Large language models (LLMs) have demonstrated remarkable open-domain capabilities. Traditionally, LLMs tailored for a domain are trained from scratch to excel at handling domain-specific tasks. In this work, we explore an alternative strategy of continual pre-training as a means to develop domain-specific LLMs. We introduce FinPythia-6.9B, developed through domain-adaptive continual pre-training on the financial domain. Continual pre-trained FinPythia showcases consistent improvements on financial tasks over the original foundational model. We further explore simple but effective data selection strategies for continual pre-training. Our data selection strategies outperforms vanilla continual pre-training's performance with just 10% of corpus size and cost, without any degradation on open-domain standard tasks. Our work proposes an alternative solution to building domain-specific LLMs from scratch in a cost-effective manner.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Rooting capacity of hardwood cuttings of some fruit trees in relation to cutting pattern
Authors:
Aram Akram Mohammed,
Rasul Rafiq Aziz,
Faraydwn Karim Ahmad,
Ibrahim Maaroof Noori,
Tariq Abubakr Ahmad
Abstract:
Study two cut patterns in hardwood cuttings of (Cydonia oblonga), (Punica granatum) and (Ficus carica). The cuttings have been cut either straight with different internode stub lengths [0 (just onto the basal node as control), 0.5, 1.0, 2.0 or 3.0 cm below the basal node], or slant with 45 degree angle for each length mentioned above (except the first length (0 cm). Effect of the basal cut directi…
▽ More
Study two cut patterns in hardwood cuttings of (Cydonia oblonga), (Punica granatum) and (Ficus carica). The cuttings have been cut either straight with different internode stub lengths [0 (just onto the basal node as control), 0.5, 1.0, 2.0 or 3.0 cm below the basal node], or slant with 45 degree angle for each length mentioned above (except the first length (0 cm). Effect of the basal cut directions on rooting percentage and other shoot and root characteristics were not significantly different, while the effect of slant cut pattern on one-side rooting at the basal margin observed in some quince cuttings but it was rarely observed in pomegranate and fig cuttings. Quince cuttings gave no different rooting percentage and other shoot and root characteristics significantly with different internode stub lengths. While, internode stub 1 and 2 cm in pomegranate cuttings, and 0 cm in fig cuttings gave the best rooting percentages 44.44% and 100%, respectively. Also, interaction effects of the two factors on rooting percentage and other shoot and root characteristics were just significantly different in pomegranate and fig cuttings. The best rooting capacity achieved in pomegranate cuttings (49.99%) in those were cut straightly at the base with 1 and 2 cm basal internode stub lengths, and fig cuttings straightly cut at the base with 0 and 1 cm basal internode stub lengths gave the highest rooting capacity (100%).
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Attention based Dual-Branch Complex Feature Fusion Network for Hyperspectral Image Classification
Authors:
Mohammed Q. Alkhatib,
Mina Al-Saad,
Nour Aburaed,
M. Sami Zitouni,
Hussain Al Ahmad
Abstract:
This research work presents a novel dual-branch model for hyperspectral image classification that combines two streams: one for processing standard hyperspectral patches using Real-Valued Neural Network (RVNN) and the other for processing their corresponding Fourier transforms using Complex-Valued Neural Network (CVNN). The proposed model is evaluated on the Pavia University and Salinas datasets.…
▽ More
This research work presents a novel dual-branch model for hyperspectral image classification that combines two streams: one for processing standard hyperspectral patches using Real-Valued Neural Network (RVNN) and the other for processing their corresponding Fourier transforms using Complex-Valued Neural Network (CVNN). The proposed model is evaluated on the Pavia University and Salinas datasets. Results show that the proposed model outperforms state-of-the-art methods in terms of overall accuracy, average accuracy, and Kappa. Through the incorporation of Fourier transforms in the second stream, the model is able to extract frequency information, which complements the spatial information extracted by the first stream. The combination of these two streams improves the overall performance of the model. Furthermore, to enhance the model performance, the Squeeze and Excitation (SE) mechanism has been utilized. Experimental evidence show that SE block improves the models overall accuracy by almost 1\%.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Creating Trustworthy LLMs: Dealing with Hallucinations in Healthcare AI
Authors:
Muhammad Aurangzeb Ahmad,
Ilker Yaramis,
Taposh Dutta Roy
Abstract:
Large language models have proliferated across multiple domains in as short period of time. There is however hesitation in the medical and healthcare domain towards their adoption because of issues like factuality, coherence, and hallucinations. Give the high stakes nature of healthcare, many researchers have even cautioned against its usage until these issues are resolved. The key to the implemen…
▽ More
Large language models have proliferated across multiple domains in as short period of time. There is however hesitation in the medical and healthcare domain towards their adoption because of issues like factuality, coherence, and hallucinations. Give the high stakes nature of healthcare, many researchers have even cautioned against its usage until these issues are resolved. The key to the implementation and deployment of LLMs in healthcare is to make these models trustworthy, transparent (as much possible) and explainable. In this paper we describe the key elements in creating reliable, trustworthy, and unbiased models as a necessary condition for their adoption in healthcare. Specifically we focus on the quantification, validation, and mitigation of hallucinations in the context in healthcare. Lastly, we discuss how the future of LLMs in healthcare may look like.
△ Less
Submitted 26 September, 2023;
originally announced November 2023.
-
Exploring the Problems, their Causes and Solutions of AI Pair Programming: A Study with Practitioners of GitHub Copilot
Authors:
Xiyu Zhou,
Peng Liang,
Beiqi Zhang,
Zengyang Li,
Aakash Ahmad,
Mojtaba Shahin,
Muhammad Waseem
Abstract:
With the recent advancement of Artificial Intelligence (AI) and Large Language Models (LLMs), AI-based code generation tools become a practical solution for software development. GitHub Copilot, the AI pair programmer, utilizes machine learning models trained on a large corpus of code snippets to generate code suggestions using natural language processing. Despite its popularity in software develo…
▽ More
With the recent advancement of Artificial Intelligence (AI) and Large Language Models (LLMs), AI-based code generation tools become a practical solution for software development. GitHub Copilot, the AI pair programmer, utilizes machine learning models trained on a large corpus of code snippets to generate code suggestions using natural language processing. Despite its popularity in software development, there is limited empirical evidence on the actual experiences of practitioners who work with Copilot. To this end, we conducted an empirical study to understand the problems that practitioners face when using Copilot, as well as their underlying causes and potential solutions. We collected data from 476 GitHub issues, 706 GitHub discussions, and 142 Stack Overflow posts. Our results reveal that (1) Operation Issue and Compatibility Issue are the most common problems faced by Copilot users, (2) Copilot Internal Error, Network Connection Error, and Editor/IDE Compatibility Issue are identified as the most frequent causes, and (3) Bug Fixed by Copilot, Modify Configuration/Setting, and Use Suitable Version are the predominant solutions. Based on the results, we discuss the potential areas of Copilot for enhancement, and provide the implications for the Copilot users, the Copilot team, and researchers.
△ Less
Submitted 28 April, 2024; v1 submitted 2 November, 2023;
originally announced November 2023.