subscribe to arXiv mailings

SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions

Authors: Shicheng Liu, Sina J. Semnani, Harold Triedman, Jialiang Xu, Isaac Dan Zhao, Monica S. Lam

Abstract: Recent work integrating Large Language Models (LLMs) has led to significant improvements in the Knowledge Base Question Answering (KBQA) task. However, we posit that existing KBQA datasets that either have simple questions, use synthetically generated logical forms, or are based on small knowledge base (KB) schemas, do not capture the true complexity of KBQA tasks. To address this, we introduce… ▽ More Recent work integrating Large Language Models (LLMs) has led to significant improvements in the Knowledge Base Question Answering (KBQA) task. However, we posit that existing KBQA datasets that either have simple questions, use synthetically generated logical forms, or are based on small knowledge base (KB) schemas, do not capture the true complexity of KBQA tasks. To address this, we introduce the SPINACH dataset, an expert-annotated KBQA dataset collected from forum discussions on Wikidata's "Request a Query" forum with 320 decontextualized question-SPARQL pairs. Much more complex than existing datasets, SPINACH calls for strong KBQA systems that do not rely on training data to learn the KB schema, but can dynamically explore large and often incomplete schemas and reason about them. Along with the dataset, we introduce the SPINACH agent, a new KBQA approach that mimics how a human expert would write SPARQLs for such challenging questions. Experiments on existing datasets show SPINACH's capability in KBQA, achieving a new state of the art on the QALD-7, QALD-9 Plus and QALD-10 datasets by 30.1%, 27.0%, and 10.0% in F1, respectively, and coming within 1.6% of the fine-tuned LLaMA SOTA model on WikiWebQuestions. On our new SPINACH dataset, SPINACH agent outperforms all baselines, including the best GPT-4-based KBQA agent, by 38.1% in F1. △ Less

Submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.11130 [pdf, other]

Geometric additivity of modular commutator for multipartite entanglement

Authors: Sung-Min Park, Isaac H. Kim, Eun-Gook Moon

Abstract: A recent surge of research in many-body quantum entanglement has uncovered intriguing properties of quantum many-body systems. A prime example is the modular commutator, which can extract a topological invariant from a single wave function. Here, we unveil novel geometric properties of many-body entanglement via a modular commutator of two-dimensional gapped quantum many-body systems. We obtain th… ▽ More A recent surge of research in many-body quantum entanglement has uncovered intriguing properties of quantum many-body systems. A prime example is the modular commutator, which can extract a topological invariant from a single wave function. Here, we unveil novel geometric properties of many-body entanglement via a modular commutator of two-dimensional gapped quantum many-body systems. We obtain the geometric additivity of a modular commutator, indicating that modular commutator for a multipartite system may be an integer multiple of the one for tripartite systems. Using our additivity formula, we also derive a curious identity for the modular commutators involving disconnected intervals in a certain class of conformal field theories. We further illustrate this geometric additivity for both bulk and edge subsystems using numerical calculations of the Haldane and $π$-flux models. △ Less

Submitted 15 July, 2024; originally announced July 2024.

Comments: 4+8 pages, 6+10 figures

arXiv:2407.10381 [pdf, other]

Hybrid Oscillator-Qubit Quantum Processors: Instruction Set Architectures, Abstract Machine Models, and Applications

Authors: Yuan Liu, Shraddha Singh, Kevin C. Smith, Eleanor Crane, John M. Martyn, Alec Eickbusch, Alexander Schuckert, Richard D. Li, Jasmine Sinanan-Singh, Micheline B. Soley, Takahiro Tsunoda, Isaac L. Chuang, Nathan Wiebe, Steven M. Girvin

Abstract: Quantum computing with discrete variable (DV, qubit) hardware is approaching the large scales necessary for computations beyond the reach of classical computers. However, important use cases such as quantum simulations of physical models containing bosonic modes, and quantum error correction are challenging for DV-only systems. Separately, hardware containing native continuous-variable (CV, oscill… ▽ More Quantum computing with discrete variable (DV, qubit) hardware is approaching the large scales necessary for computations beyond the reach of classical computers. However, important use cases such as quantum simulations of physical models containing bosonic modes, and quantum error correction are challenging for DV-only systems. Separately, hardware containing native continuous-variable (CV, oscillator) systems has received attention as an alternative approach, yet the universal control of such systems is non-trivial. In this work, we show that hybrid CV-DV hardware offers a great advantage in meeting these challenges, offering a powerful computational paradigm that inherits the strengths of both DV and CV processors. We provide a pedagogical introduction to CV-DV systems and the multiple abstraction layers needed to produce a full software stack connecting applications to hardware. We present a variety of new hybrid CV-DV compilation techniques, algorithms, and applications, including the extension of quantum signal processing concepts to CV-DV systems and strategies to simulate systems of interacting spins, fermions, and bosons. To facilitate the development of hybrid CV-DV processor systems, we introduce formal Abstract Machine Models and Instruction Set Architectures -- essential abstractions that enable developers to formulate applications, compile algorithms, and explore the potential of current and future hardware for realizing fault-tolerant circuits, modules, and processors. Hybrid CV-DV quantum computations are beginning to be performed in superconducting, trapped ion, and neutral atom platforms, and large-scale experiments are set to be demonstrated in the near future. We present a timely and comprehensive guide to this relatively unexplored yet promising approach to quantum computation and providing an architectural backbone to guide future development. △ Less

Submitted 14 July, 2024; originally announced July 2024.

Comments: 154 pages, 51 figures, 646 references

arXiv:2407.08921 [pdf, other]

Observational bounds on a possible electron-to-proton mass ratio variation and constraints in the lepton-specific 2HDM

Authors: R. G. Albuquerque, R. F. L. Holanda, I. E. T. R. Mendonça, P. S. Rodrigues da Silva

Abstract: In this work, we test a possible redshift variation of the electron-to-proton mass ratio, $μ= m_e/m_p$, directly from galaxy cluster gas mass fraction measurements and type Ia Supernovae observations. Our analysis is completely independent of any cosmological model. Our result reveals no variation of $μ$ within 1 $σ$ confidence level. From the point of view of Particle Physics, we can use the prec… ▽ More In this work, we test a possible redshift variation of the electron-to-proton mass ratio, $μ= m_e/m_p$, directly from galaxy cluster gas mass fraction measurements and type Ia Supernovae observations. Our analysis is completely independent of any cosmological model. Our result reveals no variation of $μ$ within 1 $σ$ confidence level. From the point of view of Particle Physics, we can use the precision on these results to constrain the parameter space of models beyond the Standard Model of electroweak interactions. We exemplify this by focusing in a specific Two Higgs Doublet model (2HDM), where the second scalar doublet couples exclusively to leptons. An important parameter in the model concerns the ratio between its vacuum expectation values, defined by $\tanβ$. In our approach we can constrain the inverse parameter (cot$β$) to an optimal value, (tan$β)^{-1}=$ 0.02127 $\pm$ 0.0029, with the largest vacuum expectation value for 2HDM, $v_2$, estimated at around 240.033 $\pm$ 0.21~GeV. Also, by taking into account the $(g-2)_μ$ discrepancy found between theory and experiment, we can reduce the validity region for this model and establish bounds on the scalar masses, in the light of our findings from galaxy clusters data for $μ$. This study contributes valuable insights to the understanding of Particle Physics and Astrophysics interface, establishing a new interplay between data from large scale structure of the Universe and subatomic Physics. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 14 pages, 8 figures

arXiv:2407.08874 [pdf]

Implications of mappings between ICD clinical diagnosis codes and Human Phenotype Ontology terms

Authors: Amelia LM Tan, Rafael S Gonçalves, William Yuan, Gabriel A Brat, The Consortium for Clinical Characterization of COVID-19 by EHR, Robert Gentleman, Isaac S Kohane

Abstract: Objective: Integrating EHR data with other resources is essential in rare disease research due to low disease prevalence. Such integration is dependent on the alignment of ontologies used for data annotation. The International Classification of Diseases (ICD) is used to annotate clinical diagnoses; the Human Phenotype Ontology (HPO) to annotate phenotypes. Although these ontologies overlap in biom… ▽ More Objective: Integrating EHR data with other resources is essential in rare disease research due to low disease prevalence. Such integration is dependent on the alignment of ontologies used for data annotation. The International Classification of Diseases (ICD) is used to annotate clinical diagnoses; the Human Phenotype Ontology (HPO) to annotate phenotypes. Although these ontologies overlap in biomedical entities described, the extent to which they are interoperable is unknown. We investigate how well aligned these ontologies are and whether such alignments facilitate EHR data integration. Materials and Methods: We conducted an empirical analysis of the coverage of mappings between ICD and HPO. We interpret this mapping coverage as a proxy for how easily clinical data can be integrated with research ontologies such as HPO. We quantify how exhaustively ICD codes are mapped to HPO by analyzing mappings in the UMLS Metathesaurus. We analyze the proportion of ICD codes mapped to HPO within a real-world EHR dataset. Results and Discussion: Our analysis revealed that only 2.2% of ICD codes have direct mappings to HPO in UMLS. Within our EHR dataset, less than 50% of ICD codes have mappings to HPO terms. ICD codes that are used frequently in EHR data tend to have mappings to HPO; ICD codes that represent rarer medical conditions are seldom mapped. Conclusion: We find that interoperability between ICD and HPO via UMLS is limited. While other mapping sources could be incorporated, there are no established conventions for what resources should be used to complement UMLS. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08745 [pdf, other]

Evolutionary Computation for the Design and Enrichment of General-Purpose Artificial Intelligence Systems: Survey and Prospects

Authors: Javier Poyatos, Javier Del Ser, Salvador Garcia, Hisao Ishibuchi, Daniel Molina, Isaac Triguero, Bing Xue, Xin Yao, Francisco Herrera

Abstract: In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal de… ▽ More In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal design of traditional Machine Learning models. Evolutionary Computation (EC) has been a useful tool for both the design and optimization of Machine Learning models, endowing them with the capability to configure and/or adapt themselves to the task under consideration. Therefore, their application to GPAIS is a natural choice. This paper aims to analyze the role of EC in the field of GPAIS, exploring the use of EC for their design or enrichment. We also match GPAIS properties to Machine Learning areas in which EC has had a notable contribution, highlighting recent milestones of EC for GPAIS. Furthermore, we discuss the challenges of harnessing the benefits of EC for GPAIS, presenting different strategies to both design and improve GPAIS with EC, covering tangential areas, identifying research niches, and outlining potential research directions for EC and GPAIS. △ Less

Submitted 3 June, 2024; originally announced July 2024.

arXiv:2407.08728 [pdf, other]

The Potential Impact of Noise Correlation in Next-generation Gravitational Wave Detectors

Authors: Isaac C. F. Wong, Peter T. H. Pang, Milan Wils, Francesco Cireddu, Walter Del Pozzo, Tjonnie G. F. Li

Abstract: Building upon the statistical formulation for parameter estimation in the presence of correlated noise proposed by Cireddu et al., we present an initial study to incorporate the effects of correlated noise into the analyses of various detector designs' performance. We consider a two L-shaped detector configuration located in the European Union, and compare the expectation of parameter estimation b… ▽ More Building upon the statistical formulation for parameter estimation in the presence of correlated noise proposed by Cireddu et al., we present an initial study to incorporate the effects of correlated noise into the analyses of various detector designs' performance. We consider a two L-shaped detector configuration located in the European Union, and compare the expectation of parameter estimation between the non-colocated and a hypothetical colocated configurations. In our study, we posit the existence of low-frequency correlated noise within the $5\text{ Hz}$ to $10\text{ Hz}$ range for the colocated detector configuration, with a varying degree of correlation. In this specific detector setup, our observations indicate an enhancement in the precision of intrinsic parameter measurements as the degree of correlation increases. This trend suggests that higher degrees of noise correlation may beneficially influence the accuracy of parameter estimation. In particular, when the noise is highly correlated, the uncertainty on chirp mass decreases by up to $30\%$. The absence of an inter-European baseline does hinder the estimation of the extrinsic parameters. However, given a realistic global network with the additional detector located in the United States, the uncertainty of extrinsic parameters is significantly reduced. This reduction is further amplified as the degree of noise correlation increases. When noise correlation exceeds a certain level, the colocated configuration outperforms the non-colocated one, reducing the $90\%$ credible area of sky location by up to $10\%$. We conclude that noise correlation significantly impacts detector performance, potentially altering both quantitative and qualitative outcomes. Thus, we recommend including noise correlation in comprehensive assessments of third-generation gravitational wave detector designs. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 10 pages, 2 figures

arXiv:2407.08611 [pdf]

Source-Independent Fault Detection Method for Transmission Lines in IBR-Dominated Grids

Authors: Julio Rodriguez, Isaac Kofi Otchere, Reza Jalilzadeh Hamidi

Abstract: This paper proposes a source-independent method for the detection and classification of faults along Transmission Lines (TLs). It aims to reduce the protection issues arising from Inverter-Based Resources (IBRs). Inspired by Power Line Communication (PLC), the proposed method utilizes high-frequency carrier waves which are sent from either side of a TL over each phase. As faults disrupt the propag… ▽ More This paper proposes a source-independent method for the detection and classification of faults along Transmission Lines (TLs). It aims to reduce the protection issues arising from Inverter-Based Resources (IBRs). Inspired by Power Line Communication (PLC), the proposed method utilizes high-frequency carrier waves which are sent from either side of a TL over each phase. As faults disrupt the propagation of carriers, the receiving carrier waves before and during faults exhibit differences. Based on this principle, the proposed method continuously compares the receiving carrier waves with a short history of them to detect and classify faults. The performance of the proposed method was evaluated using EMTP-RV and MATLAB, and compared to traditional phasor-based distance relays. The simulation results confirm the capability of the proposed method in detection and classification of different faults regardless of power sources types. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 5 pages. This is the preprint of a conference paper which is accepted for presentation at IEEE PES-GM2024. Based on the IEEE regulations, publishing preprints without the full format of IEEE (index number, DOI, etc) is allowed

arXiv:2407.08310 [pdf]

Robust quantum engineering of current flow in carbon nanostructures at room temperature

Authors: Gaetano Calogero, Isaac Alcón, Onurcan Kaya, Nick Papior, Aron W. Cummings, Mads Brandbyge, Stephan Roche

Abstract: Bottom-up on-surface synthesis enables the fabrication of carbon nanostructures with atomic precision. Good examples are graphene nanoribbons (GNRs), 1D conjugated polymers, and nanoporous graphenes (NPGs), which are gathering increasing attention for future carbon nanoelectronics. A key step is the ability to manipulate current flow within these nanomaterials. Destructive quantum interference (QI… ▽ More Bottom-up on-surface synthesis enables the fabrication of carbon nanostructures with atomic precision. Good examples are graphene nanoribbons (GNRs), 1D conjugated polymers, and nanoporous graphenes (NPGs), which are gathering increasing attention for future carbon nanoelectronics. A key step is the ability to manipulate current flow within these nanomaterials. Destructive quantum interference (QI), long studied in the field of single-molecule electronics, has been proposed as the most effective way to achieve such control with molecular-scale precision. However, for practical applications, it is essential that such QI-engineering remains effective near or above room temperature. To assess this important point, here we combine large-scale molecular dynamics simulations and quantum transport calculations and focus our study on NPGs formed as arrays of laterally bonded GNRs. By considering various NPGs with different inter-GNR chemical connections we disentangle the different factors determining electronic transport in these carbon nanomaterials at 300 K. Our findings unequivocally demonstrate that QI survives at room temperature, with thermal vibrations weakly restricting current flow along GNRs while completely blocking transport across GNRs. Our results thus pave the way towards the future realization of QI-engineered carbon nanocircuitry operating at room temperature, which is a fundamental step towards carbon-based nanoelectronics and quantum technologies. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 21 pages, 4 figures

arXiv:2407.08065 [pdf, other]

Towards Interpretable Foundation Models of Robot Behavior: A Task Specific Policy Generation Approach

Authors: Isaac Sheidlower, Reuben Aronson, Elaine Schaertl Short

Abstract: Foundation models are a promising path toward general-purpose and user-friendly robots. The prevalent approach involves training a generalist policy that, like a reinforcement learning policy, uses observations to output actions. Although this approach has seen much success, several concerns arise when considering deployment and end-user interaction with these systems. In particular, the lack of m… ▽ More Foundation models are a promising path toward general-purpose and user-friendly robots. The prevalent approach involves training a generalist policy that, like a reinforcement learning policy, uses observations to output actions. Although this approach has seen much success, several concerns arise when considering deployment and end-user interaction with these systems. In particular, the lack of modularity between tasks means that when model weights are updated (e.g., when a user provides feedback), the behavior in other, unrelated tasks may be affected. This can negatively impact the system's interpretability and usability. We present an alternative approach to the design of robot foundation models, Diffusion for Policy Parameters (DPP), which generates stand-alone, task-specific policies. Since these policies are detached from the foundation model, they are updated only when a user wants, either through feedback or personalization, allowing them to gain a high degree of familiarity with that policy. We demonstrate a proof-of-concept of DPP in simulation then discuss its limitations and the future of interpretable foundation models. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: Short Paper accepted to RLC 2024 Workshop on Training Agents with Foundation Models

arXiv:2407.07742 [pdf, other]

Science-Informed Deep Learning (ScIDL) With Applications to Wireless Communications

Authors: Atefeh Termehchi, Ekram Hossain, Isaac Woungang

Abstract: Given the extensive and growing capabilities offered by deep learning (DL), more researchers are turning to DL to address complex challenges in next-generation (xG) communications. However, despite its progress, DL also reveals several limitations that are becoming increasingly evident. One significant issue is its lack of interpretability, which is especially critical for safety-sensitive applica… ▽ More Given the extensive and growing capabilities offered by deep learning (DL), more researchers are turning to DL to address complex challenges in next-generation (xG) communications. However, despite its progress, DL also reveals several limitations that are becoming increasingly evident. One significant issue is its lack of interpretability, which is especially critical for safety-sensitive applications. Another significant consideration is that DL may not comply with the constraints set by physics laws or given security standards, which are essential for reliable DL. Additionally, DL models often struggle outside their training data distributions, which is known as poor generalization. Moreover, there is a scarcity of theoretical guidance on designing DL algorithms. These challenges have prompted the emergence of a burgeoning field known as science-informed DL (ScIDL). ScIDL aims to integrate existing scientific knowledge with DL techniques to develop more powerful algorithms. The core objective of this article is to provide a brief tutorial on ScIDL that illustrates its building blocks and distinguishes it from conventional DL. Furthermore, we discuss both recent applications of ScIDL and potential future research directions in the field of wireless communications. △ Less

Submitted 28 June, 2024; originally announced July 2024.

arXiv:2407.07186 [pdf, other]

Barely-Visible Surface Crack Detection for Wind Turbine Sustainability

Authors: Sourav Agrawal, Isaac Corley, Conor Wallace, Clovis Vaughn, Jonathan Lwowski

Abstract: The production of wind energy is a crucial part of sustainable development and reducing the reliance on fossil fuels. Maintaining the integrity of wind turbines to produce this energy is a costly and time-consuming task requiring repeated inspection and maintenance. While autonomous drones have proven to make this process more efficient, the algorithms for detecting anomalies to prevent catastroph… ▽ More The production of wind energy is a crucial part of sustainable development and reducing the reliance on fossil fuels. Maintaining the integrity of wind turbines to produce this energy is a costly and time-consuming task requiring repeated inspection and maintenance. While autonomous drones have proven to make this process more efficient, the algorithms for detecting anomalies to prevent catastrophic damage to turbine blades have fallen behind due to some dangerous defects, such as hairline cracks, being barely-visible. Existing datasets and literature are lacking and tend towards detecting obvious and visible defects in addition to not being geographically diverse. In this paper we introduce a novel and diverse dataset of barely-visible hairline cracks collected from numerous wind turbine inspections. To prove the efficacy of our dataset, we detail our end-to-end deployed turbine crack detection pipeline from the image acquisition stage to the use of predictions in providing automated maintenance recommendations to extend the life and efficiency of wind turbines. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.06173 [pdf, other]

Large Row-Constrained Supersaturated Designs for High-throughput Screening

Authors: Byran J. Smucker, Stephen E. Wright, Isaac Williams, Richard C. Page, Andor J. Kiss, Surendra Bikram Silwal, Maria Weese, David J. Edwards

Abstract: High-throughput screening, in which multiwell plates are used to test large numbers of compounds against specific targets, is widely used across many areas of the biological sciences and most prominently in drug discovery. We propose a statistically principled approach to these screening experiments, using the machinery of supersaturated designs and the Lasso. To accommodate limitations on the num… ▽ More High-throughput screening, in which multiwell plates are used to test large numbers of compounds against specific targets, is widely used across many areas of the biological sciences and most prominently in drug discovery. We propose a statistically principled approach to these screening experiments, using the machinery of supersaturated designs and the Lasso. To accommodate limitations on the number of biological entities that can be applied to a single microplate well, we present a new class of row-constrained supersaturated designs. We develop a computational procedure to construct these designs, provide some initial lower bounds on the average squared off-diagonal values of their main-effects information matrix, and study the impact of the constraint on design quality. We also show via simulation that the proposed constrained row screening method is statistically superior to existing methods and demonstrate the use of the new methodology on a real drug-discovery system. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: Supplementary materials can be found at https://sites.miamioh.edu/byran-smucker/research-2/

arXiv:2407.03820 [pdf]

doi 10.1109/TLA.2016.7430109

Use of Mobile Devices in the Classroom to Increase Motivation and Participation of Engineering University Students

Authors: Carlos Guerrero, Antoni Jaume-i-Capó, Carlos Juiz, Isaac Lera

Abstract: The aim of this study was to see whether student participation increased when mobile devices were used in the classroom. We measured the amount of student participative actions when the Socrative tool was used and when it was not used. Our experiment involved a total of 192 students, corresponding to 4 different subjects of Computer Engineering at the Universitat de les Illes Balears, during 2012/… ▽ More The aim of this study was to see whether student participation increased when mobile devices were used in the classroom. We measured the amount of student participative actions when the Socrative tool was used and when it was not used. Our experiment involved a total of 192 students, corresponding to 4 different subjects of Computer Engineering at the Universitat de les Illes Balears, during 2012/2013 and 2013/2014 courses. An independent paired t-test was performed on the measurements. The analysis results show that student participation increases with the use of mobile devices for theory classes and students are willing to participate in class activities and share their own results. △ Less

Submitted 4 July, 2024; originally announced July 2024.

Comments: in Spanish language

Journal ref: IEEE Latin America Transactions. Volume: 14, Issue: 1, January 2016. Pages 411 - 416. ISSN: 1548-0992

arXiv:2407.03573 [pdf, other]

An analytic, moment-based method to estimate orthopositronium lifetimes in positron annihilation lifetime spectroscopy measurements

Authors: Lucas Berens, Isaac Hsu, Chin-Tu Chen, Howard Halpern, Chien-Min Kao

Abstract: The presence of tumor hypoxia is known to correlate with poor patient prognosis. Measurement of tissue oxygen concentration can be challenging, but recent advancements using positron annihilation lifetime spectroscopy (PALS) in three-dimensional positron emission tomography (PET) scans have shown promise for hypoxia detection. In this work, a novel method for estimating the orthopositronium lifeti… ▽ More The presence of tumor hypoxia is known to correlate with poor patient prognosis. Measurement of tissue oxygen concentration can be challenging, but recent advancements using positron annihilation lifetime spectroscopy (PALS) in three-dimensional positron emission tomography (PET) scans have shown promise for hypoxia detection. In this work, a novel method for estimating the orthopositronium lifetime in PALS is presented. This method is analytical and uses moments of the time-difference histogram from photon arrival times. For sufficient statistical power, the method produces monotonic, stable estimates. For cases with a lower number of photon counts, the method was characterized and solutions are presented to correct for bias and estimation variability. △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2407.00576 [pdf, other]

The In-Medium Similarity Renormalization Group at Finite Temperature

Authors: Isaac G. Smith, Heiko Hergert, Scott K. Bogner

Abstract: The study of nuclei at finite temperature is of immense interest for many areas of nuclear astrophysics and nuclear-reaction science. A variety of ab initio methods are now available for computing the properties of nuclei from interactions rooted in Quantum Chromodynamics, but applications have largely been limited to zero temperature. In the present work, we extend one such method, the In-Medium… ▽ More The study of nuclei at finite temperature is of immense interest for many areas of nuclear astrophysics and nuclear-reaction science. A variety of ab initio methods are now available for computing the properties of nuclei from interactions rooted in Quantum Chromodynamics, but applications have largely been limited to zero temperature. In the present work, we extend one such method, the In-Medium Similarity Renormalization Group (IMSRG), to finite temperature. Using an exactly-solvable schematic model that captures essential features of nuclear interactions, we show that the FT-IMSRG can accurately determine the energetics of nuclei at finite temperature, and we explore the accuracy of the FT-IMSRG in different parameter regimes, e.g., strong and weak pairing. In anticipation of FT-IMSRG applications for finite nuclei and infinite matter, we discuss differences arising from the choice of working with the canonical and the grand canonical ensembles. In future work, we will apply the FT-IMSRG with realistic nuclear interactions to compute nuclear structure and reaction properties at finite temperature, which are important ingredients for understanding nucleosynthesis in stellar environments, or modeling reactions of hot compound nuclei. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: 14 pages, 11 figures

arXiv:2407.00031 [pdf, other]

Supercharging Federated Learning with Flower and NVIDIA FLARE

Authors: Holger R. Roth, Daniel J. Beutel, Yan Cheng, Javier Fernandez Marques, Heng Pan, Chester Chen, Zhihong Zhang, Yuhong Wen, Sean Yang, Isaac, Yang, Yuan-Ting Hsieh, Ziyue Xu, Daguang Xu, Nicholas D. Lane, Andrew Feng

Abstract: Several open-source systems, such as Flower and NVIDIA FLARE, have been developed in recent years while focusing on different aspects of federated learning (FL). Flower is dedicated to implementing a cohesive approach to FL, analytics, and evaluation. Over time, Flower has cultivated extensive strategies and algorithms tailored for FL application development, fostering a vibrant FL community in re… ▽ More Several open-source systems, such as Flower and NVIDIA FLARE, have been developed in recent years while focusing on different aspects of federated learning (FL). Flower is dedicated to implementing a cohesive approach to FL, analytics, and evaluation. Over time, Flower has cultivated extensive strategies and algorithms tailored for FL application development, fostering a vibrant FL community in research and industry. Conversely, FLARE has prioritized the creation of an enterprise-ready, resilient runtime environment explicitly designed for FL applications in production environments. In this paper, we describe our initial integration of both frameworks and show how they can work together to supercharge the FL ecosystem as a whole. Through the seamless integration of Flower and FLARE, applications crafted within the Flower framework can effortlessly operate within the FLARE runtime environment without necessitating any modifications. This initial integration streamlines the process, eliminating complexities and ensuring smooth interoperability between the two platforms, thus enhancing the overall efficiency and accessibility of FL applications. △ Less

Submitted 21 May, 2024; originally announced July 2024.

arXiv:2406.19038 [pdf, other]

Binary neutron star mergers using a discontinuous Galerkin-finite difference hybrid method

Authors: Nils Deppe, Francois Foucart, Marceline S. Bonilla, Michael Boyle, Nicholas J. Corso, Matthew D. Duez, Matthew Giesler, François Hébert, Lawrence E. Kidder, Yoonsoo Kim, Prayush Kumar, Isaac Legred, Geoffrey Lovelace, Elias R. Most, Jordan Moxon, Kyle C. Nelli, Harald P. Pfeiffer, Mark A. Scheel, Saul A. Teukolsky, William Throwe, Nils L. Vu

Abstract: We present a discontinuous Galerkin-finite difference hybrid scheme that allows high-order shock capturing with the discontinuous Galerkin method for general relativistic magnetohydrodynamics in dynamical spacetimes. We present several optimizations and stability improvements to our algorithm that allow the hybrid method to successfully simulate single, rotating, and binary neutron stars. The hybr… ▽ More We present a discontinuous Galerkin-finite difference hybrid scheme that allows high-order shock capturing with the discontinuous Galerkin method for general relativistic magnetohydrodynamics in dynamical spacetimes. We present several optimizations and stability improvements to our algorithm that allow the hybrid method to successfully simulate single, rotating, and binary neutron stars. The hybrid method achieves the efficiency of discontinuous Galerkin methods throughout almost the entire spacetime during the inspiral phase, while being able to robustly capture shocks and resolve the stellar surfaces. We also use Cauchy-Characteristic evolution to compute the first gravitational waveforms at future null infinity from binary neutron star mergers. The simulations presented here are the first successful binary neutron star inspiral and merger simulations using discontinuous Galerkin methods. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 31 pages, 8 figures, comments welcome!

arXiv:2406.18665 [pdf, other]

RouteLLM: Learning to Route LLMs with Preference Data

Authors: Isaac Ong, Amjad Almahairi, Vincent Wu, Wei-Lin Chiang, Tianhao Wu, Joseph E. Gonzalez, M Waleed Kadous, Ion Stoica

Abstract: Large language models (LLMs) exhibit impressive capabilities across a wide range of tasks, yet the choice of which model to use often involves a trade-off between performance and cost. More powerful models, though effective, come with higher expenses, while less capable models are more cost-effective. To address this dilemma, we propose several efficient router models that dynamically select betwe… ▽ More Large language models (LLMs) exhibit impressive capabilities across a wide range of tasks, yet the choice of which model to use often involves a trade-off between performance and cost. More powerful models, though effective, come with higher expenses, while less capable models are more cost-effective. To address this dilemma, we propose several efficient router models that dynamically select between a stronger and a weaker LLM during inference, aiming to optimize the balance between cost and response quality. We develop a training framework for these routers leveraging human preference data and data augmentation techniques to enhance performance. Our evaluation on widely-recognized benchmarks shows that our approach significantly reduces costs-by over 2 times in certain cases-without compromising the quality of responses. Interestingly, our router models also demonstrate significant transfer learning capabilities, maintaining their performance even when the strong and weak models are changed at test time. This highlights the potential of these routers to provide a cost-effective yet high-performance solution for deploying LLMs. △ Less

Submitted 1 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

arXiv:2406.18424 [pdf, other]

Upgrading SPHERE with the second stage AO system SAXO+: non-common path aberrations estimation and correction

Authors: Johan Mazoyer, Charles Goulas, Fabrice Vidal, Isaac Bernardino Dinis, Julien Milli, Michel Tallon, Raphaël Galicher, Oliver Absil, Clémentine Béchet, Anthony Boccaletti, Florian Ferreira, Maud Langlois, Patrice Martinez, Laurent Mugnier, Mamadou N'diaye, Gilles Orban de Xivry, Axel Potier, Isabelle Tallon-Bosc, Arthur Vigan

Abstract: SAXO+ is a planned enhancement of the existing SAXO, the VLT/ SPHERE adaptive optics system, deployed on ESO's Very Large Telescope. This upgrade is designed to significantly enhance the instrument's capacity to detect and analyze young Jupiter-like planets. The pivotal addition in SAXO+ is a second-stage adaptive optics system featuring a dedicated near-infrared pyramid wavefront sensor and a sec… ▽ More SAXO+ is a planned enhancement of the existing SAXO, the VLT/ SPHERE adaptive optics system, deployed on ESO's Very Large Telescope. This upgrade is designed to significantly enhance the instrument's capacity to detect and analyze young Jupiter-like planets. The pivotal addition in SAXO+ is a second-stage adaptive optics system featuring a dedicated near-infrared pyramid wavefront sensor and a second deformable mirror. This secondary stage is strategically integrated to address any residual wavefront errors persisting after the initial correction performed by the current primary AO loop, SAXO. However, several recent studies clearly showed that in good conditions, even in the current system SAXO, non-common path aberrations (NCPAs) are the limiting factor of the final normalized intensity in focal plane, which is the final metric for ground-based high-contrast instruments. This is likely to be even more so the case with the new AO system, with which the AO residuals will be minimized. Several techniques have already been extensively tested on SPHERE in internal source and/or on-sky and will be presented in this paper. However, the use of a new type of sensor for the second stage, a pyramid wavefront sensor, will likely complicate the correction of these aberrations. Using an end-to-end AO simulation tool, we conducted simulations to gauge the effect of measured SPHERE NCPAs in the coronagraphic image on the second loop system and their correction using focal plane wavefront sensing systems. We finally analyzed how the chosen position of SAXO+ in the beam will impact the evolution of the NCPAs in the new instrument. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: 16 pages, 10 figures, submitted to the proceedings of SPIE Astronomical Telescopes + Instrumentation 2024, 13096-357

arXiv:2406.17759 [pdf, other]

Interpreting Attention Layer Outputs with Sparse Autoencoders

Authors: Connor Kissane, Robert Krzyzanowski, Joseph Isaac Bloom, Arthur Conmy, Neel Nanda

Abstract: Decomposing model activations into interpretable components is a key open problem in mechanistic interpretability. Sparse autoencoders (SAEs) are a popular method for decomposing the internal activations of trained transformers into sparse, interpretable features, and have been applied to MLP layers and the residual stream. In this work we train SAEs on attention layer outputs and show that also h… ▽ More Decomposing model activations into interpretable components is a key open problem in mechanistic interpretability. Sparse autoencoders (SAEs) are a popular method for decomposing the internal activations of trained transformers into sparse, interpretable features, and have been applied to MLP layers and the residual stream. In this work we train SAEs on attention layer outputs and show that also here SAEs find a sparse, interpretable decomposition. We demonstrate this on transformers from several model families and up to 2B parameters. We perform a qualitative study of the features computed by attention layers, and find multiple families: long-range context, short-range context and induction features. We qualitatively study the role of every head in GPT-2 Small, and estimate that at least 90% of the heads are polysemantic, i.e. have multiple unrelated roles. Further, we show that Sparse Autoencoders are a useful tool that enable researchers to explain model behavior in greater detail than prior work. For example, we explore the mystery of why models have so many seemingly redundant induction heads, use SAEs to motivate the hypothesis that some are long-prefix whereas others are short-prefix, and confirm this with more rigorous analysis. We use our SAEs to analyze the computation performed by the Indirect Object Identification circuit (Wang et al.), validating that the SAEs find causally meaningful intermediate variables, and deepening our understanding of the semantics of the circuit. We open-source the trained SAEs and a tool for exploring arbitrary prompts through the lens of Attention Output SAEs. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.17644 [pdf, other]

Numerical simulations for the SAXO+ upgrade: Performance analysis of the adaptive optics system

Authors: Charles Goulas, Raphaël Galicher, Fabrice Vidal, Johan Mazoyer, Florian Ferreira, Arnaud Sevin, Anthony Boccaletti, Eric Gendron, Clémentine Béchet, Michel Tallon, Maud Langlois, Caroline Kulcsár, Henri-François Raynaud, Nicolas Galland, Laura Schreiber, Isaac Bernardino Dinis, François Wildi, Gaël Chauvin, Julien Milli

Abstract: SPHERE, operating at the VLT since 2014, is currently one of the high-contrast instruments with a higher performance. Its adaptive optics system, known as SAXO, will be upgraded to SAXO+, which features the addition of a second stage of adaptive optics. This stage will use a near-infrared pyramid wavefront sensor to record images of fainter exoplanets around redder stars. In this work, we compare… ▽ More SPHERE, operating at the VLT since 2014, is currently one of the high-contrast instruments with a higher performance. Its adaptive optics system, known as SAXO, will be upgraded to SAXO+, which features the addition of a second stage of adaptive optics. This stage will use a near-infrared pyramid wavefront sensor to record images of fainter exoplanets around redder stars. In this work, we compare the performance of SAXO and SAXO+. We look for the optimal values of the key system parameters of SAXO+ for various science cases and turbulence conditions. We performed numerical simulations using COMPASS, an end-to-end adaptive optics simulation tool. We simulated perfect coronagraph images of an on-axis point source, and we minimized the residual starlight intensity between 3 and 5 $λ/D$ as a performance criterion. The explored parameter space includes science cases, turbulence conditions, and key system parameters. In every science case and turbulence condition, SAXO+ reduces the residual starlight intensity inside the correction zone of the second stage by a factor of ten compared to SAXO. The optimal first stage gain is lower for SAXO+ than for SAXO alone. We quantified the gain in performance of SAXO+ when changing the second stage frequency from 2 kHz to 3 kHz, and we conclude that 2 kHz may be sufficient for most realistic conditions. We give the optimal first stage gain as well as the first and second stage frequencies for every seeing, coherence time, and science case. Finally, we find that a 2 ${λ_{\mathrm{WFS}}}/D$ pyramid modulation radius is a good trade-off between performance and robustness against varying turbulence conditions. This study shows that the future SAXO+ system will outperform the current SAXO system in all studied cases. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 14 pages, 11 figures

arXiv:2406.17625 [pdf]

Using skateboarding to develop a culturally relevant tutorial on static equilibrium

Authors: Gian Viray, Isaac Cheney, Tong Wan

Abstract: Culturally relevant pedagogy (CRP), initially developed by Ladson-Billings, is an instructional framework for supporting diverse learners by drawing on their cultural backgrounds and experiences. In line with the CRP framework, we developed a tutorial on static equilibrium using skateboarding, a popular activity on university campuses, as a culturally relevant context. To address specific student… ▽ More Culturally relevant pedagogy (CRP), initially developed by Ladson-Billings, is an instructional framework for supporting diverse learners by drawing on their cultural backgrounds and experiences. In line with the CRP framework, we developed a tutorial on static equilibrium using skateboarding, a popular activity on university campuses, as a culturally relevant context. To address specific student conceptions about static equilibrium documented in the physics education research (PER) literature, we used the elicit-confront-resolve (ECR) strategy to develop the tutorial. In this paper, we provide a detailed account of how we operationalized the ECR strategy in designing the sequences of questions in the tutorial. Additionally, we present anecdotal evidence to show that the culturally relevant tutorial appears to effectively engage students and motivate their interest in learning physics. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.17142 [pdf, other]

Continuous drive heterodyne microwave sensing with spin qubits in hexagonal boron nitride

Authors: Charlie J. Patrickson, Valentin Haemmerli, Shi Guo, Andrew J. Ramsay, Isaac J. Luxmoore

Abstract: Quantum sensors that use solid state spin defects have emerged as effective probes of weak alternating magnetic signals. By recording the phase of a signal relative to an external clock, these devices can resolve signal frequencies to a precision orders of magnitude longer than the spin state lifetime. However, these quantum heterodyne protocols suffer from sub-optimal sensitivity, as they are cur… ▽ More Quantum sensors that use solid state spin defects have emerged as effective probes of weak alternating magnetic signals. By recording the phase of a signal relative to an external clock, these devices can resolve signal frequencies to a precision orders of magnitude longer than the spin state lifetime. However, these quantum heterodyne protocols suffer from sub-optimal sensitivity, as they are currently limited to pulsed spin control techniques, which are susceptible to cumulative pulse-area errors, or single continuous drives which offer no protection of the spin coherence. Here, we present a control scheme based on a continuous microwave drive that extends spin coherence towards the effective $T_2 \approx \frac{1}{2}T_1$ limit and can resolve the frequency, amplitude and phase of GHz magnetic fields. The scheme is demonstrated using an ensemble of boron vacancies in hexagonal boron nitride, and achieves an amplitude sensitivity of $η\approx 3-5 \:\mathrm{μT \sqrt{Hz}}$ and phase sensitivity of $η_φ \approx 0.076 \:\mathrm{rads \sqrt{Hz}}$. By repeatedly referencing the phase of a resonant signal against the coherent continuous microwave drive in a quantum heterodyne demonstration, we measure a GHz signal with a resolution $<$1 Hz over a 10 s measurement. Achieving this level of performance in a two-dimensional material platform could have broad applications, from probing nanoscale condensed matter systems to integration into heterostructures for quantum networking. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.15855 [pdf, other]

Irida-Graphene Phonon Thermal Transport via Non-equilibrium Molecular Dynamics Simulations

Authors: Isaac M. Felix, Raphael M. Tromer, Leonardo D. Machado, Douglas S. Galvão, Luiz A. Ribeiro Jr, Marcelo L. Pereira Jr

Abstract: Recently, a new 2D carbon allotrope called Irida-Graphene (Irida-G) was proposed. Irida-G consists of a flat sheet topologically arranged into 3-6-8 carbon rings exhibiting metallic and non-magnetic properties. In this study, we investigated the thermal transport properties of Irida-G using classical reactive molecular dynamics simulations. The findings indicate that Irida-G has an intrinsic therm… ▽ More Recently, a new 2D carbon allotrope called Irida-Graphene (Irida-G) was proposed. Irida-G consists of a flat sheet topologically arranged into 3-6-8 carbon rings exhibiting metallic and non-magnetic properties. In this study, we investigated the thermal transport properties of Irida-G using classical reactive molecular dynamics simulations. The findings indicate that Irida-G has an intrinsic thermal conductivity of approximately 215 W/mK at room temperature, significantly lower than that of pristine graphene. This decrease is due to characteristic phonon scattering within Irida-G's porous structure. Additionally, the phonon group velocities and vibrational density of states for Irida-G were analyzed, revealing reduced average phonon group velocities compared to graphene. The thermal conductivity of Irida-G is isotropic and shows significant size effects, transitioning from ballistic to diffusive heat transport regimes as the system length increases. These results suggest that while Irida-G has lower thermal conductivity than graphene, it still holds potential for specific thermal management applications, sharing characteristics with other two-dimensional materials. △ Less

Submitted 28 June, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

Comments: 09 pages, 06 figures

MSC Class: 00-xx ACM Class: J.2; I.6

arXiv:2406.15543 [pdf, other]

doi 10.3847/1538-3881/ad58e0

Multiple Clues for Dayside Aerosols and Temperature Gradients in WASP-69 b from a Panchromatic JWST Emission Spectrum

Authors: Everett Schlawin, Sagnick Mukherjee, Kazumasa Ohno, Taylor Bell, Thomas G. Beatty, Thomas P. Greene, Michael Line, Ryan C. Challener, Vivien Parmentier, Jonathan J. Fortney, Emily Rauscher, Lindsey Wiser, Luis Welbanks, Matthew Murphy, Isaac Edelman, Natasha Batalha, Sarah E. Moran, Nishil Mehta, Marcia Rieke

Abstract: WASP-69 b is a hot, inflated, Saturn-mass planet 0.26 Mjup with a zero-albedo equilibrium temperature of 963 K. Here, we report the JWST 2 to 12 um emission spectrum of the planet consisting of two eclipses observed with NIRCam grism time series and one eclipse observed with MIRI LRS. The emission spectrum shows absorption features of water vapor, carbon dioxide and carbon monoxide, but no strong… ▽ More WASP-69 b is a hot, inflated, Saturn-mass planet 0.26 Mjup with a zero-albedo equilibrium temperature of 963 K. Here, we report the JWST 2 to 12 um emission spectrum of the planet consisting of two eclipses observed with NIRCam grism time series and one eclipse observed with MIRI LRS. The emission spectrum shows absorption features of water vapor, carbon dioxide and carbon monoxide, but no strong evidence for methane. WASP-69 b's emission spectrum is poorly fit by cloud-free homogeneous models. We find three possible model scenarios for the planet: 1) a Scattering Model that raises the brightness at short wavelengths with a free Geometric Albedo parameter 2) a Cloud Layer model that includes high altitude silicate aerosols to moderate long wavelength emission and 3) a Two-Region model that includes significant dayside inhomogeneity and cloud opacity with two different temperature-pressure profiles. In all cases, aerosols are needed to fit the spectrum of the planet. The Scattering model requires an unexpectedly high Geometric Albedo of 0.64. Our atmospheric retrievals indicate inefficient redistribution of heat and an inhomogeneous dayside distribution, which is tentatively supported by MIRI LRS broadband eclipse maps that show a central concentration of brightness. Our more plausible models (2 and 3) retrieve chemical abundances enriched in heavy elements relative to solar composition by 6x to 14x solar and a C/O ratio of 0.65 to 0.94, whereas the less plausible highly reflective scenario (1) retrieves a slightly lower metallicity and lower C/O ratio. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 41 pages, 19 figures, accepted to the Astronomical Journal

arXiv:2406.13843 [pdf, other]

Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data

Authors: Nahema Marchal, Rachel Xu, Rasmi Elasmar, Iason Gabriel, Beth Goldberg, William Isaac

Abstract: Generative, multimodal artificial intelligence (GenAI) offers transformative potential across industries, but its misuse poses significant risks. Prior research has shed light on the potential of advanced AI systems to be exploited for malicious purposes. However, we still lack a concrete understanding of how GenAI models are specifically exploited or abused in practice, including the tactics empl… ▽ More Generative, multimodal artificial intelligence (GenAI) offers transformative potential across industries, but its misuse poses significant risks. Prior research has shed light on the potential of advanced AI systems to be exploited for malicious purposes. However, we still lack a concrete understanding of how GenAI models are specifically exploited or abused in practice, including the tactics employed to inflict harm. In this paper, we present a taxonomy of GenAI misuse tactics, informed by existing academic literature and a qualitative analysis of approximately 200 observed incidents of misuse reported between January 2023 and March 2024. Through this analysis, we illuminate key and novel patterns in misuse during this time period, including potential motivations, strategies, and how attackers leverage and abuse system capabilities across modalities (e.g. image, text, audio, video) in the wild. △ Less

Submitted 21 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.13711 [pdf, other]

Imagining In-distribution States: How Predictable Robot Behavior Can Enable User Control Over Learned Policies

Authors: Isaac Sheidlower, Emma Bethel, Douglas Lilly, Reuben M. Aronson, Elaine Schaertl Short

Abstract: It is crucial that users are empowered to take advantage of the functionality of a robot and use their understanding of that functionality to perform novel and creative tasks. Given a robot trained with Reinforcement Learning (RL), a user may wish to leverage that autonomy along with their familiarity of how they expect the robot to behave to collaborate with the robot. One technique is for the us… ▽ More It is crucial that users are empowered to take advantage of the functionality of a robot and use their understanding of that functionality to perform novel and creative tasks. Given a robot trained with Reinforcement Learning (RL), a user may wish to leverage that autonomy along with their familiarity of how they expect the robot to behave to collaborate with the robot. One technique is for the user to take control of some of the robot's action space through teleoperation, allowing the RL policy to simultaneously control the rest. We formalize this type of shared control as Partitioned Control (PC). However, this may not be possible using an out-of-the-box RL policy. For example, a user's control may bring the robot into a failure state from the policy's perspective, causing it to act unexpectedly and hindering the success of the user's desired task. In this work, we formalize this problem and present Imaginary Out-of-Distribution Actions, IODA, an initial algorithm which empowers users to leverage their expectations of a robot's behavior to accomplish new tasks. We deploy IODA in a user study with a real robot and find that IODA leads to both better task performance and a higher degree of alignment between robot behavior and user expectation. We also show that in PC, there is a strong and significant correlation between task performance and the robot's ability to meet user expectations, highlighting the need for approaches like IODA. Code is available at https://github.com/AABL-Lab/ioda_roman_2024 △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: Accepted to IEEE RO-MAN 2024 as a regular paper. arXiv admin note: substantial text overlap with arXiv:2312.05991

arXiv:2406.13549 [pdf]

Hardware Realization of Neuromorphic Computing with a 4-Port Photonic Reservoir for Modulation Format Identification

Authors: Enes Şeker, Rijil Thomas, Guillermo von Hünefeld, Stephan Suckow, Mahdi Kaveh, Gregor Ronniger, Pooyan Safari, Isaac Sackey, David Stahl, Colja Schubert, Johannes Karl Fischer, Ronald Freund, Max C. Lemme

Abstract: The fields of machine learning and artificial intelligence drive researchers to explore energy-efficient, brain-inspired new hardware. Reservoir computing encompasses recurrent neural networks for sequential data processing and matches the performance of other recurrent networks with less training and lower costs. However, traditional software-based neural networks suffer from high energy consumpt… ▽ More The fields of machine learning and artificial intelligence drive researchers to explore energy-efficient, brain-inspired new hardware. Reservoir computing encompasses recurrent neural networks for sequential data processing and matches the performance of other recurrent networks with less training and lower costs. However, traditional software-based neural networks suffer from high energy consumption due to computational demands and massive data transfer needs. Photonic reservoir computing overcomes this challenge with energy-efficient neuromorphic photonic integrated circuits or NeuroPICs. Here, we introduce a reservoir NeuroPIC used for modulation format identification in C-band telecommunication network monitoring. It is built on a silicon-on-insulator platform with a 4-port reservoir architecture consisting of a set of physical nodes connected via delay lines. We comprehensively describe the NeuroPIC design and fabrication, experimentally demonstrate its performance, and compare it with simulations. The NeuroPIC incorporates non-linearity through a simple digital readout and achieves close to 100% accuracy in identifying several configurations of quadrature amplitude modulation formats transmitted over 20 km of optical fiber at 32 GBaud symbol rate. The NeuroPIC performance is robust against fabrication imperfections like waveguide propagation loss, phase randomization, etc. and delay line length variations. Furthermore, the experimental results exceeded numerical simulations, which we attribute to enhanced signal interference in the experimental NeuroPIC output. Our energy-efficient photonic approach has the potential for high-speed temporal data processing in a variety of applications. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 32 pages, including supporting information

arXiv:2406.13407

Predicting BN analogue of 8-16-4 graphyne: \textit{In silico} insights into its structural, electronic, optical, and thermal transport properties

Authors: Isaac M. Félix, Jessé M. Pontes, Djardiel S. Gomes, Thiago B. G. Guerra, Sérgio A. F. Azevedo, Leonardo D. Machado, Lídia C. Gomes, Raphael M. Tromer

Abstract: The boron nitride (BN) analogue of 8-16-4 graphyne, termed SBNyne, is proposed for the first time. Its physical properties were explored using first-principles calculations and classical molecular dynamics (MD) simulations. Thermal stability assessments reveal that SBNyne maintains structural integrity up to 1000 K. We found that SBNyne exhibits a wide indirect bandgap of 4.58 eV using HSE06 and 3… ▽ More The boron nitride (BN) analogue of 8-16-4 graphyne, termed SBNyne, is proposed for the first time. Its physical properties were explored using first-principles calculations and classical molecular dynamics (MD) simulations. Thermal stability assessments reveal that SBNyne maintains structural integrity up to 1000 K. We found that SBNyne exhibits a wide indirect bandgap of 4.58 eV using HSE06 and 3.20 eV using PBE. It displays strong optical absorption in the ultraviolet region while remaining transparent in the infrared and visible regions. Additionally, SBNyne exhibits significantly lower thermal conductivity compared to h-BN. Phonon spectrum analysis indicates that out-of-plane phonons predominantly contribute to the vibrational density of states only at very low frequencies, explaining its low thermal conductivity. These findings expand the knowledge of BN-based 2D materials and open new avenues for their design and advanced technological applications. △ Less

Submitted 2 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

Comments: We have reviewed the thermal stability calculation and found that the SBNyne structure is metastable and undergoes a transition to a new phase. We are currently investigating this new phase, and to avoid misunderstandings, we need to remove the preprint

arXiv:2406.12981 [pdf, other]

Quantum geometry of bosonic Bogoliubov quasiparticles

Authors: Isaac Tesfaye, André Eckardt

Abstract: Topological and geometrical features arising in bosonic Bogoliubov-de Gennes (BdG) systems have mainly been studied by utilizing a generalized symplectic version of the Berry curvature and related Chern numbers. Here, we propose a symplectic quantum geometric tensor (SQGT), whose imaginary part leads to the previously studied symplectic Berry curvature, while the real part gives rise to a symplect… ▽ More Topological and geometrical features arising in bosonic Bogoliubov-de Gennes (BdG) systems have mainly been studied by utilizing a generalized symplectic version of the Berry curvature and related Chern numbers. Here, we propose a symplectic quantum geometric tensor (SQGT), whose imaginary part leads to the previously studied symplectic Berry curvature, while the real part gives rise to a symplectic quantum metric, providing a natural distance measure in the space of bosonic Bogoliubov modes. We propose how to measure all components of the SQGT by extracting excitation rates in response to periodic modulations of the systems' parameters. Moreover, we connect the symplectic Berry curvature to a generalized symplectic anomalous velocity term for Bogoliubov Bloch wave packets. We test our results for a bosonic Bogoliubov-Haldane model. △ Less

Submitted 16 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

Comments: 8+23 pages, 3+2 figures

arXiv:2406.11757 [pdf, other]

STAR: SocioTechnical Approach to Red Teaming Language Models

Authors: Laura Weidinger, John Mellor, Bernat Guillen Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, Stevie Bergman, Mikel Rodriguez, Verena Rieser, William Isaac

Abstract: This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models. STAR makes two key contributions: it enhances steerability by generating parameterised instructions for human red teamers, leading to improved coverage of the risk surface. Parameterised instructions also provide more detailed insights into model failur… ▽ More This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models. STAR makes two key contributions: it enhances steerability by generating parameterised instructions for human red teamers, leading to improved coverage of the risk surface. Parameterised instructions also provide more detailed insights into model failures at no increased cost. Second, STAR improves signal quality by matching demographics to assess harms for specific groups, resulting in more sensitive annotations. STAR further employs a novel step of arbitration to leverage diverse viewpoints and improve label reliability, treating disagreement not as noise but as a valuable contribution to signal quality. △ Less

Submitted 10 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Comments: 8 pages, 5 figures, 5 pages appendix. * denotes equal contribution

arXiv:2406.11574 [pdf, ps, other]

Non-unitary Coupled Cluster Enabled by Mid-circuit Measurements on Quantum Computers

Authors: Alexandre Fleury, James Brown, Erika Lloyd, Maritza Hernandez, Isaac H. Kim

Abstract: Many quantum algorithms rely on a quality initial state for optimal performance. Preparing an initial state for specific applications can considerably reduce the cost of probabilistic algorithms such as the well studied quantum phase estimation (QPE). Fortunately, in the application space of quantum chemistry, generating approximate wave functions for molecular systems is well studied, and quantum… ▽ More Many quantum algorithms rely on a quality initial state for optimal performance. Preparing an initial state for specific applications can considerably reduce the cost of probabilistic algorithms such as the well studied quantum phase estimation (QPE). Fortunately, in the application space of quantum chemistry, generating approximate wave functions for molecular systems is well studied, and quantum computing algorithms stand to benefit from importing these classical methods directly into a quantum circuit. In this work, we propose a state preparation method based on coupled cluster (CC) theory, which is a pillar of quantum chemistry on classical computers, by incorporating mid-circuit measurements into the circuit construction. Currently, the most well studied state preparation method for quantum chemistry on quantum computers is the variational quantum eigensolver (VQE) with a unitary-CC with single- and double-electron excitation terms (UCCSD) ansatz whose operations are limited to unitary gates. We verify the accuracy of our state preparation protocol using mid-circuit measurements by performing energy evaluation and state overlap computation for a set of small chemical systems. We further demonstrate that our approach leads to a reduction of the classical computation overhead, and the number of CNOT and T gates by 28% and 57% on average when compared against the standard VQE-UCCSD protocol. △ Less

Submitted 28 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Comments: 26 pages, 6 figures; title changed, references added

arXiv:2406.11413 [pdf]

doi 10.1109/TLA.2019.8931204

A platform for lightweight deployment of IoT applications based on a Function-as-a-Service model

Authors: Sebastià Sansó, Carlos Guerrero, Isaac Lera, Carlos Juiz

Abstract: This paper presents a platform to facilitate the deployment of applications in Internet of Things (IoT) devices. The platform allows to the programmers to use a Function-as-a-Service programming paradigm that are managed and configured in a Platform-as-a-Service web tool. The tool also allows to establish interoperability between the functions of the applications. The proposed platform obtained fa… ▽ More This paper presents a platform to facilitate the deployment of applications in Internet of Things (IoT) devices. The platform allows to the programmers to use a Function-as-a-Service programming paradigm that are managed and configured in a Platform-as-a-Service web tool. The tool also allows to establish interoperability between the functions of the applications. The proposed platform obtained faster and easier deployments of the applications and the resource usages of the IoT devices also were lower in relation to a deployment process based in containers of Docker. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: in Spanish language

Journal ref: IEEE Latin America Transactions. Volume: 17, Issue: 07, July 2019

arXiv:2406.09893 [pdf]

The nucleosynthetic fingerprint of the outermost protoplanetary disk and early Solar System dynamics

Authors: Elishevah van Kooten, Xuchao Zhao, Ian Franchi, Po-Yen Tung, Simon Fairclough, John Walmsley, Isaac Onyett, Martin Schiller, Martin Bizzarro

Abstract: Knowledge of the nucleosynthetic isotope composition of the outermost protoplanetary disk is critical to understand the formation and early dynamical evolution of the Solar System. We report the discovery of outer disk material preserved in a pristine meteorite based on its chemical composition, organic-rich petrology, and 15N-rich, deuterium-rich, and 16O-poor isotope signatures. We infer that th… ▽ More Knowledge of the nucleosynthetic isotope composition of the outermost protoplanetary disk is critical to understand the formation and early dynamical evolution of the Solar System. We report the discovery of outer disk material preserved in a pristine meteorite based on its chemical composition, organic-rich petrology, and 15N-rich, deuterium-rich, and 16O-poor isotope signatures. We infer that this outer disk material originated in the comet-forming region. The nucleosynthetic Fe, Mg, Si and Cr compositions of this material reveal that, contrary to current belief, the isotope signature of the comet-forming region is ubiquitous amongst outer Solar System bodies, possibly reflecting an important planetary building block in the outer Solar System. This nucleosynthetic component represents fresh material added to the outer disk by late accretion streamers connected to the ambient molecular cloud. Our results show that most Solar System carbonaceous asteroids accreted material from the comet-forming region, a signature lacking in the terrestrial planet region. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: Accepted manuscript, pre-print version

Journal ref: Science Advances, 2024

arXiv:2406.09824 [pdf, other]

doi 10.1002/cpe.5343

Optimization policy for file replica placement in fog domains

Authors: Carlos Guerrero, Isaac Lera, Carlos Juiz

Abstract: Fog computing architectures distribute computational and storage resources along the continuum from the cloud to things. Therefore, the execution of services or the storage of files can be closer to the users. The main objectives of fog computing domains are to reduce the user latency and the network usage. Availability is also an issue in fog architectures because the topology of the network does… ▽ More Fog computing architectures distribute computational and storage resources along the continuum from the cloud to things. Therefore, the execution of services or the storage of files can be closer to the users. The main objectives of fog computing domains are to reduce the user latency and the network usage. Availability is also an issue in fog architectures because the topology of the network does not guarantee redundant links between devices. Consequently, the definition of placement polices is a key challenge. We propose a placement policy for data replication to increase data availability that contrasts with other storage policies that only consider a single replica of the files. The system is modeled with complex weighted networks and topological features, such as centrality indices. Graph partition algorithms are evaluated to select the fog devices that store data replicas. Our approach is compared with two other placement policies: one that stores only one replica and FogStore, which also stores file replicas but uses a greedy approach (the shortest path). We analyze 22 experiments with simulations. The results show that our approach obtains the shortest latency times, mainly for writing operations, a smaller network usage increase, and a similar file availability to FogStore. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Journal ref: Concurrency Computat Pract Exper. 2020; 32:e5343

arXiv:2406.09714 [pdf, other]

Large language model validity via enhanced conformal prediction methods

Authors: John J. Cherian, Isaac Gibbs, Emmanuel J. Candès

Abstract: We develop new conformal inference methods for obtaining validity guarantees on the output of large language models (LLMs). Prior work in conformal language modeling identifies a subset of the text that satisfies a high-probability guarantee of correctness. These methods work by filtering claims from the LLM's original response if a scoring function evaluated on the claim fails to exceed a thresho… ▽ More We develop new conformal inference methods for obtaining validity guarantees on the output of large language models (LLMs). Prior work in conformal language modeling identifies a subset of the text that satisfies a high-probability guarantee of correctness. These methods work by filtering claims from the LLM's original response if a scoring function evaluated on the claim fails to exceed a threshold calibrated via split conformal prediction. Existing methods in this area suffer from two deficiencies. First, the guarantee stated is not conditionally valid. The trustworthiness of the filtering step may vary based on the topic of the response. Second, because the scoring function is imperfect, the filtering step can remove many valuable and accurate claims. We address both of these challenges via two new conformal methods. First, we generalize the conditional conformal procedure of Gibbs et al. (2023) in order to adaptively issue weaker guarantees when they are required to preserve the utility of the output. Second, we show how to systematically improve the quality of the scoring function via a novel algorithm for differentiating through the conditional conformal procedure. We demonstrate the efficacy of our approach on both synthetic and real-world datasets. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: 20 pages, 8 figures

arXiv:2406.09478 [pdf, other]

doi 10.1016/j.future.2024.05.044

Distributed genetic algorithm for application placement in the compute continuum leveraging infrastructure nodes for optimization

Authors: Carlos Guerrero, Isaac Lera, Carlos Juiz

Abstract: The increasing complexity of fog computing environments calls for efficient resource optimization techniques. In this paper, we propose and evaluate three distributed designs of a genetic algorithm (GA) for resource optimization in fog computing, within an increasing degree of distribution. The designs leverage the execution of the GA in the fog devices themselves by dealing with the specific feat… ▽ More The increasing complexity of fog computing environments calls for efficient resource optimization techniques. In this paper, we propose and evaluate three distributed designs of a genetic algorithm (GA) for resource optimization in fog computing, within an increasing degree of distribution. The designs leverage the execution of the GA in the fog devices themselves by dealing with the specific features of this domain: constrained resources and widely geographical distribution of the devices. For their evaluation, we implemented a benchmark case using the NSGA-II for the specific problem of optimizing the fog service placement, according to the guidelines of our three distributed designs. These three experimental scenarios were compared with a control case, a traditional centralized version of this GA algorithm, considering solution quality and network overhead. The results show that the design with the lowest distribution degree, which keeps centralized storage of the objective space, achieves comparable solution quality to the traditional approach but incurs a higher network load. The second design, which completely distributes the population between the workers, reduces network overhead but exhibits lower solution diversity while keeping enough good results in terms of optimization objective minimization. Finally, the proposal with a distributed population and that only interchanges solution between the workers' neighbors achieves the lowest network load but with compromised solution quality. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Journal ref: Future Generation Computer Systems, Volume 160, 2024, Pages 154-170, ISSN 0167-739X

arXiv:2406.09346 [pdf, other]

Scoreformer: A Surrogate Model For Large-Scale Prediction of Docking Scores

Authors: Álvaro Ciudad, Adrián Morales-Pastor, Laura Malo, Isaac Filella-Mercè, Victor Guallar, Alexis Molina

Abstract: In this study, we present ScoreFormer, a novel graph transformer model designed to accurately predict molecular docking scores, thereby optimizing high-throughput virtual screening (HTVS) in drug discovery. The architecture integrates Principal Neighborhood Aggregation (PNA) and Learnable Random Walk Positional Encodings (LRWPE), enhancing the model's ability to understand complex molecular struct… ▽ More In this study, we present ScoreFormer, a novel graph transformer model designed to accurately predict molecular docking scores, thereby optimizing high-throughput virtual screening (HTVS) in drug discovery. The architecture integrates Principal Neighborhood Aggregation (PNA) and Learnable Random Walk Positional Encodings (LRWPE), enhancing the model's ability to understand complex molecular structures and their relationship with their respective docking scores. This approach significantly surpasses traditional HTVS methods and recent Graph Neural Network (GNN) models in both recovery and efficiency due to a wider coverage of the chemical space and enhanced performance. Our results demonstrate that ScoreFormer achieves competitive performance in docking score prediction and offers a substantial 1.65-fold reduction in inference time compared to existing models. We evaluated ScoreFormer across multiple datasets under various conditions, confirming its robustness and reliability in identifying potential drug candidates rapidly. △ Less

Submitted 25 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

Comments: Accepted at the 1st Machine Learning for Life and Material Sciences Workshop at ICML 2024

arXiv:2406.08646 [pdf, other]

PETSc/TAO Developments for Early Exascale Systems

Authors: Richard Tran Mills, Mark Adams, Satish Balay, Jed Brown, Jacob Faibussowitsch, Toby Isaac, Matthew Knepley, Todd Munson, Hansol Suh, Stefano Zampini, Hong Zhang, Junchao Zhang

Abstract: The Portable Extensible Toolkit for Scientific Computation (PETSc) library provides scalable solvers for nonlinear time-dependent differential and algebraic equations and for numerical optimization via the Toolkit for Advanced Optimization (TAO). PETSc is used in dozens of scientific fields and is an important building block for many simulation codes. During the U.S. Department of Energy's Exascal… ▽ More The Portable Extensible Toolkit for Scientific Computation (PETSc) library provides scalable solvers for nonlinear time-dependent differential and algebraic equations and for numerical optimization via the Toolkit for Advanced Optimization (TAO). PETSc is used in dozens of scientific fields and is an important building block for many simulation codes. During the U.S. Department of Energy's Exascale Computing Project, the PETSc team has made substantial efforts to enable efficient utilization of the massive fine-grain parallelism present within exascale compute nodes and to enable performance portability across exascale architectures. We recap some of the challenges that designers of numerical libraries face in such an endeavor, and then discuss the many developments we have made, which include the addition of new GPU backends, features supporting efficient on-device matrix assembly, better support for asynchronicity and GPU kernel concurrency, and new communication infrastructure. We evaluate the performance of these developments on some pre-exascale systems as well the early exascale systems Frontier and Aurora, using compute kernel, communication layer, solver, and mini-application benchmark studies, and then close with a few observations drawn from our experiences on the tension between portable performance and other goals of numerical libraries. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 15 pages, submitted to IJHPCA

MSC Class: 00A69

arXiv:2406.08636 [pdf, other]

Towards Integrating Personal Knowledge into Test-Time Predictions

Authors: Isaac Lage, Sonali Parbhoo, Finale Doshi-Velez

Abstract: Machine learning (ML) models can make decisions based on large amounts of data, but they can be missing personal knowledge available to human users about whom predictions are made. For example, a model trained to predict psychiatric outcomes may know nothing about a patient's social support system, and social support may look different for different patients. In this work, we introduce the problem… ▽ More Machine learning (ML) models can make decisions based on large amounts of data, but they can be missing personal knowledge available to human users about whom predictions are made. For example, a model trained to predict psychiatric outcomes may know nothing about a patient's social support system, and social support may look different for different patients. In this work, we introduce the problem of human feature integration, which provides a way to incorporate important personal-knowledge from users without domain expertise into ML predictions. We characterize this problem through illustrative user stories and comparisons to existing approaches; we formally describe this problem in a way that paves the ground for future technical solutions; and we provide a proof-of-concept study of a simple version of a solution to this problem in a semi-realistic setting. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.07843 [pdf, other]

Incremental Learning and Self-Attention Mechanisms Improve Neural System Identification

Authors: Isaac Lin, Tianye Wang, Shang Gao, Shiming Tang, Tai Sing Lee

Abstract: Convolutional neural networks (CNNs) have been shown to be the state-of-the-art approach for modeling the transfer functions of visual cortical neurons. Cortical neurons in the primary visual cortex are are sensitive to contextual information mediated by extensive horizontal and feedback connections. Standard CNNs can integrate global spatial image information to model such contextual modulation v… ▽ More Convolutional neural networks (CNNs) have been shown to be the state-of-the-art approach for modeling the transfer functions of visual cortical neurons. Cortical neurons in the primary visual cortex are are sensitive to contextual information mediated by extensive horizontal and feedback connections. Standard CNNs can integrate global spatial image information to model such contextual modulation via two mechanisms: successive rounds of convolutions and a fully connected readout layer. In this paper, we find that non-local networks or self-attention (SA) mechanisms, theoretically related to context-dependent flexible gating mechanisms observed in the primary visual cortex, improve neural response predictions over parameter-matched CNNs in two key metrics: tuning curve correlation and tuning peak. We factorize networks to determine the relative contribution of each context mechanism. This reveals that information in the local receptive field is most important for modeling the overall tuning curve, but surround information is critically necessary for characterizing the tuning peak. We find that self-attention can replace subsequent spatial-integration convolutions when learned in an incremental manner, and is further enhanced in the presence of a fully connected readout layer, suggesting that the two context mechanisms are complementary. Finally, we find that learning a receptive-field-centric model with self-attention, before incrementally learning a fully connected readout, yields a more biologically realistic model in terms of center-surround contributions. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: Preprint NeurIPS 2024

arXiv:2406.06716 [pdf, other]

Metal-Poor Stars in the MW Disk: Resonant Cooling of Vertical Oscillations of Halo Stars in Barred Galaxies

Authors: Xingchen Li, Isaac Shlosman, Daniel Pfenniger, Clayton Heller

Abstract: Using numerical simulations of a barred disk galaxy embedded in nonspinning and spinning dark matter (DM) halos, we present a novel mechanism of `cooling' the vertical oscillations of halo DM particles, which acquire the disk kinematics. The underlying mechanism consists of resonant interactions between halo particles and the stellar bar. The cooling mechanism acts both on dynamical and secular ti… ▽ More Using numerical simulations of a barred disk galaxy embedded in nonspinning and spinning dark matter (DM) halos, we present a novel mechanism of `cooling' the vertical oscillations of halo DM particles, which acquire the disk kinematics. The underlying mechanism consists of resonant interactions between halo particles and the stellar bar. The cooling mechanism acts both on dynamical and secular timescales, i.e., from ~ 0.5 Gyr to few Gyr, and the stellar bar acts to absorb the kinetic energy of the vertical motions. Using a Milky Way-type stellar halo, we estimate the population of metal-poor disk stars which have been trapped by the MW disk and analyze its kinematics. We find that the population of metal-poor MW disk stars with $|z|\ltorder 3$\,kpc detected by the Gaia DR3 and other surveys can have their origin in the stellar halo. The cooled population also migrates radially outwards compared by acquiring energy from the spinning bar, and prograde-moving stars have a different distribution from the retrograde ones. Next, we have calculated the ratio of the prograde-to-retrograde orbits of the cooled population and found that this ratio varies radially, with the fast-spinning stellar halo resulting in the shallower radial increase of this ratio outside of the corotation. The nonspinning stellar halo shows a monotonic increase of this ratio with radius outside the corotation. Together with analyzed radial migration of these halo stars, the cooling phenomenon of halo metal-poor stars can explain their current disk population, and has corollaries for the chemical evolution of disk galaxies in general. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 12 pages, 9 figures, 1 table. A shortened version has been submitted on ApJ Letters

arXiv:2406.05727 [pdf, other]

A Variational Approach to Learning Photonic Unitary Operators

Authors: Hadrian Bezuidenhout, Mwezi Koni, Jonathan Leach, Paola Concha Obando, Andrew Forbes, Isaac Nape

Abstract: Structured light, light tailored in its internal degrees of freedom, has become topical in numerous quantum and classical information processing protocols. In this work, we harness the high dimensional nature of structured light modulated in the transverse spatial degree of freedom to realise an adaptable scheme for learning unitary operations. Our approach borrows from concepts in variational qua… ▽ More Structured light, light tailored in its internal degrees of freedom, has become topical in numerous quantum and classical information processing protocols. In this work, we harness the high dimensional nature of structured light modulated in the transverse spatial degree of freedom to realise an adaptable scheme for learning unitary operations. Our approach borrows from concepts in variational quantum computing, where a search or optimisation problem is mapped onto the task of finding a minimum ground state energy for a given energy/goal function. We achieve this by a pseudo-random walk procedure over the parameter space of the unitary operation, implemented with optical matrix-vector multiplication enacted on arrays of Gaussian modes by exploiting the partial Fourier transforming capabilities of a cylindrical lens in the transverse degree of freedom for the measurement. We outline the concept theoretically, and experimentally demonstrate that we are able to learn optical unitary matrices for dimensions d = 2, 4, 8 and 16 with average fidelities of >90%. Our work advances high dimensional information processing and can be adapted to both process and quantum state tomography of unknown states and channels. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2406.04515 [pdf, other]

Considerations for extracting moiré-level strain from dark field intensities in transmission electron microscopy

Authors: Isaac M. Craig, Madeline Van Winkle, Colin Ophus, D. Kwabena Bediako

Abstract: Bragg interferometry (BI) is an imaging technique based on four-dimensional scanning electron microscopy (4D-STEM) wherein the intensities of select overlapping Bragg disks are fit or more qualitatively analyzed in the context of simple trigonometric equations to determine local stacking order. In 4D-STEM based approaches, the collection of full diffraction patterns at each real-space position of… ▽ More Bragg interferometry (BI) is an imaging technique based on four-dimensional scanning electron microscopy (4D-STEM) wherein the intensities of select overlapping Bragg disks are fit or more qualitatively analyzed in the context of simple trigonometric equations to determine local stacking order. In 4D-STEM based approaches, the collection of full diffraction patterns at each real-space position of the scanning probe allows the use of precise virtual apertures much smaller and more variable in shape than those used in conventional dark field imaging, such that even buried interfaces marginally twisted from other layers can be targeted. A coarse-grained form of dark field ptychography, BI uses simple physically derived fitting functions to extract the average structure within the illumination region and is therefore viable over large fields of view. BI has shown a particular advantage for selectively investigating the interlayer stacking and associated moiré reconstruction of bilayer interfaces within complex multi-layered structures. This has enabled investigation of reconstruction and substrate effects in bilayers through encapsulating hexagonal boron nitride and of select bilayer interfaces within trilayer stacks. However, the technique can be improved to provide a greater spatial resolution and probe a wider range of twisted structures, for which current limitations on acquisition parameters can lead to large illumination regions and the computationally involved post-processing can fail. Here we analyze these limitations and the computational processing in greater depth, presenting a few methods for improvement over previous works, discussing potential areas for further expansion, and illustrating the current capabilities of this approach for extracting moiré-scale strain. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 10 pages, 5 figures

arXiv:2406.04450 [pdf, other]

Sulfur Dioxide and Other Molecular Species in the Atmosphere of the Sub-Neptune GJ 3470 b

Authors: Thomas G. Beatty, Luis Welbanks, Everett Schlawin, Taylor J. Bell, Michael R. Line, Matthew Murphy, Isaac Edelman, Thomas P. Greene, Jonathan J. Fortney, Gregory W. Henry, Sagnick Mukherjee, Kazumasa Ohno, Vivien Parmentier, Emily Rauscher, Lindsey S. Wiser, Kenneth E. Arnold

Abstract: We report observations of the atmospheric transmission spectrum of the sub-Neptune exoplanet GJ 3470 b taken using the Near-Infrared Camera (NIRCam) on JWST. Combined with two archival HST/WFC3 transit observations and fifteen archival Spitzer transit observations, we detect water, methane, sulfur dioxide, and carbon dioxide in the atmosphere of GJ 3470 b, each with a significance of >3-sigma. GJ… ▽ More We report observations of the atmospheric transmission spectrum of the sub-Neptune exoplanet GJ 3470 b taken using the Near-Infrared Camera (NIRCam) on JWST. Combined with two archival HST/WFC3 transit observations and fifteen archival Spitzer transit observations, we detect water, methane, sulfur dioxide, and carbon dioxide in the atmosphere of GJ 3470 b, each with a significance of >3-sigma. GJ 3470 b is the lowest mass -- and coldest -- exoplanet known to show a substantial sulfur dioxide feature in its spectrum, at $M_{p}$=11.2${\,{\rm M}_{\oplus}}$ and $T_{eq}$=600$\,$K. This indicates disequilibrium photochemistry drives sulfur dioxide production in exoplanet atmospheres over a wider range of masses and temperatures than has been reported or expected. The water, carbon dioxide, and sulfur dioxide abundances we measure indicate an atmospheric metallicity of approximately $100\times$ Solar. We see further evidence for disequilibrium chemistry in our inferred methane abundance, which is significantly lower than expected from equilibrium models consistent with our measured water and carbon dioxide abundances. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 25 pages, 9 figures, 6 tables. Accepted in Astrophysical Journal Letters

arXiv:2406.04364 [pdf]

doi 10.1109/LSENS.2024.3408320

Use of a Multiscale Vision Transformer to predict Nursing Activities Score from Low Resolution Thermal Videos in an Intensive Care Unit

Authors: Isaac YL Lee, Thanh Nguyen-Duc, Ryo Ueno, Jesse Smith, Peter Y Chan

Abstract: Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the Intensive Care Unit (ICU) is often done using the Nursing Activities Score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of Ambient Intelligence (AmI) by using computer vision to passively derive car… ▽ More Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the Intensive Care Unit (ICU) is often done using the Nursing Activities Score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of Ambient Intelligence (AmI) by using computer vision to passively derive caregiver-patient interaction times to monitor staff workload. In this letter, we propose using a Multiscale Vision Transformer (MViT) to passively predict the NAS from low-resolution thermal videos recorded in an ICU. 458 videos were obtained from an ICU in Melbourne, Australia and used to train a MViTv2 model using an indirect prediction and a direct prediction method. The indirect method predicted 1 of 8 potentially identifiable NAS activities from the video before inferring the NAS. The direct method predicted the NAS score immediately from the video. The indirect method yielded an average 5-fold accuracy of 57.21%, an area under the receiver operating characteristic curve (ROC AUC) of 0.865, a F1 score of 0.570 and a mean squared error (MSE) of 28.16. The direct method yielded a MSE of 18.16. We also showed that the MViTv2 outperforms similar models such as R(2+1)D and ResNet50-LSTM under identical settings. This study shows the feasibility of using a MViTv2 to passively predict the NAS in an ICU and monitor staff workload automatically. Our results above also show an increased accuracy in predicting NAS directly versus predicting NAS indirectly. We hope that our study can provide a direction for future work and further improve the accuracy of passive NAS monitoring. △ Less

Submitted 30 May, 2024; originally announced June 2024.

Comments: 4 pages, 1 figure

arXiv:2406.02811 [pdf, other]

Changes in boiling controlled by molar concentration-dependent diffusion of surfactants

Authors: Mario R. Mata, Matic Može, Armin Hadžić, Giseop Lee, Blake Naccarato, Isaac Berk, Iztok Golobič, H. Jeremy Cho

Abstract: Boiling is a prevalent phase-change process that plays a vital role in facilitating efficient heat transfer from a heating surface. While this heat transfer mechanism is generally effective, a rapid increase in surface temperature can lead to hydrodynamic instabilities, resulting in a boiling crisis. Previous studies have shown that surfactants often improve boiling performance and change the boil… ▽ More Boiling is a prevalent phase-change process that plays a vital role in facilitating efficient heat transfer from a heating surface. While this heat transfer mechanism is generally effective, a rapid increase in surface temperature can lead to hydrodynamic instabilities, resulting in a boiling crisis. Previous studies have shown that surfactants often improve boiling performance and change the boiling crisis behavior. Conventional wisdom in this field attributes that these changes in boiling behavior are tied to the critical micelle concentration (CMC) of the particular surfactant. However, our work reveals that these changes in boiling behavior are independent of the CMC for three nonionic surfactants across a wide range of molar concentrations. In addition, visual snapshots of the bubbling behavior indicate changes in bubble formation, such as bubble size and nucleation site density, influenced by the molar concentration-dependent diffusion timescale of surfactants. Hence, these findings offer compelling evidence that boiling behavior, encompassing both boiling performance and boiling crisis, is governed by the dynamic adsorption of surfactants rather than dictated by the CMC. This becomes evident when quantifying the heat transfer coefficient (HTC) and critical heat flux (CHF) using the logarithm of molar concentration, as predicted by theory. Building upon these findings, we propose insights for controlling when CHF modification occurs in specific scenarios involving any surfactants. These insights hold significant potential for optimizing heat transfer processes and leveraging surfactants in energy-related applications to maximize boiling efficiency. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 21 pages, 7 figures, 2 appendices

arXiv:2406.02161 [pdf, other]

An Observability-Constrained Magnetic-Field-Aided Inertial Navigation System

Authors: Chuan Huang, Gustaf Hendeby, Isaac Skog

Abstract: A method to construct an observability-constrained magnetic-field-aided inertial navigation system is proposed. The proposed method builds upon the previously proposed observability-constrained extended Kalman filter and extends it to work with a magnetic-field-based odometry-aided inertial navigation system. The proposed method is evaluated using simulation and real-world data, showing that (i) t… ▽ More A method to construct an observability-constrained magnetic-field-aided inertial navigation system is proposed. The proposed method builds upon the previously proposed observability-constrained extended Kalman filter and extends it to work with a magnetic-field-based odometry-aided inertial navigation system. The proposed method is evaluated using simulation and real-world data, showing that (i) the system observability properties are preserved, (ii) the estimation accuracy increases, and (iii) the perceived uncertainty calculated by the EKF is more consistent with the true uncertainty of the filter estimates. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.00162 [pdf, ps, other]

doi 10.1145/3663408.366582

Optical-computing-enabled Network: A New Dawn for Optical-layer Intelligence?

Authors: Dao Thanh Hai, Minh Nguyen, Isaac Woungang

Abstract: Inspired by the renaissance of optical computing recently, this poster presents a disruptive outlook on the possibility of seamless integration between optical communications and optical computing infrastructures, paving the way for achieving optical-layer intelligence and consequently boosting the capacity efficiency. This entails a paradigm shift in optical node architecture from the currently u… ▽ More Inspired by the renaissance of optical computing recently, this poster presents a disruptive outlook on the possibility of seamless integration between optical communications and optical computing infrastructures, paving the way for achieving optical-layer intelligence and consequently boosting the capacity efficiency. This entails a paradigm shift in optical node architecture from the currently used optical-bypass to a novel one, entitled, optical-computing-enabled mode, where in addition to the traditional add-drop and cross-connect functionalities, optical nodes are upgraded to account for optical-computing capabilities between the lightpath entities directly at the optical layer. A preliminary study focusing on the optical aggregation operation is examined and early simulation results indicate a promising spectral saving enabled by the optical-computing-enabled mode compared with the optical-bypass one. △ Less

Submitted 31 May, 2024; originally announced June 2024.

Comments: 2-page poster, to be appeared in Proceedings of the 8th Asia-Pacific Workshop on Networking (APNet 2024)

Showing 1–50 of 2,177 results for author: Isaac