-
SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions
Authors:
Shicheng Liu,
Sina J. Semnani,
Harold Triedman,
Jialiang Xu,
Isaac Dan Zhao,
Monica S. Lam
Abstract:
Recent work integrating Large Language Models (LLMs) has led to significant improvements in the Knowledge Base Question Answering (KBQA) task. However, we posit that existing KBQA datasets that either have simple questions, use synthetically generated logical forms, or are based on small knowledge base (KB) schemas, do not capture the true complexity of KBQA tasks.
To address this, we introduce…
▽ More
Recent work integrating Large Language Models (LLMs) has led to significant improvements in the Knowledge Base Question Answering (KBQA) task. However, we posit that existing KBQA datasets that either have simple questions, use synthetically generated logical forms, or are based on small knowledge base (KB) schemas, do not capture the true complexity of KBQA tasks.
To address this, we introduce the SPINACH dataset, an expert-annotated KBQA dataset collected from forum discussions on Wikidata's "Request a Query" forum with 320 decontextualized question-SPARQL pairs. Much more complex than existing datasets, SPINACH calls for strong KBQA systems that do not rely on training data to learn the KB schema, but can dynamically explore large and often incomplete schemas and reason about them.
Along with the dataset, we introduce the SPINACH agent, a new KBQA approach that mimics how a human expert would write SPARQLs for such challenging questions. Experiments on existing datasets show SPINACH's capability in KBQA, achieving a new state of the art on the QALD-7, QALD-9 Plus and QALD-10 datasets by 30.1%, 27.0%, and 10.0% in F1, respectively, and coming within 1.6% of the fine-tuned LLaMA SOTA model on WikiWebQuestions. On our new SPINACH dataset, SPINACH agent outperforms all baselines, including the best GPT-4-based KBQA agent, by 38.1% in F1.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Geometric additivity of modular commutator for multipartite entanglement
Authors:
Sung-Min Park,
Isaac H. Kim,
Eun-Gook Moon
Abstract:
A recent surge of research in many-body quantum entanglement has uncovered intriguing properties of quantum many-body systems. A prime example is the modular commutator, which can extract a topological invariant from a single wave function. Here, we unveil novel geometric properties of many-body entanglement via a modular commutator of two-dimensional gapped quantum many-body systems. We obtain th…
▽ More
A recent surge of research in many-body quantum entanglement has uncovered intriguing properties of quantum many-body systems. A prime example is the modular commutator, which can extract a topological invariant from a single wave function. Here, we unveil novel geometric properties of many-body entanglement via a modular commutator of two-dimensional gapped quantum many-body systems. We obtain the geometric additivity of a modular commutator, indicating that modular commutator for a multipartite system may be an integer multiple of the one for tripartite systems. Using our additivity formula, we also derive a curious identity for the modular commutators involving disconnected intervals in a certain class of conformal field theories. We further illustrate this geometric additivity for both bulk and edge subsystems using numerical calculations of the Haldane and $π$-flux models.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Hybrid Oscillator-Qubit Quantum Processors: Instruction Set Architectures, Abstract Machine Models, and Applications
Authors:
Yuan Liu,
Shraddha Singh,
Kevin C. Smith,
Eleanor Crane,
John M. Martyn,
Alec Eickbusch,
Alexander Schuckert,
Richard D. Li,
Jasmine Sinanan-Singh,
Micheline B. Soley,
Takahiro Tsunoda,
Isaac L. Chuang,
Nathan Wiebe,
Steven M. Girvin
Abstract:
Quantum computing with discrete variable (DV, qubit) hardware is approaching the large scales necessary for computations beyond the reach of classical computers. However, important use cases such as quantum simulations of physical models containing bosonic modes, and quantum error correction are challenging for DV-only systems. Separately, hardware containing native continuous-variable (CV, oscill…
▽ More
Quantum computing with discrete variable (DV, qubit) hardware is approaching the large scales necessary for computations beyond the reach of classical computers. However, important use cases such as quantum simulations of physical models containing bosonic modes, and quantum error correction are challenging for DV-only systems. Separately, hardware containing native continuous-variable (CV, oscillator) systems has received attention as an alternative approach, yet the universal control of such systems is non-trivial. In this work, we show that hybrid CV-DV hardware offers a great advantage in meeting these challenges, offering a powerful computational paradigm that inherits the strengths of both DV and CV processors. We provide a pedagogical introduction to CV-DV systems and the multiple abstraction layers needed to produce a full software stack connecting applications to hardware. We present a variety of new hybrid CV-DV compilation techniques, algorithms, and applications, including the extension of quantum signal processing concepts to CV-DV systems and strategies to simulate systems of interacting spins, fermions, and bosons. To facilitate the development of hybrid CV-DV processor systems, we introduce formal Abstract Machine Models and Instruction Set Architectures -- essential abstractions that enable developers to formulate applications, compile algorithms, and explore the potential of current and future hardware for realizing fault-tolerant circuits, modules, and processors. Hybrid CV-DV quantum computations are beginning to be performed in superconducting, trapped ion, and neutral atom platforms, and large-scale experiments are set to be demonstrated in the near future. We present a timely and comprehensive guide to this relatively unexplored yet promising approach to quantum computation and providing an architectural backbone to guide future development.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
Observational bounds on a possible electron-to-proton mass ratio variation and constraints in the lepton-specific 2HDM
Authors:
R. G. Albuquerque,
R. F. L. Holanda,
I. E. T. R. Mendonça,
P. S. Rodrigues da Silva
Abstract:
In this work, we test a possible redshift variation of the electron-to-proton mass ratio, $μ= m_e/m_p$, directly from galaxy cluster gas mass fraction measurements and type Ia Supernovae observations. Our analysis is completely independent of any cosmological model. Our result reveals no variation of $μ$ within 1 $σ$ confidence level. From the point of view of Particle Physics, we can use the prec…
▽ More
In this work, we test a possible redshift variation of the electron-to-proton mass ratio, $μ= m_e/m_p$, directly from galaxy cluster gas mass fraction measurements and type Ia Supernovae observations. Our analysis is completely independent of any cosmological model. Our result reveals no variation of $μ$ within 1 $σ$ confidence level. From the point of view of Particle Physics, we can use the precision on these results to constrain the parameter space of models beyond the Standard Model of electroweak interactions. We exemplify this by focusing in a specific Two Higgs Doublet model (2HDM), where the second scalar doublet couples exclusively to leptons. An important parameter in the model concerns the ratio between its vacuum expectation values, defined by $\tanβ$. In our approach we can constrain the inverse parameter (cot$β$) to an optimal value, (tan$β)^{-1}=$ 0.02127 $\pm$ 0.0029, with the largest vacuum expectation value for 2HDM, $v_2$, estimated at around 240.033 $\pm$ 0.21~GeV. Also, by taking into account the $(g-2)_μ$ discrepancy found between theory and experiment, we can reduce the validity region for this model and establish bounds on the scalar masses, in the light of our findings from galaxy clusters data for $μ$. This study contributes valuable insights to the understanding of Particle Physics and Astrophysics interface, establishing a new interplay between data from large scale structure of the Universe and subatomic Physics.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Implications of mappings between ICD clinical diagnosis codes and Human Phenotype Ontology terms
Authors:
Amelia LM Tan,
Rafael S Gonçalves,
William Yuan,
Gabriel A Brat,
The Consortium for Clinical Characterization of COVID-19 by EHR,
Robert Gentleman,
Isaac S Kohane
Abstract:
Objective: Integrating EHR data with other resources is essential in rare disease research due to low disease prevalence. Such integration is dependent on the alignment of ontologies used for data annotation. The International Classification of Diseases (ICD) is used to annotate clinical diagnoses; the Human Phenotype Ontology (HPO) to annotate phenotypes. Although these ontologies overlap in biom…
▽ More
Objective: Integrating EHR data with other resources is essential in rare disease research due to low disease prevalence. Such integration is dependent on the alignment of ontologies used for data annotation. The International Classification of Diseases (ICD) is used to annotate clinical diagnoses; the Human Phenotype Ontology (HPO) to annotate phenotypes. Although these ontologies overlap in biomedical entities described, the extent to which they are interoperable is unknown. We investigate how well aligned these ontologies are and whether such alignments facilitate EHR data integration.
Materials and Methods: We conducted an empirical analysis of the coverage of mappings between ICD and HPO. We interpret this mapping coverage as a proxy for how easily clinical data can be integrated with research ontologies such as HPO. We quantify how exhaustively ICD codes are mapped to HPO by analyzing mappings in the UMLS Metathesaurus. We analyze the proportion of ICD codes mapped to HPO within a real-world EHR dataset.
Results and Discussion: Our analysis revealed that only 2.2% of ICD codes have direct mappings to HPO in UMLS. Within our EHR dataset, less than 50% of ICD codes have mappings to HPO terms. ICD codes that are used frequently in EHR data tend to have mappings to HPO; ICD codes that represent rarer medical conditions are seldom mapped.
Conclusion: We find that interoperability between ICD and HPO via UMLS is limited. While other mapping sources could be incorporated, there are no established conventions for what resources should be used to complement UMLS.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Evolutionary Computation for the Design and Enrichment of General-Purpose Artificial Intelligence Systems: Survey and Prospects
Authors:
Javier Poyatos,
Javier Del Ser,
Salvador Garcia,
Hisao Ishibuchi,
Daniel Molina,
Isaac Triguero,
Bing Xue,
Xin Yao,
Francisco Herrera
Abstract:
In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal de…
▽ More
In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal design of traditional Machine Learning models. Evolutionary Computation (EC) has been a useful tool for both the design and optimization of Machine Learning models, endowing them with the capability to configure and/or adapt themselves to the task under consideration. Therefore, their application to GPAIS is a natural choice. This paper aims to analyze the role of EC in the field of GPAIS, exploring the use of EC for their design or enrichment. We also match GPAIS properties to Machine Learning areas in which EC has had a notable contribution, highlighting recent milestones of EC for GPAIS. Furthermore, we discuss the challenges of harnessing the benefits of EC for GPAIS, presenting different strategies to both design and improve GPAIS with EC, covering tangential areas, identifying research niches, and outlining potential research directions for EC and GPAIS.
△ Less
Submitted 3 June, 2024;
originally announced July 2024.
-
The Potential Impact of Noise Correlation in Next-generation Gravitational Wave Detectors
Authors:
Isaac C. F. Wong,
Peter T. H. Pang,
Milan Wils,
Francesco Cireddu,
Walter Del Pozzo,
Tjonnie G. F. Li
Abstract:
Building upon the statistical formulation for parameter estimation in the presence of correlated noise proposed by Cireddu et al., we present an initial study to incorporate the effects of correlated noise into the analyses of various detector designs' performance. We consider a two L-shaped detector configuration located in the European Union, and compare the expectation of parameter estimation b…
▽ More
Building upon the statistical formulation for parameter estimation in the presence of correlated noise proposed by Cireddu et al., we present an initial study to incorporate the effects of correlated noise into the analyses of various detector designs' performance. We consider a two L-shaped detector configuration located in the European Union, and compare the expectation of parameter estimation between the non-colocated and a hypothetical colocated configurations. In our study, we posit the existence of low-frequency correlated noise within the $5\text{ Hz}$ to $10\text{ Hz}$ range for the colocated detector configuration, with a varying degree of correlation. In this specific detector setup, our observations indicate an enhancement in the precision of intrinsic parameter measurements as the degree of correlation increases. This trend suggests that higher degrees of noise correlation may beneficially influence the accuracy of parameter estimation. In particular, when the noise is highly correlated, the uncertainty on chirp mass decreases by up to $30\%$. The absence of an inter-European baseline does hinder the estimation of the extrinsic parameters. However, given a realistic global network with the additional detector located in the United States, the uncertainty of extrinsic parameters is significantly reduced. This reduction is further amplified as the degree of noise correlation increases. When noise correlation exceeds a certain level, the colocated configuration outperforms the non-colocated one, reducing the $90\%$ credible area of sky location by up to $10\%$. We conclude that noise correlation significantly impacts detector performance, potentially altering both quantitative and qualitative outcomes. Thus, we recommend including noise correlation in comprehensive assessments of third-generation gravitational wave detector designs.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Source-Independent Fault Detection Method for Transmission Lines in IBR-Dominated Grids
Authors:
Julio Rodriguez,
Isaac Kofi Otchere,
Reza Jalilzadeh Hamidi
Abstract:
This paper proposes a source-independent method for the detection and classification of faults along Transmission Lines (TLs). It aims to reduce the protection issues arising from Inverter-Based Resources (IBRs). Inspired by Power Line Communication (PLC), the proposed method utilizes high-frequency carrier waves which are sent from either side of a TL over each phase. As faults disrupt the propag…
▽ More
This paper proposes a source-independent method for the detection and classification of faults along Transmission Lines (TLs). It aims to reduce the protection issues arising from Inverter-Based Resources (IBRs). Inspired by Power Line Communication (PLC), the proposed method utilizes high-frequency carrier waves which are sent from either side of a TL over each phase. As faults disrupt the propagation of carriers, the receiving carrier waves before and during faults exhibit differences. Based on this principle, the proposed method continuously compares the receiving carrier waves with a short history of them to detect and classify faults. The performance of the proposed method was evaluated using EMTP-RV and MATLAB, and compared to traditional phasor-based distance relays. The simulation results confirm the capability of the proposed method in detection and classification of different faults regardless of power sources types.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Robust quantum engineering of current flow in carbon nanostructures at room temperature
Authors:
Gaetano Calogero,
Isaac Alcón,
Onurcan Kaya,
Nick Papior,
Aron W. Cummings,
Mads Brandbyge,
Stephan Roche
Abstract:
Bottom-up on-surface synthesis enables the fabrication of carbon nanostructures with atomic precision. Good examples are graphene nanoribbons (GNRs), 1D conjugated polymers, and nanoporous graphenes (NPGs), which are gathering increasing attention for future carbon nanoelectronics. A key step is the ability to manipulate current flow within these nanomaterials. Destructive quantum interference (QI…
▽ More
Bottom-up on-surface synthesis enables the fabrication of carbon nanostructures with atomic precision. Good examples are graphene nanoribbons (GNRs), 1D conjugated polymers, and nanoporous graphenes (NPGs), which are gathering increasing attention for future carbon nanoelectronics. A key step is the ability to manipulate current flow within these nanomaterials. Destructive quantum interference (QI), long studied in the field of single-molecule electronics, has been proposed as the most effective way to achieve such control with molecular-scale precision. However, for practical applications, it is essential that such QI-engineering remains effective near or above room temperature. To assess this important point, here we combine large-scale molecular dynamics simulations and quantum transport calculations and focus our study on NPGs formed as arrays of laterally bonded GNRs. By considering various NPGs with different inter-GNR chemical connections we disentangle the different factors determining electronic transport in these carbon nanomaterials at 300 K. Our findings unequivocally demonstrate that QI survives at room temperature, with thermal vibrations weakly restricting current flow along GNRs while completely blocking transport across GNRs. Our results thus pave the way towards the future realization of QI-engineered carbon nanocircuitry operating at room temperature, which is a fundamental step towards carbon-based nanoelectronics and quantum technologies.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Towards Interpretable Foundation Models of Robot Behavior: A Task Specific Policy Generation Approach
Authors:
Isaac Sheidlower,
Reuben Aronson,
Elaine Schaertl Short
Abstract:
Foundation models are a promising path toward general-purpose and user-friendly robots. The prevalent approach involves training a generalist policy that, like a reinforcement learning policy, uses observations to output actions. Although this approach has seen much success, several concerns arise when considering deployment and end-user interaction with these systems. In particular, the lack of m…
▽ More
Foundation models are a promising path toward general-purpose and user-friendly robots. The prevalent approach involves training a generalist policy that, like a reinforcement learning policy, uses observations to output actions. Although this approach has seen much success, several concerns arise when considering deployment and end-user interaction with these systems. In particular, the lack of modularity between tasks means that when model weights are updated (e.g., when a user provides feedback), the behavior in other, unrelated tasks may be affected. This can negatively impact the system's interpretability and usability. We present an alternative approach to the design of robot foundation models, Diffusion for Policy Parameters (DPP), which generates stand-alone, task-specific policies. Since these policies are detached from the foundation model, they are updated only when a user wants, either through feedback or personalization, allowing them to gain a high degree of familiarity with that policy. We demonstrate a proof-of-concept of DPP in simulation then discuss its limitations and the future of interpretable foundation models.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Science-Informed Deep Learning (ScIDL) With Applications to Wireless Communications
Authors:
Atefeh Termehchi,
Ekram Hossain,
Isaac Woungang
Abstract:
Given the extensive and growing capabilities offered by deep learning (DL), more researchers are turning to DL to address complex challenges in next-generation (xG) communications. However, despite its progress, DL also reveals several limitations that are becoming increasingly evident. One significant issue is its lack of interpretability, which is especially critical for safety-sensitive applica…
▽ More
Given the extensive and growing capabilities offered by deep learning (DL), more researchers are turning to DL to address complex challenges in next-generation (xG) communications. However, despite its progress, DL also reveals several limitations that are becoming increasingly evident. One significant issue is its lack of interpretability, which is especially critical for safety-sensitive applications. Another significant consideration is that DL may not comply with the constraints set by physics laws or given security standards, which are essential for reliable DL. Additionally, DL models often struggle outside their training data distributions, which is known as poor generalization. Moreover, there is a scarcity of theoretical guidance on designing DL algorithms. These challenges have prompted the emergence of a burgeoning field known as science-informed DL (ScIDL). ScIDL aims to integrate existing scientific knowledge with DL techniques to develop more powerful algorithms. The core objective of this article is to provide a brief tutorial on ScIDL that illustrates its building blocks and distinguishes it from conventional DL. Furthermore, we discuss both recent applications of ScIDL and potential future research directions in the field of wireless communications.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Barely-Visible Surface Crack Detection for Wind Turbine Sustainability
Authors:
Sourav Agrawal,
Isaac Corley,
Conor Wallace,
Clovis Vaughn,
Jonathan Lwowski
Abstract:
The production of wind energy is a crucial part of sustainable development and reducing the reliance on fossil fuels. Maintaining the integrity of wind turbines to produce this energy is a costly and time-consuming task requiring repeated inspection and maintenance. While autonomous drones have proven to make this process more efficient, the algorithms for detecting anomalies to prevent catastroph…
▽ More
The production of wind energy is a crucial part of sustainable development and reducing the reliance on fossil fuels. Maintaining the integrity of wind turbines to produce this energy is a costly and time-consuming task requiring repeated inspection and maintenance. While autonomous drones have proven to make this process more efficient, the algorithms for detecting anomalies to prevent catastrophic damage to turbine blades have fallen behind due to some dangerous defects, such as hairline cracks, being barely-visible. Existing datasets and literature are lacking and tend towards detecting obvious and visible defects in addition to not being geographically diverse. In this paper we introduce a novel and diverse dataset of barely-visible hairline cracks collected from numerous wind turbine inspections. To prove the efficacy of our dataset, we detail our end-to-end deployed turbine crack detection pipeline from the image acquisition stage to the use of predictions in providing automated maintenance recommendations to extend the life and efficiency of wind turbines.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
Large Row-Constrained Supersaturated Designs for High-throughput Screening
Authors:
Byran J. Smucker,
Stephen E. Wright,
Isaac Williams,
Richard C. Page,
Andor J. Kiss,
Surendra Bikram Silwal,
Maria Weese,
David J. Edwards
Abstract:
High-throughput screening, in which multiwell plates are used to test large numbers of compounds against specific targets, is widely used across many areas of the biological sciences and most prominently in drug discovery. We propose a statistically principled approach to these screening experiments, using the machinery of supersaturated designs and the Lasso. To accommodate limitations on the num…
▽ More
High-throughput screening, in which multiwell plates are used to test large numbers of compounds against specific targets, is widely used across many areas of the biological sciences and most prominently in drug discovery. We propose a statistically principled approach to these screening experiments, using the machinery of supersaturated designs and the Lasso. To accommodate limitations on the number of biological entities that can be applied to a single microplate well, we present a new class of row-constrained supersaturated designs. We develop a computational procedure to construct these designs, provide some initial lower bounds on the average squared off-diagonal values of their main-effects information matrix, and study the impact of the constraint on design quality. We also show via simulation that the proposed constrained row screening method is statistically superior to existing methods and demonstrate the use of the new methodology on a real drug-discovery system.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Use of Mobile Devices in the Classroom to Increase Motivation and Participation of Engineering University Students
Authors:
Carlos Guerrero,
Antoni Jaume-i-Capó,
Carlos Juiz,
Isaac Lera
Abstract:
The aim of this study was to see whether student participation increased when mobile devices were used in the classroom. We measured the amount of student participative actions when the Socrative tool was used and when it was not used. Our experiment involved a total of 192 students, corresponding to 4 different subjects of Computer Engineering at the Universitat de les Illes Balears, during 2012/…
▽ More
The aim of this study was to see whether student participation increased when mobile devices were used in the classroom. We measured the amount of student participative actions when the Socrative tool was used and when it was not used. Our experiment involved a total of 192 students, corresponding to 4 different subjects of Computer Engineering at the Universitat de les Illes Balears, during 2012/2013 and 2013/2014 courses. An independent paired t-test was performed on the measurements. The analysis results show that student participation increases with the use of mobile devices for theory classes and students are willing to participate in class activities and share their own results.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
An analytic, moment-based method to estimate orthopositronium lifetimes in positron annihilation lifetime spectroscopy measurements
Authors:
Lucas Berens,
Isaac Hsu,
Chin-Tu Chen,
Howard Halpern,
Chien-Min Kao
Abstract:
The presence of tumor hypoxia is known to correlate with poor patient prognosis. Measurement of tissue oxygen concentration can be challenging, but recent advancements using positron annihilation lifetime spectroscopy (PALS) in three-dimensional positron emission tomography (PET) scans have shown promise for hypoxia detection. In this work, a novel method for estimating the orthopositronium lifeti…
▽ More
The presence of tumor hypoxia is known to correlate with poor patient prognosis. Measurement of tissue oxygen concentration can be challenging, but recent advancements using positron annihilation lifetime spectroscopy (PALS) in three-dimensional positron emission tomography (PET) scans have shown promise for hypoxia detection. In this work, a novel method for estimating the orthopositronium lifetime in PALS is presented. This method is analytical and uses moments of the time-difference histogram from photon arrival times. For sufficient statistical power, the method produces monotonic, stable estimates. For cases with a lower number of photon counts, the method was characterized and solutions are presented to correct for bias and estimation variability.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
The In-Medium Similarity Renormalization Group at Finite Temperature
Authors:
Isaac G. Smith,
Heiko Hergert,
Scott K. Bogner
Abstract:
The study of nuclei at finite temperature is of immense interest for many areas of nuclear astrophysics and nuclear-reaction science. A variety of ab initio methods are now available for computing the properties of nuclei from interactions rooted in Quantum Chromodynamics, but applications have largely been limited to zero temperature. In the present work, we extend one such method, the In-Medium…
▽ More
The study of nuclei at finite temperature is of immense interest for many areas of nuclear astrophysics and nuclear-reaction science. A variety of ab initio methods are now available for computing the properties of nuclei from interactions rooted in Quantum Chromodynamics, but applications have largely been limited to zero temperature. In the present work, we extend one such method, the In-Medium Similarity Renormalization Group (IMSRG), to finite temperature. Using an exactly-solvable schematic model that captures essential features of nuclear interactions, we show that the FT-IMSRG can accurately determine the energetics of nuclei at finite temperature, and we explore the accuracy of the FT-IMSRG in different parameter regimes, e.g., strong and weak pairing. In anticipation of FT-IMSRG applications for finite nuclei and infinite matter, we discuss differences arising from the choice of working with the canonical and the grand canonical ensembles. In future work, we will apply the FT-IMSRG with realistic nuclear interactions to compute nuclear structure and reaction properties at finite temperature, which are important ingredients for understanding nucleosynthesis in stellar environments, or modeling reactions of hot compound nuclei.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Supercharging Federated Learning with Flower and NVIDIA FLARE
Authors:
Holger R. Roth,
Daniel J. Beutel,
Yan Cheng,
Javier Fernandez Marques,
Heng Pan,
Chester Chen,
Zhihong Zhang,
Yuhong Wen,
Sean Yang,
Isaac,
Yang,
Yuan-Ting Hsieh,
Ziyue Xu,
Daguang Xu,
Nicholas D. Lane,
Andrew Feng
Abstract:
Several open-source systems, such as Flower and NVIDIA FLARE, have been developed in recent years while focusing on different aspects of federated learning (FL). Flower is dedicated to implementing a cohesive approach to FL, analytics, and evaluation. Over time, Flower has cultivated extensive strategies and algorithms tailored for FL application development, fostering a vibrant FL community in re…
▽ More
Several open-source systems, such as Flower and NVIDIA FLARE, have been developed in recent years while focusing on different aspects of federated learning (FL). Flower is dedicated to implementing a cohesive approach to FL, analytics, and evaluation. Over time, Flower has cultivated extensive strategies and algorithms tailored for FL application development, fostering a vibrant FL community in research and industry. Conversely, FLARE has prioritized the creation of an enterprise-ready, resilient runtime environment explicitly designed for FL applications in production environments. In this paper, we describe our initial integration of both frameworks and show how they can work together to supercharge the FL ecosystem as a whole. Through the seamless integration of Flower and FLARE, applications crafted within the Flower framework can effortlessly operate within the FLARE runtime environment without necessitating any modifications. This initial integration streamlines the process, eliminating complexities and ensuring smooth interoperability between the two platforms, thus enhancing the overall efficiency and accessibility of FL applications.
△ Less
Submitted 21 May, 2024;
originally announced July 2024.
-
Binary neutron star mergers using a discontinuous Galerkin-finite difference hybrid method
Authors:
Nils Deppe,
Francois Foucart,
Marceline S. Bonilla,
Michael Boyle,
Nicholas J. Corso,
Matthew D. Duez,
Matthew Giesler,
François Hébert,
Lawrence E. Kidder,
Yoonsoo Kim,
Prayush Kumar,
Isaac Legred,
Geoffrey Lovelace,
Elias R. Most,
Jordan Moxon,
Kyle C. Nelli,
Harald P. Pfeiffer,
Mark A. Scheel,
Saul A. Teukolsky,
William Throwe,
Nils L. Vu
Abstract:
We present a discontinuous Galerkin-finite difference hybrid scheme that allows high-order shock capturing with the discontinuous Galerkin method for general relativistic magnetohydrodynamics in dynamical spacetimes. We present several optimizations and stability improvements to our algorithm that allow the hybrid method to successfully simulate single, rotating, and binary neutron stars. The hybr…
▽ More
We present a discontinuous Galerkin-finite difference hybrid scheme that allows high-order shock capturing with the discontinuous Galerkin method for general relativistic magnetohydrodynamics in dynamical spacetimes. We present several optimizations and stability improvements to our algorithm that allow the hybrid method to successfully simulate single, rotating, and binary neutron stars. The hybrid method achieves the efficiency of discontinuous Galerkin methods throughout almost the entire spacetime during the inspiral phase, while being able to robustly capture shocks and resolve the stellar surfaces. We also use Cauchy-Characteristic evolution to compute the first gravitational waveforms at future null infinity from binary neutron star mergers. The simulations presented here are the first successful binary neutron star inspiral and merger simulations using discontinuous Galerkin methods.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
RouteLLM: Learning to Route LLMs with Preference Data
Authors:
Isaac Ong,
Amjad Almahairi,
Vincent Wu,
Wei-Lin Chiang,
Tianhao Wu,
Joseph E. Gonzalez,
M Waleed Kadous,
Ion Stoica
Abstract:
Large language models (LLMs) exhibit impressive capabilities across a wide range of tasks, yet the choice of which model to use often involves a trade-off between performance and cost. More powerful models, though effective, come with higher expenses, while less capable models are more cost-effective. To address this dilemma, we propose several efficient router models that dynamically select betwe…
▽ More
Large language models (LLMs) exhibit impressive capabilities across a wide range of tasks, yet the choice of which model to use often involves a trade-off between performance and cost. More powerful models, though effective, come with higher expenses, while less capable models are more cost-effective. To address this dilemma, we propose several efficient router models that dynamically select between a stronger and a weaker LLM during inference, aiming to optimize the balance between cost and response quality. We develop a training framework for these routers leveraging human preference data and data augmentation techniques to enhance performance. Our evaluation on widely-recognized benchmarks shows that our approach significantly reduces costs-by over 2 times in certain cases-without compromising the quality of responses. Interestingly, our router models also demonstrate significant transfer learning capabilities, maintaining their performance even when the strong and weak models are changed at test time. This highlights the potential of these routers to provide a cost-effective yet high-performance solution for deploying LLMs.
△ Less
Submitted 1 July, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Upgrading SPHERE with the second stage AO system SAXO+: non-common path aberrations estimation and correction
Authors:
Johan Mazoyer,
Charles Goulas,
Fabrice Vidal,
Isaac Bernardino Dinis,
Julien Milli,
Michel Tallon,
Raphaël Galicher,
Oliver Absil,
Clémentine Béchet,
Anthony Boccaletti,
Florian Ferreira,
Maud Langlois,
Patrice Martinez,
Laurent Mugnier,
Mamadou N'diaye,
Gilles Orban de Xivry,
Axel Potier,
Isabelle Tallon-Bosc,
Arthur Vigan
Abstract:
SAXO+ is a planned enhancement of the existing SAXO, the VLT/ SPHERE adaptive optics system, deployed on ESO's Very Large Telescope. This upgrade is designed to significantly enhance the instrument's capacity to detect and analyze young Jupiter-like planets. The pivotal addition in SAXO+ is a second-stage adaptive optics system featuring a dedicated near-infrared pyramid wavefront sensor and a sec…
▽ More
SAXO+ is a planned enhancement of the existing SAXO, the VLT/ SPHERE adaptive optics system, deployed on ESO's Very Large Telescope. This upgrade is designed to significantly enhance the instrument's capacity to detect and analyze young Jupiter-like planets. The pivotal addition in SAXO+ is a second-stage adaptive optics system featuring a dedicated near-infrared pyramid wavefront sensor and a second deformable mirror. This secondary stage is strategically integrated to address any residual wavefront errors persisting after the initial correction performed by the current primary AO loop, SAXO. However, several recent studies clearly showed that in good conditions, even in the current system SAXO, non-common path aberrations (NCPAs) are the limiting factor of the final normalized intensity in focal plane, which is the final metric for ground-based high-contrast instruments. This is likely to be even more so the case with the new AO system, with which the AO residuals will be minimized. Several techniques have already been extensively tested on SPHERE in internal source and/or on-sky and will be presented in this paper. However, the use of a new type of sensor for the second stage, a pyramid wavefront sensor, will likely complicate the correction of these aberrations. Using an end-to-end AO simulation tool, we conducted simulations to gauge the effect of measured SPHERE NCPAs in the coronagraphic image on the second loop system and their correction using focal plane wavefront sensing systems. We finally analyzed how the chosen position of SAXO+ in the beam will impact the evolution of the NCPAs in the new instrument.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Interpreting Attention Layer Outputs with Sparse Autoencoders
Authors:
Connor Kissane,
Robert Krzyzanowski,
Joseph Isaac Bloom,
Arthur Conmy,
Neel Nanda
Abstract:
Decomposing model activations into interpretable components is a key open problem in mechanistic interpretability. Sparse autoencoders (SAEs) are a popular method for decomposing the internal activations of trained transformers into sparse, interpretable features, and have been applied to MLP layers and the residual stream. In this work we train SAEs on attention layer outputs and show that also h…
▽ More
Decomposing model activations into interpretable components is a key open problem in mechanistic interpretability. Sparse autoencoders (SAEs) are a popular method for decomposing the internal activations of trained transformers into sparse, interpretable features, and have been applied to MLP layers and the residual stream. In this work we train SAEs on attention layer outputs and show that also here SAEs find a sparse, interpretable decomposition. We demonstrate this on transformers from several model families and up to 2B parameters.
We perform a qualitative study of the features computed by attention layers, and find multiple families: long-range context, short-range context and induction features. We qualitatively study the role of every head in GPT-2 Small, and estimate that at least 90% of the heads are polysemantic, i.e. have multiple unrelated roles.
Further, we show that Sparse Autoencoders are a useful tool that enable researchers to explain model behavior in greater detail than prior work. For example, we explore the mystery of why models have so many seemingly redundant induction heads, use SAEs to motivate the hypothesis that some are long-prefix whereas others are short-prefix, and confirm this with more rigorous analysis. We use our SAEs to analyze the computation performed by the Indirect Object Identification circuit (Wang et al.), validating that the SAEs find causally meaningful intermediate variables, and deepening our understanding of the semantics of the circuit. We open-source the trained SAEs and a tool for exploring arbitrary prompts through the lens of Attention Output SAEs.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Numerical simulations for the SAXO+ upgrade: Performance analysis of the adaptive optics system
Authors:
Charles Goulas,
Raphaël Galicher,
Fabrice Vidal,
Johan Mazoyer,
Florian Ferreira,
Arnaud Sevin,
Anthony Boccaletti,
Eric Gendron,
Clémentine Béchet,
Michel Tallon,
Maud Langlois,
Caroline Kulcsár,
Henri-François Raynaud,
Nicolas Galland,
Laura Schreiber,
Isaac Bernardino Dinis,
François Wildi,
Gaël Chauvin,
Julien Milli
Abstract:
SPHERE, operating at the VLT since 2014, is currently one of the high-contrast instruments with a higher performance. Its adaptive optics system, known as SAXO, will be upgraded to SAXO+, which features the addition of a second stage of adaptive optics. This stage will use a near-infrared pyramid wavefront sensor to record images of fainter exoplanets around redder stars. In this work, we compare…
▽ More
SPHERE, operating at the VLT since 2014, is currently one of the high-contrast instruments with a higher performance. Its adaptive optics system, known as SAXO, will be upgraded to SAXO+, which features the addition of a second stage of adaptive optics. This stage will use a near-infrared pyramid wavefront sensor to record images of fainter exoplanets around redder stars. In this work, we compare the performance of SAXO and SAXO+. We look for the optimal values of the key system parameters of SAXO+ for various science cases and turbulence conditions. We performed numerical simulations using COMPASS, an end-to-end adaptive optics simulation tool. We simulated perfect coronagraph images of an on-axis point source, and we minimized the residual starlight intensity between 3 and 5 $λ/D$ as a performance criterion. The explored parameter space includes science cases, turbulence conditions, and key system parameters. In every science case and turbulence condition, SAXO+ reduces the residual starlight intensity inside the correction zone of the second stage by a factor of ten compared to SAXO. The optimal first stage gain is lower for SAXO+ than for SAXO alone. We quantified the gain in performance of SAXO+ when changing the second stage frequency from 2 kHz to 3 kHz, and we conclude that 2 kHz may be sufficient for most realistic conditions. We give the optimal first stage gain as well as the first and second stage frequencies for every seeing, coherence time, and science case. Finally, we find that a 2 ${λ_{\mathrm{WFS}}}/D$ pyramid modulation radius is a good trade-off between performance and robustness against varying turbulence conditions. This study shows that the future SAXO+ system will outperform the current SAXO system in all studied cases.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Using skateboarding to develop a culturally relevant tutorial on static equilibrium
Authors:
Gian Viray,
Isaac Cheney,
Tong Wan
Abstract:
Culturally relevant pedagogy (CRP), initially developed by Ladson-Billings, is an instructional framework for supporting diverse learners by drawing on their cultural backgrounds and experiences. In line with the CRP framework, we developed a tutorial on static equilibrium using skateboarding, a popular activity on university campuses, as a culturally relevant context. To address specific student…
▽ More
Culturally relevant pedagogy (CRP), initially developed by Ladson-Billings, is an instructional framework for supporting diverse learners by drawing on their cultural backgrounds and experiences. In line with the CRP framework, we developed a tutorial on static equilibrium using skateboarding, a popular activity on university campuses, as a culturally relevant context. To address specific student conceptions about static equilibrium documented in the physics education research (PER) literature, we used the elicit-confront-resolve (ECR) strategy to develop the tutorial. In this paper, we provide a detailed account of how we operationalized the ECR strategy in designing the sequences of questions in the tutorial. Additionally, we present anecdotal evidence to show that the culturally relevant tutorial appears to effectively engage students and motivate their interest in learning physics.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Continuous drive heterodyne microwave sensing with spin qubits in hexagonal boron nitride
Authors:
Charlie J. Patrickson,
Valentin Haemmerli,
Shi Guo,
Andrew J. Ramsay,
Isaac J. Luxmoore
Abstract:
Quantum sensors that use solid state spin defects have emerged as effective probes of weak alternating magnetic signals. By recording the phase of a signal relative to an external clock, these devices can resolve signal frequencies to a precision orders of magnitude longer than the spin state lifetime. However, these quantum heterodyne protocols suffer from sub-optimal sensitivity, as they are cur…
▽ More
Quantum sensors that use solid state spin defects have emerged as effective probes of weak alternating magnetic signals. By recording the phase of a signal relative to an external clock, these devices can resolve signal frequencies to a precision orders of magnitude longer than the spin state lifetime. However, these quantum heterodyne protocols suffer from sub-optimal sensitivity, as they are currently limited to pulsed spin control techniques, which are susceptible to cumulative pulse-area errors, or single continuous drives which offer no protection of the spin coherence. Here, we present a control scheme based on a continuous microwave drive that extends spin coherence towards the effective $T_2 \approx \frac{1}{2}T_1$ limit and can resolve the frequency, amplitude and phase of GHz magnetic fields. The scheme is demonstrated using an ensemble of boron vacancies in hexagonal boron nitride, and achieves an amplitude sensitivity of $η\approx 3-5 \:\mathrm{μT \sqrt{Hz}}$ and phase sensitivity of $η_φ \approx 0.076 \:\mathrm{rads \sqrt{Hz}}$. By repeatedly referencing the phase of a resonant signal against the coherent continuous microwave drive in a quantum heterodyne demonstration, we measure a GHz signal with a resolution $<$1 Hz over a 10 s measurement. Achieving this level of performance in a two-dimensional material platform could have broad applications, from probing nanoscale condensed matter systems to integration into heterostructures for quantum networking.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Irida-Graphene Phonon Thermal Transport via Non-equilibrium Molecular Dynamics Simulations
Authors:
Isaac M. Felix,
Raphael M. Tromer,
Leonardo D. Machado,
Douglas S. Galvão,
Luiz A. Ribeiro Jr,
Marcelo L. Pereira Jr
Abstract:
Recently, a new 2D carbon allotrope called Irida-Graphene (Irida-G) was proposed. Irida-G consists of a flat sheet topologically arranged into 3-6-8 carbon rings exhibiting metallic and non-magnetic properties. In this study, we investigated the thermal transport properties of Irida-G using classical reactive molecular dynamics simulations. The findings indicate that Irida-G has an intrinsic therm…
▽ More
Recently, a new 2D carbon allotrope called Irida-Graphene (Irida-G) was proposed. Irida-G consists of a flat sheet topologically arranged into 3-6-8 carbon rings exhibiting metallic and non-magnetic properties. In this study, we investigated the thermal transport properties of Irida-G using classical reactive molecular dynamics simulations. The findings indicate that Irida-G has an intrinsic thermal conductivity of approximately 215 W/mK at room temperature, significantly lower than that of pristine graphene. This decrease is due to characteristic phonon scattering within Irida-G's porous structure. Additionally, the phonon group velocities and vibrational density of states for Irida-G were analyzed, revealing reduced average phonon group velocities compared to graphene. The thermal conductivity of Irida-G is isotropic and shows significant size effects, transitioning from ballistic to diffusive heat transport regimes as the system length increases. These results suggest that while Irida-G has lower thermal conductivity than graphene, it still holds potential for specific thermal management applications, sharing characteristics with other two-dimensional materials.
△ Less
Submitted 28 June, 2024; v1 submitted 22 June, 2024;
originally announced June 2024.
-
Multiple Clues for Dayside Aerosols and Temperature Gradients in WASP-69 b from a Panchromatic JWST Emission Spectrum
Authors:
Everett Schlawin,
Sagnick Mukherjee,
Kazumasa Ohno,
Taylor Bell,
Thomas G. Beatty,
Thomas P. Greene,
Michael Line,
Ryan C. Challener,
Vivien Parmentier,
Jonathan J. Fortney,
Emily Rauscher,
Lindsey Wiser,
Luis Welbanks,
Matthew Murphy,
Isaac Edelman,
Natasha Batalha,
Sarah E. Moran,
Nishil Mehta,
Marcia Rieke
Abstract:
WASP-69 b is a hot, inflated, Saturn-mass planet 0.26 Mjup with a zero-albedo equilibrium temperature of 963 K. Here, we report the JWST 2 to 12 um emission spectrum of the planet consisting of two eclipses observed with NIRCam grism time series and one eclipse observed with MIRI LRS. The emission spectrum shows absorption features of water vapor, carbon dioxide and carbon monoxide, but no strong…
▽ More
WASP-69 b is a hot, inflated, Saturn-mass planet 0.26 Mjup with a zero-albedo equilibrium temperature of 963 K. Here, we report the JWST 2 to 12 um emission spectrum of the planet consisting of two eclipses observed with NIRCam grism time series and one eclipse observed with MIRI LRS. The emission spectrum shows absorption features of water vapor, carbon dioxide and carbon monoxide, but no strong evidence for methane. WASP-69 b's emission spectrum is poorly fit by cloud-free homogeneous models. We find three possible model scenarios for the planet: 1) a Scattering Model that raises the brightness at short wavelengths with a free Geometric Albedo parameter 2) a Cloud Layer model that includes high altitude silicate aerosols to moderate long wavelength emission and 3) a Two-Region model that includes significant dayside inhomogeneity and cloud opacity with two different temperature-pressure profiles. In all cases, aerosols are needed to fit the spectrum of the planet. The Scattering model requires an unexpectedly high Geometric Albedo of 0.64. Our atmospheric retrievals indicate inefficient redistribution of heat and an inhomogeneous dayside distribution, which is tentatively supported by MIRI LRS broadband eclipse maps that show a central concentration of brightness. Our more plausible models (2 and 3) retrieve chemical abundances enriched in heavy elements relative to solar composition by 6x to 14x solar and a C/O ratio of 0.65 to 0.94, whereas the less plausible highly reflective scenario (1) retrieves a slightly lower metallicity and lower C/O ratio.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data
Authors:
Nahema Marchal,
Rachel Xu,
Rasmi Elasmar,
Iason Gabriel,
Beth Goldberg,
William Isaac
Abstract:
Generative, multimodal artificial intelligence (GenAI) offers transformative potential across industries, but its misuse poses significant risks. Prior research has shed light on the potential of advanced AI systems to be exploited for malicious purposes. However, we still lack a concrete understanding of how GenAI models are specifically exploited or abused in practice, including the tactics empl…
▽ More
Generative, multimodal artificial intelligence (GenAI) offers transformative potential across industries, but its misuse poses significant risks. Prior research has shed light on the potential of advanced AI systems to be exploited for malicious purposes. However, we still lack a concrete understanding of how GenAI models are specifically exploited or abused in practice, including the tactics employed to inflict harm. In this paper, we present a taxonomy of GenAI misuse tactics, informed by existing academic literature and a qualitative analysis of approximately 200 observed incidents of misuse reported between January 2023 and March 2024. Through this analysis, we illuminate key and novel patterns in misuse during this time period, including potential motivations, strategies, and how attackers leverage and abuse system capabilities across modalities (e.g. image, text, audio, video) in the wild.
△ Less
Submitted 21 June, 2024; v1 submitted 19 June, 2024;
originally announced June 2024.
-
Imagining In-distribution States: How Predictable Robot Behavior Can Enable User Control Over Learned Policies
Authors:
Isaac Sheidlower,
Emma Bethel,
Douglas Lilly,
Reuben M. Aronson,
Elaine Schaertl Short
Abstract:
It is crucial that users are empowered to take advantage of the functionality of a robot and use their understanding of that functionality to perform novel and creative tasks. Given a robot trained with Reinforcement Learning (RL), a user may wish to leverage that autonomy along with their familiarity of how they expect the robot to behave to collaborate with the robot. One technique is for the us…
▽ More
It is crucial that users are empowered to take advantage of the functionality of a robot and use their understanding of that functionality to perform novel and creative tasks. Given a robot trained with Reinforcement Learning (RL), a user may wish to leverage that autonomy along with their familiarity of how they expect the robot to behave to collaborate with the robot. One technique is for the user to take control of some of the robot's action space through teleoperation, allowing the RL policy to simultaneously control the rest. We formalize this type of shared control as Partitioned Control (PC). However, this may not be possible using an out-of-the-box RL policy. For example, a user's control may bring the robot into a failure state from the policy's perspective, causing it to act unexpectedly and hindering the success of the user's desired task. In this work, we formalize this problem and present Imaginary Out-of-Distribution Actions, IODA, an initial algorithm which empowers users to leverage their expectations of a robot's behavior to accomplish new tasks. We deploy IODA in a user study with a real robot and find that IODA leads to both better task performance and a higher degree of alignment between robot behavior and user expectation. We also show that in PC, there is a strong and significant correlation between task performance and the robot's ability to meet user expectations, highlighting the need for approaches like IODA. Code is available at https://github.com/AABL-Lab/ioda_roman_2024
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Hardware Realization of Neuromorphic Computing with a 4-Port Photonic Reservoir for Modulation Format Identification
Authors:
Enes Şeker,
Rijil Thomas,
Guillermo von Hünefeld,
Stephan Suckow,
Mahdi Kaveh,
Gregor Ronniger,
Pooyan Safari,
Isaac Sackey,
David Stahl,
Colja Schubert,
Johannes Karl Fischer,
Ronald Freund,
Max C. Lemme
Abstract:
The fields of machine learning and artificial intelligence drive researchers to explore energy-efficient, brain-inspired new hardware. Reservoir computing encompasses recurrent neural networks for sequential data processing and matches the performance of other recurrent networks with less training and lower costs. However, traditional software-based neural networks suffer from high energy consumpt…
▽ More
The fields of machine learning and artificial intelligence drive researchers to explore energy-efficient, brain-inspired new hardware. Reservoir computing encompasses recurrent neural networks for sequential data processing and matches the performance of other recurrent networks with less training and lower costs. However, traditional software-based neural networks suffer from high energy consumption due to computational demands and massive data transfer needs. Photonic reservoir computing overcomes this challenge with energy-efficient neuromorphic photonic integrated circuits or NeuroPICs. Here, we introduce a reservoir NeuroPIC used for modulation format identification in C-band telecommunication network monitoring. It is built on a silicon-on-insulator platform with a 4-port reservoir architecture consisting of a set of physical nodes connected via delay lines. We comprehensively describe the NeuroPIC design and fabrication, experimentally demonstrate its performance, and compare it with simulations. The NeuroPIC incorporates non-linearity through a simple digital readout and achieves close to 100% accuracy in identifying several configurations of quadrature amplitude modulation formats transmitted over 20 km of optical fiber at 32 GBaud symbol rate. The NeuroPIC performance is robust against fabrication imperfections like waveguide propagation loss, phase randomization, etc. and delay line length variations. Furthermore, the experimental results exceeded numerical simulations, which we attribute to enhanced signal interference in the experimental NeuroPIC output. Our energy-efficient photonic approach has the potential for high-speed temporal data processing in a variety of applications.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Predicting BN analogue of 8-16-4 graphyne: \textit{In silico} insights into its structural, electronic, optical, and thermal transport properties
Authors:
Isaac M. Félix,
Jessé M. Pontes,
Djardiel S. Gomes,
Thiago B. G. Guerra,
Sérgio A. F. Azevedo,
Leonardo D. Machado,
Lídia C. Gomes,
Raphael M. Tromer
Abstract:
The boron nitride (BN) analogue of 8-16-4 graphyne, termed SBNyne, is proposed for the first time. Its physical properties were explored using first-principles calculations and classical molecular dynamics (MD) simulations. Thermal stability assessments reveal that SBNyne maintains structural integrity up to 1000 K. We found that SBNyne exhibits a wide indirect bandgap of 4.58 eV using HSE06 and 3…
▽ More
The boron nitride (BN) analogue of 8-16-4 graphyne, termed SBNyne, is proposed for the first time. Its physical properties were explored using first-principles calculations and classical molecular dynamics (MD) simulations. Thermal stability assessments reveal that SBNyne maintains structural integrity up to 1000 K. We found that SBNyne exhibits a wide indirect bandgap of 4.58 eV using HSE06 and 3.20 eV using PBE. It displays strong optical absorption in the ultraviolet region while remaining transparent in the infrared and visible regions. Additionally, SBNyne exhibits significantly lower thermal conductivity compared to h-BN. Phonon spectrum analysis indicates that out-of-plane phonons predominantly contribute to the vibrational density of states only at very low frequencies, explaining its low thermal conductivity. These findings expand the knowledge of BN-based 2D materials and open new avenues for their design and advanced technological applications.
△ Less
Submitted 2 July, 2024; v1 submitted 19 June, 2024;
originally announced June 2024.
-
Quantum geometry of bosonic Bogoliubov quasiparticles
Authors:
Isaac Tesfaye,
André Eckardt
Abstract:
Topological and geometrical features arising in bosonic Bogoliubov-de Gennes (BdG) systems have mainly been studied by utilizing a generalized symplectic version of the Berry curvature and related Chern numbers. Here, we propose a symplectic quantum geometric tensor (SQGT), whose imaginary part leads to the previously studied symplectic Berry curvature, while the real part gives rise to a symplect…
▽ More
Topological and geometrical features arising in bosonic Bogoliubov-de Gennes (BdG) systems have mainly been studied by utilizing a generalized symplectic version of the Berry curvature and related Chern numbers. Here, we propose a symplectic quantum geometric tensor (SQGT), whose imaginary part leads to the previously studied symplectic Berry curvature, while the real part gives rise to a symplectic quantum metric, providing a natural distance measure in the space of bosonic Bogoliubov modes. We propose how to measure all components of the SQGT by extracting excitation rates in response to periodic modulations of the systems' parameters. Moreover, we connect the symplectic Berry curvature to a generalized symplectic anomalous velocity term for Bogoliubov Bloch wave packets. We test our results for a bosonic Bogoliubov-Haldane model.
△ Less
Submitted 16 July, 2024; v1 submitted 18 June, 2024;
originally announced June 2024.
-
STAR: SocioTechnical Approach to Red Teaming Language Models
Authors:
Laura Weidinger,
John Mellor,
Bernat Guillen Pegueroles,
Nahema Marchal,
Ravin Kumar,
Kristian Lum,
Canfer Akbulut,
Mark Diaz,
Stevie Bergman,
Mikel Rodriguez,
Verena Rieser,
William Isaac
Abstract:
This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models. STAR makes two key contributions: it enhances steerability by generating parameterised instructions for human red teamers, leading to improved coverage of the risk surface. Parameterised instructions also provide more detailed insights into model failur…
▽ More
This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models. STAR makes two key contributions: it enhances steerability by generating parameterised instructions for human red teamers, leading to improved coverage of the risk surface. Parameterised instructions also provide more detailed insights into model failures at no increased cost. Second, STAR improves signal quality by matching demographics to assess harms for specific groups, resulting in more sensitive annotations. STAR further employs a novel step of arbitration to leverage diverse viewpoints and improve label reliability, treating disagreement not as noise but as a valuable contribution to signal quality.
△ Less
Submitted 10 July, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Non-unitary Coupled Cluster Enabled by Mid-circuit Measurements on Quantum Computers
Authors:
Alexandre Fleury,
James Brown,
Erika Lloyd,
Maritza Hernandez,
Isaac H. Kim
Abstract:
Many quantum algorithms rely on a quality initial state for optimal performance. Preparing an initial state for specific applications can considerably reduce the cost of probabilistic algorithms such as the well studied quantum phase estimation (QPE). Fortunately, in the application space of quantum chemistry, generating approximate wave functions for molecular systems is well studied, and quantum…
▽ More
Many quantum algorithms rely on a quality initial state for optimal performance. Preparing an initial state for specific applications can considerably reduce the cost of probabilistic algorithms such as the well studied quantum phase estimation (QPE). Fortunately, in the application space of quantum chemistry, generating approximate wave functions for molecular systems is well studied, and quantum computing algorithms stand to benefit from importing these classical methods directly into a quantum circuit. In this work, we propose a state preparation method based on coupled cluster (CC) theory, which is a pillar of quantum chemistry on classical computers, by incorporating mid-circuit measurements into the circuit construction. Currently, the most well studied state preparation method for quantum chemistry on quantum computers is the variational quantum eigensolver (VQE) with a unitary-CC with single- and double-electron excitation terms (UCCSD) ansatz whose operations are limited to unitary gates. We verify the accuracy of our state preparation protocol using mid-circuit measurements by performing energy evaluation and state overlap computation for a set of small chemical systems. We further demonstrate that our approach leads to a reduction of the classical computation overhead, and the number of CNOT and T gates by 28% and 57% on average when compared against the standard VQE-UCCSD protocol.
△ Less
Submitted 28 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
A platform for lightweight deployment of IoT applications based on a Function-as-a-Service model
Authors:
Sebastià Sansó,
Carlos Guerrero,
Isaac Lera,
Carlos Juiz
Abstract:
This paper presents a platform to facilitate the deployment of applications in Internet of Things (IoT) devices. The platform allows to the programmers to use a Function-as-a-Service programming paradigm that are managed and configured in a Platform-as-a-Service web tool. The tool also allows to establish interoperability between the functions of the applications. The proposed platform obtained fa…
▽ More
This paper presents a platform to facilitate the deployment of applications in Internet of Things (IoT) devices. The platform allows to the programmers to use a Function-as-a-Service programming paradigm that are managed and configured in a Platform-as-a-Service web tool. The tool also allows to establish interoperability between the functions of the applications. The proposed platform obtained faster and easier deployments of the applications and the resource usages of the IoT devices also were lower in relation to a deployment process based in containers of Docker.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
The nucleosynthetic fingerprint of the outermost protoplanetary disk and early Solar System dynamics
Authors:
Elishevah van Kooten,
Xuchao Zhao,
Ian Franchi,
Po-Yen Tung,
Simon Fairclough,
John Walmsley,
Isaac Onyett,
Martin Schiller,
Martin Bizzarro
Abstract:
Knowledge of the nucleosynthetic isotope composition of the outermost protoplanetary disk is critical to understand the formation and early dynamical evolution of the Solar System. We report the discovery of outer disk material preserved in a pristine meteorite based on its chemical composition, organic-rich petrology, and 15N-rich, deuterium-rich, and 16O-poor isotope signatures. We infer that th…
▽ More
Knowledge of the nucleosynthetic isotope composition of the outermost protoplanetary disk is critical to understand the formation and early dynamical evolution of the Solar System. We report the discovery of outer disk material preserved in a pristine meteorite based on its chemical composition, organic-rich petrology, and 15N-rich, deuterium-rich, and 16O-poor isotope signatures. We infer that this outer disk material originated in the comet-forming region. The nucleosynthetic Fe, Mg, Si and Cr compositions of this material reveal that, contrary to current belief, the isotope signature of the comet-forming region is ubiquitous amongst outer Solar System bodies, possibly reflecting an important planetary building block in the outer Solar System. This nucleosynthetic component represents fresh material added to the outer disk by late accretion streamers connected to the ambient molecular cloud. Our results show that most Solar System carbonaceous asteroids accreted material from the comet-forming region, a signature lacking in the terrestrial planet region.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Optimization policy for file replica placement in fog domains
Authors:
Carlos Guerrero,
Isaac Lera,
Carlos Juiz
Abstract:
Fog computing architectures distribute computational and storage resources along the continuum from the cloud to things. Therefore, the execution of services or the storage of files can be closer to the users. The main objectives of fog computing domains are to reduce the user latency and the network usage. Availability is also an issue in fog architectures because the topology of the network does…
▽ More
Fog computing architectures distribute computational and storage resources along the continuum from the cloud to things. Therefore, the execution of services or the storage of files can be closer to the users. The main objectives of fog computing domains are to reduce the user latency and the network usage. Availability is also an issue in fog architectures because the topology of the network does not guarantee redundant links between devices. Consequently, the definition of placement polices is a key challenge. We propose a placement policy for data replication to increase data availability that contrasts with other storage policies that only consider a single replica of the files. The system is modeled with complex weighted networks and topological features, such as centrality indices. Graph partition algorithms are evaluated to select the fog devices that store data replicas. Our approach is compared with two other placement policies: one that stores only one replica and FogStore, which also stores file replicas but uses a greedy approach (the shortest path). We analyze 22 experiments with simulations. The results show that our approach obtains the shortest latency times, mainly for writing operations, a smaller network usage increase, and a similar file availability to FogStore.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Large language model validity via enhanced conformal prediction methods
Authors:
John J. Cherian,
Isaac Gibbs,
Emmanuel J. Candès
Abstract:
We develop new conformal inference methods for obtaining validity guarantees on the output of large language models (LLMs). Prior work in conformal language modeling identifies a subset of the text that satisfies a high-probability guarantee of correctness. These methods work by filtering claims from the LLM's original response if a scoring function evaluated on the claim fails to exceed a thresho…
▽ More
We develop new conformal inference methods for obtaining validity guarantees on the output of large language models (LLMs). Prior work in conformal language modeling identifies a subset of the text that satisfies a high-probability guarantee of correctness. These methods work by filtering claims from the LLM's original response if a scoring function evaluated on the claim fails to exceed a threshold calibrated via split conformal prediction. Existing methods in this area suffer from two deficiencies. First, the guarantee stated is not conditionally valid. The trustworthiness of the filtering step may vary based on the topic of the response. Second, because the scoring function is imperfect, the filtering step can remove many valuable and accurate claims. We address both of these challenges via two new conformal methods. First, we generalize the conditional conformal procedure of Gibbs et al. (2023) in order to adaptively issue weaker guarantees when they are required to preserve the utility of the output. Second, we show how to systematically improve the quality of the scoring function via a novel algorithm for differentiating through the conditional conformal procedure. We demonstrate the efficacy of our approach on both synthetic and real-world datasets.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Distributed genetic algorithm for application placement in the compute continuum leveraging infrastructure nodes for optimization
Authors:
Carlos Guerrero,
Isaac Lera,
Carlos Juiz
Abstract:
The increasing complexity of fog computing environments calls for efficient resource optimization techniques. In this paper, we propose and evaluate three distributed designs of a genetic algorithm (GA) for resource optimization in fog computing, within an increasing degree of distribution. The designs leverage the execution of the GA in the fog devices themselves by dealing with the specific feat…
▽ More
The increasing complexity of fog computing environments calls for efficient resource optimization techniques. In this paper, we propose and evaluate three distributed designs of a genetic algorithm (GA) for resource optimization in fog computing, within an increasing degree of distribution. The designs leverage the execution of the GA in the fog devices themselves by dealing with the specific features of this domain: constrained resources and widely geographical distribution of the devices. For their evaluation, we implemented a benchmark case using the NSGA-II for the specific problem of optimizing the fog service placement, according to the guidelines of our three distributed designs. These three experimental scenarios were compared with a control case, a traditional centralized version of this GA algorithm, considering solution quality and network overhead. The results show that the design with the lowest distribution degree, which keeps centralized storage of the objective space, achieves comparable solution quality to the traditional approach but incurs a higher network load. The second design, which completely distributes the population between the workers, reduces network overhead but exhibits lower solution diversity while keeping enough good results in terms of optimization objective minimization. Finally, the proposal with a distributed population and that only interchanges solution between the workers' neighbors achieves the lowest network load but with compromised solution quality.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Scoreformer: A Surrogate Model For Large-Scale Prediction of Docking Scores
Authors:
Álvaro Ciudad,
Adrián Morales-Pastor,
Laura Malo,
Isaac Filella-Mercè,
Victor Guallar,
Alexis Molina
Abstract:
In this study, we present ScoreFormer, a novel graph transformer model designed to accurately predict molecular docking scores, thereby optimizing high-throughput virtual screening (HTVS) in drug discovery. The architecture integrates Principal Neighborhood Aggregation (PNA) and Learnable Random Walk Positional Encodings (LRWPE), enhancing the model's ability to understand complex molecular struct…
▽ More
In this study, we present ScoreFormer, a novel graph transformer model designed to accurately predict molecular docking scores, thereby optimizing high-throughput virtual screening (HTVS) in drug discovery. The architecture integrates Principal Neighborhood Aggregation (PNA) and Learnable Random Walk Positional Encodings (LRWPE), enhancing the model's ability to understand complex molecular structures and their relationship with their respective docking scores. This approach significantly surpasses traditional HTVS methods and recent Graph Neural Network (GNN) models in both recovery and efficiency due to a wider coverage of the chemical space and enhanced performance. Our results demonstrate that ScoreFormer achieves competitive performance in docking score prediction and offers a substantial 1.65-fold reduction in inference time compared to existing models. We evaluated ScoreFormer across multiple datasets under various conditions, confirming its robustness and reliability in identifying potential drug candidates rapidly.
△ Less
Submitted 25 June, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
PETSc/TAO Developments for Early Exascale Systems
Authors:
Richard Tran Mills,
Mark Adams,
Satish Balay,
Jed Brown,
Jacob Faibussowitsch,
Toby Isaac,
Matthew Knepley,
Todd Munson,
Hansol Suh,
Stefano Zampini,
Hong Zhang,
Junchao Zhang
Abstract:
The Portable Extensible Toolkit for Scientific Computation (PETSc) library provides scalable solvers for nonlinear time-dependent differential and algebraic equations and for numerical optimization via the Toolkit for Advanced Optimization (TAO). PETSc is used in dozens of scientific fields and is an important building block for many simulation codes. During the U.S. Department of Energy's Exascal…
▽ More
The Portable Extensible Toolkit for Scientific Computation (PETSc) library provides scalable solvers for nonlinear time-dependent differential and algebraic equations and for numerical optimization via the Toolkit for Advanced Optimization (TAO). PETSc is used in dozens of scientific fields and is an important building block for many simulation codes. During the U.S. Department of Energy's Exascale Computing Project, the PETSc team has made substantial efforts to enable efficient utilization of the massive fine-grain parallelism present within exascale compute nodes and to enable performance portability across exascale architectures. We recap some of the challenges that designers of numerical libraries face in such an endeavor, and then discuss the many developments we have made, which include the addition of new GPU backends, features supporting efficient on-device matrix assembly, better support for asynchronicity and GPU kernel concurrency, and new communication infrastructure. We evaluate the performance of these developments on some pre-exascale systems as well the early exascale systems Frontier and Aurora, using compute kernel, communication layer, solver, and mini-application benchmark studies, and then close with a few observations drawn from our experiences on the tension between portable performance and other goals of numerical libraries.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Towards Integrating Personal Knowledge into Test-Time Predictions
Authors:
Isaac Lage,
Sonali Parbhoo,
Finale Doshi-Velez
Abstract:
Machine learning (ML) models can make decisions based on large amounts of data, but they can be missing personal knowledge available to human users about whom predictions are made. For example, a model trained to predict psychiatric outcomes may know nothing about a patient's social support system, and social support may look different for different patients. In this work, we introduce the problem…
▽ More
Machine learning (ML) models can make decisions based on large amounts of data, but they can be missing personal knowledge available to human users about whom predictions are made. For example, a model trained to predict psychiatric outcomes may know nothing about a patient's social support system, and social support may look different for different patients. In this work, we introduce the problem of human feature integration, which provides a way to incorporate important personal-knowledge from users without domain expertise into ML predictions. We characterize this problem through illustrative user stories and comparisons to existing approaches; we formally describe this problem in a way that paves the ground for future technical solutions; and we provide a proof-of-concept study of a simple version of a solution to this problem in a semi-realistic setting.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Incremental Learning and Self-Attention Mechanisms Improve Neural System Identification
Authors:
Isaac Lin,
Tianye Wang,
Shang Gao,
Shiming Tang,
Tai Sing Lee
Abstract:
Convolutional neural networks (CNNs) have been shown to be the state-of-the-art approach for modeling the transfer functions of visual cortical neurons. Cortical neurons in the primary visual cortex are are sensitive to contextual information mediated by extensive horizontal and feedback connections. Standard CNNs can integrate global spatial image information to model such contextual modulation v…
▽ More
Convolutional neural networks (CNNs) have been shown to be the state-of-the-art approach for modeling the transfer functions of visual cortical neurons. Cortical neurons in the primary visual cortex are are sensitive to contextual information mediated by extensive horizontal and feedback connections. Standard CNNs can integrate global spatial image information to model such contextual modulation via two mechanisms: successive rounds of convolutions and a fully connected readout layer. In this paper, we find that non-local networks or self-attention (SA) mechanisms, theoretically related to context-dependent flexible gating mechanisms observed in the primary visual cortex, improve neural response predictions over parameter-matched CNNs in two key metrics: tuning curve correlation and tuning peak. We factorize networks to determine the relative contribution of each context mechanism. This reveals that information in the local receptive field is most important for modeling the overall tuning curve, but surround information is critically necessary for characterizing the tuning peak. We find that self-attention can replace subsequent spatial-integration convolutions when learned in an incremental manner, and is further enhanced in the presence of a fully connected readout layer, suggesting that the two context mechanisms are complementary. Finally, we find that learning a receptive-field-centric model with self-attention, before incrementally learning a fully connected readout, yields a more biologically realistic model in terms of center-surround contributions.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Metal-Poor Stars in the MW Disk: Resonant Cooling of Vertical Oscillations of Halo Stars in Barred Galaxies
Authors:
Xingchen Li,
Isaac Shlosman,
Daniel Pfenniger,
Clayton Heller
Abstract:
Using numerical simulations of a barred disk galaxy embedded in nonspinning and spinning dark matter (DM) halos, we present a novel mechanism of `cooling' the vertical oscillations of halo DM particles, which acquire the disk kinematics. The underlying mechanism consists of resonant interactions between halo particles and the stellar bar. The cooling mechanism acts both on dynamical and secular ti…
▽ More
Using numerical simulations of a barred disk galaxy embedded in nonspinning and spinning dark matter (DM) halos, we present a novel mechanism of `cooling' the vertical oscillations of halo DM particles, which acquire the disk kinematics. The underlying mechanism consists of resonant interactions between halo particles and the stellar bar. The cooling mechanism acts both on dynamical and secular timescales, i.e., from ~ 0.5 Gyr to few Gyr, and the stellar bar acts to absorb the kinetic energy of the vertical motions. Using a Milky Way-type stellar halo, we estimate the population of metal-poor disk stars which have been trapped by the MW disk and analyze its kinematics. We find that the population of metal-poor MW disk stars with $|z|\ltorder 3$\,kpc detected by the Gaia DR3 and other surveys can have their origin in the stellar halo. The cooled population also migrates radially outwards compared by acquiring energy from the spinning bar, and prograde-moving stars have a different distribution from the retrograde ones. Next, we have calculated the ratio of the prograde-to-retrograde orbits of the cooled population and found that this ratio varies radially, with the fast-spinning stellar halo resulting in the shallower radial increase of this ratio outside of the corotation. The nonspinning stellar halo shows a monotonic increase of this ratio with radius outside the corotation. Together with analyzed radial migration of these halo stars, the cooling phenomenon of halo metal-poor stars can explain their current disk population, and has corollaries for the chemical evolution of disk galaxies in general.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
A Variational Approach to Learning Photonic Unitary Operators
Authors:
Hadrian Bezuidenhout,
Mwezi Koni,
Jonathan Leach,
Paola Concha Obando,
Andrew Forbes,
Isaac Nape
Abstract:
Structured light, light tailored in its internal degrees of freedom, has become topical in numerous quantum and classical information processing protocols. In this work, we harness the high dimensional nature of structured light modulated in the transverse spatial degree of freedom to realise an adaptable scheme for learning unitary operations. Our approach borrows from concepts in variational qua…
▽ More
Structured light, light tailored in its internal degrees of freedom, has become topical in numerous quantum and classical information processing protocols. In this work, we harness the high dimensional nature of structured light modulated in the transverse spatial degree of freedom to realise an adaptable scheme for learning unitary operations. Our approach borrows from concepts in variational quantum computing, where a search or optimisation problem is mapped onto the task of finding a minimum ground state energy for a given energy/goal function. We achieve this by a pseudo-random walk procedure over the parameter space of the unitary operation, implemented with optical matrix-vector multiplication enacted on arrays of Gaussian modes by exploiting the partial Fourier transforming capabilities of a cylindrical lens in the transverse degree of freedom for the measurement. We outline the concept theoretically, and experimentally demonstrate that we are able to learn optical unitary matrices for dimensions d = 2, 4, 8 and 16 with average fidelities of >90%. Our work advances high dimensional information processing and can be adapted to both process and quantum state tomography of unknown states and channels.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Considerations for extracting moiré-level strain from dark field intensities in transmission electron microscopy
Authors:
Isaac M. Craig,
Madeline Van Winkle,
Colin Ophus,
D. Kwabena Bediako
Abstract:
Bragg interferometry (BI) is an imaging technique based on four-dimensional scanning electron microscopy (4D-STEM) wherein the intensities of select overlapping Bragg disks are fit or more qualitatively analyzed in the context of simple trigonometric equations to determine local stacking order. In 4D-STEM based approaches, the collection of full diffraction patterns at each real-space position of…
▽ More
Bragg interferometry (BI) is an imaging technique based on four-dimensional scanning electron microscopy (4D-STEM) wherein the intensities of select overlapping Bragg disks are fit or more qualitatively analyzed in the context of simple trigonometric equations to determine local stacking order. In 4D-STEM based approaches, the collection of full diffraction patterns at each real-space position of the scanning probe allows the use of precise virtual apertures much smaller and more variable in shape than those used in conventional dark field imaging, such that even buried interfaces marginally twisted from other layers can be targeted. A coarse-grained form of dark field ptychography, BI uses simple physically derived fitting functions to extract the average structure within the illumination region and is therefore viable over large fields of view. BI has shown a particular advantage for selectively investigating the interlayer stacking and associated moiré reconstruction of bilayer interfaces within complex multi-layered structures. This has enabled investigation of reconstruction and substrate effects in bilayers through encapsulating hexagonal boron nitride and of select bilayer interfaces within trilayer stacks. However, the technique can be improved to provide a greater spatial resolution and probe a wider range of twisted structures, for which current limitations on acquisition parameters can lead to large illumination regions and the computationally involved post-processing can fail. Here we analyze these limitations and the computational processing in greater depth, presenting a few methods for improvement over previous works, discussing potential areas for further expansion, and illustrating the current capabilities of this approach for extracting moiré-scale strain.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Sulfur Dioxide and Other Molecular Species in the Atmosphere of the Sub-Neptune GJ 3470 b
Authors:
Thomas G. Beatty,
Luis Welbanks,
Everett Schlawin,
Taylor J. Bell,
Michael R. Line,
Matthew Murphy,
Isaac Edelman,
Thomas P. Greene,
Jonathan J. Fortney,
Gregory W. Henry,
Sagnick Mukherjee,
Kazumasa Ohno,
Vivien Parmentier,
Emily Rauscher,
Lindsey S. Wiser,
Kenneth E. Arnold
Abstract:
We report observations of the atmospheric transmission spectrum of the sub-Neptune exoplanet GJ 3470 b taken using the Near-Infrared Camera (NIRCam) on JWST. Combined with two archival HST/WFC3 transit observations and fifteen archival Spitzer transit observations, we detect water, methane, sulfur dioxide, and carbon dioxide in the atmosphere of GJ 3470 b, each with a significance of >3-sigma. GJ…
▽ More
We report observations of the atmospheric transmission spectrum of the sub-Neptune exoplanet GJ 3470 b taken using the Near-Infrared Camera (NIRCam) on JWST. Combined with two archival HST/WFC3 transit observations and fifteen archival Spitzer transit observations, we detect water, methane, sulfur dioxide, and carbon dioxide in the atmosphere of GJ 3470 b, each with a significance of >3-sigma. GJ 3470 b is the lowest mass -- and coldest -- exoplanet known to show a substantial sulfur dioxide feature in its spectrum, at $M_{p}$=11.2${\,{\rm M}_{\oplus}}$ and $T_{eq}$=600$\,$K. This indicates disequilibrium photochemistry drives sulfur dioxide production in exoplanet atmospheres over a wider range of masses and temperatures than has been reported or expected. The water, carbon dioxide, and sulfur dioxide abundances we measure indicate an atmospheric metallicity of approximately $100\times$ Solar. We see further evidence for disequilibrium chemistry in our inferred methane abundance, which is significantly lower than expected from equilibrium models consistent with our measured water and carbon dioxide abundances.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Use of a Multiscale Vision Transformer to predict Nursing Activities Score from Low Resolution Thermal Videos in an Intensive Care Unit
Authors:
Isaac YL Lee,
Thanh Nguyen-Duc,
Ryo Ueno,
Jesse Smith,
Peter Y Chan
Abstract:
Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the Intensive Care Unit (ICU) is often done using the Nursing Activities Score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of Ambient Intelligence (AmI) by using computer vision to passively derive car…
▽ More
Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the Intensive Care Unit (ICU) is often done using the Nursing Activities Score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of Ambient Intelligence (AmI) by using computer vision to passively derive caregiver-patient interaction times to monitor staff workload. In this letter, we propose using a Multiscale Vision Transformer (MViT) to passively predict the NAS from low-resolution thermal videos recorded in an ICU. 458 videos were obtained from an ICU in Melbourne, Australia and used to train a MViTv2 model using an indirect prediction and a direct prediction method. The indirect method predicted 1 of 8 potentially identifiable NAS activities from the video before inferring the NAS. The direct method predicted the NAS score immediately from the video. The indirect method yielded an average 5-fold accuracy of 57.21%, an area under the receiver operating characteristic curve (ROC AUC) of 0.865, a F1 score of 0.570 and a mean squared error (MSE) of 28.16. The direct method yielded a MSE of 18.16. We also showed that the MViTv2 outperforms similar models such as R(2+1)D and ResNet50-LSTM under identical settings.
This study shows the feasibility of using a MViTv2 to passively predict the NAS in an ICU and monitor staff workload automatically. Our results above also show an increased accuracy in predicting NAS directly versus predicting NAS indirectly. We hope that our study can provide a direction for future work and further improve the accuracy of passive NAS monitoring.
△ Less
Submitted 30 May, 2024;
originally announced June 2024.
-
Changes in boiling controlled by molar concentration-dependent diffusion of surfactants
Authors:
Mario R. Mata,
Matic Može,
Armin Hadžić,
Giseop Lee,
Blake Naccarato,
Isaac Berk,
Iztok Golobič,
H. Jeremy Cho
Abstract:
Boiling is a prevalent phase-change process that plays a vital role in facilitating efficient heat transfer from a heating surface. While this heat transfer mechanism is generally effective, a rapid increase in surface temperature can lead to hydrodynamic instabilities, resulting in a boiling crisis. Previous studies have shown that surfactants often improve boiling performance and change the boil…
▽ More
Boiling is a prevalent phase-change process that plays a vital role in facilitating efficient heat transfer from a heating surface. While this heat transfer mechanism is generally effective, a rapid increase in surface temperature can lead to hydrodynamic instabilities, resulting in a boiling crisis. Previous studies have shown that surfactants often improve boiling performance and change the boiling crisis behavior. Conventional wisdom in this field attributes that these changes in boiling behavior are tied to the critical micelle concentration (CMC) of the particular surfactant. However, our work reveals that these changes in boiling behavior are independent of the CMC for three nonionic surfactants across a wide range of molar concentrations. In addition, visual snapshots of the bubbling behavior indicate changes in bubble formation, such as bubble size and nucleation site density, influenced by the molar concentration-dependent diffusion timescale of surfactants. Hence, these findings offer compelling evidence that boiling behavior, encompassing both boiling performance and boiling crisis, is governed by the dynamic adsorption of surfactants rather than dictated by the CMC. This becomes evident when quantifying the heat transfer coefficient (HTC) and critical heat flux (CHF) using the logarithm of molar concentration, as predicted by theory. Building upon these findings, we propose insights for controlling when CHF modification occurs in specific scenarios involving any surfactants. These insights hold significant potential for optimizing heat transfer processes and leveraging surfactants in energy-related applications to maximize boiling efficiency.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
An Observability-Constrained Magnetic-Field-Aided Inertial Navigation System
Authors:
Chuan Huang,
Gustaf Hendeby,
Isaac Skog
Abstract:
A method to construct an observability-constrained magnetic-field-aided inertial navigation system is proposed. The proposed method builds upon the previously proposed observability-constrained extended Kalman filter and extends it to work with a magnetic-field-based odometry-aided inertial navigation system. The proposed method is evaluated using simulation and real-world data, showing that (i) t…
▽ More
A method to construct an observability-constrained magnetic-field-aided inertial navigation system is proposed. The proposed method builds upon the previously proposed observability-constrained extended Kalman filter and extends it to work with a magnetic-field-based odometry-aided inertial navigation system. The proposed method is evaluated using simulation and real-world data, showing that (i) the system observability properties are preserved, (ii) the estimation accuracy increases, and (iii) the perceived uncertainty calculated by the EKF is more consistent with the true uncertainty of the filter estimates.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Optical-computing-enabled Network: A New Dawn for Optical-layer Intelligence?
Authors:
Dao Thanh Hai,
Minh Nguyen,
Isaac Woungang
Abstract:
Inspired by the renaissance of optical computing recently, this poster presents a disruptive outlook on the possibility of seamless integration between optical communications and optical computing infrastructures, paving the way for achieving optical-layer intelligence and consequently boosting the capacity efficiency. This entails a paradigm shift in optical node architecture from the currently u…
▽ More
Inspired by the renaissance of optical computing recently, this poster presents a disruptive outlook on the possibility of seamless integration between optical communications and optical computing infrastructures, paving the way for achieving optical-layer intelligence and consequently boosting the capacity efficiency. This entails a paradigm shift in optical node architecture from the currently used optical-bypass to a novel one, entitled, optical-computing-enabled mode, where in addition to the traditional add-drop and cross-connect functionalities, optical nodes are upgraded to account for optical-computing capabilities between the lightpath entities directly at the optical layer. A preliminary study focusing on the optical aggregation operation is examined and early simulation results indicate a promising spectral saving enabled by the optical-computing-enabled mode compared with the optical-bypass one.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.