subscribe to arXiv mailings

Signature of Orbital Driven Finite Momentum Pairing in a 3D Ising Superconductor

Authors: F. Z. Yang, H. D. Zhang, Saswata Mandal, F. Y. Meng, G. Fabbris, A. Said, P. Mercado Lozano, A. Rajapitamahuni, E. Vescovo, C. Nelson, S. Lin, Y. Park, E. M. Clements, T. Z. Ward, H. -N. Lee, H. C. Lei, C. X. Liu, H. Miao

Abstract: The finite momentum superconducting pairing states (FMPs), where Cooper pairs carry non-zero momentum, are believed to give rise to exotic physical phenomena including the pseudogap phase of cuprate high-Tc superconductors and Majorana fermions in topological superconductivity. FMPs can emerge in intertwined electronic liquids with strong spin-spin interactions or be induced by lifting the spin de… ▽ More The finite momentum superconducting pairing states (FMPs), where Cooper pairs carry non-zero momentum, are believed to give rise to exotic physical phenomena including the pseudogap phase of cuprate high-Tc superconductors and Majorana fermions in topological superconductivity. FMPs can emerge in intertwined electronic liquids with strong spin-spin interactions or be induced by lifting the spin degeneracy under magnetic field as originally proposed by Fulde-Ferrell and Larkin-Ovchinnikov. In quantum materials with strong Ising-type spin-orbit coupling, such as the 2D transition metal dichalcogenides (TMDs), the spin degree of freedom is frozen enabling novel orbital driven FMPs via magnetoelectric effect. While evidence of orbital driven FMPs has been revealed in bilayer TMDs, its realization in 3D bulk materials remains an unresolved challenge. Here we report experimental signatures of FMP in a locally noncentrosymmetric bulk superconductor 4Hb-TaS2. Using hard X-ray diffraction and angle-resolved photoemission spectroscopy, we reveal unusual 2D chiral charge density wave (CDW) and weak interlayer hopping in 4Hb-TaS2. Below the superconducting transition temperature, the upper critical field, Hc2, linearly increases via decreasing temperature, and well exceeds the Pauli limit, thus establishing the dominant orbital pair-breaking mechanism. Remarkably, we discover a field-induced superconductivity-to-superconductivity transition that breaks continuous rotational symmetry of the s-wave uniform pairing in the Bardeen-Cooper-Schrieffer theory down to the six-fold rotation symmetry. Combining with a Ginzburg-Landau free energy analysis that incorporates magnetoelectric effect, our observations provide strong evidence of orbital driven FMP in the 3D quantum heterostructure 4Hb-TaS2. △ Less

Submitted 14 July, 2024; originally announced July 2024.

arXiv:2407.10339 [pdf, other]

Supernova Pointing Capabilities of DUNE

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

Abstract: The determination of the direction of a stellar core collapse via its neutrino emission is crucial for the identification of the progenitor for a multimessenger follow-up. A highly effective method of reconstructing supernova directions within the Deep Underground Neutrino Experiment (DUNE) is introduced. The supernova neutrino pointing resolution is studied by simulating and reconstructing electr… ▽ More The determination of the direction of a stellar core collapse via its neutrino emission is crucial for the identification of the progenitor for a multimessenger follow-up. A highly effective method of reconstructing supernova directions within the Deep Underground Neutrino Experiment (DUNE) is introduced. The supernova neutrino pointing resolution is studied by simulating and reconstructing electron-neutrino charged-current absorption on $^{40}$Ar and elastic scattering of neutrinos on electrons. Procedures to reconstruct individual interactions, including a newly developed technique called ``brems flipping'', as well as the burst direction from an ensemble of interactions are described. Performance of the burst direction reconstruction is evaluated for supernovae happening at a distance of 10 kpc for a specific supernova burst flux model. The pointing resolution is found to be 3.4 degrees at 68% coverage for a perfect interaction-channel classification and a fiducial mass of 40 kton, and 6.6 degrees for a 10 kton fiducial mass respectively. Assuming a 4% rate of charged-current interactions being misidentified as elastic scattering, DUNE's burst pointing resolution is found to be 4.3 degrees (8.7 degrees) at 68% coverage. △ Less

Submitted 14 July, 2024; originally announced July 2024.

Comments: 25 pages, 16 figures

Report number: FERMILAB-PUB-24-0319-LBNF

arXiv:2407.09911 [pdf, other]

SensEmo: Enabling Affective Learning through Real-time Emotion Recognition with Smartwatches

Authors: Kushan Choksi, Hongkai Chen, Karan Joshi, Sukrutha Jade, Shahriar Nirjon, Shan Lin

Abstract: Recent research has demonstrated the capability of physiological signals to infer both user emotional and attention responses. This presents an opportunity for leveraging widely available physiological sensors in smartwatches, to detect real-time emotional cues in users, such as stress and excitement. In this paper, we introduce SensEmo, a smartwatch-based system designed for affective learning. S… ▽ More Recent research has demonstrated the capability of physiological signals to infer both user emotional and attention responses. This presents an opportunity for leveraging widely available physiological sensors in smartwatches, to detect real-time emotional cues in users, such as stress and excitement. In this paper, we introduce SensEmo, a smartwatch-based system designed for affective learning. SensEmo utilizes multiple physiological sensor data, including heart rate and galvanic skin response, to recognize a student's motivation and concentration levels during class. This recognition is facilitated by a personalized emotion recognition model that predicts emotional states based on degrees of valence and arousal. With real-time emotion and attention feedback from students, we design a Markov decision process-based algorithm to enhance student learning effectiveness and experience by by offering suggestions to the teacher regarding teaching content and pacing. We evaluate SensEmo with 22 participants in real-world classroom environments. Evaluation results show that SensEmo recognizes student emotion with an average of 88.9% accuracy. More importantly, SensEmo assists students to achieve better online learning outcomes, e.g., an average of 40.0% higher grades in quizzes, over the traditional learning without student emotional feedback. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: 7 pages, 7 figures, 2 tables. IEEE MASS 2024

ACM Class: C.3.3; J.3.2; J.4.2

arXiv:2407.08561 [pdf, other]

MapLocNet: Coarse-to-Fine Feature Registration for Visual Re-Localization in Navigation Maps

Authors: Hang Wu, Zhenghao Zhang, Siyuan Lin, Xiangru Mu, Qiang Zhao, Ming Yang, Tong Qin

Abstract: Robust localization is the cornerstone of autonomous driving, especially in challenging urban environments where GPS signals suffer from multipath errors. Traditional localization approaches rely on high-definition (HD) maps, which consist of precisely annotated landmarks. However, building HD map is expensive and challenging to scale up. Given these limitations, leveraging navigation maps has eme… ▽ More Robust localization is the cornerstone of autonomous driving, especially in challenging urban environments where GPS signals suffer from multipath errors. Traditional localization approaches rely on high-definition (HD) maps, which consist of precisely annotated landmarks. However, building HD map is expensive and challenging to scale up. Given these limitations, leveraging navigation maps has emerged as a promising low-cost alternative for localization. Current approaches based on navigation maps can achieve highly accurate localization, but their complex matching strategies lead to unacceptable inference latency that fails to meet the real-time demands. To address these limitations, we propose a novel transformer-based neural re-localization method. Inspired by image registration, our approach performs a coarse-to-fine neural feature registration between navigation map and visual bird's-eye view features. Our method significantly outperforms the current state-of-the-art OrienterNet on both the nuScenes and Argoverse datasets, which is nearly 10%/20% localization accuracy and 30/16 FPS improvement on single-view and surround-view input settings, separately. We highlight that our research presents an HD-map-free localization method for autonomous driving, offering cost-effective, reliable, and scalable performance in challenging driving environments. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: IROS 2024 (Oral)

arXiv:2407.08526 [pdf, other]

BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight

Authors: Hang Wu, Zhenghao Zhang, Siyuan Lin, Tong Qin, Jin Pan, Qiang Zhao, Chunjing Xu, Ming Yang

Abstract: Bird's-eye-view (BEV) representation is crucial for the perception function in autonomous driving tasks. It is difficult to balance the accuracy, efficiency and range of BEV representation. The existing works are restricted to a limited perception range within 50 meters. Extending the BEV representation range can greatly benefit downstream tasks such as topology reasoning, scene understanding, and… ▽ More Bird's-eye-view (BEV) representation is crucial for the perception function in autonomous driving tasks. It is difficult to balance the accuracy, efficiency and range of BEV representation. The existing works are restricted to a limited perception range within 50 meters. Extending the BEV representation range can greatly benefit downstream tasks such as topology reasoning, scene understanding, and planning by offering more comprehensive information and reaction time. The Standard-Definition (SD) navigation maps can provide a lightweight representation of road structure topology, characterized by ease of acquisition and low maintenance costs. An intuitive idea is to combine the close-range visual information from onboard cameras with the beyond line-of-sight (BLOS) environmental priors from SD maps to realize expanded perceptual capabilities. In this paper, we propose BLOS-BEV, a novel BEV segmentation model that incorporates SD maps for accurate beyond line-of-sight perception, up to 200m. Our approach is applicable to common BEV architectures and can achieve excellent results by incorporating information derived from SD maps. We explore various feature fusion schemes to effectively integrate the visual BEV representations and semantic features from the SD map, aiming to leverage the complementary information from both sources optimally. Extensive experiments demonstrate that our approach achieves state-of-the-art performance in BEV segmentation on nuScenes and Argoverse benchmark. Through multi-modal inputs, BEV segmentation is significantly enhanced at close ranges below 50m, while also demonstrating superior performance in long-range scenarios, surpassing other methods by over 20% mIoU at distances ranging from 50-200m. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: IEEE IV 2024

arXiv:2407.07894 [pdf, other]

Quantum metric induced quantum Hall conductance inversion and reentrant transition in fractional Chern insulators

Authors: Ang-Kun Wu, Siddhartha Sarkar, Xiaohan Wan, Kai Sun, Shi-Zeng Lin

Abstract: The quantum metric of single-particle wave functions in topological flatbands plays a crucial role in determining the stability of fractional Chern insulating (FCI) states. Here, we unravel that the quantum metric causes the many-body Chern number of the FCI states to deviate sharply from the expected value associated with partial filling of the single-particle topological flatband. Furthermore, t… ▽ More The quantum metric of single-particle wave functions in topological flatbands plays a crucial role in determining the stability of fractional Chern insulating (FCI) states. Here, we unravel that the quantum metric causes the many-body Chern number of the FCI states to deviate sharply from the expected value associated with partial filling of the single-particle topological flatband. Furthermore, the variation of the quantum metric in momentum space induces band dispersion through interactions, affecting the stability of the FCI states. This causes a reentrant transition into the Fermi liquid from the FCI phase as the interaction strength increases. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: 18 pages, 10 figures

arXiv:2407.06091 [pdf, other]

Light nuclei photoproduction in relativistic heavy ion ultraperipheral collisions

Authors: Jin-Yu Hu, Shuo Lin, Shi Pu, Qun Wang

Abstract: We have investigated light nuclei pair photoproduction in relativistic heavy ion ultraperipheral collisions. As a first attempt, we employ our previously developed quantum electrodynamics model, which incorporates a wave-packet description of initial nuclei, to compute the cross section for proton-antiproton pair photoproduction. The effective vertex for the photon and proton interaction is chosen… ▽ More We have investigated light nuclei pair photoproduction in relativistic heavy ion ultraperipheral collisions. As a first attempt, we employ our previously developed quantum electrodynamics model, which incorporates a wave-packet description of initial nuclei, to compute the cross section for proton-antiproton pair photoproduction. The effective vertex for the photon and proton interaction is chosen based on studies of two-photon exchange effects in hadron physics. We present the transverse momentum, invariant mass, and azimuthal angle distributions of proton-antiproton pairs at $\sqrt{s_{NN}}=200$ GeV in Au+Au ultraperipheral collisions. We observe a $\cos(2φ)$ modulation and an almost negligible $\cos(4φ)$ modulation in the azimuthal angle distribution. Our studies helps us better understand the matter generated by light. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 6 pages, 3 figures

arXiv:2407.05036 [pdf, other]

Enhance the Robustness of Text-Centric Multimodal Alignments

Authors: Ting-Yu Yen, Yun-Da Tsai, Keng-Te Liao, Shou-De Lin

Abstract: Converting different modalities into general text, serving as input prompts for large language models (LLMs), is a common method to align multimodal models when there is limited pairwise data. This text-centric approach leverages the unique properties of text as a modality space, transforming diverse inputs into a unified textual representation. This enables downstream models to effectively interp… ▽ More Converting different modalities into general text, serving as input prompts for large language models (LLMs), is a common method to align multimodal models when there is limited pairwise data. This text-centric approach leverages the unique properties of text as a modality space, transforming diverse inputs into a unified textual representation. This enables downstream models to effectively interpret various modal inputs. This study assesses the quality and robustness of multimodal representations in the presence of missing entries, noise, or absent modalities, revealing that current text-centric alignment methods compromise downstream robustness. To address this issue, we propose a new text-centric approach that achieves superior robustness compared to previous methods across various modalities in different settings. Our findings highlight the potential of this approach to enhance the robustness and adaptability of multimodal representations, offering a promising solution for dynamic and real-world applications. △ Less

Submitted 6 July, 2024; originally announced July 2024.

arXiv:2407.03566 [pdf, ps, other]

Stacked Intelligent Metasurfaces for Wireless Sensing and Communication: Applications and Challenges

Authors: Hao Liu, Jiancheng An, Xing Jia, Shining Lin, Xianghao Yao, Lu Gan, Bruno Clerckx, Chau Yuen, Mehdi Bennis, Mérouane Debbah

Abstract: The rapid advancement of wireless communication technologies has precipitated an unprecedented demand for high data rates, extremely low latency, and ubiquitous connectivity. In order to achieve these goals, stacked intelligent metasurfaces (SIM) has been developed as a novel solution to perform advanced signal processing tasks directly in the electromagnetic wave domain, thus achieving ultra-fast… ▽ More The rapid advancement of wireless communication technologies has precipitated an unprecedented demand for high data rates, extremely low latency, and ubiquitous connectivity. In order to achieve these goals, stacked intelligent metasurfaces (SIM) has been developed as a novel solution to perform advanced signal processing tasks directly in the electromagnetic wave domain, thus achieving ultra-fast computing speed and reducing hardware complexity. This article provides an overview of the SIM technology by discussing its hardware architectures, advantages, and potential applications for wireless sensing and communication. Specifically, we explore the utilization of SIMs in enabling wave-domain beamforming, channel modeling and estimation in SIM-assisted communication systems. Furthermore, we elaborate on the potential of utilizing a SIM to build a hybrid optical-electronic neural network (HOENN) and demonstrate its efficacy by examining two case studies: disaster monitoring and direction-of-arrival estimation. Finally, we identify key implementation challenges, including practical hardware imperfections, efficient SIM configuration for realizing wave-domain signal processing, and performance analysis to motivate future research on this important and far-reaching topic. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 8 pages, 5 figures, 1 table

arXiv:2407.03415 [pdf, other]

Theory of quasiparticle interference in Kitaev quantum spin liquids

Authors: Ammar Jahin, Hao Zhang, Gábor B. Halász, Shi-Zeng Lin

Abstract: We study quasiparticle interference (QPI) in the Kitaev quantum spin liquid (QSL) for electrons tunneling into the QSL. The local tunneling conductance around a spin vacancy or localized vison reveals unique features associated with fractionalized Majorana fermions, chargons, and visons. In certain parameter regimes, the single-spinon density of states and momentum dispersion can both be directly… ▽ More We study quasiparticle interference (QPI) in the Kitaev quantum spin liquid (QSL) for electrons tunneling into the QSL. The local tunneling conductance around a spin vacancy or localized vison reveals unique features associated with fractionalized Majorana fermions, chargons, and visons. In certain parameter regimes, the single-spinon density of states and momentum dispersion can both be directly extracted from the tunneling conductance. Our results suggest that QPI is a promising tool for identifying the Kitaev QSL and its fractionalized excitations. △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2407.01695 [pdf, ps, other]

Semifinite von Neumann algebras in gauge theory and gravity

Authors: Shadi Ali Ahmad, Marc S. Klinger, Simon Lin

Abstract: von Neumann algebras have been playing an increasingly important role in the context of gauge theories and gravity. The crossed product presents a natural method for implementing constraints through the commutation theorem, rendering it a useful tool for constructing gauge invariant algebras. The crossed product of a Type III algebra with its modular automorphism group is semifinite, which means t… ▽ More von Neumann algebras have been playing an increasingly important role in the context of gauge theories and gravity. The crossed product presents a natural method for implementing constraints through the commutation theorem, rendering it a useful tool for constructing gauge invariant algebras. The crossed product of a Type III algebra with its modular automorphism group is semifinite, which means that the crossed product regulates divergences in local quantum field theories. In this letter, we find a sufficient condition for the semifiniteness of the crossed product of a type III algebra with any locally compact group containing the modular automorphism group. Our condition surprisingly implies the centrality of the modular flow in the symmetry group, and we provide evidence for the necessity of this condition. Under these conditions, we construct an associated trace which computes physical expectation values. We comment on the importance of this result and and its implications for subregion physics in gauge theory and gravity. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 10 pages

arXiv:2407.01457 [pdf, other]

Quantum Nonlinear Acoustic Hall Effect and Inverse Acoustic Faraday Effect in Dirac Insulators

Authors: Ying Su, Alexander V. Balatsky, Shi-Zeng Lin

Abstract: We propose to realize the quantum nonlinear Hall effect and the inverse Faraday effect through the acoustic wave in a time-reversal invariant but inversion broken Dirac insulator. We focus on the acoustic frequency much lower than the Dirac gap such that the interband transition is suppressed and these effects arise solely from the intrinsic valley-contrasting band topology. The corresponding acou… ▽ More We propose to realize the quantum nonlinear Hall effect and the inverse Faraday effect through the acoustic wave in a time-reversal invariant but inversion broken Dirac insulator. We focus on the acoustic frequency much lower than the Dirac gap such that the interband transition is suppressed and these effects arise solely from the intrinsic valley-contrasting band topology. The corresponding acoustoelectric conductivity and magnetoacoustic susceptibility are both proportional to the quantized valley Chern number and independent of the quasiparticle lifetime. The linear and nonlinear components of the longitudinal and transverse topological currents can be tuned by adjusting the polarization and propagation directions of the surface acoustic wave. The static magnetization generated by a circularly polarized acoustic wave scales linearly with the acoustic frequency as well as the strain-induced charge density. Our results unveil a quantized nonlinear topological acoustoelectric response of gapped Dirac materials, like hBN and transition-metal dichalcogenide, paving the way toward room-temperature acoustoelectric devices due to their large band gaps. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 6 pages, 2 figures in main text, and 7 pages, 5 figures, 2 tables in supplement

arXiv:2407.00772 [pdf, other]

Core-level signature of long-range density-wave order and short-range excitonic correlations probed by attosecond broadband spectroscopy

Authors: Alfred Zong, Sheng-Chih Lin, Shunsuke A. Sato, Emma Berger, Bailey R. Nebgen, Marcus Hui, B. Q. Lv, Yun Cheng, Wei Xia, Yanfeng Guo, Dao Xiang, Michael W. Zuerch

Abstract: Advances in attosecond core-level spectroscopies have successfully unlocked the fastest dynamics involving high-energy electrons. Yet, these techniques are not conventionally regarded as an appropriate probe for low-energy quasiparticle interactions that govern the ground state of quantum materials, nor for studying long-range order because of their limited sensitivity to local charge environments… ▽ More Advances in attosecond core-level spectroscopies have successfully unlocked the fastest dynamics involving high-energy electrons. Yet, these techniques are not conventionally regarded as an appropriate probe for low-energy quasiparticle interactions that govern the ground state of quantum materials, nor for studying long-range order because of their limited sensitivity to local charge environments. Here, by employing a unique cryogenic attosecond beamline, we identified clear core-level signatures of long-range charge-density-wave (CDW) formation in a quasi-2D excitonic insulator candidate, even though equilibrium photoemission and absorption measurements of the same core levels showed no spectroscopic singularity at the phase transition. Leveraging the high time resolution and intrinsic sensitivity to short-range charge excitations in attosecond core-level absorption, we observed compelling time-domain evidence for excitonic correlations in the normal-state of the material, whose presence has been subjected to a long-standing debate in equilibrium experiments because of interfering phonon fluctuations in a similar part of the phase space. Our findings support the scenario that short-range excitonic fluctuations prelude long-range order formation in the ground state, providing important insights in the mechanism of exciton condensation in a quasi-low-dimensional system. These results further demonstrate the importance of a simultaneous access to long- and short-range order with underlying dynamical processes spanning a multitude of time- and energy-scales, making attosecond spectroscopy an indispensable tool for both understanding the equilibrium phase diagram and for discovering novel, nonequilibrium states in strongly correlated materials. △ Less

Submitted 16 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

arXiv:2407.00009 [pdf, other]

An Open-Source Fast Parallel Routing Approach for Commercial FPGAs

Authors: Xinshi Zang, Wenhao Lin, Shiju Lin, Jinwei Liu, Evangeline F. Y. Young

Abstract: In the face of escalating complexity and size of contemporary FPGAs and circuits, routing emerges as a pivotal and time-intensive phase in FPGA compilation flows. In response to this challenge, we present an open-source parallel routing methodology designed to expedite routing procedures for commercial FPGAs. Our approach introduces a novel recursive partitioning ternary tree to augment the parall… ▽ More In the face of escalating complexity and size of contemporary FPGAs and circuits, routing emerges as a pivotal and time-intensive phase in FPGA compilation flows. In response to this challenge, we present an open-source parallel routing methodology designed to expedite routing procedures for commercial FPGAs. Our approach introduces a novel recursive partitioning ternary tree to augment the parallelism of multi-net routing. Additionally, we propose a hybrid updating strategy for congestion coefficients within the routing cost function to accelerate congestion resolution in negotiation-based routing algorithms. Evaluation on public benchmarks from the FPGA24 routing contest demonstrates the efficacy of our parallel router. It achieves a 2x speedup compared to the academic serial router RWRoute. Furthermore, when compared to the industry-standard tool Vivado, our approach not only delivers a 2x acceleration but also yields a notable 31% enhancement in critical-path wirelength. △ Less

Submitted 25 April, 2024; originally announced July 2024.

arXiv:2406.19394 [pdf, other]

HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection

Authors: Liujuan Cao, Jianghang Lin, Zebo Hong, Yunhang Shen, Shaohui Lin, Chao Chen, Rongrong Ji

Abstract: Most WSOD methods rely on traditional object proposals to generate candidate regions and are confronted with unstable training, which easily gets stuck in a poor local optimum. In this paper, we introduce a unified, high-capacity weakly supervised object detection (WSOD) network called HUWSOD, which utilizes a comprehensive self-training framework without needing external modules or additional sup… ▽ More Most WSOD methods rely on traditional object proposals to generate candidate regions and are confronted with unstable training, which easily gets stuck in a poor local optimum. In this paper, we introduce a unified, high-capacity weakly supervised object detection (WSOD) network called HUWSOD, which utilizes a comprehensive self-training framework without needing external modules or additional supervision. HUWSOD innovatively incorporates a self-supervised proposal generator and an autoencoder proposal generator with a multi-rate resampling pyramid to replace traditional object proposals, enabling end-to-end WSOD training and inference. Additionally, we implement a holistic self-training scheme that refines detection scores and coordinates through step-wise entropy minimization and consistency-constraint regularization, ensuring consistent predictions across stochastic augmentations of the same image. Extensive experiments on PASCAL VOC and MS COCO demonstrate that HUWSOD competes with state-of-the-art WSOD methods, eliminating the need for offline proposals and additional data. The peak performance of HUWSOD approaches that of fully-supervised Faster R-CNN. Our findings also indicate that randomly initialized boxes, although significantly different from well-designed offline object proposals, are effective for WSOD training. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.17632 [pdf, other]

Transient spin modes from relaxational axial kinetic theory

Authors: Shu Lin, Haiqin Tang

Abstract: We study the dynamics of spin mode by solving the axial kinetic equations under the relaxation time approximation in the presence of dissipative sources. We find transient spin modes in response to electric field with spacetime inhomogeneity, fluid acceleration and shear. To the lowest order in spatial gradient $k$, we find the responses to electric field and acceleration can be interpreted as ret… ▽ More We study the dynamics of spin mode by solving the axial kinetic equations under the relaxation time approximation in the presence of dissipative sources. We find transient spin modes in response to electric field with spacetime inhomogeneity, fluid acceleration and shear. To the lowest order in spatial gradient $k$, we find the responses to electric field and acceleration can be interpreted as retarded response to time variations of magnetic field and vorticity respectively. The response to shear can lead to a global spin polarization suppressed by powers of $k$. Beyond lowest order, the responses to all three sources are non-local with branch cut in the dispersions. We argue that the non-locality is a consequence of the quasi-particle picture underlying the kinetic description. We also analyze the mixing between spin modes and shear modes alone using the response we have obtained, finding the spin modes split into three with two of them developing oscillatory behavior. The correction to damping dispersions occur at $O(k^{4/3})$, which is parametrically larger than the existing one due to mixing of spin modes with shear and vorticity modes. It also indicates possible breakdown of gradient expansion. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 20 pages, 1 figure

arXiv:2406.16769 [pdf, ps, other]

Determination of dark matter distribution in Ursa Major III and constraints on dark matter annihilation

Authors: Yi Zhao, Xiao-Jun Bi, Su-Jie Lin, Peng-Fei Yin

Abstract: The recently discovered satellite dwarf galaxy Ursa Major III provides a promising opportunity to explore the signatures resulting from dark matter (DM) annihilation, due to its proximity and large J-factor. Owing to the absence of an excess of $γ$-ray signatures originating from Ursa Major III, observations of $γ$-rays, such as those from Fermi-LAT, can be utilized to set constraints on the DM an… ▽ More The recently discovered satellite dwarf galaxy Ursa Major III provides a promising opportunity to explore the signatures resulting from dark matter (DM) annihilation, due to its proximity and large J-factor. Owing to the absence of an excess of $γ$-ray signatures originating from Ursa Major III, observations of $γ$-rays, such as those from Fermi-LAT, can be utilized to set constraints on the DM annihilation cross section. In this study, we determine the DM density profile, and consider the relationship between DM density and velocity dispersion at different locations within Ursa Major III through Jeans analysis. We calculate the J-factor of Ursa Major III for s-wave annihilation, along with the effective J-factors for p-wave and Sommerfeld enhanced annihilation scenarios. Utilizing these derived J-factors, we set stringent constraints on DM annihilation cross sections in three scenarios. Given the substantial impact of member star identification on the J-factor of Ursa Major III, we further calculate J-factors with the condition of excluding the largest velocity outlier. Our analysis reveals a notable reduction in the median value and an increase in the deviation of J-factors, thereby leading to considerably weaker constraints. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.16437 [pdf, other]

Theory on Mixture-of-Experts in Continual Learning

Authors: Hongbo Li, Sen Lin, Lingjie Duan, Yingbin Liang, Ness B. Shroff

Abstract: Continual learning (CL) has garnered significant attention because of its ability to adapt to new tasks that arrive over time. Catastrophic forgetting (of old tasks) has been identified as a major issue in CL, as the model adapts to new tasks. The Mixture-of-Experts (MoE) model has recently been shown to effectively mitigate catastrophic forgetting in CL, by employing a gating network to sparsify… ▽ More Continual learning (CL) has garnered significant attention because of its ability to adapt to new tasks that arrive over time. Catastrophic forgetting (of old tasks) has been identified as a major issue in CL, as the model adapts to new tasks. The Mixture-of-Experts (MoE) model has recently been shown to effectively mitigate catastrophic forgetting in CL, by employing a gating network to sparsify and distribute diverse tasks among multiple experts. However, there is a lack of theoretical analysis of MoE and its impact on the learning performance in CL. This paper provides the first theoretical results to characterize the impact of MoE in CL via the lens of overparameterized linear regression tasks. We establish the benefit of MoE over a single expert by proving that the MoE model can diversify its experts to specialize in different tasks, while its router learns to select the right expert for each task and balance the loads across all experts. Our study further suggests an intriguing fact that the MoE in CL needs to terminate the update of the gating network after sufficient training rounds to attain system convergence, which is not needed in the existing MoE studies that do not consider the continual task arrival. Furthermore, we provide explicit expressions for the expected forgetting and overall generalization error to characterize the benefit of MoE in the learning performance in CL. Interestingly, adding more experts requires additional rounds before convergence, which may not enhance the learning performance. Finally, we conduct experiments on both synthetic and real datasets to extend these insights from linear models to deep neural networks (DNNs), which also shed light on the practical algorithm design for MoE in CL. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.15854 [pdf, other]

Multiple misaligned outflows and warped accretion flows in the proto-multiple system Per-emb-8 and 55

Authors: Shang-Jing Lin, Hsi-Wei Yen, Shih-Ping Lai

Abstract: To investigate the formation process of multiple systems, we have analyzed the ALMA archival data of the 1.3 mm continuum, $^{12}$CO (2-1) and C$^{18}$O (2-1) emission in a proto-multiple system consisting of a Class 0 protostar Per-emb-8 and a Class I protobinary Per-emb-55 $A$ and $B$. The 1.3 mm continuum emission is likely to primarily trace their protostellar disks, and the Keplerian disk rot… ▽ More To investigate the formation process of multiple systems, we have analyzed the ALMA archival data of the 1.3 mm continuum, $^{12}$CO (2-1) and C$^{18}$O (2-1) emission in a proto-multiple system consisting of a Class 0 protostar Per-emb-8 and a Class I protobinary Per-emb-55 $A$ and $B$. The 1.3 mm continuum emission is likely to primarily trace their protostellar disks, and the Keplerian disk rotation is observed in Per-emb-8 and Per-emb-55 $A$ in the emission lines. In Per-emb-8, we identify two arm-like structures with a length of $\sim$ 1000 au connecting the eastern and western of its disk in the continuum and C$^{18}$O emission. Our analysis suggests that these arm-like structures are most likely infalling flows. In the $^{12}$CO emission, we discover a second bipolar outflow associated with Per-emb-8. The two bipolar outflows in Per-emb-8 are possibly launched along the normal axes of the misaligned inner and outer parts of its warped protostellar disk. In Per-emb-55, we find that the red- and blueshifted lobes of its bipolar outflow are misaligned by 90$^\circ$. The presence of the warped disk, multiple misaligned outflows, and asymmetric infalling flows suggest complex dynamics in proto-multiple systems, and these could be related to the tidal interactions between the companions and/or the turbulent environments forming this proto-multiple system. △ Less

Submitted 22 June, 2024; originally announced June 2024.

Comments: 17 pages, 10 figures, accepted by AJ

arXiv:2406.12433 [pdf, other]

LLM-enhanced Reranking in Recommender Systems

Authors: Jingtong Gao, Bo Chen, Xiangyu Zhao, Weiwen Liu, Xiangyang Li, Yichao Wang, Zijian Zhang, Wanyu Wang, Yuyang Ye, Shanru Lin, Huifeng Guo, Ruiming Tang

Abstract: Reranking is a critical component in recommender systems, playing an essential role in refining the output of recommendation algorithms. Traditional reranking models have focused predominantly on accuracy, but modern applications demand consideration of additional criteria such as diversity and fairness. Existing reranking approaches often fail to harmonize these diverse criteria effectively at th… ▽ More Reranking is a critical component in recommender systems, playing an essential role in refining the output of recommendation algorithms. Traditional reranking models have focused predominantly on accuracy, but modern applications demand consideration of additional criteria such as diversity and fairness. Existing reranking approaches often fail to harmonize these diverse criteria effectively at the model level. Moreover, these models frequently encounter challenges with scalability and personalization due to their complexity and the varying significance of different reranking criteria in diverse scenarios. In response, we introduce a comprehensive reranking framework enhanced by LLM, designed to seamlessly integrate various reranking criteria while maintaining scalability and facilitating personalized recommendations. This framework employs a fully connected graph structure, allowing the LLM to simultaneously consider multiple aspects such as accuracy, diversity, and fairness through a coherent Chain-of-Thought (CoT) process. A customizable input mechanism is also integrated, enabling the tuning of the language model's focus to meet specific reranking needs. We validate our approach using three popular public datasets, where our framework demonstrates superior performance over existing state-of-the-art reranking models in balancing multiple criteria. The code for this implementation is publicly available. △ Less

Submitted 20 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.12148 [pdf, other]

Complex variable solution on noncircular and asymmetrical tunnelling embedded by bidirectional conformal mapping incorporating Charge Simulation Method

Authors: Luobin Lin, Fuquan Chen, Changjie Zheng, Shangshun Lin

Abstract: Mechanical issues of noncircular and asymmetrical tunnelling can be estimated using complex variable method with suitable conformal mapping. Exsiting solution schemes of conformal mapping for noncircular tunnel generally need iteration or optimization strategy, and are thereby mathematically complicated. This paper proposes a new bidirectional conformal mapping for deep and shallow tunnels of nonc… ▽ More Mechanical issues of noncircular and asymmetrical tunnelling can be estimated using complex variable method with suitable conformal mapping. Exsiting solution schemes of conformal mapping for noncircular tunnel generally need iteration or optimization strategy, and are thereby mathematically complicated. This paper proposes a new bidirectional conformal mapping for deep and shallow tunnels of noncircular and asymmetrical shapes by incorporating Charge Simulation Method. The solution scheme of this new bidirectional conformal mapping only involves a pair of linear systems, and is therefore logically straight-forward, computationally efficient, and practically easy in coding. New numerical strategies are developed to deal with possible sharp corners of cavity by small arc simulation and densified collocation points. Several numerical examples are presented to illustrate the geometrical usage of the new bidirectional conformal mapping. Furthermore, the new bidirectional conformal mapping is embedded into two complex variable solutions of noncircular and asymmetrical shallow tunnelling in gravitational geomaterial with reasonable far-field displacement. The respective result comparisons with finite element solution and exsiting analytical solution show good agreements, indicating the feasible mechanical usage of the new bidirectional conformal mapping. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 41 pages, 13 figures

arXiv:2406.11442 [pdf]

Layer-dependent electromechanical response in twisted graphene moiré superlattices

Authors: Hanhao Zhang, Yuanhao Wei, Yuhao Li, Shengsheng Lin, Jiarui Wang, Takashi Taniguchi, Kenji Watanabe, Jiangyu Li, Yi Shi, Xinran Wang, Yan Shi, Zaiyao Fei

Abstract: The coupling of mechanical deformation and electrical stimuli at the nanoscale has been a subject of intense investigation in the realm of materials science. Recently, twisted van der Waals (vdW) materials have emerged as a platform to explore exotic quantum states. These states are intimately tied to the formation of moiré superlattices, which can be visualized directly exploiting the electromech… ▽ More The coupling of mechanical deformation and electrical stimuli at the nanoscale has been a subject of intense investigation in the realm of materials science. Recently, twisted van der Waals (vdW) materials have emerged as a platform to explore exotic quantum states. These states are intimately tied to the formation of moiré superlattices, which can be visualized directly exploiting the electromechanical response. However, the origin of the response, even in twisted bilayer graphene (tBLG), remains unsettled. Here, employing lateral piezoresponse force microscopy (LPFM), we investigate the electromechanical responses of marginally twisted graphene moiré superlattices with different layer thicknesses. We observe distinct LPFM amplitudes and spatial profiles in tBLG and twisted monolayer-bilayer graphene (tMBG), exhibiting effective in-plane piezoelectric coefficients of 0.05 pm/V and 0.35 pm/V, respectively. Force tuning experiments further underscore a marked divergence in their responses. The contrasting behaviors suggest different electromechanical couplings in tBLG and tMBG. In tBLG, the response near the domain walls is attributed to the flexoelectric effect, while in tMBG, the behaviors can be comprehended within the context of piezoelectric effect. Our results not only provide insights to electromechanical and corporative effects in twisted vdW materials with different stacking symmetries, but may also show their potential for engineering them at the nanoscale. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.11251 [pdf, other]

Unifying Multimodal Retrieval via Document Screenshot Embedding

Authors: Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin

Abstract: In the real world, documents are organized in different formats and varied modalities. Traditional retrieval pipelines require tailored document parsing techniques and content extraction modules to prepare input for indexing. This process is tedious, prone to errors, and has information loss. To this end, we propose Document Screenshot Embedding} (DSE), a novel retrieval paradigm that regards docu… ▽ More In the real world, documents are organized in different formats and varied modalities. Traditional retrieval pipelines require tailored document parsing techniques and content extraction modules to prepare input for indexing. This process is tedious, prone to errors, and has information loss. To this end, we propose Document Screenshot Embedding} (DSE), a novel retrieval paradigm that regards document screenshots as a unified input format, which does not require any content extraction preprocess and preserves all the information in a document (e.g., text, image and layout). DSE leverages a large vision-language model to directly encode document screenshots into dense representations for retrieval. To evaluate our method, we first craft the dataset of Wiki-SS, a 1.3M Wikipedia web page screenshots as the corpus to answer the questions from the Natural Questions dataset. In such a text-intensive document retrieval setting, DSE shows competitive effectiveness compared to other text retrieval methods relying on parsing. For example, DSE outperforms BM25 by 17 points in top-1 retrieval accuracy. Additionally, in a mixed-modality task of slide retrieval, DSE significantly outperforms OCR text retrieval methods by over 15 points in nDCG@10. These experiments show that DSE is an effective document retrieval paradigm for diverse types of documents. Model checkpoints, code, and Wiki-SS collection will be released. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.10961 [pdf, other]

Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP

Authors: Shuyang Lin, Tong Jia, Hao Wang, Bowen Ma, Mingyuan Li, Dongyue Chen

Abstract: X-ray prohibited item detection is an essential component of security check and categories of prohibited item are continuously increasing in accordance with the latest laws. Previous works all focus on close-set scenarios, which can only recognize known categories used for training and often require time-consuming as well as labor-intensive annotations when learning novel categories, resulting in… ▽ More X-ray prohibited item detection is an essential component of security check and categories of prohibited item are continuously increasing in accordance with the latest laws. Previous works all focus on close-set scenarios, which can only recognize known categories used for training and often require time-consuming as well as labor-intensive annotations when learning novel categories, resulting in limited real-world applications. Although the success of vision-language models (e.g. CLIP) provides a new perspectives for open-set X-ray prohibited item detection, directly applying CLIP to X-ray domain leads to a sharp performance drop due to domain shift between X-ray data and general data used for pre-training CLIP. To address aforementioned challenges, in this paper, we introduce distillation-based open-vocabulary object detection (OVOD) task into X-ray security inspection domain by extending CLIP to learn visual representations in our specific X-ray domain, aiming to detect novel prohibited item categories beyond base categories on which the detector is trained. Specifically, we propose X-ray feature adapter and apply it to CLIP within OVOD framework to develop OVXD model. X-ray feature adapter containing three adapter submodules of bottleneck architecture, which is simple but can efficiently integrate new knowledge of X-ray domain with original knowledge, further bridge domain gap and promote alignment between X-ray images and textual concepts. Extensive experiments conducted on PIXray and PIDray datasets demonstrate that proposed method performs favorably against other baseline OVOD methods in detecting novel categories in X-ray scenario. It outperforms previous best result by 15.2 AP50 and 1.5 AP50 on PIXray and PIDray with achieving 21.0 AP50 and 27.8 AP50 respectively. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.10378 [pdf]

An experimental search for an explanation of the difference between beam and bottle neutron lifetime measurements

Authors: M. F. Blatnik, L. S. Blokland, N. Callahan, J. H. Choi, S. Clayton, C. B Cude-Woods, B. W. Filippone, W. R. Fox, E. Fries, P. Geltenbort, F. M. Gonzalez, L. Hayen, K. P. Hickerson, A. T. Holley, T. M. Ito, A. Komives, S Lin, Chen-Yu Liu, M. F. Makela, C. L. Morris, R. Musedinovic, C. M. O'Shaughnessy, R. W. Pattie Jr., J. C. Ramsey, D. J. Salvat , et al. (10 additional authors not shown)

Abstract: The past two decades have yielded several new measurements and reanalysis of older measurements of the neutron lifetime. These have led to a 4.4 standard deviation discrepancy between the most precise measurements of the neutron decay rate producing protons in cold neutron beams and the most precise lifetime measured in neutron storage experiments. Here we publish an analysis of the recently publi… ▽ More The past two decades have yielded several new measurements and reanalysis of older measurements of the neutron lifetime. These have led to a 4.4 standard deviation discrepancy between the most precise measurements of the neutron decay rate producing protons in cold neutron beams and the most precise lifetime measured in neutron storage experiments. Here we publish an analysis of the recently published UCN aimed a searching for an explanation of this difference using the model proposed by Koch and Hummel. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Report number: LA-UR-24-25619

arXiv:2406.10280 [pdf, other]

Transferable Embedding Inversion Attack: Uncovering Privacy Risks in Text Embeddings without Model Queries

Authors: Yu-Hsiang Huang, Yuche Tsai, Hsiang Hsiao, Hong-Yi Lin, Shou-De Lin

Abstract: This study investigates the privacy risks associated with text embeddings, focusing on the scenario where attackers cannot access the original embedding model. Contrary to previous research requiring direct model access, we explore a more realistic threat model by developing a transfer attack method. This approach uses a surrogate model to mimic the victim model's behavior, allowing the attacker t… ▽ More This study investigates the privacy risks associated with text embeddings, focusing on the scenario where attackers cannot access the original embedding model. Contrary to previous research requiring direct model access, we explore a more realistic threat model by developing a transfer attack method. This approach uses a surrogate model to mimic the victim model's behavior, allowing the attacker to infer sensitive information from text embeddings without direct access. Our experiments across various embedding models and a clinical dataset demonstrate that our transfer attack significantly outperforms traditional methods, revealing the potential privacy vulnerabilities in embedding technologies and emphasizing the need for enhanced security measures. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: Accepted at ACL 2024 Main Conference

arXiv:2406.10003 [pdf, other]

Steady state, displacement current and spin polarization for massless fermion in a shear flow

Authors: Shu Lin, Ziyue Wang

Abstract: We consider spin polarization of massless fermions in a shear flow, whose complete contributions contain magnetization current and side-jump current known from collisional chiral kinetic theory. We argue that the side-jump current adopts interpretation of displacement current. We explicitly determine the displacement current contribution in the steady state reached in shear flow for a QED plasma.… ▽ More We consider spin polarization of massless fermions in a shear flow, whose complete contributions contain magnetization current and side-jump current known from collisional chiral kinetic theory. We argue that the side-jump current adopts interpretation of displacement current. We explicitly determine the displacement current contribution in the steady state reached in shear flow for a QED plasma. We find the displacement contribution enhances the magnetization contribution at small and large momenta, but leads to a suppression effect at intermediate momenta. Major differences from previous studies on collisional effect are: (i) the fermions are in the same steady state as the medium rather than being probes; (ii) Compton scattering and pair annihilation are also included in addition to the Coulomb scattering considered before. △ Less

Submitted 16 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

Comments: 20 pages, 4 figures

arXiv:2406.08698 [pdf, other]

Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes of astrophysical $γ$-ray background while large amount of dark matter. By analyzing more than 700 days observational data at LHAASO, no significant dark matter signal from 1 TeV to 1 EeV is detected. Accordingly we derive the most stringent constraints on the ultra-heavy dark matter annihilation cross-section up to EeV. The constraints on the lifetime of dark matter in decay mode are also derived. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 17 pages, 12 figures, accepted by PRL

arXiv:2406.06253 [pdf, other]

PretVM: Predictable, Efficient Virtual Machine for Real-Time Concurrency

Authors: Shaokai Lin, Erling Jellum, Mirco Theile, Tassilo Tanneberger, Binqi Sun, Chadlia Jerad, Ruomu Xu, Guangyu Feng, Christian Menard, Marten Lohstroh, Jeronimo Castrillon, Sanjit Seshia, Edward Lee

Abstract: This paper introduces the Precision-Timed Virtual Machine (PretVM), an intermediate platform facilitating the execution of quasi-static schedules compiled from a subset of programs written in the Lingua Franca (LF) coordination language. The subset consists of those programs that in principle should have statically verifiable and predictable timing behavior. The PretVM provides a schedule with wel… ▽ More This paper introduces the Precision-Timed Virtual Machine (PretVM), an intermediate platform facilitating the execution of quasi-static schedules compiled from a subset of programs written in the Lingua Franca (LF) coordination language. The subset consists of those programs that in principle should have statically verifiable and predictable timing behavior. The PretVM provides a schedule with well-defined worst-case timing bounds. The PretVM provides a clean separation between application logic and coordination logic, yielding more analyzable program executions. Experiments compare the PretVM against the default (more dynamic) LF scheduler and show that it delivers time-accurate deterministic execution. △ Less

Submitted 25 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

arXiv:2406.03455 [pdf, other]

Topological disclination states and charge fractionalization in a non-Hermitian lattice

Authors: Rimi Banerjee, Subhaskar Mandal, Yun Yong Terh, Shuxin Lin, Gui-Geng Liu, Baile Zhang, Y. D. Chong

Abstract: We show that a non-Hermitian lattice with a disclination can host topological disclination states that are induced by on-site gain and loss. The disclination states are inherently non-Hermitian as they do not exist in the limit of zero gain/loss. They arise from charge fractionalization in the non-Hermitian lattice, which we establish using non-Hermitian Wilson loops calculated with biorthogonal p… ▽ More We show that a non-Hermitian lattice with a disclination can host topological disclination states that are induced by on-site gain and loss. The disclination states are inherently non-Hermitian as they do not exist in the limit of zero gain/loss. They arise from charge fractionalization in the non-Hermitian lattice, which we establish using non-Hermitian Wilson loops calculated with biorthogonal products. The model can be realized using an array of optical resonators, with the emergence of the topological disclination states manifesting as an abrupt shift in emission intensity and frequency upon tuning the gain/loss level. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 7 pages and 4 figures

arXiv:2406.02787 [pdf, other]

Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities

Authors: Wenyue Hua, Kaijie Zhu, Lingyao Li, Lizhou Fan, Shuhang Lin, Mingyu Jin, Haochen Xue, Zelong Li, JinDong Wang, Yongfeng Zhang

Abstract: This study intends to systematically disentangle pure logic reasoning and text understanding by investigating the contrast across abstract and contextualized logical problems from a comprehensive set of domains. We explore whether LLMs demonstrate genuine reasoning capabilities across various domains when the underlying logical structure remains constant. We focus on two main questions (1) Can abs… ▽ More This study intends to systematically disentangle pure logic reasoning and text understanding by investigating the contrast across abstract and contextualized logical problems from a comprehensive set of domains. We explore whether LLMs demonstrate genuine reasoning capabilities across various domains when the underlying logical structure remains constant. We focus on two main questions (1) Can abstract logical problems alone accurately benchmark an LLM's reasoning ability in real-world scenarios, disentangled from contextual support in practical settings? (2) Does fine-tuning LLMs on abstract logic problem generalize to contextualized logic problems and vice versa? To investigate these questions, we focus on standard propositional logic, specifically propositional deductive and abductive logic reasoning. In particular, we construct instantiated datasets for deductive and abductive reasoning with 4 levels of difficulty, encompassing 12 distinct categories or domains based on the categorization of Wikipedia. Our experiments aim to provide insights into disentangling context in logical reasoning and the true reasoning capabilities of LLMs and their generalization potential. The code and dataset are available at: https://github.com/agiresearch/ContextHub. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 22 pages, 9 figures

arXiv:2406.01304 [pdf, other]

CodeR: Issue Resolving with Multi-Agent and Task Graphs

Authors: Dong Chen, Shaoxin Lin, Muhan Zeng, Daoguang Zan, Jian-Gang Wang, Anton Cheshkov, Jun Sun, Hao Yu, Guoliang Dong, Artem Aliev, Jie Wang, Xiao Cheng, Guangtai Liang, Yuchi Ma, Pan Bian, Tao Xie, Qianxiang Wang

Abstract: GitHub issue resolving recently has attracted significant attention from academia and industry. SWE-bench is proposed to measure the performance in resolving issues. In this paper, we propose CodeR, which adopts a multi-agent framework and pre-defined task graphs to Repair & Resolve reported bugs and add new features within code Repository. On SWE-bench lite, CodeR is able to solve 28.33% of issue… ▽ More GitHub issue resolving recently has attracted significant attention from academia and industry. SWE-bench is proposed to measure the performance in resolving issues. In this paper, we propose CodeR, which adopts a multi-agent framework and pre-defined task graphs to Repair & Resolve reported bugs and add new features within code Repository. On SWE-bench lite, CodeR is able to solve 28.33% of issues, when submitting only once for each issue. We examine the performance impact of each design of CodeR and offer insights to advance this research direction. △ Less

Submitted 10 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: https://github.com/NL2Code/CodeR

arXiv:2406.01007 [pdf, other]

Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive region, the relative $\overlineν_{e}$ rates and energy spectra variation among the near and far detectors gives $\mathrm{sin}^22θ_{13} = 0.0759_{-0.0049}^{+0.0050}$ and $Δm^2_{32} = (2.72^{+0.14}_{-0.15})\times10^{-3}$ eV$^2$ assuming the normal neutrino mass ordering, and $Δm^2_{32} = (-2.83^{+0.15}_{-0.14})\times10^{-3}$ eV$^2$ for the inverted neutrino mass ordering. This estimate of $\sin^2 2θ_{13}$ is consistent with and essentially independent from the one obtained using the capture-on-gadolinium sample at Daya Bay. The combination of these two results yields $\mathrm{sin}^22θ_{13}= 0.0833\pm0.0022$, which represents an 8% relative improvement in precision regarding the Daya Bay full 3158-day capture-on-gadolinium result. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.00427 [pdf, other]

You Only Need Less Attention at Each Stage in Vision Transformers

Authors: Shuoxi Zhang, Hanpeng Liu, Stephen Lin, Kun He

Abstract: The advent of Vision Transformers (ViTs) marks a substantial paradigm shift in the realm of computer vision. ViTs capture the global information of images through self-attention modules, which perform dot product computations among patchified image tokens. While self-attention modules empower ViTs to capture long-range dependencies, the computational complexity grows quadratically with the number… ▽ More The advent of Vision Transformers (ViTs) marks a substantial paradigm shift in the realm of computer vision. ViTs capture the global information of images through self-attention modules, which perform dot product computations among patchified image tokens. While self-attention modules empower ViTs to capture long-range dependencies, the computational complexity grows quadratically with the number of tokens, which is a major hindrance to the practical application of ViTs. Moreover, the self-attention mechanism in deep ViTs is also susceptible to the attention saturation issue. Accordingly, we argue against the necessity of computing the attention scores in every layer, and we propose the Less-Attention Vision Transformer (LaViT), which computes only a few attention operations at each stage and calculates the subsequent feature alignments in other layers via attention transformations that leverage the previously calculated attention scores. This novel approach can mitigate two primary issues plaguing traditional self-attention modules: the heavy computational burden and attention saturation. Our proposed architecture offers superior efficiency and ease of implementation, merely requiring matrix multiplications that are highly optimized in contemporary deep learning frameworks. Moreover, our architecture demonstrates exceptional performance across various vision tasks including classification, detection and segmentation. △ Less

Submitted 1 June, 2024; originally announced June 2024.

Comments: CVPR 2024 Camera-Ready; 10 pages, 3 figures

arXiv:2405.21075 [pdf, other]

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Authors: Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun

Abstract: In the quest for artificial general intelligence, Multi-modal Large Language Models (MLLMs) have emerged as a focal point in recent advancements. However, the predominant focus remains on developing their capabilities in static image understanding. The potential of MLLMs in processing sequential visual data is still insufficiently explored, highlighting the absence of a comprehensive, high-quality… ▽ More In the quest for artificial general intelligence, Multi-modal Large Language Models (MLLMs) have emerged as a focal point in recent advancements. However, the predominant focus remains on developing their capabilities in static image understanding. The potential of MLLMs in processing sequential visual data is still insufficiently explored, highlighting the absence of a comprehensive, high-quality assessment of their performance. In this paper, we introduce Video-MME, the first-ever full-spectrum, Multi-Modal Evaluation benchmark of MLLMs in Video analysis. Our work distinguishes from existing benchmarks through four key features: 1) Diversity in video types, spanning 6 primary visual domains with 30 subfields to ensure broad scenario generalizability; 2) Duration in temporal dimension, encompassing both short-, medium-, and long-term videos, ranging from 11 seconds to 1 hour, for robust contextual dynamics; 3) Breadth in data modalities, integrating multi-modal inputs besides video frames, including subtitles and audios, to unveil the all-round capabilities of MLLMs; 4) Quality in annotations, utilizing rigorous manual labeling by expert annotators to facilitate precise and reliable model assessment. 900 videos with a total of 254 hours are manually selected and annotated by repeatedly viewing all the video content, resulting in 2,700 question-answer pairs. With Video-MME, we extensively evaluate various state-of-the-art MLLMs, including GPT-4 series and Gemini 1.5 Pro, as well as open-source image models like InternVL-Chat-V1.5 and video models like LLaVA-NeXT-Video. Our experiments reveal that Gemini 1.5 Pro is the best-performing commercial model, significantly outperforming the open-source models. Our dataset along with these findings underscores the need for further improvements in handling longer sequences and multi-modal data. Project Page: https://video-mme.github.io △ Less

Submitted 16 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

Comments: Project Page: https://video-mme.github.io

arXiv:2405.18497 [pdf, other]

Capacity Results for Non-Ergodic Multi-Modal Broadcast Channels with Controllable Statistics

Authors: Alireza Vahid, Shih-Chun Lin

Abstract: Movable antennas and reconfigurable intelligent surfaces enable a new paradigm in which channel statistics can be controlled and altered. Further, the known trajectory and operation protocol of communication satellites results in networks with predictable statistics. The predictability of future changes results in a non-ergodic model for which the fundamentals are largely unknown. We consider the… ▽ More Movable antennas and reconfigurable intelligent surfaces enable a new paradigm in which channel statistics can be controlled and altered. Further, the known trajectory and operation protocol of communication satellites results in networks with predictable statistics. The predictability of future changes results in a non-ergodic model for which the fundamentals are largely unknown. We consider the canonical two-user broadcast erasure channel in which channel statistics vary at a priori known points. We consider a multi-modal setting with two non-transient modes (whose lengths scale linearly with the blocklength) and an arbitrary number of transient modes. We provide a new set of outer-bounds on the capacity region of this problem when the encoder has access to causal ACK/NACK feedback. The outer-bounds reveal the significant role of the non-transient mode with higher erasure probability both on the outer and the inner bounds. We show the outer-bounds are achievable in non-trivial regimes, characterizing the capacity region for a wide range of parameters. We also discuss the regimes where the inner and outer bounds diverge and analyze the gap between the two. A key finding of this work is the significant gain of inter-modal coding over the separate treating of individual modes. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: under review

arXiv:2405.18250 [pdf, other]

Spontaneous flows in active smectics with dislocations

Authors: Shao-Zhen Lin, Frank Jülicher, Jacques Prost, Jean-Francois Rupprecht

Abstract: We construct a hydrodynamic theory of active smectics A in two-dimensional space, including the creation/annihilation and motility of dislocations with Burgers' number $\pm1$. We derive analytical criteria on the set of parameters that lead to flows. We show that the motility of dislocations can lead to flow transitions with distinct features from the previously reported active Helfrich--Hurault s… ▽ More We construct a hydrodynamic theory of active smectics A in two-dimensional space, including the creation/annihilation and motility of dislocations with Burgers' number $\pm1$. We derive analytical criteria on the set of parameters that lead to flows. We show that the motility of dislocations can lead to flow transitions with distinct features from the previously reported active Helfrich--Hurault shear instability with, notably, a first-order transition in the velocity from quiescence to turbulence. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 5 pages, 5 figures

arXiv:2405.17792 [pdf, other]

JUNO Sensitivity to Invisible Decay Modes of Neutrons

Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation modes of the excited residual nuclei can produce a time- and space-correlated triple coincidence signal in the JUNO detector. Based on a full Monte Carlo simulation informed with the latest available data, we estimate all backgrounds, including inverse beta decay events of the reactor antineutrino $\barν_e$, natural radioactivity, cosmogenic isotopes and neutral current interactions of atmospheric neutrinos. Pulse shape discrimination and multivariate analysis techniques are employed to further suppress backgrounds. With two years of exposure, JUNO is expected to give an order of magnitude improvement compared to the current best limits. After 10 years of data taking, the JUNO expected sensitivities at a 90% confidence level are $τ/B( n \rightarrow { inv} ) > 5.0 \times 10^{31} \, {\rm yr}$ and $τ/B( nn \rightarrow { inv} ) > 1.4 \times 10^{32} \, {\rm yr}$. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 28 pages, 7 figures, 4 tables

arXiv:2405.17477 [pdf, other]

OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning

Authors: Sheng Yue, Xingyuan Hua, Ju Ren, Sen Lin, Junshan Zhang, Yaoxue Zhang

Abstract: In this paper, we study offline-to-online Imitation Learning (IL) that pretrains an imitation policy from static demonstration data, followed by fast finetuning with minimal environmental interaction. We find the naïve combination of existing offline IL and online IL methods tends to behave poorly in this context, because the initial discriminator (often used in online IL) operates randomly and di… ▽ More In this paper, we study offline-to-online Imitation Learning (IL) that pretrains an imitation policy from static demonstration data, followed by fast finetuning with minimal environmental interaction. We find the naïve combination of existing offline IL and online IL methods tends to behave poorly in this context, because the initial discriminator (often used in online IL) operates randomly and discordantly against the policy initialization, leading to misguided policy optimization and $\textit{unlearning}$ of pretraining knowledge. To overcome this challenge, we propose a principled offline-to-online IL method, named $\texttt{OLLIE}$, that simultaneously learns a near-expert policy initialization along with an $\textit{aligned discriminator initialization}$, which can be seamlessly integrated into online IL, achieving smooth and fast finetuning. Empirically, $\texttt{OLLIE}$ consistently and significantly outperforms the baseline methods in $\textbf{20}$ challenging tasks, from continuous control to vision-based domains, in terms of performance, demonstration efficiency, and convergence speed. This work may serve as a foundation for further exploration of pretraining and finetuning in the context of IL. △ Less

Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

Comments: International Conference on Machine Learning (ICML)

arXiv:2405.17476 [pdf, other]

How to Leverage Diverse Demonstrations in Offline Imitation Learning

Authors: Sheng Yue, Jiani Liu, Xingyuan Hua, Ju Ren, Sen Lin, Junshan Zhang, Yaoxue Zhang

Abstract: Offline Imitation Learning (IL) with imperfect demonstrations has garnered increasing attention owing to the scarcity of expert data in many real-world domains. A fundamental problem in this scenario is how to extract positive behaviors from noisy data. In general, current approaches to the problem select data building on state-action similarity to given expert demonstrations, neglecting precious… ▽ More Offline Imitation Learning (IL) with imperfect demonstrations has garnered increasing attention owing to the scarcity of expert data in many real-world domains. A fundamental problem in this scenario is how to extract positive behaviors from noisy data. In general, current approaches to the problem select data building on state-action similarity to given expert demonstrations, neglecting precious information in (potentially abundant) $\textit{diverse}$ state-actions that deviate from expert ones. In this paper, we introduce a simple yet effective data selection method that identifies positive behaviors based on their resultant states -- a more informative criterion enabling explicit utilization of dynamics information and effective extraction of both expert and beneficial diverse behaviors. Further, we devise a lightweight behavior cloning algorithm capable of leveraging the expert and selected data correctly. In the experiments, we evaluate our method on a suite of complex and high-dimensional offline IL benchmarks, including continuous-control and vision-based tasks. The results demonstrate that our method achieves state-of-the-art performance, outperforming existing methods on $\textbf{20/21}$ benchmarks, typically by $\textbf{2-5x}$, while maintaining a comparable runtime to Behavior Cloning ($\texttt{BC}$). △ Less

Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

Comments: International Conference on Machine Learning (ICML)

arXiv:2405.16634 [pdf, other]

Fast and Globally Consistent Normal Orientation based on the Winding Number Normal Consistency

Authors: Siyou Lin, Zuoqiang Shi, Yebin Liu

Abstract: Estimating a consistently oriented normal vector field for an unoriented point cloud enables a number of important downstream applications in computer graphics. While normal estimation for a small patch of points can be done with simple techniques like principal component analysis (PCA), orienting these normals to be globally consistent has been a notoriously difficult problem. Some recent methods… ▽ More Estimating a consistently oriented normal vector field for an unoriented point cloud enables a number of important downstream applications in computer graphics. While normal estimation for a small patch of points can be done with simple techniques like principal component analysis (PCA), orienting these normals to be globally consistent has been a notoriously difficult problem. Some recent methods exploit various properties of the winding number formula to achieve global consistency with state-of-the-art performance. Despite their exciting progress, these algorithms either have high space/time complexity, or do not produce accurate and consistently oriented normals for imperfect data. In this paper, we derive a novel property from the winding number formula to tackle this problem: the normal consistency property of the winding number formula. We refer to this property as the winding number normal consistency (WNNC). The derived property is based on the simple observation that the normals (negative gradients) sampled from the winding number field should be codirectional to the normals used to compute the winding number field. We further propose to turn the WNNC property into a normal update formula, which leads to an embarrassingly simple yet effective iterative algorithm that allows fast and high-quality convergence to a globally consistent normal vector field. Furthermore, our proposed algorithm only involves repeatedly evaluating the winding number formula and its derivatives, which can be accelerated and parallelized using treecode-based approximation algorithms due to their special structures. Exploiting this fact, we implement a GPU-accelerated treecode-based solver. Our GPU (and even CPU) implementation can be significantly faster than the recent state-of-the-art methods for normal orientation from raw points. △ Less

Submitted 26 May, 2024; originally announced May 2024.

arXiv:2405.16491 [pdf, other]

Nuclear deformation effects in photoproduction of $ρ$ mesons in ultraperipheral isobaric collisions

Authors: Shuo Lin, Jin-Yu Hu, Hao-Jie Xu, Shi Pu, Qun Wang

Abstract: We have investigated the $ρ^{0}$ meson photoproduction in ultraperipheral isobaric collisions between $_{44}^{96}\textrm{Ru}+_{44}^{96}\textrm{Ru}$ and $_{40}^{96}\textrm{Zr}+_{40}^{96}\textrm{Zr}$ at $\sqrt{s_{NN}}=200$ GeV, employing the dipole model with the equivalent photon approximation. By implementing the Woods-Saxon distribution to represent the nuclear mass density, which is derived from… ▽ More We have investigated the $ρ^{0}$ meson photoproduction in ultraperipheral isobaric collisions between $_{44}^{96}\textrm{Ru}+_{44}^{96}\textrm{Ru}$ and $_{40}^{96}\textrm{Zr}+_{40}^{96}\textrm{Zr}$ at $\sqrt{s_{NN}}=200$ GeV, employing the dipole model with the equivalent photon approximation. By implementing the Woods-Saxon distribution to represent the nuclear mass density, which is derived from density functional theory with an inclusion of nuclear deformation effects, we have calculated the transverse momentum $q_{T}$ spectra in isobaric collisions. We observe the characteristic dip behavior in these spectra, indicative of diffraction phenomena in high-energy physics. We notice that the deformation effects cause a nearly linear increase with $q_{T}^{2}$ for $q_{T}^{2}\lesssim0.015$ $\textrm{GeV}^{2}$, aligning with experimental observations. We offer a simple explanation for the observed behavior in these spectra by introducing the effective width of the nuclei in the thickness function. We also extend our discussion on the $ρ^{0}$ meson photoproduction with the targets $^{63}\textrm{Cu}$,$^{197}\textrm{Au}$, and $^{238}\textrm{U}$. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 9 pages, 5 figures

arXiv:2405.15352 [pdf, other]

Measurements of $\boldsymbol{B\rightarrow Kπ}$ and $\boldsymbol{B\rightarrow ππ}$ Branching Fractions and $\boldsymbol{\mathcal{A}_{CP}}$ Asymmetries at Belle II

Authors: Shu-Ping Lin

Abstract: Analyses of $B$ meson decays to charmless hadronic final states are an important part of the Belle II program. They are sensitive to effects from non-standard model physics and provide experimentally precise constraints on the weak interactions of quarks. We present recent Belle II results on branching fractions and direct $CP$-violating asymmetries of the decays $B^0 \rightarrow K^+π^-$,… ▽ More Analyses of $B$ meson decays to charmless hadronic final states are an important part of the Belle II program. They are sensitive to effects from non-standard model physics and provide experimentally precise constraints on the weak interactions of quarks. We present recent Belle II results on branching fractions and direct $CP$-violating asymmetries of the decays $B^0 \rightarrow K^+π^-$, $B^+ \rightarrow K^+π^0$, $B^+ \rightarrow K^0π^+$, and $B^0 \rightarrow K^0π^0$, and use these to test the standard model through an isospin-based sum rule. In addition, we measure the branching fraction and direct $CP$ asymmetry of the decay $B^+ \rightarrow π^+π^0$ and the branching fraction of the decay $B^0 \rightarrow π^+π^-$, which contribute towards the determination of the CKM angle $φ_2$. The data are collected with the Belle II detector from the SuperKEKB asymmetric-energy $e^+e^-$ collider, consisting of $387 \times 10^6$ $Υ(4S)\rightarrow B\bar{B}$ events. We obtain $-0.03 \pm 0.13 \pm 0.04$ for the sum rule, in agreement with the standard model expectation of zero and with a precision comparable to the best existing determinations. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: contribution to the 2024 Electroweak session of the 58th Rencontres de Moriond

arXiv:2405.15334 [pdf, other]

Detection and Positive Reconstruction of Cognitive Distortion sentences: Mandarin Dataset and Evaluation

Authors: Shuya Lin, Yuxiong Wang, Jonathan Dong, Shiguang Ni

Abstract: This research introduces a Positive Reconstruction Framework based on positive psychology theory. Overcoming negative thoughts can be challenging, our objective is to address and reframe them through a positive reinterpretation. To tackle this challenge, a two-fold approach is necessary: identifying cognitive distortions and suggesting a positively reframed alternative while preserving the origina… ▽ More This research introduces a Positive Reconstruction Framework based on positive psychology theory. Overcoming negative thoughts can be challenging, our objective is to address and reframe them through a positive reinterpretation. To tackle this challenge, a two-fold approach is necessary: identifying cognitive distortions and suggesting a positively reframed alternative while preserving the original thought's meaning. Recent studies have investigated the application of Natural Language Processing (NLP) models in English for each stage of this process. In this study, we emphasize the theoretical foundation for the Positive Reconstruction Framework, grounded in broaden-and-build theory. We provide a shared corpus containing 4001 instances for detecting cognitive distortions and 1900 instances for positive reconstruction in Mandarin. Leveraging recent NLP techniques, including transfer learning, fine-tuning pretrained networks, and prompt engineering, we demonstrate the effectiveness of automated tools for both tasks. In summary, our study contributes to multilingual positive reconstruction, highlighting the effectiveness of NLP in cognitive distortion detection and positive reconstruction. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.15146 [pdf]

On the acceptance, commissioning, and quality assurance of electron FLASH units

Authors: Allison Palmiero, Kevin Liu, Julie Colnot, Nitish Chopra, Denae Neill, Luke Connell, Brett Velasquez, Albert C. Koong, Steven H. Lin, Peter Balter, Ramesh Tailor, Charlotte Robert, Jean-François Germond, Patrik Gonçalves Jorge, Reiner Geyer, Sam Beddar, Raphael Moeckli, Emil Schüler

Abstract: Background & Purpose: FLASH or ultra-high dose rate (UHDR) radiation therapy (RT) has gained attention in recent years for its ability to spare normal tissues relative to conventional dose rate (CDR) RT in various preclinical trials. However, clinical implementation of this promising treatment option has been limited because of the lack of availability of accelerators capable of delivering UHDR RT… ▽ More Background & Purpose: FLASH or ultra-high dose rate (UHDR) radiation therapy (RT) has gained attention in recent years for its ability to spare normal tissues relative to conventional dose rate (CDR) RT in various preclinical trials. However, clinical implementation of this promising treatment option has been limited because of the lack of availability of accelerators capable of delivering UHDR RT. We established a framework for the acceptance, commissioning, and periodic quality assurance (QA) of electron FLASH units and present an example of commissioning. Methods: A protocol for acceptance, commissioning, and QA of UHDR linear accelerators was established by combining and adapting standards and professional recommendations for standard linear accelerators based on the experience with UHDR at four clinical centers that use different UHDR devices. Non-standard dosimetric beam parameters considered included pulse width, pulse repetition frequency, dose per pulse, and instantaneous dose rate, together with recommendations on how to acquire these measurements. Results: The 6 and 9 MeV beams of an UHDR electron device were commissioned by using this developed protocol. Measurements were acquired with a combination of ion chambers, beam current transformers (BCTs), and dose rate independent passive dosimeters. The unit was calibrated according to the concept of redundant dosimetry using a reference setup. Conclusions: This study provides detailed recommendations for the acceptance testing, commissioning, and routine QA of low-energy electron UHDR linear accelerators. The proposed framework is not limited to any specific unit, making it applicable to all existing eFLASH units in the market. Through practical insights and theoretical discourse, this document establishes a benchmark for the commissioning of UHDR devices for clinical use. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 22 Pages, 8 Figures

arXiv:2405.14358 [pdf, other]

AI-Olympics: Exploring the Generalization of Agents through Open Competitions

Authors: Chen Wang, Yan Song, Shuai Wu, Sa Wu, Ruizhi Zhang, Shu Lin, Haifeng Zhang

Abstract: Between 2021 and 2023, AI-Olympics, a series of online AI competitions was hosted by the online evaluation platform Jidi in collaboration with the IJCAI committee. In these competitions, an agent is required to accomplish diverse sports tasks in a two-dimensional continuous world, while competing against an opponent. This paper provides a brief overview of the competition series and highlights not… ▽ More Between 2021 and 2023, AI-Olympics, a series of online AI competitions was hosted by the online evaluation platform Jidi in collaboration with the IJCAI committee. In these competitions, an agent is required to accomplish diverse sports tasks in a two-dimensional continuous world, while competing against an opponent. This paper provides a brief overview of the competition series and highlights notable findings. We aim to contribute insights to the field of multi-agent decision-making and explore the generalization of agents through engineering efforts. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: IJCAI 2024 Demo Track Paper

arXiv:2405.13317 [pdf, other]

Deuterium fractionation of the starless core L 1498

Authors: Sheng-Jun Lin, Shih-Ping Lai, Laurent Pagani, Charlène Lefèvre, Travis J. Thieme

Abstract: Molecular deuteration is commonly seen in starless cores and is expected to occur on a timescale comparable to that of the core contraction. Thus, the deuteration serves as a chemical clock, allowing us to investigate dynamical theories of core formation. We aim to provide a 3D cloud description for the starless core L 1498 located in the nearby low-mass star-forming region Taurus, and explore the… ▽ More Molecular deuteration is commonly seen in starless cores and is expected to occur on a timescale comparable to that of the core contraction. Thus, the deuteration serves as a chemical clock, allowing us to investigate dynamical theories of core formation. We aim to provide a 3D cloud description for the starless core L 1498 located in the nearby low-mass star-forming region Taurus, and explore the possible core formation mechanism of L 1498. We carried out non-local thermal equilibrium radiative transfer with multi-transition observations of the high-density tracer N$_2$H$^+$ to derive the density and temperature profiles of the L 1498 core. Combining with the spectral observations of the deuterated species, ortho-H$_2$D$^+$, N$_2$D$^+$, and DCO$^+$, we derived the abundance profiles for observed species and performed chemical modeling of the deuteration profiles across L 1498 to constrain the contraction timescale. We present the first ortho-H$_2$D$^+$ (1$_{10}$-1$_{11}$) detection toward L 1498. We find a peak molecular hydrogen density of $1.6_{-0.3}^{+3.0}\times10^{5}$~cm$^{-3}$, a temperature of 7.5$_{-0.5}^{+0.7}$~K, and a N$_2$H$^+$ deuteration of 0.27$_{-0.15}^{+0.12}$ in the center. We derive a lower limit of the core age for L 1498 of 0.16~Ma which is compatible with the typical free-fall time, indicating that L 1498 likely formed rapidly. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 21 pages, 12 figures, accepted for publication in A&A

arXiv:2405.12117 [pdf, other]

Strongly-Consistent Distributed Discrete-event Systems

Authors: Peter Donovan, Erling Jellum, Byeonggil Jun, Hokeun Kim, Edward A. Lee, Shaokai Lin, Marten Lohstroh, Anirudh Rengarajan

Abstract: Discrete-event (DE) systems are concurrent programs where components communicate via tagged events, where tags are drawn from a totally ordered set. Reactors are an emerging model of computation based on DE and realized in the open-source coordination language Lingua Franca. Distributed DE (DDE) systems are DE systems where the components (reactors) communicate over networks. The prior art has req… ▽ More Discrete-event (DE) systems are concurrent programs where components communicate via tagged events, where tags are drawn from a totally ordered set. Reactors are an emerging model of computation based on DE and realized in the open-source coordination language Lingua Franca. Distributed DE (DDE) systems are DE systems where the components (reactors) communicate over networks. The prior art has required that for DDE systems with cycles, each cycle must contain at least one logical delay, where the tag of events is incremented. Such delays, however, are not required by the elegant fixed-point semantics of DE. The only requirement is that the program be constructive, meaning it is free of causality cycles. This paper gives a way to coordinate the execution of DDE systems that can execute any constructive program, even one with zero-delay cycles. It provides a formal model that exposes exactly the information that must be shared across networks for such execution to be possible. Furthermore, it describes a concrete implementation that is an extension of the coordination mechanisms in Lingua Franca. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.11826 [pdf, other]

Data quality control system and long-term performance monitor of the LHAASO-KM2A

Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To ensure the reliability of the LHAASO-KM2A data, a three-level quality control system has been established. It is used to monitor the status of detector units, stability of reconstructed parameters and the performance of the array based on observations of the Crab Nebula and Moon shadow. This paper will introduce the control system and its application on the LHAASO-KM2A data collected from August 2021 to July 2023. During this period, the pointing and angular resolution of the array were stable. From the observations of the Moon shadow and Crab Nebula, the results achieved using the two methods are consistent with each other. According to the observation of the Crab Nebula at energies from 25 TeV to 100 TeV, the time averaged pointing errors are estimated to be $-0.003^{\circ} \pm 0.005^{\circ}$ and $0.001^{\circ} \pm 0.006^{\circ}$ in the R.A. and Dec directions, respectively. △ Less

Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: 15 pages, 9 figures

arXiv:2405.11734 [pdf, other]

Finite Field Multiple Access for Sourced Massive Random Access with Finite Blocklength

Authors: Qi-yue Yu, Shi-wen Lin, Shu Lin

Abstract: For binary source transmission, this paper proposes an element-pair (EP) coding scheme for supporting sourced massive random access, which is used to solve the finite blocklength (FBL) of multiuser reliability transmission problem. In this paper, we first give the definition of an EP, which is used as a virtual resource. If the Cartesian product of $J$ distinct EPs satisfies the unique sum-pattern… ▽ More For binary source transmission, this paper proposes an element-pair (EP) coding scheme for supporting sourced massive random access, which is used to solve the finite blocklength (FBL) of multiuser reliability transmission problem. In this paper, we first give the definition of an EP, which is used as a virtual resource. If the Cartesian product of $J$ distinct EPs satisfies the unique sum-pattern mapping (USPM) structural property, the $J$ distinct EPs can form an uniquely-decodable EP (UD-EP) code. Then, we introduce a type of orthogonal EP code $Ψ_{\rm o, B}$ constructed over an extension field GF($2^m$). Based on the proposed EP code, we present finite-field multiple-access (FFMA) systems, including both the sparse-form-based and diagonal-form-based forms. Simulation results show that, for the massive random access scenario, the error performance of the proposed FFMA systems over a Gaussian multiple-access channel can provide much better error performance than that of a slotted ALOHA system. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2303.14086

Showing 1–50 of 1,593 results for author: Lin, S