subscribe to arXiv mailings

GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity

Authors: Shuo Cao, Yihao Liu, Wenlong Zhang, Yu Qiao, Chao Dong

Abstract: Traditional single-task image restoration methods excel in handling specific degradation types but struggle with multiple degradations. To address this limitation, we propose Grouped Restoration with Image Degradation Similarity (GRIDS), a novel approach that harmonizes the competing objectives inherent in multiple-degradation restoration. We first introduce a quantitative method for assessing rel… ▽ More Traditional single-task image restoration methods excel in handling specific degradation types but struggle with multiple degradations. To address this limitation, we propose Grouped Restoration with Image Degradation Similarity (GRIDS), a novel approach that harmonizes the competing objectives inherent in multiple-degradation restoration. We first introduce a quantitative method for assessing relationships between image degradations using statistical modeling of deep degradation representations. This analysis facilitates the strategic grouping of similar tasks, enhancing both the efficiency and effectiveness of the restoration process. Based on the degradation similarity, GRIDS divides restoration tasks into one of the optimal groups, where tasks within the same group are highly correlated. For instance, GRIDS effectively groups 11 degradation types into 4 cohesive groups. Trained models within each group show significant improvements, with an average improvement of 0.09dB over single-task upper bound models and 2.24dB over the mix-training baseline model. GRIDS incorporates an adaptive model selection mechanism for inference, automatically selecting the appropriate grouped-training model based on the input degradation. This mechanism is particularly useful for real-world scenarios with unknown degradations as it does not rely on explicit degradation classification modules. Furthermore, our method can predict model generalization ability without the need for network inference, providing valuable insights for practitioners. △ Less

Submitted 16 July, 2024; originally announced July 2024.

Comments: Accepted by ECCV2024

arXiv:2407.11998 [pdf, other]

Custom Cloth Creation and Virtual Try-on for Everyone

Authors: Pei Chen, Heng Wang, Sainan Sun, Zhiyuan Chen, Zhenkun Liu, Shuhua Cao, Li Yang, Minghui Yang

Abstract: This demo showcases a simple tool that utilizes AIGC technology, enabling both professional designers and regular users to easily customize clothing for their digital avatars. Customization options include changing clothing colors, textures, logos, and patterns. Compared with traditional 3D modeling processes, our approach significantly enhances efficiency and interactivity and reduces production… ▽ More This demo showcases a simple tool that utilizes AIGC technology, enabling both professional designers and regular users to easily customize clothing for their digital avatars. Customization options include changing clothing colors, textures, logos, and patterns. Compared with traditional 3D modeling processes, our approach significantly enhances efficiency and interactivity and reduces production costs. △ Less

Submitted 13 June, 2024; originally announced July 2024.

arXiv:2407.11008 [pdf, other]

Figuring out Figures: Using Textual References to Caption Scientific Figures

Authors: Stanley Cao, Kevin Liu

Abstract: Figures are essential channels for densely communicating complex ideas in scientific papers. Previous work in automatically generating figure captions has been largely unsuccessful and has defaulted to using single-layer LSTMs, which no longer achieve state-of-the-art performance. In our work, we use the SciCap datasets curated by Hsu et al. and use a variant of a CLIP+GPT-2 encoder-decoder model… ▽ More Figures are essential channels for densely communicating complex ideas in scientific papers. Previous work in automatically generating figure captions has been largely unsuccessful and has defaulted to using single-layer LSTMs, which no longer achieve state-of-the-art performance. In our work, we use the SciCap datasets curated by Hsu et al. and use a variant of a CLIP+GPT-2 encoder-decoder model with cross-attention to generate captions conditioned on the image. Furthermore, we augment our training pipeline by creating a new dataset MetaSciCap that incorporates textual metadata from the original paper relevant to the figure, such as the title, abstract, and in-text references. We use SciBERT to encode the textual metadata and use this encoding alongside the figure embedding. In our experimentation with different models, we found that the CLIP+GPT-2 model performs better when it receives all textual metadata from the SciBERT encoder in addition to the figure, but employing a SciBERT+GPT2 model that uses only the textual metadata achieved optimal performance. △ Less

Submitted 25 June, 2024; originally announced July 2024.

arXiv:2407.10540 [pdf, other]

Sudden polarization angle jumps of the repeating fast radio burst FRB 20201124A

Authors: J. R. Niu, W. Y. Wang, J. C. Jiang, Y. Qu, D. J. Zhou, W. W. Zhu, K. J. Lee, J. L. Han, B. Zhang, D. Li, S. Cao, Z. Y. Fang, Y. Feng, Q. Y. Fu, P. Jiang, W. C. Jing, J. Li, Y. Li, R. Luo, L. Q. Meng, C. C. Miao, X. L. Miao, C. H. Niu, Y. C. Pan, B. J. Wang , et al. (19 additional authors not shown)

Abstract: We report the first detection of polarization angle (PA) orthogonal jumps, a phenomenon previously only observed from radio pulsars, from a fast radio burst (FRB) source FRB 20201124A. We find three cases of orthogonal jumps in over two thousand bursts, all resembling those observed in pulsar single pulses. We propose that the jumps are due to the superposition of two orthogonal emission modes tha… ▽ More We report the first detection of polarization angle (PA) orthogonal jumps, a phenomenon previously only observed from radio pulsars, from a fast radio burst (FRB) source FRB 20201124A. We find three cases of orthogonal jumps in over two thousand bursts, all resembling those observed in pulsar single pulses. We propose that the jumps are due to the superposition of two orthogonal emission modes that could only be produced in a highly magnetized plasma, and they are caused by the line of sight sweeping across a rotating magnetosphere. The shortest jump timescale is of the order of one-millisecond, which hints that the emission modes come from regions smaller than the light cylinder of most pulsars or magnetars. This discovery provides convincing evidence that FRB emission originates from the complex magnetosphere of a magnetar, suggesting an FRB emission mechanism that is analogous to radio pulsars despite a huge luminosity difference between two types of objects. △ Less

Submitted 15 July, 2024; originally announced July 2024.

Comments: 10 pages, 5 figures, submitted to APJL

arXiv:2407.08377 [pdf, other]

Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework

Authors: Shengqi Xu, Run Sun, Yi Chang, Shuning Cao, Xueyao Xiao, Luxin Yan

Abstract: Long-range imaging inevitably suffers from atmospheric turbulence with severe geometric distortions due to random refraction of light. The further the distance, the more severe the disturbance. Despite existing research has achieved great progress in tackling short-range turbulence, there is less attention paid to long-range turbulence with significant distortions. To address this dilemma and adva… ▽ More Long-range imaging inevitably suffers from atmospheric turbulence with severe geometric distortions due to random refraction of light. The further the distance, the more severe the disturbance. Despite existing research has achieved great progress in tackling short-range turbulence, there is less attention paid to long-range turbulence with significant distortions. To address this dilemma and advance the field, we construct a large-scale real long-range atmospheric turbulence dataset (RLR-AT), including 1500 turbulence sequences spanning distances from 1 Km to 13 Km. The advantages of RLR-AT compared to existing ones: turbulence with longer-distances and higher-diversity, scenes with greater-variety and larger-scale. Moreover, most existing work adopts either registration-based or decomposition-based methods to address distortions through one-step mitigation. However, they fail to effectively handle long-range turbulence due to its significant pixel displacements. In this work, we propose a coarse-to-fine framework to handle severe distortions, which cooperates dynamic turbulence and static background priors (CDSP). On the one hand, we discover the pixel motion statistical prior of turbulence, and propose a frequency-aware reference frame for better large-scale distortion registration, greatly reducing the burden of refinement. On the other hand, we take advantage of the static prior of background, and propose a subspace-based low-rank tensor refinement model to eliminate the misalignments inevitably left by registration while well preserving details. The dynamic and static priors complement to each other, facilitating us to progressively mitigate long-range turbulence with severe distortions. Extensive experiments demonstrate that the proposed method outperforms SOTA methods on different datasets. △ Less

Submitted 17 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

Comments: This paper is accepted by ECCV 2024

arXiv:2407.08148 [pdf, other]

SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning

Authors: Runmin Zhang, Jun Ma, Si-Yuan Cao, Lun Luo, Beinan Yu, Shu-Jie Chen, Junwei Li, Hui-Liang Shen

Abstract: We propose a novel unsupervised cross-modal homography estimation framework based on intra-modal Self-supervised learning, Correlation, and consistent feature map Projection, namely SCPNet. The concept of intra-modal self-supervised learning is first presented to facilitate the unsupervised cross-modal homography estimation. The correlation-based homography estimation network and the consistent fe… ▽ More We propose a novel unsupervised cross-modal homography estimation framework based on intra-modal Self-supervised learning, Correlation, and consistent feature map Projection, namely SCPNet. The concept of intra-modal self-supervised learning is first presented to facilitate the unsupervised cross-modal homography estimation. The correlation-based homography estimation network and the consistent feature map projection are combined to form the learnable architecture of SCPNet, boosting the unsupervised learning framework. SCPNet is the first to achieve effective unsupervised homography estimation on the satellite-map image pair cross-modal dataset, GoogleMap, under [-32,+32] offset on a 128x128 image, leading the supervised approach MHN by 14.0% of mean average corner error (MACE). We further conduct extensive experiments on several cross-modal/spectral and manually-made inconsistent datasets, on which SCPNet achieves the state-of-the-art (SOTA) performance among unsupervised approaches, and owns 49.0%, 25.2%, 36.4%, and 10.7% lower MACEs than the supervised approach MHN. Source code is available at https://github.com/RM-Zhang/SCPNet. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: Accepted by ECCV 2024

arXiv:2407.08049 [pdf, other]

doi 10.1109/TITS.2024.3421339

Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors

Authors: Lei Cheng, Arindam Sengupta, Siyang Cao

Abstract: Autonomous driving holds great promise in addressing traffic safety concerns by leveraging artificial intelligence and sensor technology. Multi-Object Tracking plays a critical role in ensuring safer and more efficient navigation through complex traffic scenarios. This paper presents a novel deep learning-based method that integrates radar and camera data to enhance the accuracy and robustness of… ▽ More Autonomous driving holds great promise in addressing traffic safety concerns by leveraging artificial intelligence and sensor technology. Multi-Object Tracking plays a critical role in ensuring safer and more efficient navigation through complex traffic scenarios. This paper presents a novel deep learning-based method that integrates radar and camera data to enhance the accuracy and robustness of Multi-Object Tracking in autonomous driving systems. The proposed method leverages a Bi-directional Long Short-Term Memory network to incorporate long-term temporal information and improve motion prediction. An appearance feature model inspired by FaceNet is used to establish associations between objects across different frames, ensuring consistent tracking. A tri-output mechanism is employed, consisting of individual outputs for radar and camera sensors and a fusion output, to provide robustness against sensor failures and produce accurate tracking results. Through extensive evaluations of real-world datasets, our approach demonstrates remarkable improvements in tracking accuracy, ensuring reliable performance even in low-visibility scenarios. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: Published in IEEE Transactions on Intelligent Transportation Systems

arXiv:2407.03316 [pdf, other]

An Upper Limit on the Photoproduction Cross Section of the Spin-Exotic $π_1(1600)$

Authors: F. Afzal, C. S. Akondi, M. Albrecht, M. Amaryan, S. Arrigo, V. Arroyave, A. Asaturyan, A. Austregesilo, Z. Baldwin, F. Barbosa, J. Barlow, E. Barriga, R. Barsotti, D. Barton, V. Baturin, V. V. Berdnikov, T. Black, W. Boeglin, M. Boer, W. J. Briscoe, T. Britton, S. Cao, E. Chudakov, G. Chung, P. L. Cole , et al. (124 additional authors not shown)

Abstract: The spin-exotic hybrid meson $π_{1}(1600)$ is predicted to have a large decay rate to the $ωππ$ final state. Using 76.6~pb$^{-1}$ of data collected with the GlueX detector, we measure the cross sections for the reactions $γp \to ωπ^+ π^- p$, $γp \to ωπ^0 π^0 p$, and $γp\toωπ^-π^0Δ^{++}$ in the range $E_γ=$ 8-10 GeV. Using isospin conservation, we set the first upper limits on the photoproduction c… ▽ More The spin-exotic hybrid meson $π_{1}(1600)$ is predicted to have a large decay rate to the $ωππ$ final state. Using 76.6~pb$^{-1}$ of data collected with the GlueX detector, we measure the cross sections for the reactions $γp \to ωπ^+ π^- p$, $γp \to ωπ^0 π^0 p$, and $γp\toωπ^-π^0Δ^{++}$ in the range $E_γ=$ 8-10 GeV. Using isospin conservation, we set the first upper limits on the photoproduction cross sections of the $π^{0}_{1}(1600)$ and $π^{-}_{1}(1600)$. We combine these limits with lattice calculations of decay widths and find that photoproduction of $η'π$ is the most sensitive two-body system to search for the $π_1(1600)$. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 6 pages, 3 figures plus supplemental materials

arXiv:2407.00951 [pdf, other]

Effective Management of Airport Security Queues with Passenger Reassignment

Authors: Shangqing Cao, Aparimit Kasliwal, Masoud Reihanifar, Francesc Robuste, Mark Hansen

Abstract: Airport security queues often suffer from inefficiencies that result in long wait times and decreased throughput, especially at peak departure time, affecting both passengers and airlines. This work addresses the problem of reassigning passengers to specific time slots for crossing security, aiming to mitigate these inefficiencies. We frame this problem as a Minimum Cost Network Flow (MCNF) proble… ▽ More Airport security queues often suffer from inefficiencies that result in long wait times and decreased throughput, especially at peak departure time, affecting both passengers and airlines. This work addresses the problem of reassigning passengers to specific time slots for crossing security, aiming to mitigate these inefficiencies. We frame this problem as a Minimum Cost Network Flow (MCNF) problem, enabling us to solve it exactly in polynomial time due to its linear programming structure. Our approach redistributes passenger demand across different time intervals. By optimizing the reassignment of passengers to sigma-minute time slots, we achieve significant improvements in throughput and reductions in waiting time. Preliminary results demonstrate the effectiveness of our method in enhancing operational efficiency and passenger satisfaction. The MCNF formulation offers a scalable and adaptable solution, providing long-term benefits for airport security management. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2407.00947 [pdf, other]

Fleet Size and Spill for UAM Operation under Uncertain Demand

Authors: Shangqing Cao, Xuan Jiang, Emin Burak Onat, Bo Zou, Mark Hansen, Raja Sengupta, Anjan Chakrabarty

Abstract: Variation and imbalance in demand poses significant challenges to Urban Air Mobility (UAM) operations, affecting strategic decisions such as fleet sizing. To study the implications of demand variation on UAM fleet operations, we propose a stochastic passenger arrival time generation model that uses real-world data to infer demand distributions, and two integer programs that compute the zero-spill… ▽ More Variation and imbalance in demand poses significant challenges to Urban Air Mobility (UAM) operations, affecting strategic decisions such as fleet sizing. To study the implications of demand variation on UAM fleet operations, we propose a stochastic passenger arrival time generation model that uses real-world data to infer demand distributions, and two integer programs that compute the zero-spill fleet size and the spill-minimizing flight schedules and charging policies, respectively. Our numerical experiment on a two-vertiport network shows that spill in relatively inelastic to fleet size and that the driving factor behind spill is the imbalance in demand. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2407.00088 [pdf, other]

T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge

Authors: Jianyu Wei, Shijie Cao, Ting Cao, Lingxiao Ma, Lei Wang, Yanyong Zhang, Mao Yang

Abstract: The deployment of Large Language Models (LLMs) on edge devices is increasingly important to enhance on-device intelligence. Weight quantization is crucial for reducing the memory footprint of LLMs on devices. However, low-bit LLMs necessitate mixed precision matrix multiplication (mpGEMM) of low precision weights and high precision activations during inference. Existing systems, lacking native sup… ▽ More The deployment of Large Language Models (LLMs) on edge devices is increasingly important to enhance on-device intelligence. Weight quantization is crucial for reducing the memory footprint of LLMs on devices. However, low-bit LLMs necessitate mixed precision matrix multiplication (mpGEMM) of low precision weights and high precision activations during inference. Existing systems, lacking native support for mpGEMM, resort to dequantize weights for high precision computation. Such an indirect way can lead to a significant inference overhead. In this paper, we introduce T-MAC, an innovative lookup table(LUT)-based method designed for efficient low-bit LLM (i.e., weight-quantized LLM) inference on CPUs. T-MAC directly supports mpGEMM without dequantization, while simultaneously eliminating multiplications and reducing additions required. Specifically, T-MAC transforms the traditional data-type-centric multiplication to bit-wise table lookup, and enables a unified and scalable mpGEMM solution. Our LUT-based kernels scale linearly to the weight bit-width. Evaluated on low-bit Llama and BitNet models, T-MAC demonstrates up to 4x increase in throughput and 70% reduction in energy consumption compared to llama.cpp. For BitNet-b1.58-3B, T-MAC delivers a token generation throughput of 30 tokens/s with a single core and 71 tokens/s with eight cores on M2-Ultra, and 11 tokens/s on lower-end devices like Raspberry Pi 5, which significantly exceeds the adult average reading speed. T-MAC with LUT-based computing paradigm, paves the way for the practical deployment of low-bit LLMs on resource-constrained edge devices without compromising computational efficiency. The system is open-sourced at https://github.com/microsoft/T-MAC. △ Less

Submitted 25 June, 2024; originally announced July 2024.

arXiv:2406.19321 [pdf, other]

Fractional Gaussian forms and gauge theory: an overview

Authors: Sky Cao, Scott Sheffield

Abstract: Fractional Gaussian fields are scalar-valued random functions or generalized functions on an $n$-dimensional manifold $M$, indexed by a parameter $s$. They include white noise ($s = 0$), Brownian motion ($s=1, n=1$), the 2D Gaussian free field ($s = 1, n=2$) and the membrane model ($s = 2$). These simple objects are ubiquitous in math and science, and can be used as a starting point for constructi… ▽ More Fractional Gaussian fields are scalar-valued random functions or generalized functions on an $n$-dimensional manifold $M$, indexed by a parameter $s$. They include white noise ($s = 0$), Brownian motion ($s=1, n=1$), the 2D Gaussian free field ($s = 1, n=2$) and the membrane model ($s = 2$). These simple objects are ubiquitous in math and science, and can be used as a starting point for constructing non-Gaussian theories. The $\textit{differential form}$ analogs of these objects are equally natural: for example, instead of considering an instance $h(x)$ of the GFF on $\mathbb R^2$, one might write $h_1(x)dx_1 + h_2(x) dx_2$ where $h_1$ and $h_2$ are independent GFF instances. In general, given $k \in \{0,1,\ldots,n\}$, an instance of the $\textit{fractional Gaussian $k$-form}$ with parameter $s \in \mathbb R$ (abbreviated $\mathrm{FGF}_s^k(M)$) is given by $(-Δ)^{-\frac{s}{2}} W_k,$ where $W_k$ is a $k$-form-valued white noise. We write $$\textrm{FGF}_s^k(M)_{d=0} \quad \textrm{and} \quad \textrm{FGF}_s^k(M)_{d^*=0}$$ for the $L^2$ orthogonal projections of $\textrm{FGF}_s^k(M)$ onto the space of $k$-forms on which $d$ (resp.\ $d^*$) vanishes. We explain how $\mathrm{FGF}_s^k(M)$ and its projections transform under $d$ and $d^*$, as well as wedge/Hodge-star operators, subspace restrictions, and axial projections. We discuss how the $1$-form $\textrm{FGF}_1^1(M)$ and its $\textit{gauge-fixed}$ projection $\textrm{FGF}_1^1(M)_{d^*=0}$ are related to gauge theories, and we formulate several conjectures and open problems about scaling limits, including possible off-critical/non-Gaussian limits, whose construction in the Yang-Mills setting is a famous open problem. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 84 pages, 17 figures

MSC Class: 60G60; 81T13

arXiv:2406.19227 [pdf, other]

Aligning Teacher with Student Preferences for Tailored Training Data Generation

Authors: Yantao Liu, Zhao Zhang, Zijun Yao, Shulin Cao, Lei Hou, Juanzi Li

Abstract: Large Language Models (LLMs) have shown significant promise as copilots in various tasks. Local deployment of LLMs on edge devices is necessary when handling privacy-sensitive data or latency-sensitive tasks. The computational constraints of such devices make direct deployment of powerful large-scale LLMs impractical, necessitating the Knowledge Distillation from large-scale models to lightweight… ▽ More Large Language Models (LLMs) have shown significant promise as copilots in various tasks. Local deployment of LLMs on edge devices is necessary when handling privacy-sensitive data or latency-sensitive tasks. The computational constraints of such devices make direct deployment of powerful large-scale LLMs impractical, necessitating the Knowledge Distillation from large-scale models to lightweight models. Lots of work has been done to elicit diversity and quality training examples from LLMs, but little attention has been paid to aligning teacher instructional content based on student preferences, akin to "responsive teaching" in pedagogy. Thus, we propose ARTE, dubbed Aligning TeacheR with StudenT PreferencEs, a framework that aligns the teacher model with student preferences to generate tailored training examples for Knowledge Distillation. Specifically, we elicit draft questions and rationales from the teacher model, then collect student preferences on these questions and rationales using students' performance with in-context learning as a proxy, and finally align the teacher model with student preferences. In the end, we repeat the first step with the aligned teacher model to elicit tailored training examples for the student model on the target task. Extensive experiments on academic benchmarks demonstrate the superiority of ARTE over existing instruction-tuning datasets distilled from powerful LLMs. Moreover, we thoroughly investigate the generalization of ARTE, including the generalization of fine-tuned student models in reasoning ability and the generalization of aligned teacher models to generate tailored training data across tasks and students. In summary, our contributions lie in proposing a novel framework for tailored training example generation, demonstrating its efficacy in experiments, and investigating the generalization of both student & aligned teacher models in ARTE. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.19215 [pdf, other]

SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation

Authors: Zijun Yao, Weijian Qi, Liangming Pan, Shulin Cao, Linmei Hu, Weichuan Liu, Lei Hou, Juanzi Li

Abstract: This paper introduces Self-aware Knowledge Retrieval (SeaKR), a novel adaptive RAG model that extracts self-aware uncertainty of LLMs from their internal states. SeaKR activates retrieval when the LLMs present high self-aware uncertainty for generation. To effectively integrate retrieved knowledge snippets, SeaKR re-ranks them based on LLM's self-aware uncertainty to preserve the snippet that redu… ▽ More This paper introduces Self-aware Knowledge Retrieval (SeaKR), a novel adaptive RAG model that extracts self-aware uncertainty of LLMs from their internal states. SeaKR activates retrieval when the LLMs present high self-aware uncertainty for generation. To effectively integrate retrieved knowledge snippets, SeaKR re-ranks them based on LLM's self-aware uncertainty to preserve the snippet that reduces their uncertainty to the utmost. To facilitate solving complex tasks that require multiple retrievals, SeaKR utilizes their self-aware uncertainty to choose among different reasoning strategies. Our experiments on both complex and simple Question Answering datasets show that SeaKR outperforms existing adaptive RAG methods. We release our code at https://github.com/THU-KEG/SeaKR. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.18298 [pdf, other]

A model-independent determination of the sound horizon using recent BAO measurements and strong lensing systems

Authors: Tonghua Liu, Shuo Cao, Jieci Wang

Abstract: We propose an improved method to determine the sound horizon in a cosmological model-independent way by using the latest observations of BAO measurements from DES, BOSS/eBOSS, and DESI surveys and gravitationally time-delay lensed quasars from H0LiCOW collaboration. Combining the 6$D_{Δt}$ plus 4$D_{d}$ measurements and the reconstructed BAO datasets, we obtain a model-independent result of… ▽ More We propose an improved method to determine the sound horizon in a cosmological model-independent way by using the latest observations of BAO measurements from DES, BOSS/eBOSS, and DESI surveys and gravitationally time-delay lensed quasars from H0LiCOW collaboration. Combining the 6$D_{Δt}$ plus 4$D_{d}$ measurements and the reconstructed BAO datasets, we obtain a model-independent result of $r_d=139.7^{+5.2}_{-4.5}$ Mpc, with the precision at the $\sim3.7\%$ level, which is in agreement with the result of Planck 2018 within $\sim1.7σ$ uncertainty. Our method is independent of cosmological parameters such as the Hubble constant, dark energy, (and, more importantly, does not involve the cosmic curvature when using the $D_d$ measurements of the lenses, and also avoids the obstacle of mass-sheet degeneracy in gravitational lensing). Meanwhile, it does not need to consider the Eddington relation with concerning the transformation of distance. Since only two types of data are considered, the contribution of each can be clearly understood. Our results also highlight the Hubble tension and may give us a better understanding of the discordance between the datasets or reveal new physics beyond the standard model. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: 7 pages, 4 figures

arXiv:2406.17145 [pdf, other]

GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism

Authors: Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen, Peiyuan Liao, Xupeng Miao, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia

Abstract: Deep neural networks (DNNs) continue to grow rapidly in size, making them infeasible to train on a single device. Pipeline parallelism is commonly used in existing DNN systems to support large-scale DNN training by partitioning a DNN into multiple stages, which concurrently perform DNN training for different micro-batches in a pipeline fashion. However, existing pipeline-parallel approaches only c… ▽ More Deep neural networks (DNNs) continue to grow rapidly in size, making them infeasible to train on a single device. Pipeline parallelism is commonly used in existing DNN systems to support large-scale DNN training by partitioning a DNN into multiple stages, which concurrently perform DNN training for different micro-batches in a pipeline fashion. However, existing pipeline-parallel approaches only consider sequential pipeline stages and thus ignore the topology of a DNN, resulting in missed model-parallel opportunities. This paper presents graph pipeline parallelism (GPP), a new pipeline-parallel scheme that partitions a DNN into pipeline stages whose dependencies are identified by a directed acyclic graph. GPP generalizes existing sequential pipeline parallelism and preserves the inherent topology of a DNN to enable concurrent execution of computationally-independent operators, resulting in reduced memory requirement and improved GPU performance. In addition, we develop GraphPipe, a distributed system that exploits GPP strategies to enable performant and scalable DNN training. GraphPipe partitions a DNN into a graph of stages, optimizes micro-batch schedules for these stages, and parallelizes DNN training using the discovered GPP strategies. Evaluation on a variety of DNNs shows that GraphPipe outperforms existing pipeline-parallel systems such as PipeDream and Piper by up to 1.6X. GraphPipe also reduces the search time by 9-21X compared to PipeDream and Piper. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.16439 [pdf, other]

Exploring Test-Time Adaptation for Object Detection in Continually Changing Environments

Authors: Shilei Cao, Yan Liu, Juepeng Zheng, Weijia Li, Runmin Dong, Haohuan Fu

Abstract: For real-world applications, neural network models are commonly deployed in dynamic environments, where the distribution of the target domain undergoes temporal changes. Continual Test-Time Adaptation (CTTA) has recently emerged as a promising technique to gradually adapt a source-trained model to test data drawn from a continually changing target domain. Despite recent advancements in addressing… ▽ More For real-world applications, neural network models are commonly deployed in dynamic environments, where the distribution of the target domain undergoes temporal changes. Continual Test-Time Adaptation (CTTA) has recently emerged as a promising technique to gradually adapt a source-trained model to test data drawn from a continually changing target domain. Despite recent advancements in addressing CTTA, two critical issues remain: 1) The use of a fixed threshold for pseudo-labeling in existing methodologies leads to the generation of low-quality pseudo-labels, as model confidence varies across categories and domains; 2) While current solutions utilize stochastic parameter restoration to mitigate catastrophic forgetting, their capacity to preserve critical information is undermined by its intrinsic randomness. To tackle these challenges, we present CTAOD, aiming to enhance the performance of detection models in CTTA scenarios. Inspired by prior CTTA works for effective adaptation, CTAOD is founded on the mean-teacher framework, characterized by three core components. Firstly, the object-level contrastive learning module tailored for object detection extracts object-level features using the teacher's region of interest features and optimizes them through contrastive learning. Secondly, the dynamic threshold strategy updates the category-specific threshold based on predicted confidence scores to improve the quality of pseudo-labels. Lastly, we design a data-driven stochastic restoration mechanism to selectively reset inactive parameters using the gradients as weights for a random mask matrix, thereby ensuring the retention of essential knowledge. We demonstrate the effectiveness of our approach on four CTTA tasks for object detection, where CTAOD outperforms existing methods, especially achieving a 3.0 mAP improvement on the Cityscapes-to-Cityscapes-C CTTA task. △ Less

Submitted 24 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.12829 [pdf, other]

Measurement of Spin-Density Matrix Elements in $Δ^{++}(1232)$ photoproduction

Authors: F. Afzal, C. S. Akondi, M. Albrecht, M. Amaryan, S. Arrigo, V. Arroyave, A. Asaturyan, A. Austregesilo, Z. Baldwin, F. Barbosa, J. Barlow, E. Barriga, R. Barsotti, D. Barton, V. Baturin, V. V. Berdnikov, T. Black, W. Boeglin, M. Boer, W. J. Briscoe, T. Britton, S. Cao, E. Chudakov, G. Chung, P. L. Cole , et al. (124 additional authors not shown)

Abstract: We report the measurement of spin-density matrix elements of the $Δ^{++}(1232)$ in the photoproduction reaction $γp \to π^-Δ^{++}(1232)$ with the GlueX experiment in Hall D at Jefferson Lab. The measurement used a linearly polarized photon beam with $E_γ=8.2-8.8$~GeV and the statistical precision exceeds the previous measurement from SLAC by three orders of magnitude for the momentum transfer squa… ▽ More We report the measurement of spin-density matrix elements of the $Δ^{++}(1232)$ in the photoproduction reaction $γp \to π^-Δ^{++}(1232)$ with the GlueX experiment in Hall D at Jefferson Lab. The measurement used a linearly polarized photon beam with $E_γ=8.2-8.8$~GeV and the statistical precision exceeds the previous measurement from SLAC by three orders of magnitude for the momentum transfer squared region $-t < 1.4$ GeV$^2$. The data are sensitive to the previously undetermined relative sign between couplings in existing Regge exchange models. Linear combinations of the extracted SDMEs allow for a decomposition into natural and unnatural exchange amplitudes, which shows that the unnatural exchange plays an important role in the low $-t$ region. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.12793 [pdf, other]

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

Abstract: We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained on ten trillions of tokens mostly in Chinese and English, along with a small set of corpus from 24 languages, and aligned primarily for Chinese and English usage. The high-quality alignment is achieved via a multi-stage post-training process, which involves supervised fine-tuning and learning from human feedback. Evaluations show that GLM-4 1) closely rivals or outperforms GPT-4 in terms of general metrics such as MMLU, GSM8K, MATH, BBH, GPQA, and HumanEval, 2) gets close to GPT-4-Turbo in instruction following as measured by IFEval, 3) matches GPT-4 Turbo (128K) and Claude 3 for long context tasks, and 4) outperforms GPT-4 in Chinese alignments as measured by AlignBench. The GLM-4 All Tools model is further aligned to understand user intent and autonomously decide when and which tool(s) touse -- including web browser, Python interpreter, text-to-image model, and user-defined functions -- to effectively complete complex tasks. In practical applications, it matches and even surpasses GPT-4 All Tools in tasks like accessing online information via web browsing and solving math problems using Python interpreter. Over the course, we have open-sourced a series of models, including ChatGLM-6B (three generations), GLM-4-9B (128K, 1M), GLM-4V-9B, WebGLM, and CodeGeeX, attracting over 10 million downloads on Hugging face in the year 2023 alone. The open models can be accessed through https://github.com/THUDM and https://huggingface.co/THUDM. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.10467 [pdf, other]

Scheduling two types of jobs with minimum makespan

Authors: Song Cao, Kai Jin

Abstract: We consider scheduling two types of jobs (A-job and B-job) to $p$ machines and minimizing their makespan. A group of same type of jobs processed consecutively by a machine is called a batch. For machine $v$, processing $x$ A-jobs in a batch takes $k^A_vx^2$ time units for a given speed $k^A_v$, and processing $x$ B-jobs in a batch takes $k^B_vx^2$ time units for a given speed $k^B_v$. We give an… ▽ More We consider scheduling two types of jobs (A-job and B-job) to $p$ machines and minimizing their makespan. A group of same type of jobs processed consecutively by a machine is called a batch. For machine $v$, processing $x$ A-jobs in a batch takes $k^A_vx^2$ time units for a given speed $k^A_v$, and processing $x$ B-jobs in a batch takes $k^B_vx^2$ time units for a given speed $k^B_v$. We give an $O(n^2p\log(n))$ algorithm based on dynamic programming and binary search for solving this problem, where $n$ denotes the maximal number of A-jobs and B-jobs to be distributed to the machines. Our algorithm also fits the easier linear case where each batch of length $x$ of $A$-jobs takes $k^A_v x$ time units and each batch of length $x$ of $B$-jobs takes $k^B_vx$ time units. The running time is the same as the above case. △ Less

Submitted 14 June, 2024; originally announced June 2024.

arXiv:2406.08079 [pdf, other]

A$^{2}$-MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder

Authors: Lixian Zhang, Yi Zhao, Runmin Dong, Jinxiao Zhang, Shuai Yuan, Shilei Cao, Mengxuan Chen, Juepeng Zheng, Weijia Li, Wei Liu, Wayne Zhang, Litong Feng, Haohuan Fu

Abstract: Vast amounts of remote sensing (RS) data provide Earth observations across multiple dimensions, encompassing critical spatial, temporal, and spectral information which is essential for addressing global-scale challenges such as land use monitoring, disaster prevention, and environmental change mitigation. Despite various pre-training methods tailored to the characteristics of RS data, a key limita… ▽ More Vast amounts of remote sensing (RS) data provide Earth observations across multiple dimensions, encompassing critical spatial, temporal, and spectral information which is essential for addressing global-scale challenges such as land use monitoring, disaster prevention, and environmental change mitigation. Despite various pre-training methods tailored to the characteristics of RS data, a key limitation persists: the inability to effectively integrate spatial, temporal, and spectral information within a single unified model. To unlock the potential of RS data, we construct a Spatial-Temporal-Spectral Structured Dataset (STSSD) characterized by the incorporation of multiple RS sources, diverse coverage, unified locations within image sets, and heterogeneity within images. Building upon this structured dataset, we propose an Anchor-Aware Masked AutoEncoder method (A$^{2}$-MAE), leveraging intrinsic complementary information from the different kinds of images and geo-information to reconstruct the masked patches during the pre-training phase. A$^{2}$-MAE integrates an anchor-aware masking strategy and a geographic encoding module to comprehensively exploit the properties of RS images. Specifically, the proposed anchor-aware masking strategy dynamically adapts the masking process based on the meta-information of a pre-selected anchor image, thereby facilitating the training on images captured by diverse types of RS sources within one model. Furthermore, we propose a geographic encoding method to leverage accurate spatial patterns, enhancing the model generalization capabilities for downstream applications that are generally location-related. Extensive experiments demonstrate our method achieves comprehensive improvements across various downstream tasks compared with existing RS pre-training methods, including image classification, semantic segmentation, and change detection tasks. △ Less

Submitted 16 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.06125 [pdf, other]

Verifiable Generation with Subsentence-Level Fine-Grained Citations

Authors: Shuyang Cao, Lu Wang

Abstract: Verifiable generation requires large language models (LLMs) to cite source documents supporting their outputs, thereby improve output transparency and trustworthiness. Yet, previous work mainly targets the generation of sentence-level citations, lacking specificity about which parts of a sentence are backed by the cited sources. This work studies verifiable generation with subsentence-level fine-g… ▽ More Verifiable generation requires large language models (LLMs) to cite source documents supporting their outputs, thereby improve output transparency and trustworthiness. Yet, previous work mainly targets the generation of sentence-level citations, lacking specificity about which parts of a sentence are backed by the cited sources. This work studies verifiable generation with subsentence-level fine-grained citations for more precise location of generated content supported by the cited sources. We first present a dataset, SCiFi, comprising 10K Wikipedia paragraphs with subsentence-level citations. Each paragraph is paired with a set of candidate source documents for citation and a query that triggers the generation of the paragraph content. On SCiFi, we evaluate the performance of state-of-the-art LLMs and strategies for processing long documents designed for these models. Our experiment results reveals key factors that could enhance the quality of citations, including the expansion of the source documents' context accessible to the models and the implementation of specialized model tuning. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: NAACL 2024 Findings

arXiv:2406.04271 [pdf, other]

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Authors: Ling Yang, Zhaochen Yu, Tianjun Zhang, Shiyi Cao, Minkai Xu, Wentao Zhang, Joseph E. Gonzalez, Bin Cui

Abstract: We introduce Buffer of Thoughts (BoT), a novel and versatile thought-augmented reasoning approach for enhancing accuracy, efficiency and robustness of large language models (LLMs). Specifically, we propose meta-buffer to store a series of informative high-level thoughts, namely thought-template, distilled from the problem-solving processes across various tasks. Then for each problem, we retrieve a… ▽ More We introduce Buffer of Thoughts (BoT), a novel and versatile thought-augmented reasoning approach for enhancing accuracy, efficiency and robustness of large language models (LLMs). Specifically, we propose meta-buffer to store a series of informative high-level thoughts, namely thought-template, distilled from the problem-solving processes across various tasks. Then for each problem, we retrieve a relevant thought-template and adaptively instantiate it with specific reasoning structures to conduct efficient reasoning. To guarantee the scalability and stability, we further propose buffer-manager to dynamically update the meta-buffer, thus enhancing the capacity of meta-buffer as more tasks are solved. We conduct extensive experiments on 10 challenging reasoning-intensive tasks, and achieve significant performance improvements over previous SOTA methods: 11% on Game of 24, 20% on Geometric Shapes and 51% on Checkmate-in-One. Further analysis demonstrate the superior generalization ability and model robustness of our BoT, while requiring only 12% of the cost of multi-query prompting methods (e.g., tree/graph of thoughts) on average. Notably, we find that our Llama3-8B+BoT has the potential to surpass Llama3-70B model. Our project is available at: https://github.com/YangLing0818/buffer-of-thought-llm △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: Project: https://github.com/YangLing0818/buffer-of-thought-llm

arXiv:2405.17211 [pdf, other]

Spectral-Refiner: Fine-Tuning of Accurate Spatiotemporal Neural Operator for Turbulent Flows

Authors: Shuhao Cao, Francesco Brarda, Ruipeng Li, Yuanzhe Xi

Abstract: Recent advancements in operator-type neural networks have shown promising results in approximating the solutions of spatiotemporal Partial Differential Equations (PDEs). However, these neural networks often entail considerable training expenses, and may not always achieve the desired accuracy required in many scientific and engineering disciplines. In this paper, we propose a new Spatiotemporal Fo… ▽ More Recent advancements in operator-type neural networks have shown promising results in approximating the solutions of spatiotemporal Partial Differential Equations (PDEs). However, these neural networks often entail considerable training expenses, and may not always achieve the desired accuracy required in many scientific and engineering disciplines. In this paper, we propose a new Spatiotemporal Fourier Neural Operator (SFNO) that learns maps between Bochner spaces, and a new learning framework to address these issues. This new paradigm leverages wisdom from traditional numerical PDE theory and techniques to refine the pipeline of commonly adopted end-to-end neural operator training and evaluations. Specifically, in the learning problems for the turbulent flow modeling by the Navier-Stokes Equations (NSE), the proposed architecture initiates the training with a few epochs for SFNO, concluding with the freezing of most model parameters. Then, the last linear spectral convolution layer is fine-tuned without the frequency truncation. The optimization uses a negative Sobolev norm for the first time as the loss in operator learning, defined through a reliable functional-type \emph{a posteriori} error estimator whose evaluation is almost exact thanks to the Parseval identity. This design allows the neural operators to effectively tackle low-frequency errors while the relief of the de-aliasing filter addresses high-frequency errors. Numerical experiments on commonly used benchmarks for the 2D NSE demonstrate significant improvements in both computational efficiency and accuracy, compared to end-to-end evaluation and traditional numerical PDE solvers. △ Less

Submitted 27 May, 2024; originally announced May 2024.

MSC Class: 65M70 (Primary); 35Q30; 76M22; 65M50; 68T07 (Secondary)

arXiv:2405.16038 [pdf, other]

Rethinking Early-Fusion Strategies for Improved Multispectral Object Detection

Authors: Xue Zhang, Si-Yuan Cao, Fang Wang, Runmin Zhang, Zhe Wu, Xiaohan Zhang, Xiaokai Bai, Hui-Liang Shen

Abstract: Most recent multispectral object detectors employ a two-branch structure to extract features from RGB and thermal images. While the two-branch structure achieves better performance than a single-branch structure, it overlooks inference efficiency. This conflict is increasingly aggressive, as recent works solely pursue higher performance rather than both performance and efficiency. In this paper, w… ▽ More Most recent multispectral object detectors employ a two-branch structure to extract features from RGB and thermal images. While the two-branch structure achieves better performance than a single-branch structure, it overlooks inference efficiency. This conflict is increasingly aggressive, as recent works solely pursue higher performance rather than both performance and efficiency. In this paper, we address this issue by improving the performance of efficient single-branch structures. We revisit the reasons causing the performance gap between these structures. For the first time, we reveal the information interference problem in the naive early-fusion strategy adopted by previous single-branch structures. Besides, we find that the domain gap between multispectral images, and weak feature representation of the single-branch structure are also key obstacles for performance. Focusing on these three problems, we propose corresponding solutions, including a novel shape-priority early-fusion strategy, a weakly supervised learning method, and a core knowledge distillation technique. Experiments demonstrate that single-branch networks equipped with these three contributions achieve significant performance enhancements while retaining high efficiency. Our code will be available at \url{https://github.com/XueZ-phd/Efficient-RGB-T-Early-Fusion-Detection}. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.13675 [pdf, other]

Context and Geometry Aware Voxel Transformer for Semantic Scene Completion

Authors: Zhu Yu, Runming Zhang, Jiacheng Ying, Junchen Yu, Xiaohai Hu, Lun Luo, Siyuan Cao, Huiliang Shen

Abstract: Vision-based Semantic Scene Completion (SSC) has gained much attention due to its widespread applications in various 3D perception tasks. Existing sparse-to-dense approaches typically employ shared context-independent queries across various input images, which fails to capture distinctions among them as the focal regions of different inputs vary and may result in undirected feature aggregation of… ▽ More Vision-based Semantic Scene Completion (SSC) has gained much attention due to its widespread applications in various 3D perception tasks. Existing sparse-to-dense approaches typically employ shared context-independent queries across various input images, which fails to capture distinctions among them as the focal regions of different inputs vary and may result in undirected feature aggregation of cross-attention. Additionally, the absence of depth information may lead to points projected onto the image plane sharing the same 2D position or similar sampling points in the feature map, resulting in depth ambiguity. In this paper, we present a novel context and geometry aware voxel transformer. It utilizes a context aware query generator to initialize context-dependent queries tailored to individual input images, effectively capturing their unique characteristics and aggregating information within the region of interest. Furthermore, it extend deformable cross-attention from 2D to 3D pixel space, enabling the differentiation of points with similar image coordinates based on their depth coordinates. Building upon this module, we introduce a neural network named CGFormer to achieve semantic scene completion. Simultaneously, CGFormer leverages multiple 3D representations (i.e., voxel and TPV) to boost the semantic and geometric representation abilities of the transformed 3D volume from both local and global perspectives. Experimental results demonstrate that CGFormer achieves state-of-the-art performance on the SemanticKITTI and SSCBench-KITTI-360 benchmarks, attaining a mIoU of 16.87 and 20.05, as well as an IoU of 45.99 and 48.07, respectively. Remarkably, CGFormer even outperforms approaches employing temporal images as inputs or much larger image backbone networks. Code for the proposed method is available at https://github.com/pkqbajng/CGFormer. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.12575 [pdf, other]

Three-dimensional mapping and electronic origin of large altermagnetic splitting near Fermi level in CrSb

Authors: Guowei Yang, Zhanghuan Li, Sai Yang, Jiyuan Li, Hao Zheng, Weifan Zhu, Saizheng Cao, Wenxuan Zhao, Jiawen Zhang, Mao Ye, Yu Song, Lun-Hui Hu, Lexian Yang, Ming Shi, Huiqiu Yuan, Yongjun Zhang, Yuanfeng Xu, Yang Liu

Abstract: Recently, a new kind of collinear magnetism, dubbed altermagnetism, has attracted considerable interests. A key characteristic of altermagnet is the momentum-dependent band and spin splitting without net magnetization. However, finding altermagnetic materials with large splitting near the Fermi level, which necessarily requires three-dimensional k-space mapping and is crucial for spintronic applic… ▽ More Recently, a new kind of collinear magnetism, dubbed altermagnetism, has attracted considerable interests. A key characteristic of altermagnet is the momentum-dependent band and spin splitting without net magnetization. However, finding altermagnetic materials with large splitting near the Fermi level, which necessarily requires three-dimensional k-space mapping and is crucial for spintronic applications and emergent phenomena, remains challenging. Here by employing synchrotron-based angle-resolved photoemission spectroscopy (ARPES) and model calculations, we uncover a large altermagnetic splitting, up to ~1.0 eV, near the Fermi level in CrSb. We verify its bulk-type g-wave altermagnetism through systematic three-dimensional kspace mapping, which unambiguously reveals the altermagnetic symmetry and associated nodal planes. The ARPES results are well captured by density functional theory calculations. In addition, tight-binding model analysis indicate that the large altermagnetic splitting arises from strong third-nearest-neighbor hopping mediated by Sb ions, which breaks both the space-time reversal symmetry and the translational spin-rotation symmetry. The large band/spin splitting near Fermi level in metallic CrSb, together with its high TN (up to 705 K) and simple spin configuration, paves the way for exploring emergent phenomena and spintronic applications based on altermagnets. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 16 pages, 4 figures and 1 table

arXiv:2405.12488 [pdf, other]

First joint oscillation analysis of Super-Kamiokande atmospheric and T2K accelerator neutrino data

Authors: Super-Kamiokande, T2K collaborations, :, S. Abe, K. Abe, N. Akhlaq, R. Akutsu, H. Alarakia-Charles, A. Ali, Y. I. Alj Hakim, S. Alonso Monsalve, S. Amanai, C. Andreopoulos, L. H. V. Anthony, M. Antonova, S. Aoki, K. A. Apte, T. Arai, T. Arihara, S. Arimoto, Y. Asada, R. Asaka, Y. Ashida, E. T. Atkin, N. Babu , et al. (524 additional authors not shown)

Abstract: The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlapping in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of… ▽ More The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlapping in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of $19.7(16.3) \times 10^{20}$ protons on target in (anti)neutrino mode, the analysis finds a 1.9$σ$ exclusion of CP-conservation (defined as $J_{CP}=0$) and a preference for the normal mass ordering. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 10 pages, 3 figures

arXiv:2405.11414 [pdf, other]

High-Resolution Agent-Based Modeling of Campus Population Behaviors for Pandemic Response Planning

Authors: Hiroki Sayama, Shun Cao

Abstract: This paper reports a case study of an application of high-resolution agent-based modeling and simulation to pandemic response planning on a university campus. In the summer of 2020, we were tasked with a COVID-19 pandemic response project to create a detailed behavioral simulation model of the entire campus population at Binghamton University. We conceptualized this problem as an agent migration p… ▽ More This paper reports a case study of an application of high-resolution agent-based modeling and simulation to pandemic response planning on a university campus. In the summer of 2020, we were tasked with a COVID-19 pandemic response project to create a detailed behavioral simulation model of the entire campus population at Binghamton University. We conceptualized this problem as an agent migration process on a multilayer transportation network, in which each layer represented a different transportation mode. As no direct data were available about people's behaviors on campus, we collected as much indirect information as possible to inform the agents' behavioral rules. Each agent was assumed to move along the shortest path between two locations within each transportation layer and switch layers at a parking lot or a bus stop, along with several other behavioral assumptions. Using this model, we conducted simulations of the whole campus population behaviors on a typical weekday, involving more than 25,000 agents. We measured the frequency of close social contacts at each spatial location and identified several busy locations and corridors on campus that needed substantial behavioral intervention. Moreover, systematic simulations with varying population density revealed that the effect of population density reduction was nonlinear, and that reducing the population density to 40-45% would be optimal and sufficient to suppress disease spreading on campus. These results were reported to the university administration and utilized in the pandemic response planning, which led to successful outcomes. △ Less

Submitted 18 May, 2024; originally announced May 2024.

Comments: 14 pages, 6 figures; submitted to PPAM 2024 (under review)

arXiv:2405.11118 [pdf, other]

A Simulation-Optimization Framework for Developing Wind-Resilient AAM Networks

Authors: Emin Burak Onat, Shangqing Cao, Raiyan Rizwan, Xuan Jiang, Mark Hansen, Raja Sengupta, Anjan Chakrabarty

Abstract: Environmental factors pose a significant challenge to the operational efficiency and safety of advanced air mobility (AAM) networks. This paper presents a simulation-optimization framework that dynamically integrates wind variability into AAM operations. We employ a nonlinear charging model within a multi-vertiport environment to optimize fleet size and scheduling. Our framework assesses the impac… ▽ More Environmental factors pose a significant challenge to the operational efficiency and safety of advanced air mobility (AAM) networks. This paper presents a simulation-optimization framework that dynamically integrates wind variability into AAM operations. We employ a nonlinear charging model within a multi-vertiport environment to optimize fleet size and scheduling. Our framework assesses the impact of wind on operational parameters, providing strategies to enhance the resilience of AAM ecosystems. The results demonstrate that wind conditions exert significant influence on fleet size even for short-distance flights, their impact on fleet size and energy requirements becomes more pronounced over longer distances. Efficient management of fleet size and charging policies, particularly for long-distance networks, is needed to accommodate the variability of wind conditions effectively. △ Less

Submitted 17 May, 2024; originally announced May 2024.

Comments: Accepted to ICRAT 2024

arXiv:2405.07187 [pdf, ps, other]

Two-Plasmon-Decay Instability Stimulated by a Normal- and Large-Angle-Incidence Laser Pair

Authors: C. -W. Lian, Y. Ji, R. Yan, J. Li, S. -H. Cao, C. Ren, L. -F. Wang, Y. -K. Ding, J. Zheng

Abstract: The two-plasmon-decay instability (TPD) is a critical target preheating risk in direct-drive inertial confinement fusion. In this paper, TPD collectively driven by a normal-incidence laser beam (Beam-N) and a large-angle-incidence laser beam (Beam-L) is investigated via particle-in-cell simulations. Significant TPD growth is found able to develop in this regime at previously unexpected low laser i… ▽ More The two-plasmon-decay instability (TPD) is a critical target preheating risk in direct-drive inertial confinement fusion. In this paper, TPD collectively driven by a normal-incidence laser beam (Beam-N) and a large-angle-incidence laser beam (Beam-L) is investigated via particle-in-cell simulations. Significant TPD growth is found able to develop in this regime at previously unexpected low laser intensities if the intensity of Beam-L exceeds the large-angle-incidence threshold. Both beams contribute to the growth of TPD in a "seed-amplification" manner where the absolute instability driven by Beam-L provides the seeds that get convectively amplified by Beam-N, making TPD energetically important and causing significant pump depletion and hot electron generation. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: 16 pages, 5 figures, submitted

arXiv:2405.03239 [pdf, other]

Deep Learning for Detecting and Early Predicting Chronic Obstructive Pulmonary Disease from Spirogram Time Series: A UK Biobank Study

Authors: Shuhao Mei, Yuxi Zhou, Jiahao Xu, Yuxuan Wan, Shan Cao, Qinghao Zhao, Shijia Geng, Junqing Xie, Shenda Hong

Abstract: Chronic Obstructive Pulmonary Disease (COPD) is a chronic inflammatory lung condition that causes airflow obstruction. The existing methods can only detect patients who already have COPD based on obvious features shown in the spirogram (In this article, the spirogram specifically involves measuring Volume-Flow curve time series). Early prediction of COPD risk is vital for monitoring COPD disease p… ▽ More Chronic Obstructive Pulmonary Disease (COPD) is a chronic inflammatory lung condition that causes airflow obstruction. The existing methods can only detect patients who already have COPD based on obvious features shown in the spirogram (In this article, the spirogram specifically involves measuring Volume-Flow curve time series). Early prediction of COPD risk is vital for monitoring COPD disease progression, slowing it down, or even preventing its onset. However, these methods fail to early predict an individual's probability of COPD in the future based on subtle features in the spirogram. To address this gap, for the first time, we propose DeepSpiro, a method based on deep learning for early prediction of future COPD risk. DeepSpiro consists of four parts. First, we construct Volume-Flow curves guided by Time-Volume instability smoothing (SpiroSmoother) to enhance the stability of the original Volume-Flow curves precisely. Second, we extract critical features from the evolution of varied-length key patches (SpiroEncoder) to capture the key temporal evolution from original high-dimensional dynamic sequences to a unified low-dimensional temporal representation. Third, we explain the model based on temporal attention and heterogeneous feature fusion (SpiroExplainer), which integrates information from heterogeneous data such as spirogram and demographic information. Fourth, we predict the risk of COPD based on the evolution of key patch concavity (SpiroPredictor), enabling accurate prediction of the risk of disease in high-risk patients who are not yet diagnosed, for up to 1, 2, 3, 4, 5 years, and beyond. We conduct experiments on the UK Biobank dataset. Results show that DeepSpiro achieves an AUC value of 0.8328 in the task of detecting COPD. In early prediction tasks, high-risk and low-risk groups show significant differences in the future, with a p-value of <0.001. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.02690 [pdf, other]

Laser wakefield acceleration of ions with a transverse flying focus

Authors: Zheng Gong, Sida Cao, John P. Palastro, Matthew R. Edwards

Abstract: The extreme electric fields created in high-intensity laser-plasma interactions could generate energetic ions far more compactly than traditional accelerators. Despite this promise, laser-plasma accelerators have remained stagnant at maximum ion energies of 100 MeV/nucleon for the last twenty years. The central challenge is the low charge-to-mass ratio of ions, which has precluded one of the most… ▽ More The extreme electric fields created in high-intensity laser-plasma interactions could generate energetic ions far more compactly than traditional accelerators. Despite this promise, laser-plasma accelerators have remained stagnant at maximum ion energies of 100 MeV/nucleon for the last twenty years. The central challenge is the low charge-to-mass ratio of ions, which has precluded one of the most successful approaches used for electrons: laser wakefield acceleration. Here we show that a laser pulse with a focal spot that moves transverse to the laser propagation direction enables wakefield acceleration of ions to GeV energies in underdense plasma. Three-dimensional particle-in-cell simulations demonstrate that this relativistic-intensity "transverse flying focus" can trap ions in a comoving electrostatic pocket, producing a monoenergetic collimated ion beam. With a peak intensity of $10^{20}\,$W/cm$^2$ and an acceleration distance of $0.44\,$cm, we observe a proton beam with $23.1\,$pC charge, $1.6\,$GeV peak energy, and $3.7\,$% relative energy spread. This approach allows for compact high-repetition-rate production of high-energy ions, highlighting the capability of more generalized spatio-temporal pulse shaping to address open problems in plasma physics. △ Less

Submitted 4 May, 2024; originally announced May 2024.

Comments: 11 pages, 6 figures

arXiv:2405.00579 [pdf, other]

LEAP: Optimization Hierarchical Federated Learning on Non-IID Data with Coalition Formation Game

Authors: Jianfeng Lu, Yue Chen, Shuqin Cao, Longbiao Chen, Wei Wang, Yun Xin

Abstract: Although Hierarchical Federated Learning (HFL) utilizes edge servers (ESs) to alleviate communication burdens, its model performance will be degraded by non-IID data and limited communication resources. Current works often assume that data is uniformly distributed, which however contradicts the heterogeneity of IoT. Solutions of additional model training to check the data distribution inevitably i… ▽ More Although Hierarchical Federated Learning (HFL) utilizes edge servers (ESs) to alleviate communication burdens, its model performance will be degraded by non-IID data and limited communication resources. Current works often assume that data is uniformly distributed, which however contradicts the heterogeneity of IoT. Solutions of additional model training to check the data distribution inevitably increases computational costs and the risk of privacy leakage. The challenges in solving these issues are how to reduce the impact of non-IID data without involving raw data and how to rationalize the communication resource allocation for addressing straggler problem. To tackle these challenges, we propose a novel optimization method based on coaLition formation gamE and grAdient Projection, called LEAP. Specifically, we combine edge data distribution with coalition formation game innovatively to adjust the correlations between clients and ESs dynamically, which ensures optimal correlations. We further capture the client heterogeneity to achieve the rational bandwidth allocation from coalition perception and determine the optimal transmission power within specified delay constraints at client level. Experimental results on four real datasets show that LEAP is able to achieve 20.62% improvement in model accuracy compared to the state-of-the-art baselines. Moreover, LEAP effectively reduce transmission energy consumption by at least about 2.24 times. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2405.00478 [pdf, ps, other]

Dual-frequency optical-microwave atomic clocks based on cesium atoms

Authors: Tiantian Shi, Qiang Wei, Xiaomin Qin, Zhenfeng Liu, Kunkun Chen, Shiying Cao, Hangbo Shi, Zijie Liu, Jingbiao Chen

Abstract: $^{133}$Cs, which is the only stable cesium (Cs) isotope, is one of the most investigated elements in atomic spectroscopy and was used to realize the atomic clock in 1955. Among all atomic clocks, the cesium atomic clock has a special place, since the current unit of time is based on a microwave transition in the Cs atom. In addition, the long lifetime of the $6{\text{P}}_{3/2}… ▽ More $^{133}$Cs, which is the only stable cesium (Cs) isotope, is one of the most investigated elements in atomic spectroscopy and was used to realize the atomic clock in 1955. Among all atomic clocks, the cesium atomic clock has a special place, since the current unit of time is based on a microwave transition in the Cs atom. In addition, the long lifetime of the $6{\text{P}}_{3/2}$ state and simple preparation technique of Cs vapor cells have great relevance to quantum and atom optics experiments, which suggests the use of the $6{\text{S}} - 6{\text{P}}$ D2 transition as an optical frequency standard. In this work, using one laser as the local oscillator and Cs atoms as the quantum reference, we realized two atomic clocks in the optical and microwave frequencies, respectively. Both clocks could be freely switched or simultaneously output. The optical clock based on the vapor cell continuously operated with a frequency stability of $3.89 \times {10^{ - 13}}$ at 1 s, decreasing to $2.17 \times {10^{ - 13}}$ at 32 s, which was frequency stabilized by modulation transfer spectroscopy and estimated by an optical comb. Then, applying this stabilized laser for an optically pumped Cs beam atomic clock to reduce the laser frequency noise, we obtained a microwave clock with a frequency stability of $1.84 \times {10^{ - 12}}/\sqrt τ$, reaching $5.99 \times {10^{ - 15}}$ at $10^5$ s. This study demonstrates an attractive feature for the commercialization and deployment of optical and microwave clocks and will guide further development of integrated atomic clocks with better stability. Thus, this study lays the groundwork for future quantum metrology and laser physics. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 8 pages, 4 figures

arXiv:2404.18115 [pdf, other]

Exploring system size dependence of jet modification in heavy-ion collisions

Authors: Yang He, Mengxue Zhang, Maowu Nie, Shanshan Cao, Li Yi

Abstract: In relativistic heavy-ion collisions, jet quenching in quark-gluon plasma (QGP) has been extensively studied, revealing important insights into the properties of the color deconfined nuclear matter. Over the past decade, there has been a surge of interest in the exploration of QGP droplets in small collision systems like $p$+$p$ or $p$+A collisions driven by the observation of collective flow phen… ▽ More In relativistic heavy-ion collisions, jet quenching in quark-gluon plasma (QGP) has been extensively studied, revealing important insights into the properties of the color deconfined nuclear matter. Over the past decade, there has been a surge of interest in the exploration of QGP droplets in small collision systems like $p$+$p$ or $p$+A collisions driven by the observation of collective flow phenomena. However, the absence of jet quenching, a key QGP signature, in these systems poses a puzzle. Understanding how jet quenching evolves with system size is crucial for uncovering the underlying physics. In this study, we employ the linear Boltzmann transport (LBT) model to investigate jet modification in $^{96}$Ru+$^{96}$Ru, $^{96}$Zr+$^{96}$Zr, and $^{197}$Au+$^{197}$Au collisions at $\sqrt{s_\mathrm{NN}}=200$ GeV. Our findings highlight the system size sensitivity exhibited by jet nuclear modification factor ($R_\mathrm{AA}$) and jet shape ($ρ$), contrasting to the relatively weak responses of jet mass ($M$), girth ($g$) and momentum dispersion ($p_\mathrm{T}{D}$) to system size variations. These results offer invaluable insights into the system size dependence of the QGP properties and to be validated experimentally at the Relativistic Heavy-Ion Collider. △ Less

Submitted 28 April, 2024; originally announced April 2024.

arXiv:2404.18084 [pdf, other]

Age-minimal Multicast by Graph Attention Reinforcement Learning

Authors: Yanning Zhang, Guocheng Liao, Shengbin Cao, Ning Yang, Meng Zhang

Abstract: Age of Information (AoI) is an emerging metric used to assess the timeliness of information, gaining research interest in real-time multicast applications such as video streaming and metaverse platforms. In this paper, we consider a dynamic multicast network with energy constraints, where our objective is to minimize the expected time-average AoI through energy-constrained multicast routing and sc… ▽ More Age of Information (AoI) is an emerging metric used to assess the timeliness of information, gaining research interest in real-time multicast applications such as video streaming and metaverse platforms. In this paper, we consider a dynamic multicast network with energy constraints, where our objective is to minimize the expected time-average AoI through energy-constrained multicast routing and scheduling. The inherent complexity of the problem, given the NP-hardness and intertwined scheduling and routing decisions, makes existing approaches inapplicable. To address these challenges, we decompose the original problem into two subtasks, each amenable to reinforcement learning (RL) methods. Subsequently, we propose an innovative framework based on graph attention networks (GATs) to effectively capture graph information with superior generalization capabilities. To validate our framework, we conduct experiments on three datasets including a real-world dataset called AS-733, and show that our proposed scheme reduces the average weighted AoI by 62.9% and reduces the energy consumption by at most 72.5% compared to baselines. △ Less

Submitted 31 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

arXiv:2404.12601 [pdf, other]

Study of bottom quark dynamics via non-prompt $D^0$ and $J/ψ$ in Pb+Pb collisions at $\sqrt{s_\mathrm{NN}}=5.02$ TeV

Authors: Wen-Jing Xing, Shu-Qing Li, Shanshan Cao, Guang-You Qin

Abstract: We study bottom quark energy loss via the nuclear modification factor ($R_\mathrm{AA}$) and elliptic flow ($v_2$) of non-prompt $D^0$ and $J/ψ$ in relativistic heavy-ion collisions at the LHC. The space-time profile of quark-gluon plasma is obtained from the CLVisc hydrodynamics simulation, the dynamical evolution of heavy quarks inside the color deconfined QCD medium is simulated using a linear B… ▽ More We study bottom quark energy loss via the nuclear modification factor ($R_\mathrm{AA}$) and elliptic flow ($v_2$) of non-prompt $D^0$ and $J/ψ$ in relativistic heavy-ion collisions at the LHC. The space-time profile of quark-gluon plasma is obtained from the CLVisc hydrodynamics simulation, the dynamical evolution of heavy quarks inside the color deconfined QCD medium is simulated using a linear Boltzmann transport model that combines Yukawa and string potentials of heavy-quark-medium interactions, the hadronization of heavy quarks is performed using a hybrid coalescence-fragmentation model, and the decay of $B$ mesons is simulated via PYTHIA. Using this numerical framework, we calculate the transverse momentum ($p_\mathrm{T}$) dependent $R_\mathrm{AA}$ and $v_2$ of direct $D$ mesons, $B$ mesons, and non-prompt $D^0$ and $J/ψ$ from $B$ meson decay in Pb+Pb collisions at $\sqrt{s_\mathrm{NN}}=5.02$ TeV. We find the mass hierarchy of the nuclear modification of prompt $D$ and $B$ mesons depends on their $p_\mathrm{T}$. Both $R_\mathrm{AA}$ and $v_2$ of heavy flavor particles show strong $p_\mathrm{T}$ and centrality dependences due to the interplay between parton energy loss, medium geometry and flow, and hadronization of heavy quarks. Non-prompt $D^0$ and $J/ψ$ share similar patterns of $R_\mathrm{AA}$ and $v_2$ to $B$ mesons except for a $p_\mathrm{T}$ shift during the decay processes. Therefore, future more precise measurements on non-prompt $D^0$ and $J/ψ$ can help further pin down the bottom quark dynamics inside the quark-gluon plasma. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 11 pages,8 figures

arXiv:2404.12386 [pdf, other]

SOHES: Self-supervised Open-world Hierarchical Entity Segmentation

Authors: Shengcao Cao, Jiuxiang Gu, Jason Kuen, Hao Tan, Ruiyi Zhang, Handong Zhao, Ani Nenkova, Liang-Yan Gui, Tong Sun, Yu-Xiong Wang

Abstract: Open-world entity segmentation, as an emerging computer vision task, aims at segmenting entities in images without being restricted by pre-defined classes, offering impressive generalization capabilities on unseen images and concepts. Despite its promise, existing entity segmentation methods like Segment Anything Model (SAM) rely heavily on costly expert annotators. This work presents Self-supervi… ▽ More Open-world entity segmentation, as an emerging computer vision task, aims at segmenting entities in images without being restricted by pre-defined classes, offering impressive generalization capabilities on unseen images and concepts. Despite its promise, existing entity segmentation methods like Segment Anything Model (SAM) rely heavily on costly expert annotators. This work presents Self-supervised Open-world Hierarchical Entity Segmentation (SOHES), a novel approach that eliminates the need for human annotations. SOHES operates in three phases: self-exploration, self-instruction, and self-correction. Given a pre-trained self-supervised representation, we produce abundant high-quality pseudo-labels through visual feature clustering. Then, we train a segmentation model on the pseudo-labels, and rectify the noises in pseudo-labels via a teacher-student mutual-learning procedure. Beyond segmenting entities, SOHES also captures their constituent parts, providing a hierarchical understanding of visual entities. Using raw images as the sole training data, our method achieves unprecedented performance in self-supervised open-world segmentation, marking a significant milestone towards high-quality open-world entity segmentation in the absence of human-annotated masks. Project page: https://SOHES.github.io. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: ICLR 2024

arXiv:2404.11436 [pdf, other]

Excitation Transmission through a non-Hermitian traversable wormhole

Authors: Sizheng Cao, Xian-Hui Ge

Abstract: This study explores the intricate real-time dynamics of a non-Hermitian system composed of two interconnected Sachdev-Ye-Kitaev (SYK) models. A central finding reveals that an excitation initially localized in the right SYK subsystem can be efficiently transmitted to the left subsystem subsequent to the characteristic scrambling time, a phenomenon facilitated by the intrinsic non-Hermitian nature… ▽ More This study explores the intricate real-time dynamics of a non-Hermitian system composed of two interconnected Sachdev-Ye-Kitaev (SYK) models. A central finding reveals that an excitation initially localized in the right SYK subsystem can be efficiently transmitted to the left subsystem subsequent to the characteristic scrambling time, a phenomenon facilitated by the intrinsic non-Hermitian nature of the system. The defining hallmark of non-Hermiticity is manifest in the asymmetric conveyance of quantum states, with the non-Hermitian parameter functioning as a tunable knob that selectively amplifies or dampens propagation modes on either side. Despite this inherent directional bias in state transfer, the system sustains two distinct phases, analogously likened to black holes and wormholes. △ Less

Submitted 27 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

Comments: 12 pages, 9 figures

arXiv:2404.10160 [pdf, other]

Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs

Authors: Ruoxi Cheng, Haoxuan Ma, Shuirong Cao, Jiaqi Li, Aihua Pei, Zhiqiang Wang, Pengliang Ji, Haoyu Wang, Jiaqi Huo

Abstract: Bias in LLMs can harm user experience and societal outcomes. However, current bias mitigation methods often require intensive human feedback, lack transferability to other topics or yield overconfident and random outputs. We find that involving LLMs in role-playing scenario boosts their ability to recognize and mitigate biases. Based on this, we propose Reinforcement Learning from Multi-role Debat… ▽ More Bias in LLMs can harm user experience and societal outcomes. However, current bias mitigation methods often require intensive human feedback, lack transferability to other topics or yield overconfident and random outputs. We find that involving LLMs in role-playing scenario boosts their ability to recognize and mitigate biases. Based on this, we propose Reinforcement Learning from Multi-role Debates as Feedback (RLDF), a novel approach for bias mitigation replacing human feedback in traditional RLHF. We utilize LLMs in multi-role debates to create a dataset that includes both high-bias and low-bias instances for training the reward model in reinforcement learning. Our approach comprises two modes: (1) self-reflection, where the same LLM participates in multi-role debates, and (2) teacher-student, where a more advanced LLM like GPT-3.5-turbo guides the LLM to perform this task. Experimental results across different LLMs demonstrate the effectiveness of our approach in bias mitigation. △ Less

Submitted 18 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

Comments: The first three authors contributed equally to this work

arXiv:2404.09920 [pdf, other]

Combined Pre-Supernova Alert System with Kamland and Super-Kamiokande

Authors: KamLAND, Super-Kamiokande Collaborations, :, Seisho Abe, Minori Eizuka, Sawako Futagi, Azusa Gando, Yoshihito Gando, Shun Goto, Takahiko Hachiya, Kazumi Hata, Koichi Ichimura, Sei Ieki, Haruo Ikeda, Kunio Inoue, Koji Ishidoshiro, Yuto Kamei, Nanami Kawada, Yasuhiro Kishimoto, Masayuki Koga, Maho Kurasawa, Tadao Mitsui, Haruhiko Miyake, Daisuke Morita, Takeshi Nakahata , et al. (290 additional authors not shown)

Abstract: Preceding a core-collapse supernova, various processes produce an increasing amount of neutrinos of all flavors characterized by mounting energies from the interior of massive stars. Among them, the electron antineutrinos are potentially detectable by terrestrial neutrino experiments such as KamLAND and Super-Kamiokande via inverse beta decay interactions. Once these pre-supernova neutrinos are ob… ▽ More Preceding a core-collapse supernova, various processes produce an increasing amount of neutrinos of all flavors characterized by mounting energies from the interior of massive stars. Among them, the electron antineutrinos are potentially detectable by terrestrial neutrino experiments such as KamLAND and Super-Kamiokande via inverse beta decay interactions. Once these pre-supernova neutrinos are observed, an early warning of the upcoming core-collapse supernova can be provided. In light of this, KamLAND and Super-Kamiokande, both located in the Kamioka mine in Japan, have been monitoring pre-supernova neutrinos since 2015 and 2021, respectively. Recently, we performed a joint study between KamLAND and Super-Kamiokande on pre-supernova neutrino detection. A pre-supernova alert system combining the KamLAND detector and the Super-Kamiokande detector was developed and put into operation, which can provide a supernova alert to the astrophysics community. Fully leveraging the complementary properties of these two detectors, the combined alert is expected to resolve a pre-supernova neutrino signal from a 15 M$_{\odot}$ star within 510 pc of the Earth, at a significance level corresponding to a false alarm rate of no more than 1 per century. For a Betelgeuse-like model with optimistic parameters, it can provide early warnings up to 12 hours in advance. △ Less

Submitted 1 July, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

Comments: Resubmitted to ApJ. 22 pages, 16 figures, for more information about the combined pre-supernova alert system, see https://www.lowbg.org/presnalarm/

arXiv:2404.09408 [pdf, other]

A Distributed Scalable Cross-chain State Channel Scheme Based on Recursive State Synchronization

Authors: Xinyu Liang, Ruiying Du, Jing Chen, Yu Zhang, Meng Jia, Shuangxi Cao, Yufeng Wei, Shixiong Yao

Abstract: As cross-chain technology continues to advance, the scale of cross-chain transactions is experiencing significant expansion. To improve scalability, researchers have turned to the study of cross-chain state channels. However, most of the existing schemes rely on trusted parties to support channel operations. To address this issue, we present Interpipe: a distributed cross-chain state channel schem… ▽ More As cross-chain technology continues to advance, the scale of cross-chain transactions is experiencing significant expansion. To improve scalability, researchers have turned to the study of cross-chain state channels. However, most of the existing schemes rely on trusted parties to support channel operations. To address this issue, we present Interpipe: a distributed cross-chain state channel scheme. Specifically, we propose a real-time cross-chain synchronization scheme to ensure consistent operations between two blockchains to a cross-chain state channel. Moreover, we propose a batch transaction proof scheme based on recursive SNARK to meet the cross-chain verification needs of large-scale users. Based on the above designs, Interpipe offers protocols for opening, updating, closing, and disputing operations to cross-chain state channels. Security analysis shows that Interpipe has consistency and resistance, and experimental results demonstrate that a cross-chain state channel can be nearly as efficient as an existing intra-chain state channel. △ Less

Submitted 14 April, 2024; originally announced April 2024.

arXiv:2404.08725 [pdf, other]

Development of a data overflow protection system for Super-Kamiokande to maximize data from nearby supernovae

Authors: M. Mori, K. Abe, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Okamoto, K. Sato, H. Sekiya, H. Shiba, K. Shimizu , et al. (230 additional authors not shown)

Abstract: Neutrinos from very nearby supernovae, such as Betelgeuse, are expected to generate more than ten million events over 10\,s in Super-Kamokande (SK). At such large event rates, the buffers of the SK analog-to-digital conversion board (QBEE) will overflow, causing random loss of data that is critical for understanding the dynamics of the supernova explosion mechanism. In order to solve this problem,… ▽ More Neutrinos from very nearby supernovae, such as Betelgeuse, are expected to generate more than ten million events over 10\,s in Super-Kamokande (SK). At such large event rates, the buffers of the SK analog-to-digital conversion board (QBEE) will overflow, causing random loss of data that is critical for understanding the dynamics of the supernova explosion mechanism. In order to solve this problem, two new DAQ modules were developed to aid in the observation of very nearby supernovae. The first of these, the SN module, is designed to save only the number of hit PMTs during a supernova burst and the second, the Veto module, prescales the high rate neutrino events to prevent the QBEE from overflowing based on information from the SN module. In the event of a very nearby supernova, these modules allow SK to reconstruct the time evolution of the neutrino event rate from beginning to end using both QBEE and SN module data. This paper presents the development and testing of these modules together with an analysis of supernova-like data generated with a flashing laser diode. We demonstrate that the Veto module successfully prevents DAQ overflows for Betelgeuse-like supernovae as well as the long-term stability of the new modules. During normal running the Veto module is found to issue DAQ vetos a few times per month resulting in a total dead time less than 1\,ms, and does not influence ordinary operations. Additionally, using simulation data we find that supernovae closer than 800~pc will trigger Veto module resulting in a prescaling of the observed neutrino data. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 28 pages, 18 figures. Submitted to PTEP

arXiv:2404.08697 [pdf, other]

Testing the standardizability of, and deriving cosmological constraints from, a new Amati-correlated gamma-ray burst data compilation

Authors: Shulei Cao, Bharat Ratra

Abstract: By using gamma-ray burst (GRB) data to simultaneously constrain Amati correlation parameters and cosmological parameters in six spatially-flat and nonflat dark energy cosmological models, we show that an updated 220 GRB version of the Jia et al. [Mon. Not. R. Astron. Soc. 516, 2575 (2022)] GRB data compilation are standardizable through the Amati correlation and so can be used for cosmological ana… ▽ More By using gamma-ray burst (GRB) data to simultaneously constrain Amati correlation parameters and cosmological parameters in six spatially-flat and nonflat dark energy cosmological models, we show that an updated 220 GRB version of the Jia et al. [Mon. Not. R. Astron. Soc. 516, 2575 (2022)] GRB data compilation are standardizable through the Amati correlation and so can be used for cosmological analyses. However, the resulting GRB data constraints on the current value of the nonrelativistic matter density parameter, $Ω_{m0}$, are in $>2σ$ tension with those from a joint analysis of better-established Hubble parameter [$H(z)$] and baryon acoustic oscillation (BAO) data for most of the cosmological models we consider, indicating that these GRB data cannot be jointly used with better-established $H(z)$ + BAO data to constrain cosmological parameters. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 11 pages, 2 figures, submitted to Physical Review D

arXiv:2404.07671 [pdf]

Deep learning-driven pulmonary arteries and veins segmentation reveals demography-associated pulmonary vasculature anatomy

Authors: Yuetan Chu, Gongning Luo, Longxi Zhou, Shaodong Cao, Guolin Ma, Xianglin Meng, Juexiao Zhou, Changchun Yang, Dexuan Xie, Ricardo Henao, Xigang Xiao, Lianming Wu, Zhaowen Qiu, Xin Gao

Abstract: Pulmonary artery-vein segmentation is crucial for diagnosing pulmonary diseases and surgical planning, and is traditionally achieved by Computed Tomography Pulmonary Angiography (CTPA). However, concerns regarding adverse health effects from contrast agents used in CTPA have constrained its clinical utility. In contrast, identifying arteries and veins using non-contrast CT, a conventional and low-… ▽ More Pulmonary artery-vein segmentation is crucial for diagnosing pulmonary diseases and surgical planning, and is traditionally achieved by Computed Tomography Pulmonary Angiography (CTPA). However, concerns regarding adverse health effects from contrast agents used in CTPA have constrained its clinical utility. In contrast, identifying arteries and veins using non-contrast CT, a conventional and low-cost clinical examination routine, has long been considered impossible. Here we propose a High-abundant Pulmonary Artery-vein Segmentation (HiPaS) framework achieving accurate artery-vein segmentation on both non-contrast CT and CTPA across various spatial resolutions. HiPaS first performs spatial normalization on raw CT scans via a super-resolution module, and then iteratively achieves segmentation results at different branch levels by utilizing the low-level vessel segmentation as a prior for high-level vessel segmentation. We trained and validated HiPaS on our established multi-centric dataset comprising 1,073 CT volumes with meticulous manual annotation. Both quantitative experiments and clinical evaluation demonstrated the superior performance of HiPaS, achieving a dice score of 91.8% and a sensitivity of 98.0%. Further experiments demonstrated the non-inferiority of HiPaS segmentation on non-contrast CT compared to segmentation on CTPA. Employing HiPaS, we have conducted an anatomical study of pulmonary vasculature on 10,613 participants in China (five sites), discovering a new association between pulmonary vessel abundance and sex and age: vessel abundance is significantly higher in females than in males, and slightly decreases with age, under the controlling of lung volumes (p < 0.0001). HiPaS realizing accurate artery-vein segmentation delineates a promising avenue for clinical diagnosis and understanding pulmonary physiology in a non-invasive manner. △ Less

Submitted 11 April, 2024; originally announced April 2024.

arXiv:2404.07419 [pdf, other]

doi 10.3847/2041-8213/ad3553

Model-independent way to determine the Hubble constant and the curvature from phase shift of gravitational waves with DECIGO

Authors: Tonghua Liu, Shuo Cao, Marek Biesiada, Yilong Zhang, Jieci Wang

Abstract: In this Letter, we propose a model-independent method to determine the Hubble constant and curvature simultaneously taking advantage of the possibilities of future space-borne gravitational wave (GW) detector DECIGO in combination with the radio quasars as standard rulers. Similarly to the redshift drift in the electromagnetic domain, accelerating expansion of the Universe causes a characteristic… ▽ More In this Letter, we propose a model-independent method to determine the Hubble constant and curvature simultaneously taking advantage of the possibilities of future space-borne gravitational wave (GW) detector DECIGO in combination with the radio quasars as standard rulers. Similarly to the redshift drift in the electromagnetic domain, accelerating expansion of the Universe causes a characteristic phase correction to the gravitational waveform detectable by DECIGO. Hence, one would be able to extract the Hubble parameter $H(z)$. This could be used to recover distance-redshift relation supported by the data not relying on any specific cosmological model. Assuming the FLRW metric, and using intermediate luminosity radio quasars as standard rulers one achieves an interesting opportunity to directly assess $H_0$ and $Ω_k$ parameters. To test this method we simulated a set of acceleration parameters achievable by future DECIGO. Based on the existing sample of 120 intermediate-luminosity radio-quasars calibrated as standard rulers, we simulated much bigger samples of such standard rulers possible to obtain with VLBI. In the case of $(N=100)$ of radio quasars, which is the size of currently available sample, the precision of cosmological parameters determined would be $σ_{H_0}=2.74$ ${\mathrm{~km~s^{-1}~Mpc^{-1}}}$ and $σ_{Ω_k}=0.175$. In the optimistic scenario $(N = 1000)$ achievable by VLBI, the precision of $H_{0}$ would be improved to $1\%$, which is comparable to the result of $σ_{H_0} =0.54$ ${\mathrm{~km~s^{-1}~Mpc^{-1}}}$ from \emph{Planck} 2018 TT, TE, EE+lowE+lensing data, and the precision of $Ω_k$ would be 0.050. Our results demonstrate that such combined analysis, possible in the future, could be helpful to solve the current cosmological issues concerning the Hubble tension and cosmic curvature tension. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 9 pages, 6 figures

Journal ref: ApJL, 965, L11(2024)

arXiv:2404.05825 [pdf, other]

LLM-Augmented Retrieval: Enhancing Retrieval Models Through Language Models and Doc-Level Embedding

Authors: Mingrui Wu, Sheng Cao

Abstract: Recently embedding-based retrieval or dense retrieval have shown state of the art results, compared with traditional sparse or bag-of-words based approaches. This paper introduces a model-agnostic doc-level embedding framework through large language model (LLM) augmentation. In addition, it also improves some important components in the retrieval model training process, such as negative sampling,… ▽ More Recently embedding-based retrieval or dense retrieval have shown state of the art results, compared with traditional sparse or bag-of-words based approaches. This paper introduces a model-agnostic doc-level embedding framework through large language model (LLM) augmentation. In addition, it also improves some important components in the retrieval model training process, such as negative sampling, loss function, etc. By implementing this LLM-augmented retrieval framework, we have been able to significantly improve the effectiveness of widely-used retriever models such as Bi-encoders (Contriever, DRAGON) and late-interaction models (ColBERTv2), thereby achieving state-of-the-art results on LoTTE datasets and BEIR datasets. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.03577 [pdf, other]

Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models

Authors: Yantao Liu, Zijun Yao, Xin Lv, Yuchen Fan, Shulin Cao, Jifan Yu, Lei Hou, Juanzi Li

Abstract: Providing knowledge documents for large language models (LLMs) has emerged as a promising solution to update the static knowledge inherent in their parameters. However, knowledge in the document may conflict with the memory of LLMs due to outdated or incorrect knowledge in the LLMs' parameters. This leads to the necessity of examining the capability of LLMs to assimilate supplemental external know… ▽ More Providing knowledge documents for large language models (LLMs) has emerged as a promising solution to update the static knowledge inherent in their parameters. However, knowledge in the document may conflict with the memory of LLMs due to outdated or incorrect knowledge in the LLMs' parameters. This leads to the necessity of examining the capability of LLMs to assimilate supplemental external knowledge that conflicts with their memory. While previous studies have explained to what extent LLMs extract conflicting knowledge from the provided text, they neglect the necessity to reason with conflicting knowledge. Furthermore, there lack a detailed analysis on strategies to enable LLMs to resolve conflicting knowledge via prompting, decoding strategy, and supervised fine-tuning. To address these limitations, we construct a new dataset, dubbed KNOT, for knowledge conflict resolution examination in the form of question answering. KNOT facilitates in-depth analysis by dividing reasoning with conflicting knowledge into three levels: (1) Direct Extraction, which directly extracts conflicting knowledge to answer questions. (2) Explicit Reasoning, which reasons with conflicting knowledge when the reasoning path is explicitly provided in the question. (3) Implicit Reasoning, where reasoning with conflicting knowledge requires LLMs to infer the reasoning path independently to answer questions. We also conduct extensive experiments on KNOT to establish empirical guidelines for LLMs to utilize conflicting knowledge in complex circumstances. Dataset and associated codes can be accessed at https://github.com/THU-KEG/KNOT . △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: Accepted by LREC-COLING 2024 as long paper

arXiv:2404.02525 [pdf, other]

Large Language Model for Vulnerability Detection and Repair: Literature Review and the Road Ahead

Authors: Xin Zhou, Sicong Cao, Xiaobing Sun, David Lo

Abstract: The significant advancements in Large Language Models (LLMs) have resulted in their widespread adoption across various tasks within Software Engineering (SE), including vulnerability detection and repair. Numerous recent studies have investigated the application of LLMs to enhance vulnerability detection and repair tasks. Despite the increasing research interest, there is currently no existing sur… ▽ More The significant advancements in Large Language Models (LLMs) have resulted in their widespread adoption across various tasks within Software Engineering (SE), including vulnerability detection and repair. Numerous recent studies have investigated the application of LLMs to enhance vulnerability detection and repair tasks. Despite the increasing research interest, there is currently no existing survey that focuses on the utilization of LLMs for vulnerability detection and repair. In this paper, we aim to bridge this gap by offering a systematic literature review of approaches aimed at improving vulnerability detection and repair through the utilization of LLMs. The review encompasses research work from leading SE, AI, and Security conferences and journals, covering 36 papers published at 21 distinct venues. By answering three key research questions, we aim to (1) summarize the LLMs employed in the relevant literature, (2) categorize various LLM adaptation techniques in vulnerability detection, and (3) classify various LLM adaptation techniques in vulnerability repair. Based on our findings, we have identified a series of challenges that still need to be tackled considering existing studies. Additionally, we have outlined a roadmap highlighting potential opportunities that we believe are pertinent and crucial for future research endeavors. △ Less

Submitted 6 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

Comments: 11 pages

Showing 1–50 of 789 results for author: Cao, S