Skip to main content

Showing 201–250 of 11,926 results for author: Li, X

  1. arXiv:2406.06716  [pdf, other

    astro-ph.GA

    Metal-Poor Stars in the MW Disk: Resonant Cooling of Vertical Oscillations of Halo Stars in Barred Galaxies

    Authors: Xingchen Li, Isaac Shlosman, Daniel Pfenniger, Clayton Heller

    Abstract: Using numerical simulations of a barred disk galaxy embedded in nonspinning and spinning dark matter (DM) halos, we present a novel mechanism of `cooling' the vertical oscillations of halo DM particles, which acquire the disk kinematics. The underlying mechanism consists of resonant interactions between halo particles and the stellar bar. The cooling mechanism acts both on dynamical and secular ti… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 pages, 9 figures, 1 table. A shortened version has been submitted on ApJ Letters

  2. arXiv:2406.06277  [pdf, other

    hep-ex

    Measurement of the branching fractions of $\bar{B}\to D^{(*)} K^- K^{(*)0}_{(S)}$ and $\bar{B}\to D^{(*)}D_s^{-}$ decays at Belle II

    Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer , et al. (382 additional authors not shown)

    Abstract: We present measurements of the branching fractions of eight $\overline B{}^0\to D^{(*)+} K^- K^{(*)0}_{(S)}$, $B^{-}\to D^{(*)0} K^- K^{(*)0}_{(S)}$ decay channels. The results are based on data from SuperKEKB electron-positron collisions at the $Υ(4S)$ resonance collected with the Belle II detector, corresponding to an integrated luminosity of $362~\text{fb}^{-1}$. The event yields are extracted… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Prepared for submission to JHEP. 34 pages, 14 figures

    Report number: Belle II Preprint: 2024-014, KEK Preprint: 2024-8

  3. arXiv:2406.06272  [pdf, ps, other

    math.NA

    Global-in-time energy stability analysis for the exponential time differencing Runge-Kutta scheme for the phase field crystal equation

    Authors: Xiao Li, Zhonghua Qiao, Cheng Wang, Nan Zheng

    Abstract: The global-in-time energy estimate is derived for the second-order accurate exponential time differencing Runge-Kutta (ETDRK2) numerical scheme to the phase field crystal (PFC) equation, a sixth-order parabolic equation modeling crystal evolution. To recover the value of stabilization constant, some local-in-time convergence analysis has been reported, and the energy stability becomes available ov… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2406.06258  [pdf, other

    cs.CV

    Tuning-Free Visual Customization via View Iterative Self-Attention Control

    Authors: Xiaojie Li, Chenghao Gu, Shuzhao Xie, Yunpeng Bai, Weixiang Zhang, Zhi Wang

    Abstract: Fine-Tuning Diffusion Models enable a wide range of personalized generation and editing applications on diverse visual modalities. While Low-Rank Adaptation (LoRA) accelerates the fine-tuning process, it still requires multiple reference images and time-consuming training, which constrains its scalability for large-scale and real-time applications. In this paper, we propose \textit{View Iterative… ▽ More

    Submitted 10 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Under review

  5. arXiv:2406.06137  [pdf, other

    math.ST

    Matrix norm shrinkage estimators and priors

    Authors: Xiao Li, Takeru Matsuda, Fumiyasu Komaki

    Abstract: We develop a class of minimax estimators for a normal mean matrix under the Frobenius loss, which generalizes the James--Stein and Efron--Morris estimators. It shrinks the Schatten norm towards zero and works well for low-rank matrices. We also propose a class of superharmonic priors based on the Schatten norm, which generalizes Stein's prior and the singular value shrinkage prior. The generalized… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  6. arXiv:2406.06119  [pdf, other

    cs.LG

    A Survey on Incomplete Multi-label Learning: Recent Advances and Future Trends

    Authors: Xiang Li, Jiexi Liu, Xinrui Wang, Songcan Chen

    Abstract: In reality, data often exhibit associations with multiple labels, making multi-label learning (MLL) become a prominent research topic. The last two decades have witnessed the success of MLL, which is indispensable from complete and accurate supervised information. However, obtaining such information in practice is always laborious and sometimes even impossible. To circumvent this dilemma, incomple… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 10 pages, 3 figures

  7. arXiv:2406.06118  [pdf, other

    hep-ex

    Strong and weak $CP$ tests in sequential decays of polarized $Σ^0$ hyperons

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: The $J/ψ, ψ(3686) \to Σ^0 \barΣ^{0}$ processes and subsequent decays are studied using the world's largest $J/ψ$ and $ψ(3686)$ data samples collected with the BESIII detector. The strong-$CP$ symmetry is tested in the decays of the $Σ^0$ hyperons for the first time by measuring the decay parameters, $α_{Σ^0} = -0.0017 \pm 0.0021 \pm 0.0018$ and $\barα_{Σ^0} = 0.0021 \pm 0.0020 \pm 0.0022$. The wea… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  8. arXiv:2406.05966  [pdf, other

    eess.SY

    Approximating arrival costs in distributed moving horizon estimation: A recursive method

    Authors: Xiaojie Li, Xunyuan Yin

    Abstract: In this paper, we present a new approach to distributed moving horizon estimation for constrained nonlinear processes. The method involves approximating the arrival costs of local estimators through a recursive framework. First, distributed full-information estimation for linear unconstrained systems is presented, which serves as the foundation for deriving the analytical expression of the arrival… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  9. arXiv:2406.05948  [pdf, other

    cs.CR cs.AI

    Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models

    Authors: Xi Li, Yusen Zhang, Renze Lou, Chen Wu, Jiaqi Wang

    Abstract: Backdoor attacks present significant threats to Large Language Models (LLMs), particularly with the rise of third-party services that offer API integration and prompt engineering. Untrustworthy third parties can plant backdoors into LLMs and pose risks to users by embedding malicious instructions into user queries. The backdoor-compromised LLM will generate malicious output when and input is embed… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  10. arXiv:2406.05855  [pdf, other

    cs.LG cs.AI stat.ML

    Self-Distilled Disentangled Learning for Counterfactual Prediction

    Authors: Xinshu Li, Mingming Gong, Lina Yao

    Abstract: The advancements in disentangled representation learning significantly enhance the accuracy of counterfactual predictions by granting precise control over instrumental variables, confounders, and adjustable variables. An appealing method for achieving the independent separation of these factors is mutual information minimization, a task that presents challenges in numerous machine learning scenari… ▽ More

    Submitted 14 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  11. arXiv:2406.05827  [pdf, ps, other

    hep-ex

    Measurement of the integrated luminosity of the data collected at 3.773 GeV by BESIII from 2021 to 2024

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: We present a measurement of the integrated luminosity of $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at a center-of-mass energy of $E_{\rm cm} = 3.773$~GeV. The integrated luminosities of the data sets taken from December 2021 to June 2022, from November 2022 to June 2023, and from October 2023 to February 2024 are determined to be $4.995 \pm 0.019$~fb$^{-1}$,… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  12. arXiv:2406.05804  [pdf, other

    cs.AI cs.CL cs.SE

    A Survey on LLM-Based Agents: Common Workflows and Reusable LLM-Profiled Components

    Authors: Xinzhe Li

    Abstract: Recent advancements in Large Language Models (LLMs) have catalyzed the development of sophisticated frameworks for developing LLM-based agents. However, the complexity of these frameworks r poses a hurdle for nuanced differentiation at a granular level, a critical aspect for enabling efficient implementations across different frameworks and fostering future research. Hence, the primary purpose of… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  13. Efficient Beamforming Feedback Information-Based Wi-Fi Sensing by Feature Selection

    Authors: Xin Li, Jingzhi Hu, Jun Luo

    Abstract: Wi-Fi sensing leveraging plain-text beamforming feedback information (BFI) in multiple-input-multiple-output (MIMO) systems attracts increasing attention. However, due to the implicit relationship between BFI and the channel state information (CSI), quantifying the sensing capability of BFI poses a challenge in building efficient BFI-based sensing algorithms. In this letter, we first derive a math… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  14. arXiv:2406.05637  [pdf, ps, other

    math.OC cs.LG math.PR stat.ML

    A Generalized Version of Chung's Lemma and its Applications

    Authors: Li Jiang, Xiao Li, Andre Milzarek, Junwen Qiu

    Abstract: Chung's lemma is a classical tool for establishing asymptotic convergence rates of (stochastic) optimization methods under strong convexity-type assumptions and appropriate polynomial diminishing step sizes. In this work, we develop a generalized version of Chung's lemma, which provides a simple non-asymptotic convergence framework for a more general family of step size rules. We demonstrate broad… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 43 pages, 5 figures

    MSC Class: 90C15; 90C30; 90C26

  15. arXiv:2406.05631  [pdf, other

    cs.LG cs.AI cs.CV

    CCSI: Continual Class-Specific Impression for Data-free Class Incremental Learning

    Authors: Sana Ayromlou, Teresa Tsang, Purang Abolmaesumi, Xiaoxiao Li

    Abstract: In real-world clinical settings, traditional deep learning-based classification methods struggle with diagnosing newly introduced disease types because they require samples from all disease classes for offline training. Class incremental learning offers a promising solution by adapting a deep network trained on specific disease classes to handle new diseases. However, catastrophic forgetting occur… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  16. arXiv:2406.05565  [pdf, other

    cs.CV

    Medical Vision Generalist: Unifying Medical Imaging Tasks in Context

    Authors: Sucheng Ren, Xiaoke Huang, Xianhang Li, Junfei Xiao, Jieru Mei, Zeyu Wang, Alan Yuille, Yuyin Zhou

    Abstract: This study presents Medical Vision Generalist (MVG), the first foundation model capable of handling various medical imaging tasks -- such as cross-modal synthesis, image segmentation, denoising, and inpainting -- within a unified image-to-image generation framework. Specifically, MVG employs an in-context generation strategy that standardizes the handling of inputs and outputs as images. By treati… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  17. arXiv:2406.05355  [pdf, other

    physics.flu-dyn

    Revisit to the WGVC schemes: a nonlinear order-preserving and spectral-property-optimized methodology and its enhancement

    Authors: Kang He, Hongwei Liu, Tongbiao Guo, Xinliang Li, Zhiwei He

    Abstract: The numerical simulation of supersonic complex flow problems demands capabilities in identifying multiscale structures and capturing shocks, imposing stringent requirements on the numerical scheme. The capability to identify multiscale structures is closely related to the spectral properties of the numerical scheme. Currently, existing methods to improve the spectral properties of finite differenc… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  18. arXiv:2406.05340  [pdf, other

    stat.ME stat.ML

    Selecting the Number of Communities for Weighted Degree-Corrected Stochastic Block Models

    Authors: Yucheng Liu, Xiaodong Li

    Abstract: We investigate how to select the number of communities for weighted networks without a full likelihood modeling. First, we propose a novel weighted degree-corrected stochastic block model (DCSBM), in which the mean adjacency matrix is modeled as the same as in standard DCSBM, while the variance profile matrix is assumed to be related to the mean adjacency matrix through a given variance function.… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 3 figures, 2 tables

  19. arXiv:2406.05271  [pdf, other

    cs.CV

    USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

    Authors: Xiaoqi Wang, Wenbin He, Xiwei Xuan, Clint Sebastian, Jorge Piazentin Ono, Xin Li, Sima Behpour, Thang Doan, Liang Gou, Han Wei Shen, Liu Ren

    Abstract: The open-vocabulary image segmentation task involves partitioning images into semantically meaningful segments and classifying them with flexible text-defined categories. The recent vision-based foundation models such as the Segment Anything Model (SAM) have shown superior performance in generating class-agnostic image segments. The main challenge in open-vocabulary image segmentation now lies in… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  20. arXiv:2406.05223  [pdf, other

    cs.LG cs.AI

    CorDA: Context-Oriented Decomposition Adaptation of Large Language Models

    Authors: Yibo Yang, Xiaojie Li, Zhongzhu Zhou, Shuaiwen Leon Song, Jianlong Wu, Liqiang Nie, Bernard Ghanem

    Abstract: Current parameter-efficient fine-tuning (PEFT) methods build adapters without considering the context of downstream task to learn, or the context of important knowledge to maintain. As a result, there is often a performance gap compared to full-parameter finetuning, and meanwhile the finetuned model suffers from catastrophic forgetting of the pre-trained world knowledge. In this paper, we propose… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  21. arXiv:2406.05222  [pdf, other

    cs.LG cs.NE

    Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation

    Authors: Yibo Yang, Xiaojie Li, Motasem Alfarra, Hasan Hammoud, Adel Bibi, Philip Torr, Bernard Ghanem

    Abstract: Relieving the reliance of neural network training on a global back-propagation (BP) has emerged as a notable research topic due to the biological implausibility and huge memory consumption caused by BP. Among the existing solutions, local learning optimizes gradient-isolated modules of a neural network with local errors and has been proved to be effective even on large-scale datasets. However, the… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  22. arXiv:2406.05127  [pdf, other

    cs.CV

    Towards Semantic Equivalence of Tokenization in Multimodal LLM

    Authors: Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan

    Abstract: Multimodal Large Language Models (MLLMs) have demonstrated exceptional capabilities in processing vision-language tasks. One of the crux of MLLMs lies in vision tokenization, which involves efficiently transforming input visual signals into feature representations that are most beneficial for LLMs. However, existing vision tokenizers, essential for semantic alignment between vision and language, r… ▽ More

    Submitted 27 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Technical Report. The project page: https://chocowu.github.io/SeTok-web/

  23. arXiv:2406.05082  [pdf, other

    cs.CV

    CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion

    Authors: Xingrui Wang, Xin Li, Zhibo Chen

    Abstract: Tuning-free long video diffusion has been proposed to generate extended-duration videos with enriched content by reusing the knowledge from pre-trained short video diffusion model without retraining. However, most works overlook the fine-grained long-term video consistency modeling, resulting in limited scene consistency (i.e., unreasonable object or background transitions), especially with multip… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 21 pages

  24. arXiv:2406.04983  [pdf, other

    cs.CV

    CityCraft: A Real Crafter for 3D City Generation

    Authors: Jie Deng, Wenhao Chai, Junsheng Huang, Zhonghan Zhao, Qixuan Huang, Mingyan Gao, Jianshu Guo, Shengyu Hao, Wenhao Hu, Jenq-Neng Hwang, Xi Li, Gaoang Wang

    Abstract: City scene generation has gained significant attention in autonomous driving, smart city development, and traffic simulation. It helps enhance infrastructure planning and monitoring solutions. Existing methods have employed a two-stage process involving city layout generation, typically using Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), or Transformers, followed by neur… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 20 pages, 9 figures

  25. arXiv:2406.04880  [pdf, ps, other

    math.AP

    The free boundary problem of an epidemic model with nonlocal diffusions and nonlocal reactions: spreading-vanishing dichotomy

    Authors: Xueping Li, Lei Li, Mingxin Wang

    Abstract: This paper concerns the free boundary problem of an epidemic model. The spatial movements of the infectious agents and the infective humans are approximated by nonlocal diffusion operators. Especially, both the growth rate of the agents and the infective rate of humans are represented by nonlocal reaction terms. Thus our model has four integral terms which bring some diffculties for the study of t… ▽ More

    Submitted 10 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  26. arXiv:2406.04815  [pdf, other

    cs.LG cs.AI cs.RO

    Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning

    Authors: Xuehui Yu, Mhairi Dunion, Xin Li, Stefano V. Albrecht

    Abstract: Meta-Reinforcement Learning (Meta-RL) agents can struggle to operate across tasks with varying environmental features that require different optimal skills (i.e., different modes of behaviours). Using context encoders based on contrastive learning to enhance the generalisability of Meta-RL agents is now widely studied but faces challenges such as the requirement for a large sample size, also refer… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  27. arXiv:2406.04699  [pdf, other

    cs.LO cs.AI

    Logic Synthesis with Generative Deep Neural Networks

    Authors: Xihan Li, Xing Li, Lei Chen, Xing Zhang, Mingxuan Yuan, Jun Wang

    Abstract: While deep learning has achieved significant success in various domains, its application to logic circuit design has been limited due to complex constraints and strict feasibility requirement. However, a recent generative deep neural model, "Circuit Transformer", has shown promise in this area by enabling equivalence-preserving circuit transformation on a small scale. In this paper, we introduce a… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: In IWLS 2024

  28. arXiv:2406.04650  [pdf, ps, other

    hep-ph

    The Potential Energy of Heavy Quarkonium in Flavor-Dependent Systems from a Holographic Model

    Authors: Xi Guo, Xun Chen, Dong Xiang, Miguel Angel Martin Contreras, Xiao-Hua Li

    Abstract: Within the framework of the Einstein-Maxwell-Dilaton (EMD) model, which incorporates information on the equation of state and baryon number susceptibility from lattice results, we have conducted a comprehensive analysis of the potential energy, running coupling, and dissociation time for heavy quark-antiquark pairs using gauge/gravity duality. This study encompasses various systems, including pure… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  29. arXiv:2406.04642  [pdf, ps, other

    hep-ex hep-ph

    Measurements of the branching fractions of $Ξ_{c}^{0}\toΞ^{0}π^{0}$, $Ξ_{c}^{0}\toΞ^{0}η$, and $Ξ_{c}^{0}\toΞ^{0}η^{\prime}$ and asymmetry parameter of $Ξ_{c}^{0}\toΞ^{0}π^{0}$

    Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, M. Barrett, J. Baudot, A. Baur, A. Beaubien , et al. (360 additional authors not shown)

    Abstract: We present a study of $Ξ_{c}^{0}\toΞ^{0}π^{0}$, $Ξ_{c}^{0}\toΞ^{0}η$, and $Ξ_{c}^{0}\toΞ^{0}η^{\prime}$ decays using the Belle and Belle~II data samples, which have integrated luminosities of 980~$\mathrm{fb}^{-1}$ and 426~$\mathrm{fb}^{-1}$, respectively. We measure the following relative branching fractions… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 23 pages, 5 figures

    Report number: Belle II Preprint 2024-015; KEK Preprint 2024-9

  30. arXiv:2406.04603  [pdf, ps, other

    cs.CV

    Simplify Implant Depth Prediction as Video Grounding: A Texture Perceive Implant Depth Prediction Network

    Authors: Xinquan Yang, Xuguang Li, Xiaoling Luo, Leilei Zeng, Yudi Zhang, Linlin Shen, Yongqiang Deng

    Abstract: Surgical guide plate is an important tool for the dental implant surgery. However, the design process heavily relies on the dentist to manually simulate the implant angle and depth. When deep neural networks have been applied to assist the dentist quickly locates the implant position, most of them are not able to determine the implant depth. Inspired by the video grounding task which localizes the… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Journal ref: MICCAI'2024

  31. arXiv:2406.04600  [pdf, other

    cs.CV

    1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation

    Authors: Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang

    Abstract: Tracking and segmenting multiple objects in complex scenes has always been a challenge in the field of video object segmentation, especially in scenarios where objects are occluded and split into parts. In such cases, the definition of objects becomes very ambiguous. The motivation behind the MOSE dataset is how to clearly recognize and distinguish objects in complex scenes. In this challenge, we… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  32. arXiv:2406.04594  [pdf, other

    cs.DC cs.AI cs.LG

    Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

    Authors: Jianbo Dong, Bin Luo, Jun Zhang, Pengcheng Zhang, Fei Feng, Yikai Zhu, Ang Liu, Zian Chen, Yi Shi, Hairong Jiao, Gang Lu, Yu Guan, Ennan Zhai, Wencong Xiao, Hanyu Zhao, Man Yuan, Siran Yang, Xiang Li, Jiamang Wang, Rui Men, Jianwei Zhang, Huang Zhong, Dennis Cai, Yuan Xie, Binzhang Fu

    Abstract: The emergence of Large Language Models (LLMs) has necessitated the adoption of parallel training techniques, involving the deployment of thousands of GPUs to train a single model. Unfortunately, we have found that the efficiency of current parallel training is often suboptimal, largely due to the following two main issues. Firstly, hardware failures are inevitable, leading to interruptions in the… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  33. arXiv:2406.04145  [pdf, other

    cs.CL cs.AI

    Every Answer Matters: Evaluating Commonsense with Probabilistic Measures

    Authors: Qi Cheng, Michael Boratko, Pranay Kumar Yelugam, Tim O'Gorman, Nalini Singh, Andrew McCallum, Xiang Lorraine Li

    Abstract: Large language models have demonstrated impressive performance on commonsense tasks; however, these tasks are often posed as multiple-choice questions, allowing models to exploit systematic biases. Commonsense is also inherently probabilistic with multiple correct answers. The purpose of "boiling water" could be making tea and cooking, but it also could be killing germs. Existing tasks do not capt… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Camera Ready

  34. arXiv:2406.04100  [pdf, other

    cs.CV cs.RO

    Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging

    Authors: Zhongliang Jiang, Yunfeng Kang, Yuan Bi, Xuesong Li, Chenyang Li, Nassir Navab

    Abstract: Ultrasound imaging has been widely used in clinical examinations owing to the advantages of being portable, real-time, and radiation-free. Considering the potential of extensive deployment of autonomous examination systems in hospitals, robotic US imaging has attracted increased attention. However, due to the inter-patient variations, it is still challenging to have an optimal path for each patien… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  35. arXiv:2406.03902  [pdf, other

    eess.IV cs.CV

    C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction

    Authors: Yiqun Lin, Jiewen Yang, Hualiang Wang, Xinpeng Ding, Wei Zhao, Xiaomeng Li

    Abstract: Cone beam computed tomography (CBCT) is an important imaging technology widely used in medical scenarios, such as diagnosis and preoperative planning. Using fewer projection views to reconstruct CT, also known as sparse-view reconstruction, can reduce ionizing radiation and further benefit interventional radiology. Compared with sparse-view reconstruction for traditional parallel/fan-beam CT, CBCT… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR 2024

  36. arXiv:2406.03867  [pdf, other

    quant-ph cs.ET

    A Comprehensive Study of Quantum Arithmetic Circuits

    Authors: Siyi Wang, Xiufan Li, Wei Jie Bryan Lee, Suman Deb, Eugene Lim, Anupam Chattopadhyay

    Abstract: In recent decades, the field of quantum computing has experienced remarkable progress. This progress is marked by the superior performance of many quantum algorithms compared to their classical counterparts, with Shor's algorithm serving as a prominent illustration. Quantum arithmetic circuits, which are the fundamental building blocks in numerous quantum algorithms, have attracted much attention.… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Under review at the Royal Society's Philosophical Transactions A

  37. arXiv:2406.03809  [pdf, other

    astro-ph.HE astro-ph.CO

    Time delay of fast radio burst population with respect to the star formation history

    Authors: Hai-Nan Lin, Xin-Yi Li, Rui Zou

    Abstract: In spite of significant progress in the research of fast radio bursts (FRBs) in recent decade, their origin is still under extensive debate. Investigation on the population of FRBs can provide new insight into this interesting problem. In this paper, based on the first CHIME/FRB catalog, we construct a Bayesian framework to analyze the FRB population, with the selection effect of the CHIME telesco… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 12 pages, 2 figures, 2 tables. Accepted by Astrophysical Journal

  38. arXiv:2406.03763  [pdf, other

    cs.SI physics.soc-ph

    The impact of nodes of information dissemination on epidemic spreading in dynamic multiplex networks

    Authors: Minyu Feng, Xiangxi Li, Yuhan Li, Qin Li

    Abstract: Epidemic spreading processes on dynamic multiplex networks provide a more accurate description of natural spreading processes than those on single layered networks. To describe the influence of different individuals in the awareness layer on epidemic spreading, we propose a two-layer network-based epidemic spreading model, including some individuals who neglect the epidemic, and we explore how ind… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages, 10 figures

  39. arXiv:2406.03420  [pdf, other

    math.DS

    Dynamic properties of a class of van der Pol-Duffing oscillators

    Authors: Yelei Kuang, Xuemei Li

    Abstract: In this paper, we study the existence of bifurcation of a van der Pol-Duffing oscillator with quintic terms and its quasi-periodic solutions by means of qualitative and bifurcation theories. Firstly, we analyze the autonomous system and find that it has two kinds of local bifurcations and a global bifurcation: pitchfork bifurcation, Hopf bifurcation, homoclinic bifurcation. It is worth noting that… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  40. arXiv:2406.03409  [pdf, other

    cs.LG cs.AI

    Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model

    Authors: Jinyin Chen, Xiaoming Zhao, Haibin Zheng, Xiao Li, Sheng Xiang, Haifeng Guo

    Abstract: Benefiting from well-trained deep neural networks (DNNs), model compression have captured special attention for computing resource limited equipment, especially edge devices. Knowledge distillation (KD) is one of the widely used compression techniques for edge deployment, by obtaining a lightweight student model from a well-trained teacher model released on public platforms. However, it has been e… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  41. arXiv:2406.03394  [pdf, other

    cs.CV

    Gaussian Representation for Deformable Image Registration

    Authors: Jihe Li, Fabian Zhang, Xia Li, Tianhao Zhang, Ye Zhang, Joachim Buhmann

    Abstract: Deformable image registration (DIR) is a fundamental task in radiotherapy, with existing methods often struggling to balance computational efficiency, registration accuracy, and speed effectively. We introduce a novel DIR approach employing parametric 3D Gaussian control points achieving a better tradeoff. It provides an explicit and flexible representation for spatial deformation fields between 3… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  42. arXiv:2406.03228  [pdf, other

    eess.AS

    Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement

    Authors: Wang Dai, Xiaofei Li, Archontis Politis, Tuomas Virtanen

    Abstract: In end-to-end multi-channel speech enhancement, the traditional approach of designating one microphone signal as the reference for processing may not always yield optimal results. The limitation is particularly in scenarios with large distributed microphone arrays with varying speaker-to-microphone distances or compact, highly directional microphone arrays where speaker or microphone positions cha… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted by EUSIPCO 2024

  43. arXiv:2406.03081  [pdf, other

    quant-ph

    A Quantum Neural Network-Based Approach to Power Quality Disturbances Detection and Recognition

    Authors: Guo-Dong Li, Hai-Yan He, Yue Li, Xin-Hao Li, Hao Liu, Qing-Le Wang, Long Cheng

    Abstract: Power quality disturbances (PQDs) significantly impact the stability and reliability of power systems, necessitating accurate and efficient detection and recognition methods. While numerous classical algorithms for PQDs detection and recognition have been extensively studied and applied, related work in the quantum domain is still in its infancy. In this paper, an improved quantum neural networks… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  44. arXiv:2406.02939  [pdf, ps, other

    math.OC cs.DC cs.LG

    Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes

    Authors: Yan Huang, Xiang Li, Yipeng Shen, Niao He, Jinming Xu

    Abstract: In this paper, we show that applying adaptive methods directly to distributed minimax problems can result in non-convergence due to inconsistency in locally computed adaptive stepsizes. To address this challenge, we propose D-AdaST, a Distributed Adaptive minimax method with Stepsize Tracking. The key strategy is to employ an adaptive stepsize tracking protocol involving the transmission of two ex… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  45. arXiv:2406.02931  [pdf, other

    hep-ex

    Measurements of the branching fractions of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^-π^0/η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Based on $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events, we investigate four hadronic decay modes of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^- π^0/η$ ($h=π$ or $K$) via the process $ψ(3686) \to π^{0}h_c$ at BESIII. The $h_c \to π^+ π^- π^0$ decay is observed with a significance of 9.6$σ$ after taking into account systematic uncertainties. Evidences for… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages, 7 figures

  46. arXiv:2406.02833  [pdf, other

    cs.CV

    DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images

    Authors: Yimian Dai, Minrui Zou, Yuxuan Li, Xiang Li, Kang Ni, Jian Yang

    Abstract: Synthetic Aperture Radar (SAR) target detection has long been impeded by inherent speckle noise and the prevalence of diminutive, ambiguous targets. While deep neural networks have advanced SAR target detection, their intrinsic low-frequency bias and static post-training weights falter with coherent noise and preserving subtle details across heterogeneous terrains. Motivated by traditional SAR ima… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  47. arXiv:2406.02635  [pdf, other

    cs.LG cs.AI

    Evidentially Calibrated Source-Free Time-Series Domain Adaptation with Temporal Imputation

    Authors: Mohamed Ragab, Peiliang Gong, Emadeldeen Eldele, Wenyu Zhang, Min Wu, Chuan-Sheng Foo, Daoqiang Zhang, Xiaoli Li, Zhenghua Chen

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a model pre-trained on a labeled source domain to an unlabeled target domain without access to source data, preserving the source domain's privacy. While SFDA is prevalent in computer vision, it remains largely unexplored in time series analysis. Existing SFDA methods, designed for visual data, struggle to capture the inherent temporal dynamics of… ▽ More

    Submitted 12 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  48. arXiv:2406.02430  [pdf, other

    eess.AS cs.SD

    Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

    Authors: Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu , et al. (21 additional authors not shown)

    Abstract: We introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech in-context learning, achieving performance in speaker similarity and naturalness that matches ground truth human speech in both objective and sub… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  49. arXiv:2406.02395  [pdf, other

    cs.LG cs.CV

    GrootVL: Tree Topology is All You Need in State Space Model

    Authors: Yicheng Xiao, Lin Song, Shaoli Huang, Jiangshan Wang, Siyu Song, Yixiao Ge, Xiu Li, Ying Shan

    Abstract: The state space models, employing recursively propagated features, demonstrate strong representation capabilities comparable to Transformer models and superior efficiency. However, constrained by the inherent geometric constraints of sequences, it still falls short in modeling long-range dependencies. To address this issue, we propose the GrootVL network, which first dynamically generates a tree t… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: The code is available at https://github.com/EasonXiao-888/GrootVL

  50. arXiv:2406.02273  [pdf, ps, other

    math.OC cs.LG

    A KL-based Analysis Framework with Applications to Non-Descent Optimization Methods

    Authors: Junwen Qiu, Bohao Ma, Xiao Li, Andre Milzarek

    Abstract: We propose a novel analysis framework for non-descent-type optimization methodologies in nonconvex scenarios based on the Kurdyka-Lojasiewicz property. Our framework allows covering a broad class of algorithms, including those commonly employed in stochastic and distributed optimization. Specifically, it enables the analysis of first-order methods that lack a sufficient descent property and do not… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 29 pages

    MSC Class: 90C06; 90C26; 90C30