Skip to main content

Showing 1–50 of 667 results for author: Tan, T

  1. arXiv:2407.11536  [pdf, other

    cs.CL cs.AI

    Fine-Tuning Medical Language Models for Enhanced Long-Contextual Understanding and Domain Expertise

    Authors: Qimin Yang, Rongsheng Wang, Jiexin Chen, Runqi Su, Tao Tan

    Abstract: Large Language Models (LLMs) have been widely applied in various professional fields. By fine-tuning the models using domain specific question and answer datasets, the professional domain knowledge and Q\&A abilities of these models have significantly improved, for example, medical professional LLMs that use fine-tuning of doctor-patient Q\&A data exhibit extraordinary disease diagnostic abilities… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 5 pages, 1 figure. Accepted by the Workshop on Long-Context Foundation Models (LCFM) at ICML 2024

  2. arXiv:2407.10767  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Magnetic and nematic order of Bose-Fermi mixtures in moiré superlattices of 2D semiconductors

    Authors: Feng-Ren Fan, Tixuan Tan, Chengxin Xiao, Wang Yao

    Abstract: We investigate the magnetic orders in a mixture of Boson (exciton) and Fermion (electron or hole) trapped in transition-metal dichalcogenides moiré superlattices. A sizable antiferromagnetic exchange interaction is found between a carrier and an interlayer exciton trapped at different high symmetry points of the moiré supercell. This interaction at a distance much shorter than the carrier-carrier… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures

  3. arXiv:2407.07666  [pdf

    cs.CL cs.AI

    A Proposed S.C.O.R.E. Evaluation Framework for Large Language Models : Safety, Consensus, Objectivity, Reproducibility and Explainability

    Authors: Ting Fang Tan, Kabilan Elangovan, Jasmine Ong, Nigam Shah, Joseph Sung, Tien Yin Wong, Lan Xue, Nan Liu, Haibo Wang, Chang Fu Kuo, Simon Chesterman, Zee Kin Yeong, Daniel SW Ting

    Abstract: A comprehensive qualitative evaluation framework for large language models (LLM) in healthcare that expands beyond traditional accuracy and quantitative metrics needed. We propose 5 key aspects for evaluation of LLMs: Safety, Consensus, Objectivity, Reproducibility and Explainability (S.C.O.R.E.). We suggest that S.C.O.R.E. may form the basis for an evaluation framework for future LLM-based models… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  4. arXiv:2407.06857  [pdf, other

    eess.SY

    Enhanced Battery Degradation-Aware Scheduling for Distribution Network with Electric Vehicle Load

    Authors: Vijay Babu Pamshetti, Wei Zhang, Andy Man-Fai Ng, Qingyu Yan, Kuan Tak Tan

    Abstract: Batteries play a key role in today's power grid. In this paper, we investigate the impact of battery degradation on the distribution network. We formulate a multi-objective framework for optimizing battery scheduling with the goals of minimizing monetary costs and improving network performance. Our framework incorporates energy purchase and battery degradation into the costs and measures the netwo… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 3 figures

  5. arXiv:2407.04675  [pdf, other

    eess.AS cs.SD

    Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

    Authors: Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li , et al. (30 additional authors not shown)

    Abstract: Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this wor… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  6. arXiv:2407.02911  [pdf, other

    eess.IV cs.CV

    Non-Adversarial Learning: Vector-Quantized Common Latent Space for Multi-Sequence MRI

    Authors: Luyi Han, Tao Tan, Tianyu Zhang, Xin Wang, Yuan Gao, Chunyao Lu, Xinglong Liang, Haoran Dou, Yunzhi Huang, Ritse Mann

    Abstract: Adversarial learning helps generative models translate MRI from source to target sequence when lacking paired samples. However, implementing MRI synthesis with adversarial learning in clinical settings is challenging due to training instability and mode collapse. To address this issue, we leverage intermediate sequences to estimate the common latent space among multi-sequence MRI, enabling the rec… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  7. arXiv:2407.00993  [pdf, other

    cs.AI cs.CL

    Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents

    Authors: Shihan Deng, Weikai Xu, Hongda Sun, Wei Liu, Tao Tan, Jianfeng Liu, Ang Li, Jian Luan, Bin Wang, Rui Yan, Shuo Shang

    Abstract: With the remarkable advancements of large language models (LLMs), LLM-based agents have become a research hotspot in human-computer interaction. However, there is a scarcity of benchmarks available for LLM-based mobile agents. Benchmarking these agents generally faces three main challenges: (1) The inefficiency of UI-only operations imposes limitations to task evaluation. (2) Specific instructions… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  8. Artificial Immune System of Secure Face Recognition Against Adversarial Attacks

    Authors: Min Ren, Yunlong Wang, Yuhao Zhu, Yongzhen Huang, Zhenan Sun, Qi Li, Tieniu Tan

    Abstract: Insect production for food and feed presents a promising supplement to ensure food safety and address the adverse impacts of agriculture on climate and environment in the future. However, optimisation is required for insect production to realise its full potential. This can be by targeted improvement of traits of interest through selective breeding, an approach which has so far been underexplored… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Journal ref: International Journal of Computer Vision (IJCV), 2024

  9. arXiv:2406.15704  [pdf, other

    cs.CV

    video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

    Authors: Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang

    Abstract: Speech understanding as an element of the more generic video understanding using audio-visual large language models (av-LLMs) is a crucial yet understudied aspect. This paper proposes video-SALMONN, a single end-to-end av-LLM for video processing, which can understand not only visual frame sequences, audio events and music, but speech as well. To obtain fine-grained temporal information required b… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. arXiv admin note: substantial text overlap with arXiv:2310.05863

  10. arXiv:2406.12996  [pdf, other

    astro-ph.EP

    TOI-2374 b and TOI-3071 b: two metal-rich sub-Saturns well within the Neptunian desert

    Authors: Alejandro Hacker, Rodrigo F. Díaz, David J. Armstrong, Jorge Fernández Fernández, Simon Müller, Elisa Delgado-Mena, Sérgio G. Sousa, Vardan Adibekyan, Keivan G. Stassun, Karen A. Collins, Samuel W. Yee, Daniel Bayliss, Allyson Bieryla, François Bouchy, R. Paul Butler, Jeffrey D. Crane, Xavier Dumusque, Joel D. Hartman, Ravit Helled, Jon Jenkins, Marcelo Aron F. Keniger, Hannah Lewis, Jorge Lillo-Box, Michael B. Lund, Louise D. Nielsen , et al. (18 additional authors not shown)

    Abstract: We report the discovery of two transiting planets detected by the Transiting Exoplanet Survey Satellite (TESS), TOI-2374 b and TOI-3071 b, orbiting a K5V and an F8V star, respectively, with periods of 4.31 and 1.27 days, respectively. We confirm and characterize these two planets with a variety of ground-based and follow-up observations, including photometry, precise radial velocity monitoring and… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 24 pages, 22 figures, 10 tables, accepted for publication in MNRAS

  11. arXiv:2406.12447  [pdf, other

    eess.AS

    Text-aware Speech Separation for Multi-talker Keyword Spotting

    Authors: Haoyu Li, Baochen Yang, Yu Xi, Linfeng Yu, Tian Tan, Hao Li, Kai Yu

    Abstract: For noisy environments, ensuring the robustness of keyword spotting (KWS) systems is essential. While much research has focused on noisy KWS, less attention has been paid to multi-talker mixed speech scenarios. Unlike the usual cocktail party problem where multi-talker speech is separated using speaker clues, the key challenge here is to extract the target speech for KWS based on text clues. To ad… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH2024

  12. arXiv:2406.11369  [pdf, other

    cs.CG cs.DS

    Approximation Algorithms for Smallest Intersecting Balls

    Authors: Jiaqi Zheng, Tiow-Seng Tan

    Abstract: We study a general smallest intersecting ball problem and its soft-margin variant in high-dimensional Euclidean spaces, which only require the input objects to be compact and convex. These two problems link and unify a series of fundamental problems in computational geometry and machine learning, including smallest enclosing ball, polytope distance, intersection radius, $\ell_1$-loss support vecto… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  13. arXiv:2406.08481  [pdf, other

    cs.CV

    Enhancing End-to-End Autonomous Driving with Latent World Model

    Authors: Yingyan Li, Lue Fan, Jiawei He, Yuqi Wang, Yuntao Chen, Zhaoxiang Zhang, Tieniu Tan

    Abstract: End-to-end autonomous driving has garnered widespread attention. Current end-to-end approaches largely rely on the supervision from perception tasks such as detection, tracking, and map segmentation to aid in learning scene representations. However, these methods require extensive annotations, hindering the data scalability. To address this challenge, we propose a novel self-supervised method to e… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  14. arXiv:2406.07914  [pdf, other

    cs.SD eess.AS

    Can Large Language Models Understand Spatial Audio?

    Authors: Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Jun Zhang, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang

    Abstract: This paper explores enabling large language models (LLMs) to understand spatial information from multichannel audio, a skill currently lacking in auditory LLMs. By leveraging LLMs' advanced cognitive and inferential abilities, the aim is to enhance understanding of 3D environments via audio. We study 3 spatial audio tasks: sound source localization (SSL), far-field speech recognition (FSR), and lo… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  15. arXiv:2406.06278  [pdf, other

    astro-ph.EP

    Three super-Earths and a possible water world from TESS and ESPRESSO

    Authors: M. J. Hobson, F. Bouchy, B. Lavie, C. Lovis, V. Adibekyan, C. Allende Prieto, Y. Alibert, S. C. C. Barros, A. Castro-González, S. Cristiani, V. D'Odorico, M. Damasso, P. Di Marcantonio, X. Dumusque, D. Ehrenreich, P. Figueira, R. Génova Santos, J. I. González Hernández, J. Lillo-Box, G. Lo Curto, C. J. A. P. Martins, A. Mehner, G. Micela, P. Molaro, N. J. Nunes , et al. (29 additional authors not shown)

    Abstract: Since 2018, the ESPRESSO spectrograph at the VLT has been hunting for planets in the Southern skies via the RV method. One of its goals is to follow up candidate planets from transit surveys such as the TESS mission, particularly small planets. We analyzed photometry from TESS and ground-based facilities, high-resolution imaging, and RVs from ESPRESSO, HARPS, and HIRES, to confirm and characterize… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 61 pages (of which pp. 24-61 are appendices), 20 figures (main text). Accepted for publication in A&A

  16. arXiv:2406.01154  [pdf, other

    cs.CV

    UniUSNet: A Promptable Framework for Universal Ultrasound Disease Prediction and Tissue Segmentation

    Authors: Zehui Lin, Zhuoneng Zhang, Xindi Hu, Zhifan Gao, Xin Yang, Yue Sun, Dong Ni, Tao Tan

    Abstract: Ultrasound is a widely used imaging modality in clinical practice due to its low cost, portability, and safety. Current research in general AI for healthcare focuses on large language models and general segmentation models, with insufficient attention to solutions addressing both disease prediction and tissue segmentation. In this study, we propose a novel universal framework for ultrasound, namel… ▽ More

    Submitted 20 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  17. arXiv:2405.17921  [pdf

    cs.AI cs.CY

    Towards Clinical AI Fairness: Filling Gaps in the Puzzle

    Authors: Mingxuan Liu, Yilin Ning, Salinelat Teixayavong, Xiaoxuan Liu, Mayli Mertens, Yuqing Shang, Xin Li, Di Miao, Jie Xu, Daniel Shu Wei Ting, Lionel Tim-Ee Cheng, Jasmine Chiat Ling Ong, Zhen Ling Teo, Ting Fang Tan, Narrendar RaviChandran, Fei Wang, Leo Anthony Celi, Marcus Eng Hock Ong, Nan Liu

    Abstract: The ethical integration of Artificial Intelligence (AI) in healthcare necessitates addressing fairness-a concept that is highly context-specific across medical fields. Extensive studies have been conducted to expand the technical components of AI fairness, while tremendous calls for AI fairness have been raised from healthcare. Despite this, a significant disconnect persists between technical adva… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  18. arXiv:2405.15237  [pdf, other

    quant-ph

    Benchmarking bosonic modes for quantum information with randomized displacements

    Authors: Christophe H. Valahu, Tomas Navickas, Michael J. Biercuk, Ting Rei Tan

    Abstract: Bosonic modes are prevalent in all aspects of quantum information processing. However, existing tools for characterizing the quality, stability, and noise properties of bosonic modes are limited, especially in a driven setting. Here, we propose, demonstrate, and analyze a bosonic randomized benchmarking (BRB) protocol that uses randomized displacements of the bosonic modes in phase space to determ… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 13 pages, 6 figures

  19. arXiv:2405.14988  [pdf, other

    astro-ph.CO

    CMB lensing and Lyα forest cross bispectrum from DESI's first-year quasar sample

    Authors: N. G. Karaçaylı, P. Martini, D. H. Weinberg, S. Ferraro, R. de Belsunce, J. Aguilar, S. Ahlen, E. Armengaud, D. Brooks, T. Claybaugh, A. de la Macorra, B. Dey, P. Doel, K. Fanning, J. E. Forero-Romero, S. Gontcho A Gontcho, A. X. Gonzalez-Morales, G. Gutierrez, J. Guy, K. Honscheid, D. Kirkby, T. Kisner, A. Kremin, A. Lambert, M. Landriau , et al. (28 additional authors not shown)

    Abstract: The squeezed cross-bispectrum \bispeconed\ between the gravitational lensing in the Cosmic Microwave Background and the 1D \lya\ forest power spectrum can constrain bias parameters and break degeneracies between $σ_8$ and other cosmological parameters. We detect \bispeconed\ with $4.8σ$ significance at an effective redshift $z_\mathrm{eff}=2.4$ using Planck PR3 lensing map and over 280,000 quasar… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 13 pages excluding references, 8 figures

  20. arXiv:2405.14646  [pdf, other

    cs.CL

    Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models

    Authors: Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li

    Abstract: The automatic evaluation of natural language generation (NLG) systems presents a long-lasting challenge. Recent studies have highlighted various neural metrics that align well with human evaluations. Yet, the robustness of these evaluators against adversarial perturbations remains largely under-explored due to the unique challenges in obtaining adversarial data for different NLG evaluation tasks.… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: ACL24 Finding

  21. arXiv:2405.10379  [pdf, other

    astro-ph.EP astro-ph.SR

    Wide Binary Orbits are Preferentially Aligned with the Orbits of Small Planets, but Probably Not Hot Jupiters

    Authors: Sam Christian, Andrew Vanderburg, Juliette Becker, Adam L. Kraus, Logan Pearce, Karen A. Collins, Malena Rice, Eric L. N. Jensen, David Baker, Paul Benni, Allyson Bieryla, Abraham Binnenfeld, Kevin I. Collins, Dennis M. Conti, Phil Evans, Eric Girardin, Joao Gregorio, Tsevi Mazeh, Felipe Murgas, Aviad Panahi, Francisco J. Pozuelos, Howard M. Relles, Fabian Rodriguez Frustaglia, Richard P. Schwarz, Gregor Srdoc , et al. (6 additional authors not shown)

    Abstract: Studying the relative orientations of the orbits of exoplanets and wide-orbiting binary companions (semimajor axis greater than 100 AU) can shed light on how planets form and evolve in binary systems. Previous observations by multiple groups discovered a possible alignment between the orbits of visual binaries and the exoplanets that reside in them. In this study, using data from \textit{Gaia} DR3… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 16 pages, 10 figures, submitted to AJ. Email corresponding author for data files

  22. arXiv:2405.09737  [pdf, other

    astro-ph.CO

    Validation of the DESI 2024 Lyman Alpha Forest BAL Masking Strategy

    Authors: Paul Martini, A. Cuceu, L. Ennesser, A. Brodzeller, J. Aguilar, S. Ahlen, D. Brooks, T. Claybaugh, R. de Belsunce, A. de la Macorra, Arjun Dey, P. Doel, J. E. Forero-Romero, E. Gaztañaga, S. Gontcho A Gontcho, J. Guy, H. K. Herrera-Alcantar, K. Honscheid, N. G. Karaçaylı, T. Kisner, A. Kremin, A. Lambert, L. Le Guillou, M. Manera, A. Meisner , et al. (22 additional authors not shown)

    Abstract: Broad absorption line quasars (BALs) exhibit blueshifted absorption relative to a number of their prominent broad emission features. These absorption features can contribute to quasar redshift errors and add absorption to the Lyman-alpha (LyA) forest that is unrelated to large-scale structure. We present a detailed analysis of the impact of BALs on the Baryon Acoustic Oscillation (BAO) results wit… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 32 pages, 8 figures, submitted to JCAP

  23. arXiv:2405.09157  [pdf, other

    math.OC cs.CG cs.DC cs.DS

    A Primal-Dual Framework for Symmetric Cone Programming

    Authors: Jiaqi Zheng, Antonios Varvitsiotis, Tiow-Seng Tan, Wayne Lin

    Abstract: In this paper, we introduce a primal-dual algorithmic framework for solving Symmetric Cone Programs (SCPs), a versatile optimization model that unifies and extends Linear, Second-Order Cone (SOCP), and Semidefinite Programming (SDP). Our work generalizes the primal-dual framework for SDPs introduced by Arora and Kale, leveraging a recent extension of the Multiplicative Weights Update method (MWU)… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  24. arXiv:2405.00075  [pdf, ps, other

    eess.IV

    Charting the Path Forward: CT Image Quality Assessment -- An In-Depth Review

    Authors: Siyi Xun, Qiaoyu Li, Xiaohong Liu, Guangtao Zhai, Mingxiang Wu, Tao Tan

    Abstract: Computed Tomography (CT) is a frequently utilized imaging technology that is employed in the clinical diagnosis of many disorders. However, clinical diagnosis, data storage, and management are posed huge challenges by a huge volume of non-homogeneous CT data in terms of imaging quality. As a result, the quality assessment of CT images is a crucial problem that demands consideration. The history, a… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  25. arXiv:2404.18373  [pdf, other

    cs.NI

    6G comprehensive intelligence: network operations and optimization based on Large Language Models

    Authors: Sifan Long, Fengxiao Tang, Yangfan Li, Tiao Tan, Zhengjie Jin, Ming Zhao, Nei Kato

    Abstract: The sixth generation mobile communication standard (6G) can promote the development of Industrial Internet and Internet of Things (IoT). To achieve comprehensive intelligent development of the network and provide customers with higher quality personalized services. This paper proposes a network performance optimization and intelligent operation network architecture based on Large Language Model (L… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 8 pages, 5 figures, 15 preferences

  26. arXiv:2404.18151  [pdf, other

    cs.LO

    Decidability of Graph Neural Networks via Logical Characterizations

    Authors: Michael Benedikt, Chia-Hsuan Lu, Boris Motik, Tony Tan

    Abstract: We present results concerning the expressiveness and decidability of a popular graph learning formalism, graph neural networks (GNNs), exploiting connections with logic. We use a family of recently-discovered decidable logics involving "Presburger quantifiers". We show how to use these logics to measure the expressiveness of classes of GNNs, in some cases getting exact correspondences between the… ▽ More

    Submitted 23 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  27. arXiv:2404.17747  [pdf, other

    cs.CV

    MMA-UNet: A Multi-Modal Asymmetric UNet Architecture for Infrared and Visible Image Fusion

    Authors: Jingxue Huang, Xilai Li, Tianshu Tan, Xiaosong Li, Tao Ye

    Abstract: Multi-modal image fusion (MMIF) maps useful information from various modalities into the same representation space, thereby producing an informative fused image. However, the existing fusion algorithms tend to symmetrically fuse the multi-modal images, causing the loss of shallow information or bias towards a single modality in certain regions of the fusion results. In this study, we analyzed the… ▽ More

    Submitted 11 July, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  28. arXiv:2404.12311  [pdf, other

    physics.atom-ph nucl-ex

    Laser excitation of the $^{229}$Th nuclear isomeric transition in a solid-state host

    Authors: R. Elwell, Christian Schneider, Justin Jeet, J. E. S. Terhune, H. W. T. Morgan, A. N. Alexandrova, H. B. Tran Tan, Andrei Derevianko, Eric R. Hudson

    Abstract: LiSrAlF$_6$ crystals doped with $^{229}$Th are used in a laser-based search for the nuclear isomeric transition. Two spectroscopic features near the nuclear transition energy are observed. The first is a broad excitation feature that produces red-shifted fluorescence that decays with a timescale of a few seconds. The second is a narrow, laser-linewidth-limited spectral feature at… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Submitted to Physical Review Letters on March 25, 2024

  29. arXiv:2404.09498  [pdf, other

    cs.CV

    FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba

    Authors: Xinyu Xie, Yawen Cui, Chio-In Ieong, Tao Tan, Xiaozhi Zhang, Xubin Zheng, Zitong Yu

    Abstract: Multi-modal image fusion aims to combine information from different modes to create a single image with comprehensive information and detailed textures. However, fusion models based on convolutional neural networks encounter limitations in capturing global image features due to their focus on local convolution operations. Transformer-based models, while excelling in global feature modeling, confro… ▽ More

    Submitted 20 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  30. arXiv:2404.09187  [pdf

    physics.acc-ph

    Stable Acceleration of a LHe-Free Nb3Sn demo SRF e-linac Based on Conduction Cooling

    Authors: Ziqin Yang, Yuan He, Tiancai Jiang, Feng Bai, Fengfeng Wang, Weilong Chen, Guangze Jiang, Yimeng Chu, Hangxu Li, Bo Zhao, Guozhen Sun, Zongheng Xue, Yugang Zhao, Zheng Gao, Yaguang Li, Pingran Xiong, Hao Guo, Liepeng Sun, Guirong Huang, Zhijun Wang, Junhui Zhang, Teng Tan, Hongwei Zhao, Wenlong Zhan

    Abstract: The design, construction, and commissioning of a conduction-cooled Nb3Sn demonstration superconducting radio frequency (SRF) electron accelerator at the Institute of Modern Physics of the Chinese Academy of Sciences (IMP, CAS) will be presented. In the context of engineering application planning for Nb3Sn thin-film SRF cavities within the CiADS project, a 650MHz 5-cell elliptical cavity was coated… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  31. arXiv:2404.07905  [pdf, other

    math-ph quant-ph

    Poincaré disk as a model of squeezed states of a harmonic oscillator

    Authors: Ian Chi, Martin Fraas, Tina Tan

    Abstract: Single-mode squeezed states exhibit a direct correspondence with points on the Poincaré disk. In this study, we delve into this correspondence and describe the motions of the disk generated by a quadratic Hamiltonian. This provides a geometric representation of squeezed states and their evolution. We discuss applications in bang-bang and adiabatic control problems involving squeezed states.

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 16 pages, 5 figures

  32. arXiv:2404.03004  [pdf, other

    astro-ph.CO

    Validation of the DESI 2024 Ly$α$ forest BAO analysis using synthetic datasets

    Authors: Andrei Cuceu, Hiram K. Herrera-Alcantar, Calum Gordon, Paul Martini, Julien Guy, Andreu Font-Ribera, Alma X. Gonzalez-Morales, M. Abdul Karim, J. Aguilar, S. Ahlen, E. Armengaud, A. Bault, D. Brooks, T. Claybaugh, A. de la Macorra, P. Doel, K. Fanning, S. Ferraro, J. E. Forero-Romero, E. Gaztañaga, S. Gontcho A Gontcho, G. Gutierrez, K. Honscheid, C. Howlett, N. G. Karaçaylı , et al. (34 additional authors not shown)

    Abstract: The first year of data from the Dark Energy Spectroscopic Instrument (DESI) contains the largest set of Lyman-$α$ (Ly$α$) forest spectra ever observed. This data, collected in the DESI Data Release 1 (DR1) sample, has been used to measure the Baryon Acoustic Oscillation (BAO) feature at redshift $z=2.33$. In this work, we use a set of 150 synthetic realizations of DESI DR1 to validate the DESI 202… ▽ More

    Submitted 5 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Supporting publication of DESI 2024 IV: Baryon Acoustic Oscillations from the Lyman Alpha Forest

  33. arXiv:2404.03003  [pdf, other

    astro-ph.CO

    Characterization of contaminants in the Lyman-alpha forest auto-correlation with DESI

    Authors: J. Guy, S. Gontcho A Gontcho, E. Armengaud, A. Brodzeller, A. Cuceu, A. Font-Ribera, H. K. Herrera-Alcantar, N. G. Karaçaylı, A. Muñoz-Gutiérrez, M. Pieri, I. Pérez-Ràfols, C. Ramírez-Pérez, C. Ravoux, J. Rich, M. Walther, M. Abdul Karim, J. Aguilar, S. Ahlen, A. Bault, D. Brooks, T. Claybaugh, R. de la Cruz, A. de la Macorra, P. Doel, K. Fanning , et al. (39 additional authors not shown)

    Abstract: Baryon Acoustic Oscillations can be measured with sub-percent precision above redshift two with the Lyman-alpha forest auto-correlation and its cross-correlation with quasar positions. This is one of the key goals of the Dark Energy Spectroscopic Instrument (DESI) which started its main survey in May 2021. We present in this paper a study of the contaminants to the lyman-alpha forest which are mai… ▽ More

    Submitted 9 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: 30 pages, 12 figures

  34. arXiv:2404.03002  [pdf, other

    astro-ph.CO

    DESI 2024 VI: Cosmological Constraints from the Measurements of Baryon Acoustic Oscillations

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, B. Bahr-Kalus, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, A. Bera, F. Beutler, D. Bianchi, C. Blake, R. Blum , et al. (178 additional authors not shown)

    Abstract: We present cosmological results from the measurement of baryon acoustic oscillations (BAO) in galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations from the Dark Energy Spectroscopic Instrument (DESI), to be released in the DESI Data Release 1. DESI BAO provide robust measurements of the transverse comoving distance and Hubble rate, or their combination, relative to the s… ▽ More

    Submitted 24 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers). Typos corrected and a new figure and discussion added to Appendix A

  35. arXiv:2404.03001  [pdf, other

    astro-ph.CO

    DESI 2024 IV: Baryon Acoustic Oscillations from the Lyman Alpha Forest

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Bautista, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden , et al. (174 additional authors not shown)

    Abstract: We present the measurement of Baryon Acoustic Oscillations (BAO) from the Lyman-$α$ (Ly$α$) forest of high-redshift quasars with the first-year dataset of the Dark Energy Spectroscopic Instrument (DESI). Our analysis uses over $420\,000$ Ly$α$ forest spectra and their correlation with the spatial distribution of more than $700\,000$ quasars. An essential facet of this work is the development of a… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers)

  36. arXiv:2404.03000  [pdf, other

    astro-ph.CO

    DESI 2024 III: Baryon Acoustic Oscillations from Galaxies and Quasars

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden, A. Brodzeller , et al. (171 additional authors not shown)

    Abstract: We present the DESI 2024 galaxy and quasar baryon acoustic oscillations (BAO) measurements using over 5.7 million unique galaxy and quasar redshifts in the range 0.1<z<2.1. Divided by tracer type, we utilize 300,017 galaxies from the magnitude-limited Bright Galaxy Survey with 0.1<z<0.4, 2,138,600 Luminous Red Galaxies with 0.4<z<1.1, 2,432,022 Emission Line Galaxies with 0.8<z<1.6, and 856,652 qu… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers)

  37. arXiv:2404.00838  [pdf, other

    cs.CV

    3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching

    Authors: Yibin Ye, Xichao Teng, Shuo Chen, Yijie Bian, Tao Tan, Zhang Li

    Abstract: Optical-SAR image matching is a fundamental task for image fusion and visual navigation. However, all large-scale open SAR dataset for methods development are collected from single platform, resulting in limited satellite types and spatial resolutions. Since images captured by different sensors vary significantly in both geometric and radiometric appearance, existing methods may fail to match corr… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 20pages 17 figures

  38. arXiv:2403.19278  [pdf, other

    cs.CV

    CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection

    Authors: Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli, Robby T. Tan

    Abstract: Domain adaptive object detection aims to adapt detection models to domains where annotated data is unavailable. Existing methods have been proposed to address the domain gap using the semi-supervised student-teacher framework. However, a fundamental issue arises from the class imbalance in the labelled training set, which can result in inaccurate pseudo-labels. The relationship between classes, es… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted into CVPR 2024

  39. arXiv:2403.15851  [pdf, other

    hep-th cond-mat.str-el quant-ph

    Local operator quench induced by two-dimensional inhomogeneous and homogeneous CFT Hamiltonians

    Authors: Weibo Mao, Masahiro Nozaki, Kotaro Tamaoka, Mao Tian Tan

    Abstract: We explore non-equilibrium processes in two-dimensional conformal field theories (2d CFTs) due to the growth of operators induced by inhomogeneous and homogeneous Hamiltonians by investigating the time dependence of the partition function, energy density, and entanglement entropy. The non-equilibrium processes considered in this paper are constructed out of the Lorentzian and Euclidean time evolut… ▽ More

    Submitted 2 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: 37 pages+appendices, 6 figures. v2: references added

    Report number: RIKEN-iTHEMS-Report-24

  40. arXiv:2403.11172  [pdf, other

    cs.CV

    Artifact Feature Purification for Cross-domain Detection of AI-generated Images

    Authors: Zheling Meng, Bo Peng, Jing Dong, Tieniu Tan

    Abstract: In the era of AIGC, the fast development of visual content generation technologies, such as diffusion models, bring potential security risks to our society. Existing generated image detection methods suffer from performance drop when faced with out-of-domain generators and image scenes. To relieve this problem, we propose Artifact Purification Network (APN) to facilitate the artifact extraction fr… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: This work is under consideration at Computer Vision and Image Understanding

  41. arXiv:2403.07408  [pdf, other

    cs.CV

    NightHaze: Nighttime Image Dehazing via Self-Prior Learning

    Authors: Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Robby T. Tan

    Abstract: Masked autoencoder (MAE) shows that severe augmentation during training produces robust representations for high-level tasks. This paper brings the MAE-like framework to nighttime image enhancement, demonstrating that severe augmentation during training produces strong network priors that are resilient to real-world night haze degradations. We propose a novel nighttime image dehazing method with s… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  42. arXiv:2403.07350  [pdf, other

    cs.CL cs.AI cs.CV

    VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark

    Authors: Han Huang, Haitian Zhong, Tao Yu, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

    Abstract: Recently, knowledge editing on large language models (LLMs) has received considerable attention. Compared to this, editing Large Vision-Language Models (LVLMs) faces extra challenges from diverse data modalities and complicated model components, and data for LVLMs editing are limited. The existing LVLM editing benchmark, which comprises three metrics (Reliability, Locality, and Generality), falls… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 9+11 pages (main+appendix), 7 figures, 13 tables. $\href{https://github.com/VLKEB/VLKEB}{\text{get code and data}}$

  43. arXiv:2403.05262  [pdf, other

    cs.CV

    Debiasing Multimodal Large Language Models

    Authors: Yi-Fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

    Abstract: In the realms of computer vision and natural language processing, Large Vision-Language Models (LVLMs) have become indispensable tools, proficient in generating textual descriptions based on visual inputs. Despite their advancements, our investigation reveals a noteworthy bias in the generated content, where the output is primarily influenced by the underlying Large Language Models (LLMs) prior ra… ▽ More

    Submitted 27 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 38 pages, 17 figures

  44. arXiv:2403.04196  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Parent Berry curvature and the ideal anomalous Hall crystal

    Authors: Tixuan Tan, Trithep Devakul

    Abstract: We study a model of electrons moving in a parent band of uniform Berry curvature. At sufficiently high parent Berry curvature, we show that strong repulsive interactions generically lead to the formation of an anomalous Hall crystal: a topological state with spontaneously broken continuous translation symmetry. Our results are established via a mapping to a problem of Wigner crystallization in a r… ▽ More

    Submitted 8 July, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  45. arXiv:2403.02593  [pdf, ps, other

    math.CO

    The Ramsey numbers for trees of order $n$ with maximum degree at least $n-5$ versus the wheel graph of order nine

    Authors: Zhi Yee Chng, Thomas Britz, Ta Sheng Tan, Kok Bin Wong

    Abstract: The Ramsey numbers $R(T_n,W_8)$ are determined for each tree graph $T_n$ of order $n\geq 7$ and maximum degree $Δ(T_n)$ equal to either $n-4$ or $n-5$. These numbers indicate strong support for the conjecture, due to Chen, Zhang and Zhang and to Hafidh and Baskoro, that $R(T_n,W_m) = 2n-1$ for each tree graph $T_n$ of order $n\geq m-1$ with $Δ(T_n)\leq n-m+2$ when $m\geq 4$ is even.

    Submitted 4 March, 2024; originally announced March 2024.

    MSC Class: 05C55; 05D10

  46. arXiv:2402.18009  [pdf, other

    astro-ph.CO

    Impact of Systematic Redshift Errors on the Cross-correlation of the Lyman-$α$ Forest with Quasars at Small Scales Using DESI Early Data

    Authors: Abby Bault, David Kirkby, Julien Guy, Allyson Brodzeller, J. Aguilar, S. Ahlen, S. Bailey, D. Brooks, L. Cabayol-Garcia, J. Chaves-Montero, T. Claybaugh, A. Cuceu, K. Dawson, R. de la Cruz, A. de la Macorra, A. Dey, P. Doel, S. Filbert, A. Font-Ribera, J. E. Forero-Romero, E. Gaztañaga, S. Gontcho A Gontcho, C. Gordon, H. K. Herrera-Alcantar, K. Honscheid , et al. (37 additional authors not shown)

    Abstract: The Dark Energy Spectroscopic Instrument (DESI) will measure millions of quasar spectra by the end of its 5 year survey. Quasar redshift errors impact the shape of the Lyman-$α$ forest correlation functions, which can affect cosmological analyses and therefore cosmological interpretations. Using data from the DESI Early Data Release and the first two months of the main survey, we measure the syste… ▽ More

    Submitted 12 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 29 pages, 9 figures, 5 tables

  47. arXiv:2402.11622  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models

    Authors: Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan

    Abstract: Object hallucination has been an Achilles' heel which hinders the broader applications of large vision-language models (LVLMs). Object hallucination refers to the phenomenon that the LVLMs claim non-existent objects in the image. To mitigate the object hallucinations, instruction tuning and external model-based detection methods have been proposed, which either require large-scare computational re… ▽ More

    Submitted 28 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Accept to ACL 2024; 19 Pages, 15 Figures, 6 Tables

  48. arXiv:2402.10551  [pdf, other

    cs.LG q-bio.QM

    Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information

    Authors: Aishwarya Jayagopal, Hansheng Xue, Ziyang He, Robert J. Walsh, Krishna Kumar Hariprasannan, David Shao Peng Tan, Tuan Zea Tan, Jason J. Pitt, Anand D. Jeyasekharan, Vaibhav Rajan

    Abstract: Cancer remains a global challenge due to its growing clinical and economic burden. Its uniquely personal manifestation, which makes treatment difficult, has fuelled the quest for personalized treatment strategies. Thus, genomic profiling is increasingly becoming part of clinical diagnostic panels. Effective use of such panels requires accurate drug response prediction (DRP) models, which are chall… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  49. arXiv:2402.10083  [pdf

    cs.AI

    Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots in Ophthalmology and LLM-based evaluation using GPT-4

    Authors: Ting Fang Tan, Kabilan Elangovan, Liyuan Jin, Yao Jie, Li Yong, Joshua Lim, Stanley Poh, Wei Yan Ng, Daniel Lim, Yuhe Ke, Nan Liu, Daniel Shu Wei Ting

    Abstract: Purpose: To assess the alignment of GPT-4-based evaluation to human clinician experts, for the evaluation of responses to ophthalmology-related patient queries generated by fine-tuned LLM chatbots. Methods: 400 ophthalmology questions and paired answers were created by ophthalmologists to represent commonly asked patient questions, divided into fine-tuning (368; 92%), and testing (40; 8%). We find… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 13 Pages, 1 Figure, 8 Tables

  50. arXiv:2402.09214  [pdf, ps, other

    math.NT

    A Schmidt's subspace theorem for moving hyeprplane targets over function fields

    Authors: Le Giang, Tran Van Tan, Nguyen Van Thin

    Abstract: In this paper, we establish a Schmidt's subspace theorem for moving hyeprplane targets in projective spaces over function fields.

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 12 pages, Accepted in Acta Mathematica Vietnamica