Skip to main content

Showing 1–50 of 508 results for author: Lai, C

  1. arXiv:2407.09953  [pdf, other

    physics.ins-det

    Photocathode characterisation for robust PICOSEC Micromegas precise-timing detectors

    Authors: M. Lisowska, R. Aleksan, Y. Angelis, S. Aune, J. Bortfeldt, F. Brunbauer, M. Brunoldi, E. Chatzianagnostou, J. Datta, K. Dehmelt, G. Fanourakis, S. Ferry, D. Fiorina, K. J. Floethner, M. Gallinaro, F. Garcia, I. Giomataris, K. Gnanvo, F. J. Iguaz, D. Janssens, A. Kallitsopoulou, M. Kovacic, B. Kross, C. C. Lai, P. Legou , et al. (33 additional authors not shown)

    Abstract: The PICOSEC Micromegas detector is a precise-timing gaseous detector based on a Cherenkov radiator coupled with a semi-transparent photocathode and a Micromegas amplifying structure, targeting a time resolution of tens of picoseconds for minimum ionising particles. Initial single-pad prototypes have demonstrated a time resolution below 25 ps, prompting ongoing developments to adapt the concept for… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  2. arXiv:2407.07329  [pdf, other

    cs.CL

    Probability of Differentiation Reveals Brittleness of Homogeneity Bias in Large Language Models

    Authors: Messi H. J. Lee, Calvin K. Lai

    Abstract: Homogeneity bias in Large Language Models (LLMs) refers to their tendency to homogenize the representations of some groups compared to others. Previous studies documenting this bias have predominantly used encoder models, which may have inadvertently introduced biases. To address this limitation, we prompted GPT-4 to generate single word/expression completions associated with 18 situation cues - s… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2407.07001  [pdf, ps, other

    math.AP math-ph

    Applications of the Green tensor estimates of the nonstationary Stokes system in the half space

    Authors: Kyungkeun Kang, Baishun Lai, Chen-Chih Lai, Tai-Peng Tsai

    Abstract: In this paper, we present a series of applications of the pointwise estimates of the (unrestricted) Green tensor of the nonstationary Stokes system in the half space, established in our previous work [CMP 2023]. First, we show the $L^1$-$L^q$ estimates for the Stokes flow with possibly non-solenoidal $L^1$ initial data, generalizing the results of Giga-Matsui-Shimizu [Math. Z. 1999] and Desch-Hieb… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  4. arXiv:2407.06194  [pdf, other

    cs.CV cs.AI cs.CL

    More Distinctively Black and Feminine Faces Lead to Increased Stereotyping in Vision-Language Models

    Authors: Messi H. J. Lee, Jacob M. Montgomery, Calvin K. Lai

    Abstract: Vision Language Models (VLMs), exemplified by GPT-4V, adeptly integrate text and vision modalities. This integration enhances Large Language Models' ability to mimic human perception, allowing them to process image inputs. Despite VLMs' advanced capabilities, however, there is a concern that VLMs inherit biases of both modalities in ways that make biases more pervasive and difficult to mitigate. O… ▽ More

    Submitted 21 May, 2024; originally announced July 2024.

  5. arXiv:2407.01432  [pdf, ps, other

    quant-ph physics.app-ph physics.optics

    $\mathcal{PT}$-Symmetry induced Bi-Stability in Non-Hermitian Cavity Magnomechanics

    Authors: Chaoyi Lai, Shah Fahad, Kashif Ammar Yasir

    Abstract: We study the steady-state non-Hermitian magnomechanical system driven by a transverse magnetic field directly interacting with YIG sphere and excites cavity magnons and photons. To make the system non-Hermitian, we use a traveling field directly interacting with magnons generating gain to the system. We start by illustrating PT-configuration of the system, which contains two PT broken region aroun… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 10 pages, 6 figures

  6. arXiv:2407.00607  [pdf, other

    quant-ph

    Reducing Quantum Error Correction Overhead with Versatile Flag-Sharing Syndrome Extraction Circuits

    Authors: Pei-Hao Liou, Ching-Yi Lai

    Abstract: Given that quantum error correction processes are unreliable, an efficient error syndrome extraction circuit should use fewer ancillary qubits, quantum gates, and measurements, while maintaining low circuit depth, to minimizing the circuit area, roughly defined as the product of circuit depth and the number of physical qubits. We propose to design parallel flagged syndrome extraction with shared f… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 19 pages, 22 figures

  7. arXiv:2407.00450  [pdf, other

    quant-ph

    Hybrid Quantum-Classical Clustering for Preparing a Prior Distribution of Eigenspectrum

    Authors: Mengzhen Ren, Yu-Cheng Chen, Ching-Jui Lai, Min-Hsiu Hsieh, Alice Hu

    Abstract: Determining the energy gap in a quantum many-body system is critical to understanding its behavior and is important in quantum chemistry and condensed matter physics. The challenge of determining the energy gap requires identifying both the excited and ground states of a system. In this work, we consider preparing the prior distribution and circuits for the eigenspectrum of time-independent Hamilt… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  8. arXiv:2406.18556  [pdf

    eess.IV cs.CV cs.LG

    Renal digital pathology visual knowledge search platform based on language large model and book knowledge

    Authors: Xiaomin Lv, Chong Lai, Liya Ding, Maode Lai, Qingrong Sun

    Abstract: Large models have become mainstream, yet their applications in digital pathology still require exploration. Meanwhile renal pathology images play an important role in the diagnosis of renal diseases. We conducted image segmentation and paired corresponding text descriptions based on 60 books for renal pathology, clustering analysis for all image and text description features based on large models,… ▽ More

    Submitted 26 May, 2024; originally announced June 2024.

    Comments: 9 pages, 6 figures

  9. arXiv:2406.09270  [pdf, other

    astro-ph.HE

    Discovery and Extensive Follow-Up of SN 2024ggi, a nearby type IIP supernova in NGC 3621

    Authors: Ting-Wan Chen, Sheng Yang, Shubham Srivastav, Takashi J. Moriya, Stephen J. Smartt, Sofia Rest, Armin Rest, Hsing Wen Lin, Hao-Yu Miao, Yu-Chi Cheng, Amar Aryan, Chia-Yu Cheng, Morgan Fraser, Li-Ching Huang, Meng-Han Lee, Cheng-Han Lai, Yu Hsuan Liu, Aiswarya Sankar. K, Ken W. Smith, Heloise F. Stevance, Ze-Ning Wang, Joseph P. Anderson, Charlotte R. Angus, Thomas de Boer, Kenneth Chambers , et al. (23 additional authors not shown)

    Abstract: We present the discovery and early observations of the nearby Type II supernova (SN) 2024ggi in NGC 3621 at 6.64 +/- 0.3 Mpc. The SN was caught 5.8 (+1.9 -2.9) hours after its explosion by the ATLAS survey. Early-phase, high-cadence, and multi-band photometric follow-up was performed by the Kinder (Kilonova Finder) project, collecting over 1000 photometric data points within a week. The combined o… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures in manuscript, 6 pages in appendix, submitted to ApJL

  10. arXiv:2406.08353  [pdf, other

    eess.AS cs.CL cs.MM cs.SD

    Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques

    Authors: Yuanchao Li, Peter Bell, Catherine Lai

    Abstract: Text data is commonly utilized as a primary input to enhance Speech Emotion Recognition (SER) performance and reliability. However, the reliance on human-transcribed text in most studies impedes the development of practical SER systems, creating a gap between in-lab research and real-world scenarios where Automatic Speech Recognition (ASR) serves as the text source. Hence, this study benchmarks SE… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  11. arXiv:2406.08102  [pdf, other

    cs.CV

    Adversarial Patch for 3D Local Feature Extractor

    Authors: Yu Wen Pao, Li Chang Lai, Hong-Yi Lin

    Abstract: Local feature extractors are the cornerstone of many computer vision tasks. However, their vulnerability to adversarial attacks can significantly compromise their effectiveness. This paper discusses approaches to attack sophisticated local feature extraction algorithms and models to achieve two distinct goals: (1) forcing a match between originally non-matching image regions, and (2) preventing a… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  12. arXiv:2406.04553  [pdf, other

    cs.IR cs.AI

    Better Late Than Never: Formulating and Benchmarking Recommendation Editing

    Authors: Chengyu Lai, Sheng Zhou, Zhimeng Jiang, Qiaoyu Tan, Yuanchen Bei, Jiawei Chen, Ningyu Zhang, Jiajun Bu

    Abstract: Recommendation systems play a pivotal role in suggesting items to users based on their preferences. However, in online platforms, these systems inevitably offer unsuitable recommendations due to limited model capacity, poor data quality, or evolving user interests. Enhancing user experience necessitates efficiently rectify such unsuitable recommendation behaviors. This paper introduces a novel and… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  13. arXiv:2406.00317  [pdf, other

    stat.ML cs.LG stat.ME

    Combining Experimental and Historical Data for Policy Evaluation

    Authors: Ting Li, Chengchun Shi, Qianglin Wen, Yang Sui, Yongli Qin, Chunbo Lai, Hongtu Zhu

    Abstract: This paper studies policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to min… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  14. arXiv:2405.20064  [pdf, other

    eess.AS cs.SD

    1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem

    Authors: Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain

    Abstract: Speech emotion recognition is a challenging classification task with natural emotional speech, especially when the distribution of emotion types is imbalanced in the training and test data. In this case, it is more difficult for a model to learn to separate minority classes, resulting in those sometimes being ignored or frequently misclassified. Previous work has utilised class weighted loss for t… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  15. arXiv:2405.18503  [pdf, other

    cs.SD cs.LG eess.AS

    SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation

    Authors: Koichi Saito, Dongjun Kim, Takashi Shibuya, Chieh-Hsin Lai, Zhi Zhong, Yuhta Takida, Yuki Mitsufuji

    Abstract: Sound content is an indispensable element for multimedia works such as video games, music, and films. Recent high-quality diffusion-based sound generation models can serve as valuable tools for the creators. However, despite producing high-quality sounds, these models often suffer from slow inference speeds. This drawback burdens creators, who typically refine their sounds through trial and error… ▽ More

    Submitted 10 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Audio samples: https://koichi-saito-sony.github.io/soundctm/. Codes: https://github.com/sony/soundctm. Checkpoints: https://huggingface.co/Sony/soundctm

  16. arXiv:2405.17768  [pdf, other

    cs.LG cs.SI

    Revisiting the Message Passing in Heterophilous Graph Neural Networks

    Authors: Zhuonan Zheng, Yuanchen Bei, Sheng Zhou, Yao Ma, Ming Gu, HongJia XU, Chengyu Lai, Jiawei Chen, Jiajun Bu

    Abstract: Graph Neural Networks (GNNs) have demonstrated strong performance in graph mining tasks due to their message-passing mechanism, which is aligned with the homophily assumption that adjacent nodes exhibit similar behaviors. However, in many real-world graphs, connected nodes may display contrasting behaviors, termed as heterophilous patterns, which has attracted increased interest in heterophilous G… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  17. arXiv:2405.17251  [pdf, other

    cs.CV

    GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping

    Authors: Junyoung Seo, Kazumi Fukuda, Takashi Shibuya, Takuya Narihira, Naoki Murata, Shoukang Hu, Chieh-Hsin Lai, Seungryong Kim, Yuki Mitsufuji

    Abstract: Generating novel views from a single image remains a challenging task due to the complexity of 3D scenes and the limited diversity in the existing multi-view datasets to train a model on. Recent research combining large-scale text-to-image (T2I) models with monocular depth estimation (MDE) has shown promise in handling in-the-wild images. In these methods, an input view is geometrically warped to… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Project page: https://GenWarp-NVS.github.io

  18. arXiv:2405.16677  [pdf, other

    eess.AS cs.CL cs.SD

    Crossmodal ASR Error Correction with Discrete Speech Units

    Authors: Yuanchao Li, Pinzhen Chen, Peter Bell, Catherine Lai

    Abstract: ASR remains unsatisfactory in scenarios where the speaking style diverges from that used to train ASR systems, resulting in erroneous transcripts. To address this, ASR Error Correction (AEC), a post-ASR processing approach, is required. In this work, we tackle an understudied issue: the Low-Resource Out-of-Domain (LROOD) problem, by investigating crossmodal AEC on very limited downstream data with… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  19. arXiv:2405.16194  [pdf, other

    cs.LG cs.AI cs.RO

    Diffusion-Reward Adversarial Imitation Learning

    Authors: Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh, Yu-Chiang Frank Wang, Min-Hung Chen, Shao-Hua Sun

    Abstract: Imitation learning aims to learn a policy from observing expert demonstrations without access to reward signals from environments. Generative adversarial imitation learning (GAIL) formulates imitation learning as adversarial learning, employing a generator policy learning to imitate expert behaviors and discriminator learning to distinguish the expert demonstrations from agent trajectories. Despit… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  20. arXiv:2405.14822  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher

    Authors: Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon

    Abstract: To accelerate sampling, diffusion models (DMs) are often distilled into generators that directly map noise to data in a single step. In this approach, the resolution of the generator is fundamentally limited by that of the teacher DM. To overcome this limitation, we propose Progressive Growing of Diffusion Autoencoder (PaGoDA), a technique to progressively grow the resolution of the generator beyo… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  21. arXiv:2404.19228  [pdf, other

    cs.LG

    Understanding Multimodal Contrastive Learning Through Pointwise Mutual Information

    Authors: Toshimitsu Uesaka, Taiji Suzuki, Yuhta Takida, Chieh-Hsin Lai, Naoki Murata, Yuki Mitsufuji

    Abstract: Multimodal representation learning to integrate different modalities, such as text, vision, and audio is important for real-world applications. The symmetric InfoNCE loss proposed in CLIP is a key concept in multimodal representation learning. In this work, we provide a theoretical understanding of the symmetric InfoNCE loss through the lens of the pointwise mutual information and show that encode… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  22. arXiv:2404.18586   

    quant-ph

    How to surpass no-go limits in Gaussian quantum error correction and entangled Gaussian state distillation?

    Authors: En-Jui Chang, Ching-Yi Lai

    Abstract: Gaussian quantum information processing with continuous-variable (CV) quantum information carriers holds significant promise for applications in quantum communication and quantum internet. However, applying Gaussian state distillation and quantum error correction (QEC) faces limitations imposed by no-go results concerning local Gaussian unitary operations and classical communications. This paper i… ▽ More

    Submitted 7 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Lemma 3 and Lemma 4 are incorrect

  23. arXiv:2404.13227  [pdf, other

    physics.ao-ph nlin.CD physics.comp-ph physics.flu-dyn physics.geo-ph

    Machine learning for climate physics and simulations

    Authors: Ching-Yao Lai, Pedram Hassanzadeh, Aditi Sheshadri, Maike Sonnewald, Raffaele Ferrari, Venkatramani Balaji

    Abstract: We discuss the emerging advances and opportunities at the intersection of machine learning (ML) and climate physics, highlighting the use of ML techniques, including supervised, unsupervised, and equation discovery, to accelerate climate knowledge discoveries and simulations. We delineate two distinct yet complementary aspects: (1) ML for climate physics and (2) ML for climate simulations. While p… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  24. arXiv:2404.09385  [pdf, other

    eess.AS cs.CL eess.SP

    A Large-Scale Evaluation of Speech Foundation Models

    Authors: Shu-wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee

    Abstract: The foundation model paradigm leverages a shared foundation model to achieve state-of-the-art (SOTA) performance for various tasks, requiring minimal downstream-specific modeling and data annotation. This approach has proven crucial in the field of Natural Language Processing (NLP). However, the speech processing community lacks a similar setup to explore the paradigm systematically. In this work,… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: The extended journal version for SUPERB and SUPERB-SG. Published in IEEE/ACM TASLP. The Arxiv version is preferred

  25. arXiv:2404.03268  [pdf, other

    quant-ph

    Efficient Ground State Estimation Using Generalized Hund's Rule

    Authors: Leo Chiang, Ching-Jui Lai

    Abstract: Quantum computers offer a promising approach to simulate the ground state of molecules, which is crucial for understanding molecular properties and chemical reactions. However, the limited number of available qubits on current devices poses a challenge for simulation. This paper investigates the feasibility of reducing the qubit usage of molecular simulation by examining specific fermionic states… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  26. arXiv:2404.02846  [pdf, ps, other

    math.RT math.AG

    On the Springer correspondence for wreath products

    Authors: You-Hung Hsu, Chun-Ju Lai

    Abstract: We first show that the wreath product $Σ_m\wr Σ_d$ between two symmetric groups appears as the generalized Weyl group of an Iwahori's generalized Tits system. We then introduce a certain subvariety of the flag variety of type A, and then give a geometric proof of its Bruhat decomposition indexed by $Σ_m\wr Σ_d$, via the Bialynicki-Birula decomposition. Furthermore, we realize the group algebra… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: 17 pages. v2: exposition improved

  27. arXiv:2403.11211  [pdf

    cs.CV

    RCdpia: A Renal Carcinoma Digital Pathology Image Annotation dataset based on pathologists

    Authors: Qingrong Sun, Weixiang Zhong, Jie Zhou, Chong Lai, Xiaodong Teng, Maode Lai

    Abstract: The annotation of digital pathological slide data for renal cell carcinoma is of paramount importance for correct diagnosis of artificial intelligence models due to the heterogeneous nature of the tumor. This process not only facilitates a deeper understanding of renal cell cancer heterogeneity but also aims to minimize noise in the data for more accurate studies. To enhance the applicability of t… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures, 1 table

  28. arXiv:2402.19383  [pdf, other

    quant-ph

    Harnessing Coding Theory for Reliable Network Quantum Communication

    Authors: Ching-Yi Lai, Kao-Yueh Kuo

    Abstract: This article explores the application of coding techniques for fault-tolerant quantum computation and extends their usage to fault-tolerant quantum communication. We review repeater-based quantum networks, emphasizing the roles of coding theory and fault-tolerant quantum operations, particularly in the context of quantum teleportation. We highlight that fault-tolerant implementation of the Bell me… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 7 pages, 5 figures

  29. arXiv:2402.15630  [pdf

    physics.ins-det nucl-ex

    In-beam test results of an RPC-based module for position-sensitive neutron detectors with timing readout

    Authors: G. Canezin, L. M. S. Margato, A. Morozov, A. Blanco, J. Saraiva, L. Lopes, P. Fonte, Chung Chuan Lai, Per-Olof Svensson, G. Markaj, Florian M. Piegsa

    Abstract: Recently we have proposed a new concept of a thermal neutron detector based on resistive plate chambers and 10B4C solid neutron converters, enabling to readout with high resolution in both the 3D position of neutron capture and the neutron time of flight (ToF). In this paper, we report the results of the first beam tests conducted with a new neutron RPC detection module, coupled to the position re… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  30. arXiv:2402.14905  [pdf, other

    cs.LG cs.AI cs.CL

    MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

    Authors: Zechun Liu, Changsheng Zhao, Forrest Iandola, Chen Lai, Yuandong Tian, Igor Fedorov, Yunyang Xiong, Ernie Chang, Yangyang Shi, Raghuraman Krishnamoorthi, Liangzhen Lai, Vikas Chandra

    Abstract: This paper addresses the growing need for efficient large language models (LLMs) on mobile devices, driven by increasing cloud costs and latency concerns. We focus on designing top-quality LLMs with fewer than a billion parameters, a practical choice for mobile deployment. Contrary to prevailing belief emphasizing the pivotal role of data and parameter quantity in determining model quality, our in… ▽ More

    Submitted 26 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: ICML 2024. Code is available at https://github.com/facebookresearch/MobileLLM

  31. arXiv:2402.14677  [pdf, other

    cond-mat.quant-gas

    Influence of thermal effects on atomic Bloch oscillation

    Authors: Guoling Yin, Chi-Kin Lai, Nana Chang, Yi Liang, Dekai Mao, Xiaoji Zhou

    Abstract: Advancements in the experimental toolbox of cold atoms have enabled the meticulous control of atomic Bloch oscillation within optical lattices, thereby enhancing the capabilities of gravity interferometers. This work delves into the impact of thermal effects on Bloch oscillation in 1D accelerated optical lattices aligned with gravity by varying the system's initial temperature. Through the applica… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 8 pages, 7 figures

  32. arXiv:2402.08643  [pdf, other

    cs.CV cs.LG

    Learned Image Compression with Text Quality Enhancement

    Authors: Chih-Yu Lai, Dung Tran, Kazuhito Koishida

    Abstract: Learned image compression has gained widespread popularity for their efficiency in achieving ultra-low bit-rates. Yet, images containing substantial textual content, particularly screen-content images (SCI), often suffers from text distortion at such compressed levels. To address this, we propose to minimize a novel text logit loss designed to quantify the disparity in text between the original an… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Submitted to ICIP 2024

  33. Multi-Blade detector with VMM3a-ASIC-based readout: installation and commissioning at the reflectometer Amor at PSI

    Authors: F. Piscitelli, F. Ghazi Moradi, F. S. Alves, M. J. Christensen, J. Hrivnak, A. Johansson, K. Fissum, C. C. Lai, A. Monera Martinez, D. Pfeiffer, E. Shahu, J. Stahn, P. O. Svensson

    Abstract: The Multi-Blade (MB) Boron-10-based neutron detector is the chosen technology for three instruments at the European Spallation Source (ESS): the two ESS reflectometers, ESTIA and FREIA, and the Test Beam Line. A fourth MB detector has been built, installed and commissioned for the user operation of the reflectometer Amor at PSI (Switzerland). Amor can be considered a downscaled version of the ESS… ▽ More

    Submitted 18 March, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 16 pages, 12 figures

    Journal ref: 2024 JINST 19 P05010

  34. arXiv:2402.02617  [pdf, other

    cs.CL cs.SD eess.AS

    Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition

    Authors: Alexandra Saliba, Yuanchao Li, Ramon Sanabria, Catherine Lai

    Abstract: The efficacy of self-supervised speech models has been validated, yet the optimal utilization of their representations remains challenging across diverse tasks. In this study, we delve into Acoustic Word Embeddings (AWEs), a fixed-length feature derived from continuous representations, to explore their advantages in specific tasks. AWEs have previously shown utility in capturing acoustic discrimin… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted to ICASSP2024 Self-supervision in Audio, Speech and Beyond (SASB) workshop. First two authors contributed equally

  35. arXiv:2401.10711  [pdf, other

    cs.CV cs.AI cs.CL

    Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering

    Authors: Haibo Wang, Chenghang Lai, Yixuan Sun, Weifeng Ge

    Abstract: Video Question Answering (VideoQA) aims to answer natural language questions based on the information observed in videos. Despite the recent success of Large Multimodal Models (LMMs) in image-language understanding and reasoning, they deal with VideoQA insufficiently, by simply taking uniformly sampled frames as visual inputs, which ignores question-relevant visual clues. Moreover, there are no hu… ▽ More

    Submitted 26 April, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

  36. arXiv:2401.09695  [pdf

    cs.HC cs.AI

    Should ChatGPT Write Your Breakup Text? Exploring the Role of AI in Relationship Dissolution

    Authors: Yue Fu, Yixin Chen, Zelia Gomes Da Costa Lai, Alexis Hiniker

    Abstract: Relationships are essential to our happiness and wellbeing. The dissolution of a relationship, the final stage of relationship's lifecycle and one of the most stressful events in an individual's life, can have profound and long-lasting impacts on people. With the breakup process increasingly facilitated by computer-mediated communication (CMC), and the likely future influence of AI-mediated commun… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  37. Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans

    Authors: Messi H. J. Lee, Jacob M. Montgomery, Calvin K. Lai

    Abstract: Large language models (LLMs) are becoming pervasive in everyday life, yet their propensity to reproduce biases inherited from training data remains a pressing concern. Prior investigations into bias in LLMs have focused on the association of social groups with stereotypical attributes. However, this is only one form of human bias such systems may reproduce. We investigate a new form of bias in LLM… ▽ More

    Submitted 25 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Forthcoming at ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2024

  38. arXiv:2401.01329  [pdf, other

    eess.SP cs.NI

    Self-Supervised Millimeter Wave Indoor Localization using Tiny Neural Networks

    Authors: Anish Shastri, Steve Blandino, Camillo Gentile, Chiehping Lai, Paolo Casari

    Abstract: The quasi-optical propagation of millimeter-wave signals enables high-accuracy localization algorithms that employ geometric approaches or machine learning models. However, most algorithms require information on the indoor environment, may entail the collection of large training datasets, or bear an infeasible computational burden for commercial off-the-shelf (COTS) devices. In this work, we propo… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 13 pages, 11 figures

  39. arXiv:2401.00365  [pdf, other

    cs.LG cs.AI cs.CV

    HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes

    Authors: Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji

    Abstract: Vector quantization (VQ) is a technique to deterministically learn features with discrete codebook representations. It is commonly performed with a variational autoencoding model, VQ-VAE, which can be further extended to hierarchical structures for making high-fidelity reconstructions. However, such hierarchical extensions of VQ-VAE often suffer from the codebook/layer collapse issue, where the co… ▽ More

    Submitted 28 March, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 34 pages with 17 figures, accepted for TMLR

  40. arXiv:2312.13594  [pdf, other

    cs.CL cs.AI cs.CV

    Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA

    Authors: Chengen Lai, Shengli Song, Shiqi Meng, Jingyang Li, Sitong Yan, Guangneng Hu

    Abstract: Natural language explanation in visual question answer (VQA-NLE) aims to explain the decision-making process of models by generating natural language sentences to increase users' trust in the black-box systems. Existing post-hoc methods have achieved significant progress in obtaining a plausible explanation. However, such post-hoc explanations are not always aligned with human logical inference, s… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  41. arXiv:2312.01319  [pdf, ps, other

    math.CA math.MG

    Erdős similarity problem via bi-Lipschitz embedding

    Authors: De-jun Feng, Chun-Kit Lai, Ying Xiong

    Abstract: The Erdős similarity conjecture asserted that an infinite set of real numbers cannot be affinely embedded into every measurable set of positive Lebesgue measure. The problem is still open, in particular for all fast decaying sequences. In this paper, we relax the problem to the bi-Lipschitz embedding and obtain some sharp criteria about the bi-Lipschitz Erdős similarity problem for strictly decrea… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    MSC Class: 28A78; 28A05; 30L05; 11K55

  42. arXiv:2311.16424  [pdf, other

    cs.LG cs.AI cs.CV

    Manifold Preserving Guided Diffusion

    Authors: Yutong He, Naoki Murata, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Dongjun Kim, Wei-Hsiang Liao, Yuki Mitsufuji, J. Zico Kolter, Ruslan Salakhutdinov, Stefano Ermon

    Abstract: Despite the recent advancements, conditional image generation still faces challenges of cost, generalizability, and the need for task-specific training. In this paper, we propose Manifold Preserving Guided Diffusion (MPGD), a training-free conditional generation framework that leverages pretrained diffusion models and off-the-shelf neural networks with minimal additional inference cost for a broad… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  43. arXiv:2311.07111  [pdf, ps, other

    quant-ph cs.IT

    Semidefinite programming bounds on the size of entanglement-assisted codeword stabilized quantum codes

    Authors: Ching-Yi Lai, Pin-Chieh Tseng, Wei-Hsuan Yu

    Abstract: In this paper, we explore the application of semidefinite programming to the realm of quantum codes, specifically focusing on codeword stabilized (CWS) codes with entanglement assistance. Notably, we utilize the isotropic subgroup of the CWS group and the set of word operators of a CWS-type quantum code to derive an upper bound on the minimum distance. Furthermore, this characterization can be inc… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 20 pages, 1 table

  44. arXiv:2311.04149  [pdf, other

    cs.SI cs.LG

    HyperS2V: A Framework for Structural Representation of Nodes in Hyper Networks

    Authors: Shu Liu, Cameron Lai, Fujio Toriumi

    Abstract: In contrast to regular (simple) networks, hyper networks possess the ability to depict more complex relationships among nodes and store extensive information. Such networks are commonly found in real-world applications, such as in social interactions. Learning embedded representations for nodes involves a process that translates network structures into more simplified spaces, thereby enabling the… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  45. arXiv:2310.15416  [pdf, other

    cs.LG cs.AI

    Nominality Score Conditioned Time Series Anomaly Detection by Point/Sequential Reconstruction

    Authors: Chih-Yu Lai, Fan-Keng Sun, Zhengqi Gao, Jeffrey H. Lang, Duane S. Boning

    Abstract: Time series anomaly detection is challenging due to the complexity and variety of patterns that can occur. One major difficulty arises from modeling time-dependent relationships to find contextual anomalies while maintaining detection accuracy for point anomalies. In this paper, we propose a framework for unsupervised time series anomaly detection that utilizes point-based and sequence-based recon… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 (https://neurips.cc/virtual/2023/poster/70582)

  46. arXiv:2310.13267  [pdf, other

    cs.CL cs.CV cs.LG cs.SD eess.AS

    On the Language Encoder of Contrastive Cross-modal Models

    Authors: Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji

    Abstract: Contrastive cross-modal models such as CLIP and CLAP aid various vision-language (VL) and audio-language (AL) tasks. However, there has been limited investigation of and improvement in their language encoder, which is the central component of encoding natural language descriptions of image/audio into vector representations. We extensively evaluate how unsupervised and supervised sentence embedding… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  47. arXiv:2310.12682  [pdf, other

    quant-ph cs.IT

    Correcting phenomenological quantum noise via belief propagation

    Authors: Kao-Yueh Kuo, Ching-Yi Lai

    Abstract: Quantum stabilizer codes often face the challenge of syndrome errors due to error-prone measurements. To address this issue, multiple rounds of syndrome extraction are typically employed to obtain reliable error syndromes. In this paper, we consider phenomenological decoding problems, where data qubit errors may occur between two syndrome extractions, and each syndrome measurement can be faulty. T… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 14 pages, 9 figures, 1 table

  48. Distributed Indexing Schemes for k-Dominant Skyline Analytics on Uncertain Edge-IoT Data

    Authors: Chuan-Chi Lai, Hsuan-Yu Lin, Chuan-Ming Liu

    Abstract: Skyline queries typically search a Pareto-optimal set from a given data set to solve the corresponding multiobjective optimization problem. As the number of criteria increases, the skyline presumes excessive data items, which yield a meaningless result. To address this curse of dimensionality, we proposed a k-dominant skyline in which the number of skyline members was reduced by relaxing the restr… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 13 pages, 8 figures, 12 tables, to appear in IEEE Transactions on Emerging Topics in Computing

  49. arXiv:2310.11839  [pdf

    cond-mat.mtrl-sci

    Neel tensor torque at the ferromagnet/antiferromagnet interface

    Authors: Chao-Yao Yang, Sheng-Huai Chen, Chih-Hsiang Tseng, Chang-Yang Kuo, Hsiu-Hau Lin, Chih-Huang Lai

    Abstract: Antiferromagnets (AFMs) exhibit spin arrangements with no net magnetization, positioning them as promising candidates for spintronics applications. While electrical manipulation of the single-crystal AFMs, composed of periodic spin configurations, is achieved recently, it remains a daunting challenge to characterize and to manipulate polycrystalline AFMs. Utilizing statistical analysis in data sci… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: main text 18 pages, supplementary information 10 pages

  50. arXiv:2310.07654  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Audio-Visual Neural Syntax Acquisition

    Authors: Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David Cox, David Harwath, Yang Zhang, Karen Livescu, James Glass

    Abstract: We study phrase structure induction from visually-grounded speech. The core idea is to first segment the speech waveform into sequences of word segments, and subsequently induce phrase structure using the inferred segment-level continuous representations. We present the Audio-Visual Neural Syntax Learner (AV-NSL) that learns phrase structure by listening to audio and looking at images, without eve… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.