Skip to main content

Showing 1–50 of 467 results for author: Ma, D

  1. arXiv:2407.09829  [pdf, other

    cs.RO

    VLMPC: Vision-Language Model Predictive Control for Robotic Manipulation

    Authors: Wentao Zhao, Jiaming Chen, Ziyu Meng, Donghui Mao, Ran Song, Wei Zhang

    Abstract: Although Model Predictive Control (MPC) can effectively predict the future states of a system and thus is widely used in robotic manipulation tasks, it does not have the capability of environmental perception, leading to the failure in some complex scenarios. To address this issue, we introduce Vision-Language Model Predictive Control (VLMPC), a robotic manipulation framework which takes advantage… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Accepted by RSS2024

  2. arXiv:2407.06901  [pdf, other

    cs.HC cs.SD eess.AS

    RespEar: Earable-Based Robust Respiratory Rate Monitoring

    Authors: Yang Liu, Kayla-Jade Butkow, Jake Stuchbury-Wass, Adam Pullin, Dong Ma, Cecilia Mascolo

    Abstract: Respiratory rate (RR) monitoring is integral to understanding physical and mental health and tracking fitness. Existing studies have demonstrated the feasibility of RR monitoring under specific user conditions (e.g., while remaining still, or while breathing heavily). Yet, performing accurate, continuous and non-obtrusive RR monitoring across diverse daily routines and activities remains challengi… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2407.05391  [pdf, other

    eess.SP

    Interference Management in MIMO-ISAC Systems: A Transceiver Design Approach

    Authors: Yangyang Niu, Zhiqing Wei, Dingyou Ma, Xiaoyu Yang, Huici Wu, Zhiyong Feng, Jianhua Yuan

    Abstract: The integrated sensing and communication (ISAC) system under multi-input multi-output (MIMO) architecture achieves dual functionalities of sensing and communication on the same platform by utilizing spatial gain, which provides a feasible paradigm facing spectrum congestion. However, the dual functionalities of sensing and communication operating simultaneously in the same platform bring severe in… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  4. arXiv:2407.05250  [pdf, other

    cs.CL

    CLIMB: A Benchmark of Clinical Bias in Large Language Models

    Authors: Yubo Zhang, Shudi Hou, Mingyu Derek Ma, Wei Wang, Muhao Chen, Jieyu Zhao

    Abstract: Large language models (LLMs) are increasingly applied to clinical decision-making. However, their potential to exhibit bias poses significant risks to clinical equity. Currently, there is a lack of benchmarks that systematically evaluate such clinical bias in LLMs. While in downstream tasks, some biases of LLMs can be avoided such as by instructing the model to answer "I'm not sure...", the intern… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  5. arXiv:2407.03906  [pdf

    physics.med-ph

    Color-map recommendation for MR relaxometry maps

    Authors: Miha Fuderer, Barbara Wichtmann, Fabio Crameri, Nandita M. deSouza, Bettina Baeßler, Vikas Gulani, Meiyun Wang, Dirk Poot, Ruud de Boer, Matt Cashmore, Wolter de Graaf, Kathryn E. Keenan, Dan Ma, Carolin Pirkl, Nico Sollmann, Sebastian Weingärtner, Stefano Mandija, Xavier Golay

    Abstract: Purpose: To harmonize the use of color for MR relaxometry maps and therefore recommend the use of specific color-maps for representing T1 and T2 maps. Methods: Perceptually linearized color-maps were chosen to have similar color settings as those proposed by Griswold et al. in 2018. A Delphi process, polling the opinion of a panel of 81 experts, was used to generate consensus on the suitability of… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 22 pages; embedded are 5 figures and 5 tables; contact the first author for supplementary material. Submitted to Magnetic Resonance in Medicine

  6. arXiv:2407.03688  [pdf, other

    physics.optics

    Adaptive sampling strategy for tolerance analysis of freeform optical surfaces based on critical ray aiming

    Authors: Rundong Fan, Shili Wei, Zhuang Qian, Huiru Ji, Hao Tan, Yan Mo, Donglin Ma

    Abstract: The tolerance analysis of freeform surfaces plays a crucial role in the development of advanced imaging systems. However, the intricate relationship between surface error and imaging quality poses significant challenges, necessitating dense sampling of featured rays during the computation process to ensure an accurate tolerance for different fields of view (FOVs). Here, we propose an adaptive samp… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  7. arXiv:2407.01231  [pdf, other

    cs.CL cs.AI

    MIRAI: Evaluating LLM Agents for Event Forecasting

    Authors: Chenchen Ye, Ziniu Hu, Yihe Deng, Zijie Huang, Mingyu Derek Ma, Yanqiao Zhu, Wei Wang

    Abstract: Recent advancements in Large Language Models (LLMs) have empowered LLM agents to autonomously collect world information, over which to conduct reasoning to solve complex problems. Given this capability, increasing interests have been put into employing LLM agents for predicting international events, which can influence decision-making and shape policy development on an international scale. Despite… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 66 pages, 8 figures, 6 tables; Website: https://mirai-llm.github.io/

  8. arXiv:2406.18147  [pdf, ps, other

    math.DS

    Correlation entropy of free semigroup actions

    Authors: Xiaojiang Ye, Yanjie Tang, Dongkui Ma

    Abstract: This paper introduces the concepts of correlation entropy and local correlation entropy for free semigroup actions on compact metric space, and explores their fundamental properties. Thereafter, we generalize some classical results on correlation entropy and local correlation entropy to apply to free semigroup actions. Finally, we establish the relationship between topological entropy, measure-the… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 35 pages

  9. arXiv:2406.16847  [pdf, other

    cond-mat.quant-gas physics.atom-ph quant-ph

    Realizing a spatially correlated lattice interferometer

    Authors: Peng Peng, Dekai Mao, Yi Liang, Guoling Yin, Hongmian Shui, Bo Song, Xiaoji Zhou

    Abstract: Atom interferometers provide a powerful tool for measuring physical constants and testifying fundamental physics with unprecedented precision. Conventional atom interferometry focuses on the phase difference between two paths and utilizes matter waves with fixed coherence. Here, we report on realizing a Ramsey-Bordé interferometer of coherent matter waves dressed by a moving optical lattice in the… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  10. arXiv:2406.13448  [pdf, other

    physics.acc-ph physics.plasm-ph

    Demonstration of High-Efficiency Microwave Heating Producing Record Highly Charged Xenon Ion Beams with Superconducting ECR Ion Sources

    Authors: X. Wang, J. B. Li, V. Mironov, J. W. Guo, X. Z. Zhang, O. Tarvainen, Y. C. Feng, L. X. Li, J. D. Ma, Z. H. Zhang, W. Lu, S. Bogomolov, L. Sun, H. W. Zhao

    Abstract: Intense highly charged ion beam production is essential for high-power heavy ion accelerators. A novel movable Vlasov launcher for superconducting high charge state Electron Cyclotron Resonance (ECR) ion source has been devised that can affect the microwave power effectiveness by a factor of about 4 in terms of highly charged ion beam production. This approach based on a dedicated microwave launch… ▽ More

    Submitted 14 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

  11. arXiv:2406.12323  [pdf, other

    eess.SP

    Hybrid Beamforming Design for Near-Field ISAC with Modular XL-MIMO

    Authors: Chunwei Meng, Dingyou Ma, Zhaolin Wang, Yuanwei Liu, Zhiqing Wei, Zhiyong Feng

    Abstract: A novel modular extremely large-scale multiple-input-multiple-output (XL-MIMO) integrated sensing and communication (ISAC) framework is proposed in this paper. We consider a downlink ISAC scenario and exploit the modular array architecture to enhance the communication spectral efficiency and sensing resolution while reducing the channel modeling complexity by employing the hybrid spherical and pla… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  12. arXiv:2406.11816  [pdf, other

    cs.CV

    VideoLLM-online: Online Video Large Language Model for Streaming Video

    Authors: Joya Chen, Zhaoyang Lv, Shiwei Wu, Kevin Qinghong Lin, Chenan Song, Difei Gao, Jia-Wei Liu, Ziteng Gao, Dongxing Mao, Mike Zheng Shou

    Abstract: Recent Large Language Models have been enhanced with vision capabilities, enabling them to comprehend images, videos, and interleaved vision-language content. However, the learning methods of these large multimodal models typically treat videos as predetermined clips, making them less effective and efficient at handling streaming video inputs. In this paper, we propose a novel Learning-In-Video-St… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: CVPR 2024. This arxiv version is upgraded with Llama-3

  13. arXiv:2406.09923  [pdf, other

    cs.CL cs.AI cs.LG

    CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions

    Authors: Mingyu Derek Ma, Chenchen Ye, Yu Yan, Xiaoxuan Wang, Peipei Ping, Timothy S Chang, Wei Wang

    Abstract: The integration of Artificial Intelligence (AI), especially Large Language Models (LLMs), into the clinical diagnosis process offers significant potential to improve the efficiency and accessibility of medical care. While LLMs have shown some promise in the medical domain, their application in clinical diagnosis remains underexplored, especially in real-world clinical practice, where highly sophis… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Project page: https://clibench.github.io

  14. arXiv:2406.09411  [pdf, other

    cs.CV cs.AI cs.CL

    MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

    Authors: Fei Wang, Xingyu Fu, James Y. Huang, Zekun Li, Qin Liu, Xiaogeng Liu, Mingyu Derek Ma, Nan Xu, Wenxuan Zhou, Kai Zhang, Tianyi Lorena Yan, Wenjie Jacky Mo, Hsiang-Hui Liu, Pan Lu, Chunyuan Li, Chaowei Xiao, Kai-Wei Chang, Dan Roth, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: We introduce MuirBench, a comprehensive benchmark that focuses on robust multi-image understanding capabilities of multimodal LLMs. MuirBench consists of 12 diverse multi-image tasks (e.g., scene understanding, ordering) that involve 10 categories of multi-image relations (e.g., multiview, temporal relations). Comprising 11,264 images and 2,600 multiple-choice questions, MuirBench is created in a… ▽ More

    Submitted 1 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: typos corrected, references added, Project Page: https://muirbench.github.io/

  15. arXiv:2406.07552  [pdf, ps, other

    math.RA

    Cohomology of a restricted Lie algebra with a restricted derivation in characteristic 2

    Authors: Dan Mao, Liangyun Chen

    Abstract: This paper mainly studies the ResLieDer pair in characteristic 2, that is, a restricted Lie algebra with a restricted derivation. We define the restricted representation of a ResLieDer pair and the corresponding cohomology complex. We show that a ResLieDer pair is rigid if the second cohomology group is trivial and a deformation of order $n$ is extensible if and only if its obstruction class is tr… ▽ More

    Submitted 12 February, 2024; originally announced June 2024.

    Comments: 26 page

  16. arXiv:2406.06962  [pdf, other

    cs.CL cs.AI

    Evolving Subnetwork Training for Large Language Models

    Authors: Hanqi Li, Lu Chen, Da Ma, Zijian Wu, Su Zhu, Kai Yu

    Abstract: Large language models have ushered in a new era of artificial intelligence research. However, their substantial training costs hinder further development and widespread adoption. In this paper, inspired by the redundancy in the parameters of large language models, we propose a novel training paradigm: Evolving Subnetwork Training (EST). EST samples subnetworks from the layers of the large language… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  17. arXiv:2406.05357  [pdf, other

    astro-ph.HE

    Classification of Fermi Gamma-Ray Bursts Based on Machine Learning

    Authors: Si-Yuan Zhu, Wan-Peng Sun, Da-Ling Ma, Fu-Wen Zhang

    Abstract: Gamma-ray bursts (GRBs) are typically classified into long and short GRBs based on their durations. However, there is a significant overlapping in the duration distributions of these two categories. In this paper, we apply the unsupervised dimensionality reduction algorithm called t-SNE and UMAP to classify 2061 Fermi GRBs based on four observed quantities: duration, peak energy, fluence, and peak… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures, revised version submitted to MNRAS

    Report number: https://doi.org/10.1093/mnras/stae1594

    Journal ref: MNRAS, 2024, 532, 1434-1443

  18. arXiv:2406.01392  [pdf, other

    cs.CL

    Sparsity-Accelerated Training for Large Language Models

    Authors: Da Ma, Lu Chen, Pengyu Wang, Hongshen Xu, Hanqi Li, Liangtai Sun, Su Zhu, Shuai Fan, Kai Yu

    Abstract: Large language models (LLMs) have demonstrated proficiency across various natural language processing (NLP) tasks but often require additional training, such as continual pre-training and supervised fine-tuning. However, the costs associated with this, primarily due to their large parameter count, remain high. This paper proposes leveraging \emph{sparsity} in pre-trained LLMs to expedite this trai… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  19. arXiv:2405.19338  [pdf, other

    eess.SP cs.AI cs.CV

    Accurate Patient Alignment without Unnecessary Imaging Dose via Synthesizing Patient-specific 3D CT Images from 2D kV Images

    Authors: Yuzhen Ding, Jason M. Holmes, Hongying Feng, Baoxin Li, Lisa A. McGee, Jean-Claude M. Rwigema, Sujay A. Vora, Daniel J. Ma, Robert L. Foote, Samir H. Patel, Wei Liu

    Abstract: In radiotherapy, 2D orthogonally projected kV images are used for patient alignment when 3D-on-board imaging(OBI) unavailable. But tumor visibility is constrained due to the projection of patient's anatomy onto a 2D plane, potentially leading to substantial setup errors. In treatment room with 3D-OBI such as cone beam CT(CBCT), the field of view(FOV) of CBCT is limited with unnecessarily high imag… ▽ More

    Submitted 1 April, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures and tables

  20. arXiv:2405.09116  [pdf, other

    quant-ph

    Atomic transport dynamics in crossed optical dipole trap

    Authors: Peng Peng, Zhengxi Zhang, Yaoyuan Fan, Guoling Yin, Dekai Mao, Xuzong Chen, Wei Xiong, Xiaoji Zhou

    Abstract: We study the dynamical evolution of cold atoms in crossed optical dipole trap theoretically and experimentally. The atomic transport process is accompanied by two competitive kinds of physical mechanics, atomic loading and atomic loss. The loading process normally is negligible in the evaporative cooling experiment on the ground, while it is significant in the preparation of ultra-cold atoms in th… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  21. Multi-Objective Optimization-based Transmit Beamforming for Multi-Target and Multi-User MIMO-ISAC Systems

    Authors: Chunwei Meng, Zhiqing Wei, Dingyou Ma, Wanli Ni, Liyan Su, Zhiyong Feng

    Abstract: Integrated sensing and communication (ISAC) is an enabling technology for the sixth-generation mobile communications, which equips the wireless communication networks with sensing capabilities. In this paper, we investigate transmit beamforming design for multiple-input and multiple-output (MIMO)-ISAC systems in scenarios with multiple radar targets and communication users. A general form of multi… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  22. arXiv:2405.06909  [pdf, ps, other

    cs.LG cs.AI cs.CY

    Fairness in Reinforcement Learning: A Survey

    Authors: Anka Reuel, Devin Ma

    Abstract: While our understanding of fairness in machine learning has significantly progressed, our understanding of fairness in reinforcement learning (RL) remains nascent. Most of the attention has been on fairness in one-shot classification tasks; however, real-world, RL-enabled systems (e.g., autonomous vehicles) are much more complicated in that agents operate in dynamic environments over a long period… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 10 pages

    ACM Class: A.1; I.2

  23. arXiv:2405.05983  [pdf

    cs.CV cs.AI cs.LG

    Real-Time Pill Identification for the Visually Impaired Using Deep Learning

    Authors: Bo Dang, Wenchao Zhao, Yufeng Li, Danqing Ma, Qixuan Yu, Elly Yijun Zhu

    Abstract: The prevalence of mobile technology offers unique opportunities for addressing healthcare challenges, especially for individuals with visual impairments. This paper explores the development and implementation of a deep learning-based mobile application designed to assist blind and visually impaired individuals in real-time pill identification. Utilizing the YOLO framework, the application aims to… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  24. arXiv:2405.04768  [pdf, other

    cond-mat.mtrl-sci

    Circularly polarized light irradiated ferromagnetic MnBi$_2$Te$_4$: the long-sought ideal Weyl semimetal

    Authors: Shuai Fan, Shengpu Huang, Zhuo Chen, Fangyang Zhan, Xian-Yong Ding, Da-Shuai Ma, Rui Wang

    Abstract: The interaction between light and non-trivial energy band topology allows for the precise manipulation of topological quantum states, which has attracted intensive interest in condensed matter physics. In this work, using first-principles calculations, we studied the topological transition of ferromagnetic (FM) MnBi$_2$Te$_4$ upon irradiation with circularly polarized light (CPL). We revealed that… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  25. arXiv:2405.00513  [pdf

    q-bio.QM

    3D MR Fingerprinting for Dynamic Contrast-Enhanced Imaging of Whole Mouse Brain

    Authors: Yuran Zhu, Guanhua Wang, Yuning Gu, Walter Zhao, Jiahao Lu, Junqing Zhu, Christina J. MacAskill, Andrew Dupuis, Mark A. Griswold, Dan Ma, Chris A. Flask, Xin Yu

    Abstract: Quantitative MRI enables direct quantification of contrast agent concentrations in contrast-enhanced scans. However, the lengthy scan times required by conventional methods are inadequate for tracking contrast agent transport dynamically in mouse brain. We developed a 3D MR fingerprinting (MRF) method for simultaneous T1 and T2 mapping across the whole mouse brain with 4.3-min temporal resolution.… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  26. arXiv:2404.12634  [pdf

    cs.CV cs.AI cs.LG

    Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment

    Authors: Danqing Ma, Meng Wang, Ao Xiang, Zongqing Qi, Qin Yang

    Abstract: This study proposes a multi-modal fusion framework Multitrans based on the Transformer architecture and self-attention mechanism. This architecture combines the study of non-contrast computed tomography (NCCT) images and discharge diagnosis reports of patients undergoing stroke treatment, using a variety of methods based on Transformer architecture approach to predicting functional outcomes of str… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  27. arXiv:2404.07506  [pdf, other

    cond-mat.supr-con

    Flexible Control of Chiral Superconductivity in Optically Driven Nodal Point Superconductors with Antiferromagnetism

    Authors: Zhen Ning, Junjie Zeng, Da-Shuai Ma, Dong-Hui Xu, Rui Wang

    Abstract: Recent studies have attracted widespread attention on magnet-superconductor hybrid systems with emergent topological superconductivity. Here, we present the Floquet engineering of realistic two-dimensional topological nodal-point superconductors that are composed of antiferromagnetic monolayers in proximity to an s-wave superconductor. We show that Floquet chiral topological superconductivity aris… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  28. Cramer-Rao Bounds for Near-Field Sensing: A Generic Modular Architecture

    Authors: Chunwei Meng, Dingyou Ma, Xu Chen, Zhiyong Feng, Yuanwei Liu

    Abstract: A generic modular array architecture is proposed, featuring uniform/non-uniform subarray layouts that allows for flexible deployment. The bistatic near-field sensing system is considered, where the target is located in the near-field of the whole modular array and the far-field of each subarray. Then, the closed-form expressions of Cramer-Rao bounds (CRBs) for range and angle estimations are deriv… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  29. arXiv:2403.19081  [pdf

    physics.optics

    Surface variation analysis of freeform optical systems over surface frequency bands for prescribed wavefront errors

    Authors: Rundong Fan, Shili Wei, Huiru JI, Zhuang Qian, Hao Tan, Yan Mo, Donglin MA

    Abstract: The surface errors of freeform surfaces reflect the manufacturing complexities and significantly impact the feasibility of processing designed optical systems. With multiple degrees of freedom, freeform surfaces pose challenges in surface tolerance analysis in the field. Nevertheless, current research has neglected the influence of surface slopes on the directions of ray propagation. A sudden alte… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  30. arXiv:2403.18349  [pdf, other

    cs.CL

    Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

    Authors: Hongshen Xu, Zichen Zhu, Situo Zhang, Da Ma, Shuai Fan, Lu Chen, Kai Yu

    Abstract: Large Language Models (LLMs) often generate erroneous outputs, known as hallucinations, due to their limitations in discerning questions beyond their knowledge scope. While addressing hallucination has been a focal point in research, previous efforts primarily concentrate on enhancing correctness without giving due consideration to the significance of rejection mechanisms. In this paper, we conduc… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  31. arXiv:2403.17421  [pdf, other

    cs.IR cs.AI

    MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification

    Authors: Yiqun Chen, Jiaxin Mao, Yi Zhang, Dehong Ma, Long Xia, Jun Fan, Daiting Shi, Zhicong Cheng, Simiu Gu, Dawei Yin

    Abstract: The objective of search result diversification (SRD) is to ensure that selected documents cover as many different subtopics as possible. Existing methods primarily utilize a paradigm of "greedy selection", i.e., selecting one document with the highest diversity score at a time. These approaches tend to be inefficient and are easily trapped in a suboptimal state. In addition, some other methods aim… ▽ More

    Submitted 27 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  32. Utilizing the LightGBM Algorithm for Operator User Credit Assessment Research

    Authors: Shaojie Li, Xinqi Dong, Danqing Ma, Bo Dang, Hengyi Zang, Yulu Gong

    Abstract: Mobile Internet user credit assessment is an important way for communication operators to establish decisions and formulate measures, and it is also a guarantee for operators to obtain expected benefits. However, credit evaluation methods have long been monopolized by financial industries such as banks and credit. As supporters and providers of platform network technology and network resources, co… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Journal ref: ACE (2024) Vol. 75: 36-47

  33. arXiv:2403.13703  [pdf

    cs.CV cs.AI

    Fostc3net:A Lightweight YOLOv5 Based On the Network Structure Optimization

    Authors: Danqing Ma, Shaojie Li, Bo Dang, Hengyi Zang, Xinqi Dong

    Abstract: Transmission line detection technology is crucial for automatic monitoring and ensuring the safety of electrical facilities. The YOLOv5 series is currently one of the most advanced and widely used methods for object detection. However, it faces inherent challenges, such as high computational load on devices and insufficient detection accuracy. To address these concerns, this paper presents an enha… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  34. arXiv:2403.12574  [pdf, other

    cs.CV cs.AI cs.NE

    EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks

    Authors: Ziming Wang, Ziling Wang, Huaning Li, Lang Qin, Runhao Jiang, De Ma, Huajin Tang

    Abstract: Event cameras, with their high dynamic range and temporal resolution, are ideally suited for object detection, especially under scenarios with motion blur and challenging lighting conditions. However, while most existing approaches prioritize optimizing spatiotemporal representations with advanced detection backbones and early aggregation functions, the crucial issue of adaptive event sampling rem… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  35. arXiv:2403.12332  [pdf, other

    stat.ME

    A maximum penalised likelihood approach for semiparametric accelerated failure time models with time-varying covariates and partly interval censoring

    Authors: Aishwarya Bhaskaran, Ding Ma, Benoit Liquet, Angela Hong, Serigne N Lo, Stephane Heritier, Jun Ma

    Abstract: Accelerated failure time (AFT) models are frequently used for modelling survival data. This approach is attractive as it quantifies the direct relationship between the time until an event occurs and various covariates. It asserts that the failure times experience either acceleration or deceleration through a multiplicative factor when these covariates are present. While existing literature provide… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 31 pages, 5 figures, 4 tables

  36. arXiv:2403.09035  [pdf, other

    cs.LG

    DiTMoS: Delving into Diverse Tiny-Model Selection on Microcontrollers

    Authors: Xiao Ma, Shengfeng He, Hezhe Qiao, Dong Ma

    Abstract: Enabling efficient and accurate deep neural network (DNN) inference on microcontrollers is non-trivial due to the constrained on-chip resources. Current methodologies primarily focus on compressing larger models yet at the expense of model accuracy. In this paper, we rethink the problem from the inverse perspective by constructing small/weak models directly and improving their accuracy. Thus, we i… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  37. arXiv:2403.08511  [pdf

    cs.CV

    A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product

    Authors: Ao Xiang, Zongqing Qi, Han Wang, Qin Yang, Danqing Ma

    Abstract: This paper introduces a new multi-modal model based on the Transformer architecture and tensor product fusion strategy, combining BERT's text vectors and ViT's image vectors to classify students' psychological conditions, with an accuracy of 93.65%. The purpose of the study is to accurately analyze the mental health status of students from various data sources. This paper discusses modal fusion me… ▽ More

    Submitted 19 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  38. arXiv:2403.08499  [pdf

    cs.CV

    Improved YOLOv5 Based on Attention Mechanism and FasterNet for Foreign Object Detection on Railway and Airway tracks

    Authors: Zongqing Qi, Danqing Ma, Jingyu Xu, Ao Xiang, Hedi Qu

    Abstract: In recent years, there have been frequent incidents of foreign objects intruding into railway and Airport runways. These objects can include pedestrians, vehicles, animals, and debris. This paper introduces an improved YOLOv5 architecture incorporating FasterNet and attention mechanisms to enhance the detection of foreign objects on railways and Airport runways. This study proposes a new dataset,… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  39. arXiv:2403.02586  [pdf, other

    cs.CL

    Improving Event Definition Following For Zero-Shot Event Detection

    Authors: Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, Nanyun Peng

    Abstract: Existing approaches on zero-shot event detection usually train models on datasets annotated with known event types, and prompt them with unseen event definitions. These approaches yield sporadic successes, yet generally fall short of expectations. In this work, we aim to improve zero-shot event detection by training models to better follow event definitions. We hypothesize that a diverse set of ev… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  40. arXiv:2402.18262  [pdf, other

    cs.CL cs.CV

    Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

    Authors: Hongshen Xu, Lu Chen, Zihan Zhao, Da Ma, Ruisheng Cao, Zichen Zhu, Kai Yu

    Abstract: The growing prevalence of visually rich documents, such as webpages and scanned/digital-born documents (images, PDFs, etc.), has led to increased interest in automatic document understanding and information extraction across academia and industry. Although various document modalities, including image, text, layout, and structure, facilitate human information retrieval, the interconnected nature of… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  41. arXiv:2402.15725  [pdf, other

    eess.AS

    Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks

    Authors: Duo Ma, Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li

    Abstract: Human language can be expressed in either written or spoken form, i.e. text or speech. Humans can acquire knowledge from text to improve speaking and listening. However, the quest for speech pre-trained models to leverage unpaired text has just started. In this paper, we investigate a new way to pre-train such a joint speech-text model to learn enhanced speech representations and benefit various s… ▽ More

    Submitted 28 February, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: 5 pages, 1 figures,5 tables, submit to IEEE Signal Processing Letters(SPL)

  42. arXiv:2402.14774  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Dominant 1/3-filling Correlated Insulator States and Orbital Geometric Frustration in Twisted Bilayer Graphene

    Authors: Haidong Tian, Emilio Codecido, Dan Mao, Kevin Zhang, Shi Che, Kenji Watanabe, Takashi Taniguchi, Dmitry Smirnov, Eun-Ah Kim, Marc Bockrath, Chun Ning Lau

    Abstract: Geometric frustration is a phenomenon in a lattice system where not all interactions can be satisfied, the simplest example being antiferromagnetically coupled spins on a triangular lattice. Frustrated systems are characterized by their many nearly degenerate ground states, leading to non-trivial phases such as spin ice and spin liquids. To date most studies are on geometric frustration of spins;… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  43. arXiv:2402.14677  [pdf, other

    cond-mat.quant-gas

    Influence of thermal effects on atomic Bloch oscillation

    Authors: Guoling Yin, Chi-Kin Lai, Nana Chang, Yi Liang, Dekai Mao, Xiaoji Zhou

    Abstract: Advancements in the experimental toolbox of cold atoms have enabled the meticulous control of atomic Bloch oscillation within optical lattices, thereby enhancing the capabilities of gravity interferometers. This work delves into the impact of thermal effects on Bloch oscillation in 1D accelerated optical lattices aligned with gravity by varying the system's initial temperature. Through the applica… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 8 pages, 7 figures

  44. arXiv:2402.09264  [pdf, other

    cs.LG cs.HC

    UR2M: Uncertainty and Resource-Aware Event Detection on Microcontrollers

    Authors: Hong Jia, Young D. Kwon, Dong Ma, Nhat Pham, Lorena Qendro, Tam Vu, Cecilia Mascolo

    Abstract: Traditional machine learning techniques are prone to generating inaccurate predictions when confronted with shifts in the distribution of data between the training and testing phases. This vulnerability can lead to severe consequences, especially in applications such as mobile healthcare. Uncertainty estimation has the potential to mitigate this issue by assessing the reliability of a model's outp… ▽ More

    Submitted 12 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  45. arXiv:2402.07971  [pdf, other

    cond-mat.str-el cond-mat.dis-nn

    Quasicrystalline Spin Liquid

    Authors: Sunghoon Kim, Mohammad Saad, Dan Mao, Adhip Agarwala, Debanjan Chowdhury

    Abstract: The interplay of electronic interactions and frustration in crystalline systems leads to a panoply of correlated phases, including exotic Mott insulators with non-trivial patterns of entanglement. Disorder introduces additional quantum interference effects that can drive localization phenomena. Quasicrystals, which are neither disordered nor perfectly crystalline, are interesting playgrounds for s… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 5 pages, 3 figures. Supplementary material: 5 pages, 7 figures

  46. arXiv:2402.06177  [pdf, other

    math.CO

    Hamiltonicity of Sparse Pseudorandom Graphs

    Authors: Asaf Ferber, Jie Han, Dingjia Mao, Roman Vershynin

    Abstract: We show that every $(n,d,λ)$-graph contains a Hamilton cycle for sufficiently large $n$, assuming that $d\geq \log^{10}n$ and $λ\leq cd$, where $c=\frac{1}{9000}$. This significantly improves a recent result of Glock, Correia and Sudakov, who obtain a similar result for $d$ that grows polynomially with $n$. The proof is based on the absorption technique combined with a new result regarding the sec… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  47. arXiv:2402.04245  [pdf, other

    quant-ph

    Direct evidence for cosmic-ray-induced correlated errors in superconducting qubit array

    Authors: Xue-Gang Li, Jun-Hua Wang, Yao-Yao Jiang, Guang-Ming Xue, Xiao-Xia Cai, Jun Zhou, Ming Gong, Zhao-Feng Liu, Shuang-Yu Zheng, Deng-Ke Ma, Mo Chen, Wei-Jie Sun, Shuang Yang, Fei Yan, Yi-Rong Jin, Xue-Feng Ding, Hai-Feng Yu

    Abstract: Correlated errors can significantly impact the quantum error correction, which challenges the assumption that errors occur in different qubits independently in both space and time. Superconducting qubits have been found to suffer correlated errors across multiple qubits, which could be attributable to ionizing radiations and cosmic rays. Nevertheless, the direct evidence and a quantitative underst… ▽ More

    Submitted 23 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 7 pages and 5 figures for the main text, 20 pages and 20 figures for the supplementary materials

  48. arXiv:2402.03557  [pdf, other

    cs.CV

    Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on Light-Weighed Backbones and Effective Measurement of Multi-Task Learning Challenges by Feature Disentanglement

    Authors: Dayou Mao, Yuhao Chen, Yifan Wu, Maximilian Gilles, Alexander Wong

    Abstract: One of the main motivations of MTL is to develop neural networks capable of inferring multiple tasks simultaneously. While countless methods have been proposed in the past decade investigating robust model architectures and efficient training algorithms, there is still lack of understanding of these methods when applied on smaller feature extraction backbones, the generalizability of the commonly… ▽ More

    Submitted 16 April, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  49. 1002 km Twin-Field Quantum Key Distribution with Finite-Key Analysis

    Authors: Yang Liu, Wei-Jun Zhang, Cong Jiang, Jiu-Peng Chen, Di Ma, Chi Zhang, Wen-Xin Pan, Hao Dong, Jia-Min Xiong, Cheng-Jun Zhang, Hao Li, Rui-Chun Wang, Chao-Yang Lu, Jun Wu, Teng-Yun Chen, Lixing You, Xiang-Bin Wang, Qiang Zhang, Jian-Wei Pan

    Abstract: Quantum key distribution (QKD) holds the potential to establish secure keys over long distances. The distance of point-to-point QKD secure key distribution is primarily impeded by the transmission loss inherent to the channel. In the quest to realize a large-scale quantum network, increasing the QKD distance under current technology is of great research interest. Here we adopt the 3-intensity send… ▽ More

    Submitted 1 December, 2023; originally announced February 2024.

    Comments: 18 pages, 3 figures

    Journal ref: Quantum Front 2, 16 (2023)

  50. arXiv:2401.14818  [pdf, other

    cs.CL cs.DL

    ChemDFM: Dialogue Foundation Model for Chemistry

    Authors: Zihan Zhao, Da Ma, Lu Chen, Liangtai Sun, Zihao Li, Hongshen Xu, Zichen Zhu, Su Zhu, Shuai Fan, Guodong Shen, Xin Chen, Kai Yu

    Abstract: Large language models (LLMs) have established great success in the general domain of natural language processing. Their emerging task generalization and free-form dialogue capabilities can greatly help to design Chemical General Intelligence (CGI) to assist real-world research in chemistry. However, the existence of specialized language and knowledge in the field of chemistry, such as the highly i… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 10 pages, 12 figures, 13 tables. Under Review