Skip to main content

Showing 201–250 of 1,363 results for author: Gao, Z

  1. arXiv:2312.04934  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Gate-controlled neuromorphic functional transition in an electrochemical graphene transistor

    Authors: Chenglin Yu, Shaorui Li, Zhoujie Pan, Yanming Liu, Yongchao Wang, Siyi Zhou, Zhiting Gao, He Tian, Kaili Jiang, Yayu Wang, Jinsong Zhang

    Abstract: Neuromorphic devices have gained significant attention as potential building blocks for the next generation of computing technologies owing to their ability to emulate the functionalities of biological nervous systems. The essential components in artificial neural network such as synapses and neurons are predominantly implemented by dedicated devices with specific functionalities. In this work, we… ▽ More

    Submitted 31 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 22 pages, 4 figures

    Journal ref: Nano Lett. 2024, 24, 5, 1620-1628

  2. arXiv:2312.04019  [pdf, other

    q-bio.BM cs.AI

    Efficiently Predicting Protein Stability Changes Upon Single-point Mutation with Large Language Models

    Authors: Yijie Zhang, Zhangyang Gao, Cheng Tan, Stan Z. Li

    Abstract: Predicting protein stability changes induced by single-point mutations has been a persistent challenge over the years, attracting immense interest from numerous researchers. The ability to precisely predict protein thermostability is pivotal for various subfields and applications in biochemistry, including drug development, protein evolution analysis, and enzyme synthesis. Despite the proposition… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  3. arXiv:2312.02586  [pdf

    cs.HC cs.IT

    Mapping the Information Journey: Unveiling the Documentation Experience of Software Developers in China

    Authors: Zhijun Gao, Jiangying Wang, Meina Wang

    Abstract: This research delves into understanding the behaviors and characteristics of Chinese developers in relation to their use of technical documentation, which is crucial for creating high-quality developer documentation. We conducted interviews with 25 software developers and surveyed 177 participants, using the preliminary interview findings to inform the survey design. Our approach encompassed tradi… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 27 pages

  4. arXiv:2312.02372  [pdf, other

    eess.SP cs.LG

    On the Trade-Off between Stability and Representational Capacity in Graph Neural Networks

    Authors: Zhan Gao, Amanda Prorok, Elvin Isufi

    Abstract: Analyzing the stability of graph neural networks (GNNs) under topological perturbations is key to understanding their transferability and the role of each architecture component. However, stability has been investigated only for particular architectures, questioning whether it holds for a broader spectrum of GNNs or only for a few instances. To answer this question, we study the stability of EdgeN… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  5. arXiv:2312.01987  [pdf, other

    cs.CV

    Bootstrapping SparseFormers from Vision Foundation Models

    Authors: Ziteng Gao, Zhan Tong, Kevin Qinghong Lin, Joya Chen, Mike Zheng Shou

    Abstract: The recently proposed SparseFormer architecture provides an alternative approach to visual understanding by utilizing a significantly lower number of visual tokens via adjusting RoIs, greatly reducing computational costs while still achieving promising performance. However, training SparseFormers from scratch is still expensive, and scaling up the number of parameters can be challenging. In this p… ▽ More

    Submitted 4 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  6. arXiv:2312.01573  [pdf

    eess.IV cs.CV

    Survey on deep learning in multimodal medical imaging for cancer detection

    Authors: Yan Tian, Zhaocheng Xu, Yujun Ma, Weiping Ding, Ruili Wang, Zhihong Gao, Guohua Cheng, Linyang He, Xuran Zhao

    Abstract: The task of multimodal cancer detection is to determine the locations and categories of lesions by using different imaging techniques, which is one of the key research methods for cancer diagnosis. Recently, deep learning-based object detection has made significant developments due to its strength in semantic feature extraction and nonlinear function fitting. However, multimodal cancer detection r… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Journal ref: Neural Computing and Applications. 2023 Nov 29:1-6

  7. arXiv:2311.18173  [pdf

    eess.IV cs.CE cs.CV

    Quantification of cardiac capillarization in single-immunostained myocardial slices using weakly supervised instance segmentation

    Authors: Zhao Zhang, Xiwen Chen, William Richardson, Bruce Z. Gao, Abolfazl Razi, Tong Ye

    Abstract: Decreased myocardial capillary density has been reported as an important histopathological feature associated with various heart disorders. Quantitative assessment of cardiac capillarization typically involves double immunostaining of cardiomyocytes (CMs) and capillaries in myocardial slices. In contrast, single immunostaining of basement membrane components is a straightforward approach to simult… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  8. arXiv:2311.16027  [pdf

    cs.HC cs.AI

    An HCAI Methodological Framework: Putting It Into Action to Enable Human-Centered AI

    Authors: Wei Xu, Zaifeng Gao, Marvin Dainoff

    Abstract: Human-centered AI (HCAI), as a design philosophy, advocates prioritizing humans in designing, developing, and deploying intelligent systems, aiming to maximize the benefits of AI technology to humans and avoid its potential adverse effects. While HCAI has gained momentum, the lack of guidance on methodology in its implementation makes its adoption challenging. After assessing the needs for a metho… ▽ More

    Submitted 30 November, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

  9. A Deep-learning Real-time Bias Correction Method for Significant Wave Height Forecasts in the Western North Pacific

    Authors: Wei Zhang, Yu Sun, Yapeng Wu, Junyu Dong, Xiaojiang Song, Zhiyi Gao, Renbo Pang, Boyu Guoan

    Abstract: Significant wave height is one of the most important parameters characterizing ocean waves, and accurate numerical ocean wave forecasting is crucial for coastal protection and shipping. However, due to the randomness and nonlinearity of the wind fields that generate ocean waves and the complex interaction between wave and wind fields, current forecasts of numerical ocean waves have biases. In this… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: 21 pages

    Journal ref: Ocean Modelling, Volume 187, February 2024, 102289

  10. arXiv:2311.14109  [pdf, other

    cs.AI

    Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training

    Authors: Cheng Tan, Jingxuan Wei, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Ruifeng Guo, Bihui Yu, Stan Z. Li

    Abstract: Multimodal reasoning is a challenging task that requires models to reason across multiple modalities to answer questions. Existing approaches have made progress by incorporating language and visual modalities into a two-stage reasoning framework, separating rationale generation from answer inference. However, these approaches often fall short due to the inadequate quality of the generated rational… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Accepted by ECCV 2024

  11. arXiv:2311.12420  [pdf, other

    cs.AI cs.CL cs.CR

    How Far Have We Gone in Vulnerability Detection Using Large Language Models

    Authors: Zeyu Gao, Hao Wang, Yuchen Zhou, Wenyu Zhu, Chao Zhang

    Abstract: As software becomes increasingly complex and prone to vulnerabilities, automated vulnerability detection is critically important, yet challenging. Given the significant successes of large language models (LLMs) in various tasks, there is growing anticipation of their efficacy in vulnerability detection. However, a quantitative understanding of their potential in vulnerability detection is still mi… ▽ More

    Submitted 22 December, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

  12. arXiv:2311.11519  [pdf, other

    physics.atom-ph physics.optics

    Opportunities for Gas-Phase Science at Short-Wavelength Free-Electron Lasers with Undulator-Based Polarization Control

    Authors: Markus Ilchen, Enrico Allaria, Primož Rebernik Ribič, Heinz-Dieter Nuhn, Alberto Lutman, Evgeny Schneidmiller, Markus Tischer, Mikail Yurkov, Marco Calvi, Eduard Prat, Sven Reiche, Thomas Schmidt, Gianluca Aldo Geloni, Suren Karabekyan, Jiawei Yan, Svitozar Serkez, Zhangfeng Gao, Bangjie Deng, Chao Feng, Haixiao Deng, Wolfram Helml, Lars Funke, Mats Larsson, Vitali, Zhaunerchyk , et al. (22 additional authors not shown)

    Abstract: Free-electron lasers (FELs) are the world's most brilliant light sources with rapidly evolving technological capabilities in terms of ultrabright and ultrashort pulses over a large range of accessible photon energies. Their revolutionary and innovative developments have opened new fields of science regarding nonlinear light-matter interaction, the investigation of ultrafast processes from specific… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  13. arXiv:2311.11120  [pdf

    cs.AI

    An Improved Neural Network Model Based On CNN Using For Fruit Sugar Degree Detection

    Authors: Boyang Deng, Xin Wen, Zhan Gao

    Abstract: Artificial Intelligence(AI) widely applies in Image Classification and Recognition, Text Understanding and Natural Language Processing, which makes great progress. In this paper, we introduced AI into the fruit quality detection field. We designed a fruit sugar degree regression model using an Artificial Neural Network based on spectra of fruits within the visible/near-infrared(V/NIR)range. After… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  14. arXiv:2311.09737  [pdf, other

    cs.CV

    Gradient-Map-Guided Adaptive Domain Generalization for Cross Modality MRI Segmentation

    Authors: Bingnan Li, Zhitong Gao, Xuming He

    Abstract: Cross-modal MRI segmentation is of great value for computer-aided medical diagnosis, enabling flexible data acquisition and model generalization. However, most existing methods have difficulty in handling local variations in domain shift and typically require a significant amount of data for training, which hinders their usage in practice. To address these problems, we propose a novel adaptive dom… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 9 pages, Machine Learning for Health (ML4H) 2023

  15. arXiv:2311.08897  [pdf, other

    nucl-th

    Role of the isospin diffusion on cluster transfer in $^{12,14}$C + $^{209}$Bi reactions

    Authors: Zepeng Gao, Yinu Zhang, Long Zhu, Zehong Liao, Yu Yang, Chenchen Guo, Jun Su

    Abstract: Heavy-ion collisions at near-barrier energies provide a crucial pathway for investigating nucleon correlations and clustering structures. Recent experimental results showed that the valence neutrons in light projectiles obviously enhance the $α$ transfer. This finding is extremely puzzled and fascinating, because it violates the ground-state $Q$ value systematics unexpectedly. In this work, the… ▽ More

    Submitted 16 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

  16. arXiv:2311.06770  [pdf, other

    cs.IT eess.SP

    Compressive Sensing-Based Grant-Free Massive Access for 6G Massive Communication

    Authors: Zhen Gao, Malong Ke, Yikun Mei, Li Qiao, Sheng Chen, Derrick Wing Kwan Ng, H. Vincent Poor

    Abstract: The advent of the sixth-generation (6G) of wireless communications has given rise to the necessity to connect vast quantities of heterogeneous wireless devices, which requires advanced system capabilities far beyond existing network architectures. In particular, such massive communication has been recognized as a prime driver that can empower the 6G vision of future ubiquitous connectivity, suppor… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted by IEEE IoT Journal

  17. arXiv:2311.06703  [pdf

    cs.AI cs.CY cs.SE

    Enabling Human-Centered AI: A Methodological Perspective

    Authors: Wei Xu, Zaifeng Gao

    Abstract: Human-centered AI (HCAI) is a design philosophy that advocates prioritizing humans in designing, developing, and deploying intelligent systems, aiming to maximize the benefits of AI to humans and avoid potential adverse impacts. While HCAI continues to influence, the lack of guidance on methodology in practice makes its adoption challenging. This paper proposes a comprehensive HCAI framework based… ▽ More

    Submitted 14 November, 2023; v1 submitted 11 November, 2023; originally announced November 2023.

  18. arXiv:2311.04418  [pdf, other

    cond-mat.mtrl-sci cs.AI physics.comp-ph

    AI-accelerated Discovery of Altermagnetic Materials

    Authors: Ze-Feng Gao, Shuai Qu, Bocheng Zeng, Yang Liu, Ji-Rong Wen, Hao Sun, Peng-Jie Guo, Zhong-Yi Lu

    Abstract: Altermagnetism, a new magnetic phase, has been theoretically proposed and experimentally verified to be distinct from ferromagnetism and antiferromagnetism. Although altermagnets have been found to possess many exotic physical properties, the very limited availability of known altermagnetic materials (e.g., 14 confirmed materials) hinders the study of such properties. Hence, discovering more types… ▽ More

    Submitted 12 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 38 pages; 22 figures; 3 tables

  19. arXiv:2310.20157  [pdf, other

    physics.optics

    Electrically empowered microcomb laser

    Authors: Jingwei Ling, Zhengdong Gao, Shixin Xue, Qili Hu, Mingxiao Li, Kaibo Zhang, Usman A. Javid, Raymond Lopez-Rios, Jeremy Staffa, Qiang Lin

    Abstract: Optical frequency comb underpins a wide range of applications from communication, metrology, to sensing. Its development on a chip-scale platform -- so called soliton microcomb -- provides a promising path towards system miniaturization and functionality integration via photonic integrated circuit (PIC) technology. Although extensively explored in recent years, challenges remain in key aspects of… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  20. arXiv:2310.19535  [pdf, other

    cs.CV

    Revitalizing Legacy Video Content: Deinterlacing with Bidirectional Information Propagation

    Authors: Zhaowei Gao, Mingyang Song, Christopher Schroers, Yang Zhang

    Abstract: Due to old CRT display technology and limited transmission bandwidth, early film and TV broadcasts commonly used interlaced scanning. This meant each field contained only half of the information. Since modern displays require full frames, this has spurred research into deinterlacing, i.e. restoring the missing information in legacy video content. In this paper, we present a deep-learning-based met… ▽ More

    Submitted 5 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  21. arXiv:2310.19167  [pdf, other

    cs.LG cs.AI stat.ML

    Rare Event Probability Learning by Normalizing Flows

    Authors: Zhenggqi Gao, Dinghuai Zhang, Luca Daniel, Duane S. Boning

    Abstract: A rare event is defined by a low probability of occurrence. Accurate estimation of such small probabilities is of utmost importance across diverse domains. Conventional Monte Carlo methods are inefficient, demanding an exorbitant number of samples to achieve reliable estimates. Inspired by the exact sampling capabilities of normalizing flows, we revisit this challenge and propose normalizing flow… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 16 pages, 5 figures, 2 tables

  22. arXiv:2310.18180  [pdf, other

    cs.IT eess.SP

    DPSS-based Codebook Design for Near-Field XL-MIMO Channel Estimation

    Authors: Shicong Liu, Xianghao Yu, Zhen Gao, Derrick Wing Kwan Ng

    Abstract: Future sixth-generation (6G) systems are expected to leverage extremely large-scale multiple-input multiple-output (XL-MIMO) technology, which significantly expands the range of the near-field region. While accurate channel estimation is essential for beamforming and data detection, the unique characteristics of near-field channels pose additional challenges to the effective acquisition of channel… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 6 pages, 5 figures

  23. arXiv:2310.17844  [pdf, other

    math.NA stat.CO stat.ML

    Adaptive operator learning for infinite-dimensional Bayesian inverse problems

    Authors: Zhiwei Gao, Liang Yan, Tao Zhou

    Abstract: The fundamental computational issues in Bayesian inverse problems (BIP) governed by partial differential equations (PDEs) stem from the requirement of repeated forward model evaluations. A popular strategy to reduce such costs is to replace expensive model simulations with computationally efficient approximations using operator learning, motivated by recent progress in deep learning. However, usin… ▽ More

    Submitted 4 March, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  24. arXiv:2310.17796  [pdf, other

    cs.CV cs.MM

    ControlLLM: Augment Language Models with Tools by Searching on Graphs

    Authors: Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Ziheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang

    Abstract: We present ControlLLM, a novel framework that enables large language models (LLMs) to utilize multi-modal tools for solving complex real-world tasks. Despite the remarkable performance of LLMs, they still struggle with tool invocation due to ambiguous user prompts, inaccurate tool selection and parameterization, and inefficient tool scheduling. To overcome these challenges, our framework comprises… ▽ More

    Submitted 18 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 24 pages, 9 figures, 12 tables

  25. arXiv:2310.17570  [pdf, other

    cs.CL

    DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation

    Authors: Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Zhongyi Ye, Linli Xu

    Abstract: While Diffusion Generative Models have achieved great success on image generation tasks, how to efficiently and effectively incorporate them into speech generation especially translation tasks remains a non-trivial problem. Specifically, due to the low information density of speech data, the transformed discrete speech unit sequence is much longer than the corresponding text transcription, posing… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted in EMNLP2023 main conference

  26. arXiv:2310.16861  [pdf, other

    cs.LG cs.CV

    General Point Model with Autoencoding and Autoregressive

    Authors: Zhe Li, Zhangyang Gao, Cheng Tan, Stan Z. Li, Laurence T. Yang

    Abstract: The pre-training architectures of large language models encompass various types, including autoencoding models, autoregressive models, and encoder-decoder models. We posit that any modality can potentially benefit from a large language model, as long as it undergoes vector quantization to become discrete tokens. Inspired by GLM, we propose a General Point Model (GPM) which seamlessly integrates au… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  27. arXiv:2310.16421  [pdf, other

    cs.AI

    Graph Agent: Explicit Reasoning Agent for Graphs

    Authors: Qinyong Wang, Zhenxiang Gao, Rong Xu

    Abstract: Graph embedding methods such as Graph Neural Networks (GNNs) and Graph Transformers have contributed to the development of graph reasoning algorithms for various tasks on knowledge graphs. However, the lack of interpretability and explainability of graph embedding methods has limited their applicability in scenarios requiring explicit reasoning. In this paper, we introduce the Graph Agent (GA), an… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  28. arXiv:2310.15872  [pdf, other

    cs.LG cs.AI cs.AR

    KirchhoffNet: A Scalable Ultra Fast Analog Neural Network

    Authors: Zhengqi Gao, Fan-Keng Sun, Ron Rohrer, Duane S. Boning

    Abstract: In this paper, we leverage a foundational principle of analog electronic circuitry, Kirchhoff's current and voltage laws, to introduce a distinctive class of neural network models termed KirchhoffNet. Essentially, KirchhoffNet is an analog circuit that can function as a neural network, utilizing its initial node voltages as the neural network input and the node voltages at a specific time point as… ▽ More

    Submitted 6 May, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 9 pages, 10 figures

  29. arXiv:2310.15416  [pdf, other

    cs.LG cs.AI

    Nominality Score Conditioned Time Series Anomaly Detection by Point/Sequential Reconstruction

    Authors: Chih-Yu Lai, Fan-Keng Sun, Zhengqi Gao, Jeffrey H. Lang, Duane S. Boning

    Abstract: Time series anomaly detection is challenging due to the complexity and variety of patterns that can occur. One major difficulty arises from modeling time-dependent relationships to find contextual anomalies while maintaining detection accuracy for point anomalies. In this paper, we propose a framework for unsupervised time series anomaly detection that utilizes point-based and sequence-based recon… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 (https://neurips.cc/virtual/2023/poster/70582)

  30. arXiv:2310.14860  [pdf, other

    cs.RO

    Adaptive Tuning of Robotic Polishing Skills based on Force Feedback Model

    Authors: Yu Wang, Zhouyi Zheng, Chen Chen, Zezheng Wang, Zhitao Gao, Fangyu Peng, Xiaowei Tang, Rong Yan

    Abstract: Acquiring human skills offers an efficient approach to tackle complex task planning challenges. When performing a learned skill model for a continuous contact task, such as robot polishing in an uncertain environment, the robot needs to be able to adaptively modify the skill model to suit the environment and perform the desired task. The environmental perturbation of the polishing task is mainly r… ▽ More

    Submitted 22 November, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted by The 2023 IEEE International Conference on Robotics and Biomimetics (IEEE ROBIO 2023)

  31. VR PreM+ : An Immersive Pre-learning Branching Visualization System for Museum Tours

    Authors: Ze Gao, Xiang Li, Changkun Liu, Xian Wang, Anqi Wang, Liang Yang, Yuyang Wang, Pan Hui, Tristan Braud

    Abstract: We present VR PreM+, an innovative VR system designed to enhance web exploration beyond traditional computer screens. Unlike static 2D displays, VR PreM+ leverages 3D environments to create an immersive pre-learning experience. Using keyword-based information retrieval allows users to manage and connect various content sources in a dynamic 3D space, improving communication and data comparison. We… ▽ More

    Submitted 1 November, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted for publication at The Eleventh International Symposium of Chinese CHI (Chinese CHI 2023), Bali

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary) ACM Class: F.2.2; I.2.7

  32. arXiv:2310.13039  [pdf, other

    cs.CV

    Human Pose-based Estimation, Tracking and Action Recognition with Deep Learning: A Survey

    Authors: Lijuan Zhou, Xiang Meng, Zhihuan Liu, Mengqi Wu, Zhimin Gao, Pichao Wang

    Abstract: Human pose analysis has garnered significant attention within both the research community and practical applications, owing to its expanding array of uses, including gaming, video surveillance, sports performance analysis, and human-computer interactions, among others. The advent of deep learning has significantly improved the accuracy of pose capture, making pose-based applications increasingly p… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 47 pages

  33. arXiv:2310.12792  [pdf, other

    cs.CG

    Almost Optimal Locality Sensitive Orderings in Euclidean Space

    Authors: Zhimeng Gao, Sariel Har-Peled

    Abstract: $ \newcommand{\Re}{\mathbb{R}} \newcommand{\reals}{\mathbb{R}} \newcommand{\SetX}{\mathsf{X}} \newcommand{\rad}{r} \newcommand{\Eps}{\Mh{\mathcal{E}}} \newcommand{\p}{\Mh{p}} \newcommand{\q}{\Mh{q}} \newcommand{\Mh}[1]{#1} \newcommand{\query}{q} \newcommand{\eps}{\varepsilon} \newcommand{\VorX}[1]{\mathcal{V} \pth{#1}} \newcommand{\Polygon}{\mathsf{P}} \newcommand{\IntRange}[1]{[ #1 ]} \newcommand… ▽ More

    Submitted 21 February, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: To appear in SoCG 2024

  34. arXiv:2310.12349  [pdf, other

    cs.CE stat.AP

    Developing 3D Virtual Safety Risk Terrain for UAS Operations in Complex Urban Environments

    Authors: Zhenyu Gao, John-Paul Clarke, Javid Mardanov, Karen Marais

    Abstract: Unmanned Aerial Systems (UAS), an integral part of the Advanced Air Mobility (AAM) vision, are capable of performing a wide spectrum of tasks in urban environments. The societal integration of UAS is a pivotal challenge, as these systems must operate harmoniously within the constraints imposed by regulations and societal concerns. In complex urban environments, UAS safety has been a perennial obst… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 33 pages, 19 figures

  35. arXiv:2310.11870  [pdf, other

    cs.CL cs.AI

    AI Nushu: An Exploration of Language Emergence in Sisterhood -Through the Lens of Computational Linguistics

    Authors: Yuqian Sun, Yuying Tang, Ze Gao, Zhijun Pan, Chuyan Xu, Yurou Chen, Kejiang Qian, Zhigang Wang, Tristan Braud, Chang Hee Lee, Ali Asadipour

    Abstract: This paper presents "AI Nushu," an emerging language system inspired by Nushu (women's scripts), the unique language created and used exclusively by ancient Chinese women who were thought to be illiterate under a patriarchal society. In this interactive installation, two artificial intelligence (AI) agents are trained in the Chinese dictionary and the Nushu corpus. By continually observing their e… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted for publication at SIGGRAPH Asia 2023

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary) ACM Class: F.2.2; I.2.7

  36. arXiv:2310.11466  [pdf, other

    cs.LG cs.AI q-bio.QM

    Protein 3D Graph Structure Learning for Robust Structure-based Protein Property Prediction

    Authors: Yufei Huang, Siyuan Li, Jin Su, Lirong Wu, Odin Zhang, Haitao Lin, Jingqi Qi, Zihan Liu, Zhangyang Gao, Yuyang Liu, Jiangbin Zheng, Stan. ZQ. Li

    Abstract: Protein structure-based property prediction has emerged as a promising approach for various biological tasks, such as protein function prediction and sub-cellular location estimation. The existing methods highly rely on experimental protein structure data and fail in scenarios where these data are unavailable. Predicted protein structures from AI tools (e.g., AlphaFold2) were utilized as alternati… ▽ More

    Submitted 19 October, 2023; v1 submitted 14 October, 2023; originally announced October 2023.

  37. arXiv:2310.10060  [pdf, other

    cs.LG

    Data Augmentation for Time-Series Classification: An Extensive Empirical Study and Comprehensive Survey

    Authors: Zijun Gao, Lingbo Li

    Abstract: Data Augmentation (DA) has emerged as an indispensable strategy in Time Series Classification (TSC), primarily due to its capacity to amplify training samples, thereby bolstering model robustness, diversifying datasets, and curtailing overfitting. However, the current landscape of DA in TSC is plagued with fragmented literature reviews, nebulous methodological taxonomies, inadequate evaluative mea… ▽ More

    Submitted 9 April, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  38. Staged Depthwise Correlation and Feature Fusion for Siamese Object Tracking

    Authors: Dianbo Ma, Jianqiang Xiao, Ziyan Gao, Satoshi Yamane

    Abstract: In this work, we propose a novel staged depthwise correlation and feature fusion network, named DCFFNet, to further optimize the feature extraction for visual tracking. We build our deep tracker upon a siamese network architecture, which is offline trained from scratch on multiple large-scale datasets in an end-to-end manner. The model contains a core component, that is, depthwise correlation and… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Accepted in 2023 International Joint Conference on Neural Networks (IJCNN)

  39. arXiv:2310.08825  [pdf, other

    cs.CV

    From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

    Authors: Dongsheng Jiang, Yuchen Liu, Songlin Liu, Jin'e Zhao, Hao Zhang, Zhen Gao, Xiaopeng Zhang, Jin Li, Hongkai Xiong

    Abstract: Multi-modal Large Language Models (MLLMs) have made significant strides in expanding the capabilities of Large Language Models (LLMs) through the incorporation of visual perception interfaces. Despite the emergence of exciting applications and the availability of diverse instruction tuning data, existing approaches often rely on CLIP or its variants as the visual branch, and merely extract feature… ▽ More

    Submitted 7 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  40. arXiv:2310.06357  [pdf, other

    stat.ME stat.AP

    Adaptive Storey's null proportion estimator

    Authors: Zijun Gao

    Abstract: False discovery rate (FDR) is a commonly used criterion in multiple testing and the Benjamini-Hochberg (BH) procedure is arguably the most popular approach with FDR guarantee. To improve power, the adaptive BH procedure has been proposed by incorporating various null proportion estimators, among which Storey's estimator has gained substantial popularity. The performance of Storey's estimator hinge… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 17 pages, 4 figures, 1 table

  41. arXiv:2310.06024  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall cond-mat.str-el

    Pair-breaking scattering interference as a mechanism for superconducting gap modulation

    Authors: Zhi-Qiang Gao, Yu-Ping Lin, Dung-Hai Lee

    Abstract: We propose the "pair-breaking scattering interference" as a general source of coherence peak modulations in superconductors. Assuming this mechanism, we present a simple physical picture for the coherence peak modulations in overdoped cuprate Bi$_2$Sr$_2$Ca$_2$Cu$_3$O$_{10+δ}$ (Bi-2223), ferromagnetic iron pnictide EuRbFe$_4$As$_4$ (Eu-1144), and kagome metals $A$V$_3$Sb$_5$ ($A=$ K, Rb, and Cs).… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 5+6 pages, 3+2 figures

  42. arXiv:2310.05829  [pdf, other

    cs.CV

    Revisiting the Temporal Modeling in Spatio-Temporal Predictive Learning under A Unified View

    Authors: Cheng Tan, Jue Wang, Zhangyang Gao, Siyuan Li, Lirong Wu, Jun Xia, Stan Z. Li

    Abstract: Spatio-temporal predictive learning plays a crucial role in self-supervised learning, with wide-ranging applications across a diverse range of fields. Previous approaches for temporal modeling fall into two categories: recurrent-based and recurrent-free methods. The former, while meticulously processing frames one by one, neglect short-term spatio-temporal information redundancies, leading to inef… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Under review

  43. arXiv:2310.04985  [pdf, other

    cs.CE

    VQPL: Vector Quantized Protein Language

    Authors: Zhangyang Gao, Cheng Tan, Stan Z. Li

    Abstract: Is there a foreign language describing protein sequences and structures simultaneously? Protein structures, represented by continuous 3D points, have long posed a challenge due to the contrasting modeling paradigms of discrete sequences. To represent protein sequence-structure as discrete symbols, we propose a VQProteinformer to project residue types and structures into a discrete space, supervise… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  44. arXiv:2310.04771  [pdf, other

    cs.HC cs.MM

    Embodied Cognition Guides Virtual-Real Interaction Design to Help Yicheng Flower Drum Intangible Cultural Heritage Dissemination

    Authors: Yuhan Ma, Weiran Zhao, Xiaolin Zhang, Ze Gao

    Abstract: In order to make the non-heritage culture of Yicheng Flower Drum more relevant to the trend of the digital era and promote its dissemination and inheritance, the design and application of gesture recognition and virtual reality technologies guided by embodied cognition theory in the process of non-heritage culture dissemination is studied. At the same time, it will enhance the interaction between… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 7 pages, 4 figures

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary) ACM Class: F.2.2; I.2.7

  45. arXiv:2310.04700  [pdf, other

    nucl-th nucl-ex

    Importance of physical information on the prediction of heavy-ion fusion cross section with machine learning

    Authors: Zhilong Li, Zepeng Gao, Ling Liu, Yongjia Wang, Long Zhu, Qingfeng Li

    Abstract: In this work, the Light Gradient Boosting Machine (LightGBM), which is a modern decision tree based machine-learning algorithm, is used to study the fusion cross section (CS) of heavy-ion reaction. Several basic quantities (e.g., mass number and proton number of projectile and target) and the CS obtained from phenomenological formula are fed into the LightGBM algorithm to predict the CS. It is fou… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 12 pages, 7 figures

  46. arXiv:2310.04673  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT

    Authors: Zhihao Du, Jiaming Wang, Qian Chen, Yunfei Chu, Zhifu Gao, Zerui Li, Kai Hu, Xiaohuan Zhou, Jin Xu, Ziyang Ma, Wen Wang, Siqi Zheng, Chang Zhou, Zhijie Yan, Shiliang Zhang

    Abstract: Generative Pre-trained Transformer (GPT) models have achieved remarkable performance on various natural language processing tasks, and have shown great potential as backbones for audio-and-text large language models (LLMs). Previous mainstream audio-and-text LLMs use discrete audio tokens to represent both input and output audio; however, they suffer from performance degradation on tasks such as a… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 10 pages, work in progress

  47. arXiv:2310.04282  [pdf, other

    nucl-ex nucl-th

    Multi-alpha Boson Gas state in Fusion Evaporation Reaction and Three-body Force

    Authors: Taofeng Wang, Ziming Li, R. B. Wiringa, Minliang Liu, Jiansong Wang, Yanyun Yang, Qinghua He, Zhiyu Sun, Chengjian Lin, M. Assié, Y. Ayyad, D. Beaumel, Zhen Bai, Fangfang Duan, Zhihao Gao, Song Guo, Yue Hu, Wei Jiang, F. Kobayashi, Chengui Lu, Junbing Ma, Peng Ma, P. Napolitani, G. Verde, Jianguo Wang , et al. (11 additional authors not shown)

    Abstract: The experimental evidence for the $α$ Boson gas state in the $^{11}$C+$^{12}$C$\rightarrow$$^{23}$Mg$^{\ast}$ fusion evaporation reaction is presented. By measuring the $α$ emission spectrum with multiplicity 2 and 3, we provide insight into the existence of a three-body force among $α$ particles. The observed spectrum exhibited distinct tails corresponding to $α$ particles emitted in pairs and tr… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 7 pages, 6 figures

  48. arXiv:2310.04274  [pdf, other

    nucl-ex nucl-th

    Aspect of Clusters Correlation at Light Nuclei Excited State

    Authors: Ziming Li, Jie Zhu, Taofeng Wang, Minliang Liu, Jiansong Wang, Yanyun Yang, Chengjian Lin, Zhiyu Sun, Qinghua He, M. Assié, Y. Ayyad, D. Beaumel, Zhen Bai, Fangfang Duan, Zhihao Gao, Song Guo, Yue Hu, Wei Jiang, F. Kobayashi, Chengui Lu, Junbing Ma, Peng Ma, P. Napolitani, G. Verde, Jianguo Wang , et al. (11 additional authors not shown)

    Abstract: The correlation of $αα$ was probed via measuring the transverse momentum $p_{T}$ and width $δp_{T}$ of one $α$, for the first time, which represents the spatial and dynamical essentialities of the initial coupling state in $^{8}$Be nucleus. The weighted interaction vertex of 3$α$ reflected by the magnitudes of their relative momentums and relative emission angles proves the isosceles triangle conf… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 8 pages, 9 figures

  49. arXiv:2310.04261  [pdf, other

    nucl-ex nucl-th

    Variation of Tensor Force due to Nuclear Medium Effect

    Authors: Ziming Li, Jie Zhu, Taofeng Wang, Minliang Liu, Jiansong Wang, Yanyun Yang, Chengjian Lin, Zhiyu Sun, Qinghua He, M. Assié, Y. Ayyad, D. Beaumel, Zhen Bai, Fangfang Duan, Zhihao Gao, Song Guo, Yue Hu, Wei Jiang, F. Kobayashi, Chengui Lu, Junbing Ma, Peng Ma, P. Napolitani, G. Verde, Jianguo Wang , et al. (11 additional authors not shown)

    Abstract: The enhancement of $J^π(T)$=3$^{+}$(0) state with isospin $T=0$ excited by the tensor force in the free $^{6}$Li nucleus has been observed, for the first time, relative to a shrinkable excitation in the $^{6}$Li cluster component inside its host nucleus. Comparatively, the excitation of $J^π(T)$=0$^{+}$(1) state with isospin $T=1$ for these two $^{6}$Li formations take on an approximately equal ex… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 6 pages, 4 figures

  50. arXiv:2310.00135  [pdf, other

    math.OC

    Alpha-Fair Routing in Urban Air Mobility with Risk-Aware Constraints

    Authors: Yue Yu, Zhenyu Gao, Sarah H. Q. Li, Qinshuang Wei, John-Paul Clarke, Ufuk Topcu

    Abstract: In the vision of urban air mobility, air transport systems serve the demands of urban communities by routing flight traffic in networks formed by vertiports and flight corridors. We develop a routing algorithm to ensure that the air traffic flow fairly serves the demand of multiple communities subject to stochastic network capacity constraints. This algorithm guarantees that the flight traffic vol… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.