Skip to main content

Showing 1–50 of 227 results for author: Yan, P

  1. arXiv:2407.05810  [pdf, other

    cs.AI cs.HC

    Integrating AI in College Education: Positive yet Mixed Experiences with ChatGPT

    Authors: Xinrui Song, Jiajin Zhang, Pingkun Yan, Juergen Hahn, Uwe Kruger, Hisham Mohamed, Ge Wang

    Abstract: The integration of artificial intelligence (AI) chatbots into higher education marks a shift towards a new generation of pedagogical tools, mirroring the arrival of milestones like the internet. With the launch of ChatGPT-4 Turbo in November 2023, we developed a ChatGPT-based teaching application (https://chat.openai.com/g/g-1imx1py4K-chatge-medical-imaging) and integrated it into our undergraduat… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2407.03959  [pdf, other

    cond-mat.mtrl-sci

    Skyrmion Hall effect in altermagnets

    Authors: Zhejunyu Jin, Zhaozhuo Zeng, Yunshan Cao, Peng Yan

    Abstract: It is widely believed that the skyrmion Hall effect is absent in antiferromagnets because of the vanishing topological charge. However, the Aharonov-Casher theory indicates the possibility of topological effects for neutral particles. In this work, we predict the skyrmion Hall effect in emerging altermagnets with zero net magnetization and zero skyrmion charge. We first show that the neutral skyrm… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 6 pages and 5 figures

  3. arXiv:2407.03658  [pdf, other

    cs.CL

    GPT-4 vs. Human Translators: A Comprehensive Evaluation of Translation Quality Across Languages, Domains, and Expertise Levels

    Authors: Jianhao Yan, Pingchuan Yan, Yulong Chen, Judy Li, Xianchao Zhu, Yue Zhang

    Abstract: This study comprehensively evaluates the translation quality of Large Language Models (LLMs), specifically GPT-4, against human translators of varying expertise levels across multiple language pairs and domains. Through carefully designed annotation rounds, we find that GPT-4 performs comparably to junior translators in terms of total errors made but lags behind medium and senior translators. We a… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  4. arXiv:2407.00557  [pdf, other

    cs.CV

    Explaining Chest X-ray Pathology Models using Textual Concepts

    Authors: Vijay Sadashivaiah, Mannudeep K. Kalra, Pingkun Yan, James A. Hendler

    Abstract: Deep learning models have revolutionized medical imaging and diagnostics, yet their opaque nature poses challenges for clinical adoption and trust. Amongst approaches to improve model interpretability, concept-based explanations aim to provide concise and human understandable explanations of any arbitrary classifier. However, such methods usually require a large amount of manually collected data w… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  5. arXiv:2407.00541  [pdf

    cs.CL cs.AI cs.IR

    Answering real-world clinical questions using large language model based systems

    Authors: Yen Sia Low, Michael L. Jackson, Rebecca J. Hyde, Robert E. Brown, Neil M. Sanghavi, Julian D. Baldwin, C. William Pike, Jananee Muralidharan, Gavin Hui, Natasha Alexander, Hadeel Hassan, Rahul V. Nene, Morgan Pike, Courtney J. Pokrzywa, Shivam Vedak, Adam Paul Yan, Dong-han Yao, Amy R. Zipursky, Christina Dinh, Philip Ballentine, Dan C. Derieg, Vladimir Polony, Rehan N. Chawdry, Jordan Davies, Brigham B. Hyde , et al. (2 additional authors not shown)

    Abstract: Evidence to guide healthcare decisions is often limited by a lack of relevant and trustworthy literature as well as difficulty in contextualizing existing research for a specific patient. Large language models (LLMs) could potentially address both challenges by either summarizing published literature or generating new studies based on real-world data (RWD). We evaluated the ability of five LLM-bas… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 28 pages (2 figures, 3 tables) inclusive of 8 pages of supplemental materials (4 supplemental figures and 4 supplemental tables)

  6. arXiv:2407.00514  [pdf, ps, other

    cs.PL

    Combining Classical and Probabilistic Independence Reasoning to Verify the Security of Oblivious Algorithms (Extended Version)

    Authors: Pengbo Yan, Toby Murray, Olga Ohrimenko, Van-Thuan Pham, Robert Sison

    Abstract: We consider the problem of how to verify the security of probabilistic oblivious algorithms formally and systematically. Unfortunately, prior program logics fail to support a number of complexities that feature in the semantics and invariant needed to verify the security of many practical probabilistic oblivious algorithms. We propose an approach based on reasoning over perfectly oblivious approxi… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  7. arXiv:2406.19631  [pdf, other

    cs.LG cs.DC

    Personalized Interpretation on Federated Learning: A Virtual Concepts approach

    Authors: Peng Yan, Guodong Long, Jing Jiang, Michael Blumenstein

    Abstract: Tackling non-IID data is an open challenge in federated learning research. Existing FL methods, including robust FL and personalized FL, are designed to improve model performance without consideration of interpreting non-IID across clients. This paper aims to design a novel FL method to robust and interpret the non-IID data across clients. Specifically, we interpret each client's dataset as a mixt… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  8. arXiv:2406.13953  [pdf, other

    cond-mat.mes-hall

    Peculiar corner states in magnetic fractals

    Authors: Zhixiong Li, Peng Yan

    Abstract: Topological excitations in periodic magnetic crystals have received significant recent attention. However, it is an open question on their fate once the lattice periodicity is broken. In this work, we theoretically study the topological properties embedded in the collective dynamics of magnetic texture array arranged into a Sierpiński carpet structure with effective Hausdorff dimensionality… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 5 figures

  9. arXiv:2406.09298  [pdf, other

    cond-mat.mes-hall

    Magnon spin transport through atomic ferrimagnetic domain walls

    Authors: Zhaozhuo Zeng, Peng Yan

    Abstract: It is a well-established notion that the spin of a magnon should be flipped when it passes through a $180^{\circ}$ domain wall (DW) in both ferromagnets and antiferromagnets, while the magnon spin transport through ferrimagnetic DW is still elusive. In this work, we report that the magnon preserves its spin after the transmission through an atomically sharp DW in ferrimagnets, due to the intriguin… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2406.00258  [pdf, other

    cs.CV cs.AI

    Artemis: Towards Referential Understanding in Complex Videos

    Authors: Jihao Qiu, Yuan Zhang, Xi Tang, Lingxi Xie, Tianren Ma, Pengyu Yan, David Doermann, Qixiang Ye, Yunjie Tian

    Abstract: Videos carry rich visual information including object description, action, interaction, etc., but the existing multimodal large language models (MLLMs) fell short in referential understanding scenarios such as video-based referring. In this paper, we present Artemis, an MLLM that pushes video-based referential understanding to a finer level. Given a video, Artemis receives a natural-language quest… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 19 pages, 14 figures. Code and data are available at https://github.com/qiujihao19/Artemis

  11. arXiv:2405.18533  [pdf, other

    eess.IV cs.CV

    Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-Mamba

    Authors: Zefan Yang, Jiajin Zhang, Ge Wang, Mannudeep K. Kalra, Pingkun Yan

    Abstract: Accurate prediction of Cardiovascular disease (CVD) risk in medical imaging is central to effective patient health management. Previous studies have demonstrated that imaging features in computed tomography (CT) can help predict CVD risk. However, CT entails notable radiation exposure, which may result in adverse health effects for patients. In contrast, chest X-ray emits significantly lower level… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Early accepted paper for MICCAI 2024

  12. arXiv:2405.15728  [pdf, other

    cs.CV

    Disease-informed Adaptation of Vision-Language Models

    Authors: Jiajin Zhang, Ge Wang, Mannudeep K. Kalra, Pingkun Yan

    Abstract: In medical image analysis, the expertise scarcity and the high cost of data annotation limits the development of large artificial intelligence models. This paper investigates the potential of transfer learning with pre-trained vision-language models (VLMs) in this domain. Currently, VLMs still struggle to transfer to the underrepresented diseases with minimal presence and new diseases entirely abs… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Early Accepted by MICCAI 2024

  13. arXiv:2405.14643  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Circuit realization of topological physics

    Authors: Huanhuan Yang, Lingling Song, Yunshan Cao, Peng Yan

    Abstract: Recently, topolectrical circuits (TECs) boom in studying the topological states of matter. The resemblance between circuit Laplacians and tight-binding models in condensed matter physics allows for the exploration of exotic topological phases on the circuit platform. In this review, we begin by presenting the basic equations for the circuit elements and units, along with the fundamentals and exper… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  14. arXiv:2405.13467  [pdf, other

    cs.CV

    AdaFedFR: Federated Face Recognition with Adaptive Inter-Class Representation Learning

    Authors: Di Qiu, Xinyang Lin, Kaiye Wang, Xiangxiang Chu, Pengfei Yan

    Abstract: With the growing attention on data privacy and communication security in face recognition applications, federated learning has been introduced to learn a face recognition model with decentralized datasets in a privacy-preserving manner. However, existing works still face challenges such as unsatisfying performance and additional communication costs, limiting their applicability in real-world scena… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  15. arXiv:2405.11344  [pdf

    cs.LG cs.AI

    LiPost: Improved Content Understanding With Effective Use of Multi-task Contrastive Learning

    Authors: Akanksha Bindal, Sudarshan Ramanujam, Dave Golland, TJ Hazen, Tina Jiang, Fengyu Zhang, Peng Yan

    Abstract: In enhancing LinkedIn core content recommendation models, a significant challenge lies in improving their semantic understanding capabilities. This paper addresses the problem by leveraging multi-task learning, a method that has shown promise in various domains. We fine-tune a pre-trained, transformer-based LLM using multi-task contrastive learning with data from a diverse set of semantic labeling… ▽ More

    Submitted 13 July, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

  16. arXiv:2405.01962  [pdf

    physics.optics physics.app-ph

    Optical skyrmions from metafibers

    Authors: Tiantian He, Yuan Meng, Lele Wang, Hongkun Zhong, Nilo Mata-Cervera, Dan Li, Ping Yan, Qiang Liu, Yijie Shen, Qirong Xiao

    Abstract: Optical skyrmions are an emerging class of structured light with sophisticated particle-like topologies with great potential for revolutionizing modern informatics. However, the current generation of optical skyrmions involves complex or bulky systems, hindering their development of practical applications. Here, exploiting the emergent "lab-on-fiber" technology, we demonstrate the design of a meta… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  17. arXiv:2404.17743  [pdf, ps, other

    math.NT math.RT

    Fourier Coefficients and Algebraic Cusp Forms on $\mathrm{U}(2,n)$

    Authors: Anton Hilado, Finn McGlade, Pan Yan

    Abstract: We establish a theory of scalar Fourier coefficients for a class of non-holomorphic, automorphic forms on the quaternionic real Lie group $\mathrm{U}(2,n)$. By studying the theta lifts of holomorphic modular forms from $\mathrm{U}(1,1)$, we apply this theory to obtain examples of non-holomorphic cusp forms on $\mathrm{U}(2,n)$ whose Fourier coefficients are algebraic numbers.

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 27 pages, comments welcome

    MSC Class: 11F30

  18. arXiv:2404.09235  [pdf, other

    astro-ph.GA

    PDRs4All IX. Sulfur elemental abundance in the Orion Bar

    Authors: Asunción Fuente, Evelyne Roueff, Franck Le Petit, Jacques Le Bourlot, Emeric Bron, Mark G. Wolfire, James F. Babb, Pei-Gen Yan, Takashi Onaka, John H. Black, Ilane Schroetter, Dries Van De Putte, Ameek Sidhu, Amélie Canin, Boris Trahin, Felipe Alarcón, Ryan Chown, Olga Kannavou, Olivier Berné, Emilie Habart, Els Peeters, Javier R. Goicoechea, Marion Zannese, Raphael Meshaka, Yoko Okada , et al. (9 additional authors not shown)

    Abstract: One of the main problems in astrochemistry is determining the amount of sulfur in volatiles and refractories in the interstellar medium. The detection of the main sulfur reservoirs (icy H$_2$S and atomic gas) has been challenging, and estimates are based on the reliability of models to account for the abundances of species containing less than 1% of the total sulfur. The high sensitivity of the Ja… ▽ More

    Submitted 4 June, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: 16 pages, 6 figures. Accepted for publication in Astronomy and Astrophysics

  19. arXiv:2404.08450  [pdf, other

    cs.CV

    Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues

    Authors: Xianhua He, Dashuang Liang, Song Yang, Zhanlong Hao, Hui Ma, Binjie Mao, Xi Li, Yao Wang, Pengfei Yan, Ajian Liu

    Abstract: Face recognition systems are frequently subjected to a variety of physical and digital attacks of different types. Previous methods have achieved satisfactory performance in scenarios that address physical attacks and digital attacks, respectively. However, few methods are considered to integrate a model that simultaneously addresses both physical and digital attacks, implying the necessity to dev… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 10 pages with 6 figures, Accepted by CVPRW 2024

  20. arXiv:2404.08361  [pdf, other

    cs.IR cs.AI

    Large-Scale Multi-Domain Recommendation: an Automatic Domain Feature Extraction and Personalized Integration Framework

    Authors: Dongbo Xi, Zhen Chen, Yuexian Wang, He Cui, Chong Peng, Fuzhen Zhuang, Peng Yan

    Abstract: Feed recommendation is currently the mainstream mode for many real-world applications (e.g., TikTok, Dianping), it is usually necessary to model and predict user interests in multiple scenarios (domains) within and even outside the application. Multi-domain learning is a typical solution in this regard. While considerable efforts have been made in this regard, there are still two long-standing cha… ▽ More

    Submitted 14 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 8 pages

  21. arXiv:2404.03181  [pdf, other

    cs.CV

    MonoCD: Monocular 3D Object Detection with Complementary Depths

    Authors: Longfei Yan, Pei Yan, Shengzhou Xiong, Xuanyu Xiang, Yihua Tan

    Abstract: Monocular 3D object detection has attracted widespread attention due to its potential to accurately obtain object 3D localization from a single image at a low cost. Depth estimation is an essential but challenging subtask of monocular 3D object detection due to the ill-posedness of 2D to 3D mapping. Many methods explore multiple local depth clues such as object heights and keypoints and then formu… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  22. arXiv:2404.02655  [pdf, other

    cs.CL

    Calibrating the Confidence of Large Language Models by Eliciting Fidelity

    Authors: Mozhi Zhang, Mianqiu Huang, Rundong Shi, Linsen Guo, Chong Peng, Peng Yan, Yaqian Zhou, Xipeng Qiu

    Abstract: Large language models optimized with techniques like RLHF have achieved good alignment in being helpful and harmless. However, post-alignment, these language models often exhibit overconfidence, where the expressed confidence does not accurately calibrate with their correctness rate. In this paper, we decompose the language model confidence into the \textit{Uncertainty} about the question and the… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 17 pages, 13 figures

  23. arXiv:2404.00561  [pdf, ps, other

    math.NT math.RT

    Epsilon dichotomy for twisted linear models

    Authors: Hang Xue, Pan Yan

    Abstract: Let $E/F$ be a quadratic extension of local nonarchimedean fields of characteristic zero and let $D$ be a quaternion algebra over $F$ containing $E$. In this paper, we study a relation between the existence of twisted linear models on $\mathrm{GL}_n(D)$ and the local root numbers.

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 38 pages

    MSC Class: 11F70; 22E50

  24. arXiv:2403.19499  [pdf, other

    cs.LG

    Client-supervised Federated Learning: Towards One-model-for-all Personalization

    Authors: Peng Yan, Guodong Long

    Abstract: Personalized Federated Learning (PerFL) is a new machine learning paradigm that delivers personalized models for diverse clients under federated learning settings. Most PerFL methods require extra learning processes on a client to adapt a globally shared model to the client-specific personalized model using its own local data. However, the model adaptation process in PerFL is still an open challen… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  25. arXiv:2403.18154  [pdf, ps, other

    math.NT

    Cohomology classes, periods, and special values of Rankin-Selberg $L$-functions

    Authors: Yubo Jin, Pan Yan

    Abstract: In this article, we give a cohomological interpretation of (a special case of) the integrals constructed by the second named author and Q. Zhang \cite{YanZhang2023} which represent the product of Rankin-Selberg $L$-functions of $\mathrm{GL}_n\times\mathrm{GL}_m$ and $\mathrm{GL}_n\times\mathrm{GL}_{n-m-1}$ for $m<n$. As an application, we prove an algebraicity result for the special values of cert… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 21 pages

    MSC Class: 11F67; 11F70; 11F75; 22E55

  26. arXiv:2403.00274  [pdf, other

    cs.CV cs.SD eess.AS

    CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation

    Authors: Xi Liu, Ying Guo, Cheng Zhen, Tong Li, Yingying Ao, Pengfei Yan

    Abstract: Listening head generation aims to synthesize a non-verbal responsive listener head by modeling the correlation between the speaker and the listener in dynamic conversion.The applications of listener agent generation in virtual interaction have promoted many works achieving the diverse and fine-grained motion generation. However, they can only manipulate motions through simple emotional labels, but… ▽ More

    Submitted 29 March, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  27. arXiv:2403.00209  [pdf, other

    cs.CV

    ChartReformer: Natural Language-Driven Chart Image Editing

    Authors: Pengyu Yan, Mahesh Bhosale, Jay Lal, Bikhyat Adhikari, David Doermann

    Abstract: Chart visualizations are essential for data interpretation and communication; however, most charts are only accessible in image format and lack the corresponding data tables and supplementary information, making it difficult to alter their appearance for different application scenarios. To eliminate the need for original underlying data and information to perform chart editing, we propose ChartRef… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Published in ICDAR 2024. Code and model are available at https://github.com/pengyu965/ChartReformer

  28. arXiv:2402.15687  [pdf, other

    cs.CV cs.AI

    General Purpose Image Encoder DINOv2 for Medical Image Registration

    Authors: Xinrui Song, Xuanang Xu, Pingkun Yan

    Abstract: Existing medical image registration algorithms rely on either dataset specific training or local texture-based features to align images. The former cannot be reliably implemented without large modality-specific training datasets, while the latter lacks global semantics thus could be easily trapped at local minima. In this paper, we present a training-free deformable image registration method, DINO… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  29. arXiv:2402.00137  [pdf, other

    cs.LG cs.CV

    Multimodal Neurodegenerative Disease Subtyping Explained by ChatGPT

    Authors: Diego Machado Reyes, Hanqing Chao, Juergen Hahn, Li Shen, Pingkun Yan

    Abstract: Alzheimer's disease (AD) is the most prevalent neurodegenerative disease; yet its currently available treatments are limited to stopping disease progression. Moreover, effectiveness of these treatments is not guaranteed due to the heterogenetiy of the disease. Therefore, it is essential to be able to identify the disease subtypes at a very early stage. Current data driven approaches are able to cl… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  30. arXiv:2401.08407  [pdf, other

    cs.CV

    Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining

    Authors: Jiahao Nie, Yun Xing, Gongjie Zhang, Pei Yan, Aoran Xiao, Yap-Peng Tan, Alex C. Kot, Shijian Lu

    Abstract: Cross-Domain Few-Shot Segmentation (CD-FSS) poses the challenge of segmenting novel categories from a distinct domain using only limited exemplars. In this paper, we undertake a comprehensive study of CD-FSS and uncover two crucial insights: (i) the necessity of a fine-tuning stage to effectively transfer the learned meta-knowledge across domains, and (ii) the overfitting risk during the naïve fin… ▽ More

    Submitted 13 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by CVPR 2024

  31. arXiv:2312.12484  [pdf, other

    cs.CR cs.DC cs.LG

    SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks

    Authors: Peishen Yan, Hao Wang, Tao Song, Yang Hua, Ruhui Ma, Ningxin Hu, Mohammad R. Haghighat, Haibing Guan

    Abstract: Federated Learning (FL) is becoming a popular paradigm for leveraging distributed data and preserving data privacy. However, due to the distributed characteristic, FL systems are vulnerable to Byzantine attacks that compromised clients attack the global model by uploading malicious model updates. With the development of layer-level and parameter-level fine-grained attacks, the attacks' stealthines… ▽ More

    Submitted 18 July, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by ECCV2024

  32. arXiv:2312.12027  [pdf, other

    physics.atom-ph

    A continuous cold rubidium atomic beam with enhanced flux and tunable velocity

    Authors: Shengzhe Wang, Zhixin Meng, and Peiqiang Yan, Yuanxing Liu, Yanying Feng

    Abstract: We present a cold atomic beam source based on a two-dimensional (2D)+ magneto-optical trap (MOT), capable of generating a continuous cold beam of 87Rb atoms with a flux up to 4.3*10^9 atoms/s, a mean velocity of 10.96(2.20) m/s, and a transverse temperature of 16.90(1.56) uK. Investigating the influence of high cooling laser intensity, we observe a significant population loss of atoms to hyperfine… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  33. arXiv:2312.11927  [pdf, other

    cs.LG cs.SI stat.ME

    Empowering Dual-Level Graph Self-Supervised Pretraining with Motif Discovery

    Authors: Pengwei Yan, Kaisong Song, Zhuoren Jiang, Yangyang Kang, Tianqianjin Lin, Changlong Sun, Xiaozhong Liu

    Abstract: While self-supervised graph pretraining techniques have shown promising results in various domains, their application still experiences challenges of limited topology learning, human knowledge dependency, and incompetent multi-level interactions. To address these issues, we propose a novel solution, Dual-level Graph self-supervised Pretraining with Motif discovery (DGPM), which introduces a unique… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 14 pages, 6 figures, accepted by AAAI'24

  34. arXiv:2312.08317  [pdf, other

    cs.CR cs.AI

    Prompt Engineering-assisted Malware Dynamic Analysis Using GPT-4

    Authors: Pei Yan, Shunquan Tan, Miaohui Wang, Jiwu Huang

    Abstract: Dynamic analysis methods effectively identify shelled, wrapped, or obfuscated malware, thereby preventing them from invading computers. As a significant representation of dynamic malware behavior, the API (Application Programming Interface) sequence, comprised of consecutive API calls, has progressively become the dominant feature of dynamic analysis methods. Though there have been numerous deep l… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  35. arXiv:2312.06462  [pdf, other

    cs.CV cs.AI cs.SD eess.AS

    Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation

    Authors: Qi Yang, Xing Nie, Tong Li, Pengfei Gao, Ying Guo, Cheng Zhen, Pengfei Yan, Shiming Xiang

    Abstract: Recently, an audio-visual segmentation (AVS) task has been introduced, aiming to group pixels with sounding objects within a given video. This task necessitates a first-ever audio-driven pixel-level understanding of the scene, posing significant challenges. In this paper, we propose an innovative audio-visual transformer framework, termed COMBO, an acronym for COoperation of Multi-order Bilateral… ▽ More

    Submitted 7 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Highlight. 13 pages, 10 figures

  36. arXiv:2311.16557  [pdf, other

    physics.atom-ph

    A Continuous Dual-Axis Atomic Interferometric Inertial Sensor

    Authors: Pei-Qiang Yan, Wei-Chen Jia, Sheng-Zhe Wang, Yan-Ying Feng

    Abstract: We present an interferometric inertial sensor that utilizes two counter-propagating atomic beams with transverse two-dimensional cooling. By employing three parallel and spatially aligned Raman laser beams for Doppler-sensitive Raman transitions, we successfully generate inertia-sensitive Mach-Zehnder interference fringes with an interrogation length of $2L=54\,\rm{cm}$. The measured rotation and… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 8 pages, 4 figures

  37. arXiv:2311.03679  [pdf, other

    cs.CV eess.IV

    Unsupervised convolutional neural network fusion approach for change detection in remote sensing images

    Authors: Weidong Yan, Pei Yan, Li Cao

    Abstract: With the rapid development of deep learning, a variety of change detection methods based on deep learning have emerged in recent years. However, these methods usually require a large number of training samples to train the network model, so it is very expensive. In this paper, we introduce a completely unsupervised shallow convolutional neural network (USCNN) fusion approach for change detection.… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  38. arXiv:2311.00353  [pdf, other

    cs.CV

    LatentWarp: Consistent Diffusion Latents for Zero-Shot Video-to-Video Translation

    Authors: Yuxiang Bao, Di Qiu, Guoliang Kang, Baochang Zhang, Bo Jin, Kaiye Wang, Pengfei Yan

    Abstract: Leveraging the generative ability of image diffusion models offers great potential for zero-shot video-to-video translation. The key lies in how to maintain temporal consistency across generated video frames by image diffusion models. Previous methods typically adopt cross-frame attention, \emph{i.e.,} sharing the \textit{key} and \textit{value} tokens across attentions of different frames, to enc… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  39. arXiv:2311.00266  [pdf, other

    cond-mat.supr-con

    Constructing the Fulde-Ferrell-Larkin-Ovchinnikov state in antiferromagnetic insulator CrOCl

    Authors: Yifan Ding, Jiadian He, Shihao Zhang, Huakun Zuo, Pingfan Gu, Jiliang Cai, Xiaohui Zeng, Pu Yan, Kecheng Cao, Kenji Watanabe, Takashi Taniguchi, Peng Dong, Yiwen Zhang, Yueshen Wu, Xiang Zhou, Jinghui Wang, Yulin Chen, Yu Ye, Jianpeng Liu, Jun Li

    Abstract: Time reversal symmetry breaking in superconductors, resulting from external magnetic fields or spontaneous magnetization, often leads to unconventional superconducting properties. In this way, a conventional Fulde-Ferrell-Larkin-Ovchinnikov (FFLO) state, characterized by the Cooper pairs with nonzero total momentum, may be realized by the Zeeman effect caused from external magnetic fields. Here, w… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  40. arXiv:2310.17144  [pdf, ps, other

    math.NT

    Some remarks on strong multiplicity one for paramodular forms

    Authors: Xiyuan Wang, Zhining Wei, Pan Yan, Shaoyun Yi

    Abstract: We establish several refined strong multiplicity one results for paramodular cusp forms by using the spinor and standard $L$-functions with the combination of the methods from both of automorphic side and Galois side.

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 28 pages

    MSC Class: Primary 11F46; 11F60; 11F66; Secondary 11F30; 11F70; 11F80

  41. arXiv:2309.15699  [pdf, other

    stat.ME math.ST

    STRAW: Structure-Adaptive Weighting Procedure for Large-Scale Spatial Multiple Testing

    Authors: Pengfei Wang, Pengyu Yan, Canhui Li

    Abstract: The problem of large-scale spatial multiple testing is often encountered in various scientific research fields, where the signals are usually enriched on some regions while sparse on others. To integrate spatial structure information from nearby locations, we propose a novel approach, called {\bf STR}ucture-{\bf A}daptive {\bf W}eighting (STRAW) procedure, for large-scale spatial multiple testing.… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  42. arXiv:2309.10445  [pdf, other

    math.RT math.NT

    Product of Rankin-Selberg convolutions and a new proof of Jacquet's local converse conjecture

    Authors: Pan Yan, Qing Zhang

    Abstract: In this article, we construct a family of integrals which represent the product of Rankin-Selberg $L$-functions of $\mathrm{GL}_{l}\times \mathrm{GL}_m$ and of $\mathrm{GL}_{l}\times \mathrm{GL}_n $ when $m+n<l$. When $n=0$, these integrals are those defined by Jacquet--Piatetski-Shapiro--Shalika up to a shift. In this sense, these new integrals generalize Jacquet--Piatetski-Shapiro--Shalika's Ran… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    MSC Class: 11F70; 22E50

  43. arXiv:2309.09475  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph

    Terahertz magnon frequency comb

    Authors: Xianglong Yao, Zhejunyu Jin, Zhenyu Wang, Zhaozhuo Zeng, Peng Yan

    Abstract: Magnon frequency comb (MFC), the spin-wave spectra composing of equidistant coherent peaks, is attracting much attention in magnonics. A terahertz (THz) MFC, combining the advantages of the THz and MFC technologies, is highly desired because it would significantly advance the MFC applications in ultrafast magnonic metrology, sensing, and communications. Here, we show that the THz MFC can be genera… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 6 pages, 6 figures

  44. arXiv:2309.01207  [pdf, other

    eess.IV cs.CV cs.LG

    Spectral Adversarial MixUp for Few-Shot Unsupervised Domain Adaptation

    Authors: Jiajin Zhang, Hanqing Chao, Amit Dhurandhar, Pin-Yu Chen, Ali Tajer, Yangyang Xu, Pingkun Yan

    Abstract: Domain shift is a common problem in clinical applications, where the training images (source domain) and the test images (target domain) are under different distributions. Unsupervised Domain Adaptation (UDA) techniques have been proposed to adapt models trained in the source domain to the target domain. However, those methods require a large number of images from the target domain for model train… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: Accepted by MICCAI 2023

  45. arXiv:2308.01971  [pdf, other

    cs.CV cs.AI

    SpaDen : Sparse and Dense Keypoint Estimation for Real-World Chart Understanding

    Authors: Saleem Ahmed, Pengyu Yan, David Doermann, Srirangaraj Setlur, Venu Govindaraju

    Abstract: We introduce a novel bottom-up approach for the extraction of chart data. Our model utilizes images of charts as inputs and learns to detect keypoints (KP), which are used to reconstruct the components within the plot area. Our novelty lies in detecting a fusion of continuous and discrete KP as predicted heatmaps. A combination of sparse and dense per-pixel objectives coupled with a uni-modal self… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted ORAL at ICDAR 23

  46. arXiv:2307.16748  [pdf, ps, other

    gr-qc

    The ring-shaped shadow of rotating naked singularity with a complete photon sphere

    Authors: Mingzhi Wang, Guanghai Guo, Pengfei Yan, Songbai Chen, Jiliang Jing

    Abstract: We investigate the shadows of Konoplya-Zhidenko naked singularity. In the spacetime of Konoplya-Zhidenko naked singularity, not only can unstable retrograde light ring (LR) exist, but also unstable prograde LR, leading to the formation of a complete photon sphere (PS). Due to the absence of an event horizon, a dark disc-shaped shadow does not appear; instead, a ring-shaped shadow is observed. The… ▽ More

    Submitted 5 June, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: 14 pages, 11 figures. It is to be published in Chinese Physics C

  47. arXiv:2307.14634  [pdf, other

    cs.AI cs.CR cs.CV cs.LG eess.IV

    Fact-Checking of AI-Generated Reports

    Authors: Razi Mahmood, Ge Wang, Mannudeep Kalra, Pingkun Yan

    Abstract: With advances in generative artificial intelligence (AI), it is now possible to produce realistic-looking automated reports for preliminary reads of radiology images. This can expedite clinical workflows, improve accuracy and reduce overall costs. However, it is also well-known that such models often hallucinate, leading to false findings in the generated reports. In this paper, we propose a new m… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 10 pages, 3 figures, 3 tables

  48. arXiv:2307.14039  [pdf, other

    cs.CV

    Controllable Guide-Space for Generalizable Face Forgery Detection

    Authors: Ying Guo, Cheng Zhen, Pengfei Yan

    Abstract: Recent studies on face forgery detection have shown satisfactory performance for methods involved in training datasets, but are not ideal enough for unknown domains. This motivates many works to improve the generalization, but forgery-irrelevant information, such as image background and identity, still exists in different domain features and causes unexpected clustering, limiting the generalizatio… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023

  49. arXiv:2307.13693  [pdf, other

    cs.CL

    Evaluating Large Language Models for Radiology Natural Language Processing

    Authors: Zhengliang Liu, Tianyang Zhong, Yiwei Li, Yutong Zhang, Yi Pan, Zihao Zhao, Peixin Dong, Chao Cao, Yuxiao Liu, Peng Shu, Yaonai Wei, Zihao Wu, Chong Ma, Jiaqi Wang, Sheng Wang, Mengyue Zhou, Zuowei Jiang, Chunlin Li, Jason Holmes, Shaochen Xu, Lu Zhang, Haixing Dai, Kai Zhang, Lin Zhao, Yuanhao Chen , et al. (20 additional authors not shown)

    Abstract: The rise of large language models (LLMs) has marked a pivotal shift in the field of natural language processing (NLP). LLMs have revolutionized a multitude of domains, and they have made a significant impact in the medical field. Large language models are now more abundant than ever, and many of these models exhibit bilingual capabilities, proficient in both English and Chinese. However, a compreh… ▽ More

    Submitted 27 July, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  50. arXiv:2307.10954  [pdf, other

    cs.RO cs.CV

    Soft-tissue Driven Craniomaxillofacial Surgical Planning

    Authors: Xi Fang, Daeseung Kim, Xuanang Xu, Tianshu Kuang, Nathan Lampen, Jungwook Lee, Hannah H. Deng, Jaime Gateno, Michael A. K. Liebschner, James J. Xia, Pingkun Yan

    Abstract: In CMF surgery, the planning of bony movement to achieve a desired facial outcome is a challenging task. Current bone driven approaches focus on normalizing the bone with the expectation that the facial appearance will be corrected accordingly. However, due to the complex non-linear relationship between bony structure and facial soft-tissue, such bone-driven methods are insufficient to correct fac… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Early accepted by MICCAI 2023