Skip to main content

Showing 1–50 of 752 results for author: He, Q

  1. arXiv:2407.11183  [pdf, other

    cs.LG

    Differentiable Neural-Integrated Meshfree Method for Forward and Inverse Modeling of Finite Strain Hyperelasticity

    Authors: Honghui Du, Binyao Guo, QiZhi He

    Abstract: The present study aims to extend the novel physics-informed machine learning approach, specifically the neural-integrated meshfree (NIM) method, to model finite-strain problems characterized by nonlinear elasticity and large deformations. To this end, the hyperelastic material models are integrated into the loss function of the NIM method by employing a consistent local variational formulation. Th… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.10485  [pdf, other

    cs.CV

    Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss

    Authors: Mufeng Yao, Jinlong Peng, Qingdong He, Bo Peng, Hao Chen, Mingmin Chi, Chao Liu, Jon Atli Benediktsson

    Abstract: Multiple object tracking (MOT) from unmanned aerial vehicle (UAV) platforms requires efficient motion modeling. This is because UAV-MOT faces tracking difficulties caused by large and irregular motion, and insufficient training due to the motion long-tailed distribution of current UAV-MOT datasets. Previous UAV-MOT methods either extract motion and detection features redundantly or supervise motio… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2308.07207

  3. arXiv:2407.05368  [pdf, other

    cs.SD cs.AI cs.IR eess.AS

    Music Era Recognition Using Supervised Contrastive Learning and Artist Information

    Authors: Qiqi He, Xuchen Song, Weituo Hao, Ju-Chiang Wang, Wei-Tsung Lu, Wei Li

    Abstract: Does popular music from the 60s sound different than that of the 90s? Prior study has shown that there would exist some variations of patterns and regularities related to instrumentation changes and growing loudness across multi-decadal trends. This indicates that perceiving the era of a song from musical features such as audio and artist information is possible. Music era information can be an im… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  4. arXiv:2407.04498  [pdf, ps, other

    math.AP

    Global dynamics for the generalized chemotaxis-Navier-Stokes system in $\mathbb{R}^3$

    Authors: Qingyou He, Ling-Yun Shou, Leyun Wu

    Abstract: We consider the Cauchy problem of the three-dimensional generalized chemotaxis-Navier-Stokes system \begin{eqnarray*} \begin{cases} \partial_t n+u\cdot \nabla n=Δn- \nabla \cdot (χ(c)n \nabla c),\\ \partial_t c+u \cdot \nabla c=Δc-nf(c),\\ \partial_t u +u \cdot \nabla u+\nabla P=-(-Δ)^αu-n\nabla φ,\\ \nabla \cdot u=0. \end{cases} \end{eqnarray*} First, we study the time extensibility criter… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 39 pages

  5. arXiv:2407.03612  [pdf, other

    quant-ph

    Quantum phase transition in a quantum Rabi square with next-nearest-neighbor hopping

    Authors: Yilun Xu, Feng-Xao Sun, Qiongyi He, Han Pu, Wei Zhang

    Abstract: We propose a quantum Rabi square model where both the nearest-neighbor and the next-nearest-neighbor photon hopping are allowed among four quantum Rabi systems located at the vertices of a square. By tuning the next-nearest hopping strength, we realize a first-order phase transition between the antiferromagnetic superradiant phase and the frustrated superradiant phase, as well as a second-order ph… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  6. arXiv:2407.03547  [pdf, ps, other

    math.AP

    Large Time Behavior of Solutions to Cauchy Problem for 1-D Compressible Isentropic Navier-Stokes/Allen-Cahn System

    Authors: Yazhou Chen, Qiaolin He, Xiaoding Shi

    Abstract: This paper is concerned with the large time behavior of the solutions to the Cauchy problem for the one-dimensional compressible Navier-Stokes/Allen-Cahn system with the immiscible two-phase flow initially located near the phase separation state. Under the assumptions that the initial data is a small perturbation of the constant state, we prove the global existence and uniqueness of the solutions… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 26 pages

    MSC Class: 35Q35; 35B65; 76N10; 35M10; 35B40; 35C20; 76T30

  7. arXiv:2407.02784  [pdf

    quant-ph

    Breeding the Cat Through Superposition of Two Schrodinger Kittens Based on Coupled Waveguides

    Authors: Nuo Wang, Xinchen Zhang, Qi Liu, Fengxiao Sun, Qiongyi He, Ying Gu

    Abstract: Optical Schrodinger's cat (SC) is highly anticipated because of the potential of realizing fault-tolerant quantum computing, but the practical merit is only shown when the amplitude is larger than 2. However, such high-amplitude cats have not been prepared due to the limitations rooted in the existing method. Here, we demonstrate a principle that a large SC-like state can be generated by the super… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  8. arXiv:2407.02761  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el cond-mat.supr-con

    Inducing superconductivity in quantum anomalous Hall regime

    Authors: Yu Huang, Yu Fu, Peng Zhang, Kang L. Wang, Qing Lin He

    Abstract: Interfacing the quantum anomalous Hall insulator with a conventional superconductor is known to be a promising manner for realizing a topological superconductor, which has been continuously pursued for years. Such a proximity route depends to a great extent on the control of the delicate interfacial coupling of the two constituents. However, a recent experiment reported the failure to reproduce su… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 17 pages, 4 figures

    Journal ref: 2024 J. Phys.: Condens. Matter 36 37LT01

  9. arXiv:2407.00294  [pdf, other

    math.NA cs.LG physics.comp-ph

    Deep Neural Networks with Symplectic Preservation Properties

    Authors: Qing He, Wei Cai

    Abstract: We propose a deep neural network architecture designed such that its output forms an invertible symplectomorphism of the input. This design draws an analogy to the real-valued non-volume-preserving (real NVP) method used in normalizing flow techniques. Utilizing this neural network type allows for learning tasks on unknown Hamiltonian systems without breaking the inherent symplectic structure of t… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    MSC Class: 37J11; 70H15; 68T07

  10. arXiv:2406.19859  [pdf, other

    cs.AI cs.HC cs.MM

    MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis

    Authors: Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Jingdong Sun, Qi He, Wangmeng Xiang, Hanyuan Chen, Jin-Peng Lan, Xianhui Lin, Kang Zhu, Bin Luo, Yifeng Geng, Xuansong Xie, Alexander G. Hauptmann

    Abstract: MetaDesigner revolutionizes artistic typography synthesis by leveraging the strengths of Large Language Models (LLMs) to drive a design paradigm centered around user engagement. At the core of this framework lies a multi-agent system comprising the Pipeline, Glyph, and Texture agents, which collectively enable the creation of customized WordArt, ranging from semantic enhancements to the imposition… ▽ More

    Submitted 4 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: 18 pages, 16 figures, Project: https://modelscope.cn/studios/WordArt/WordArt

  11. arXiv:2406.15879  [pdf

    physics.optics cond-mat.mtrl-sci

    Robust Ptychographic Reconstruction with an Out-of-Focus Electron Probe

    Authors: Shoucong Ning, Wenhui Xu, Pengju Sheng, Leyi Loh, Stephen Pennycook, Fucai Zhang, Michel Bosman, Qian He

    Abstract: As a burgeoning technique, out-of-focus electron ptychography offers the potential for rapidly imaging atomic-scale large fields of view (FoV) using a single diffraction dataset. However, achieving robust out-of-focus ptychographic reconstruction poses a significant challenge due to the inherent scan instabilities of electron microscopes, compounded by the presence of unknown aberrations in the pr… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 22 pages, 6 figures

  12. arXiv:2406.15005  [pdf, other

    physics.optics cond-mat.mes-hall quant-ph

    Manipulating Spectral Windings and Skin Modes through Nonconservative Couplings

    Authors: Ningxin Kong, Chenghe Yu, Yilun Xu, Matteo Fadel, Xinyao Huang, Qiongyi He

    Abstract: The discovery of the non-Hermitian skin effect (NHSE) has revolutionized our understanding of wave propagation in non-Hermitian systems, highlighting unexpected localization effects beyond conventional theories. Here, we discover that NHSE, accompanied by multi-type spectral phases, can be induced by manipulating nonconservative couplings. By characterizing the spectrum through the windings of the… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 6 pages, 4 figures

  13. arXiv:2406.13577  [pdf, other

    quant-ph

    Genuine Multipartite Entanglement induced by a Thermal Acoustic Reservoir

    Authors: Qing-Yang Qiu, Zhi-Guang Lu, Qiongyi He, Ying Wu, Xin-You Lü

    Abstract: Genuine multipartite entanglement (GME) is not only fundamental interesting for the study of quantum-to-classical transition, but also is essential for realizing universal quantum computing and quantum networks. Here we investigate the multipartite entanglement (ME) dynamics in a linear chain of N LC resonators interacting optomechanically with a common thermal acoustic reservoir. By presenting th… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 25 pages, 9 figures

  14. arXiv:2406.13171  [pdf

    physics.optics

    Super-resolution 3D tomography of vector near-fields in dielectric resonators

    Authors: Bingbing Zhu, Qingnan Cai, Yaxin Liu, Sheng Zhang, Weifeng Liu, Qiong He, Lei Zhou, Zhensheng Tao

    Abstract: All-dielectric optical resonators, exhibiting exotic near-field distributions upon excitations, have emerged as low-loss, versatile and highly adaptable components in nanophotonic structures for manipulating electromagnetic waves and enhancing light-matter interactions. However, achieving experimental full three-dimensional characterization of near-fields within dielectric materials poses signific… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 26 pages, 4 figures

  15. arXiv:2406.11789  [pdf, other

    quant-ph

    Quantum metrology with a squeezed Kerr oscillator

    Authors: Jiajie Guo, Qiongyi He, Matteo Fadel

    Abstract: We study the squeezing dynamics in a Kerr-nonlinear oscillator, and quantify the metrological usefulness of the resulting states. Even if the nonlinearity limits the attainable squeezing by making the evolution non-Gaussian, the states obtained still have a high quantum Fisher information for sensing displacements. However, contrary to the Gaussian case, the amplitude of the displacement cannot be… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  16. arXiv:2406.10902  [pdf, other

    cs.CV cs.CL

    Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models

    Authors: Yikai Zhang, Qianyu He, Xintao Wang, Siyu Yuan, Jiaqing Liang, Yanghua Xiao

    Abstract: Multi-Modal Knowledge Graphs (MMKGs) have proven valuable for various downstream tasks. However, scaling them up is challenging because building large-scale MMKGs often introduces mismatched images (i.e., noise). Most entities in KGs belong to the long tail, meaning there are few images of them available online. This scarcity makes it difficult to determine whether a found image matches the entity… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  17. arXiv:2406.10715  [pdf, other

    physics.optics quant-ph

    Chip-scale generation of 60-mode continuous-variable cluster states

    Authors: Ze Wang, Kangkang Li, Yue Wang, Xin Zhou, Yinke Cheng, Boxuan Jing, Fengxiao Sun, Jincheng Li, Zhilin Li, Qihuang Gong, Qiongyi He, Bei-Bei Li, Qi-Fan Yang

    Abstract: Increasing the number of entangled entities is crucial for achieving exponential computational speedups and secure quantum networks. Despite recent progress in generating large-scale entanglement through continuous-variable (CV) cluster states, translating these technologies to photonic chips has been hindered by decoherence, limiting the number of entangled entities to 8. Here, we demonstrate 60-… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  18. arXiv:2406.10517  [pdf, other

    cs.IR cs.AI cs.LG

    ADSNet: Cross-Domain LTV Prediction with an Adaptive Siamese Network in Advertising

    Authors: Ruize Wang, Hui Xu, Ying Cheng, Qi He, Xing Zhou, Rui Feng, Wei Xu, Lei Huang, Jie Jiang

    Abstract: Advertising platforms have evolved in estimating Lifetime Value (LTV) to better align with advertisers' true performance metric. However, the sparsity of real-world LTV data presents a significant challenge to LTV predictive model(i.e., pLTV), severely limiting the their capabilities. Therefore, we propose to utilize external data, in addition to the internal data of advertising platform, to expan… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted to KDD 2024

  19. arXiv:2406.09422  [pdf, other

    cs.DC cs.AI cs.CE cs.CR

    LooPIN: A PinFi protocol for decentralized computing

    Authors: Yunwei Mao, Qi He, Ju Li

    Abstract: Networked computing power is a critical utility in the era of artificial intelligence. This paper presents a novel Physical Infrastructure Finance (PinFi) protocol designed to facilitate the distribution of computing power within networks in a decentralized manner. Addressing the core challenges of coordination, pricing, and liquidity in decentralized physical infrastructure networks (DePIN), the… ▽ More

    Submitted 29 March, 2024; originally announced June 2024.

  20. arXiv:2406.08122  [pdf

    eess.AS cs.SD

    Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor

    Authors: Yongjie Si, Yanxiong Li, Jialong Li, Jiaxin Tan, Qianhua He

    Abstract: It's assumed that training data is sufficient in base session of few-shot class-incremental audio classification. However, it's difficult to collect abundant samples for model training in base session in some practical scenarios due to the data scarcity of some classes. This paper explores a new problem of fully few-shot class-incremental audio classification with few training samples in all sessi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted for publication on Interspeech 2024. 5 pages, 3 figures, 5 tables

  21. arXiv:2406.08119  [pdf

    eess.AS cs.SD

    Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network

    Authors: Yanxiong Li, Jiaxin Tan, Guoqing Chen, Jialong Li, Yongjie Si, Qianhua He

    Abstract: This work is an improved system that we submitted to task 1 of DCASE2023 challenge. We propose a method of low-complexity acoustic scene classification by a parallel attention-convolution network which consists of four modules, including pre-processing, fusion, global and local contextual information extraction. The proposed network is computationally efficient to capture global and local contextu… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted for publication on Interspeech 2024. 5 pages, 4 figures, 3 tables

  22. arXiv:2406.06464  [pdf, other

    cs.AI cs.CL

    Transforming Wearable Data into Health Insights using Large Language Model Agents

    Authors: Mike A. Merrill, Akshay Paruchuri, Naghmeh Rezaei, Geza Kovacs, Javier Perez, Yun Liu, Erik Schenck, Nova Hammerquist, Jake Sunshine, Shyam Tailor, Kumar Ayush, Hao-Wei Su, Qian He, Cory Y. McLean, Mark Malhotra, Shwetak Patel, Jiening Zhan, Tim Althoff, Daniel McDuff, Xin Liu

    Abstract: Despite the proliferation of wearable health trackers and the importance of sleep and exercise to health, deriving actionable personalized insights from wearable data remains a challenge because doing so requires non-trivial open-ended analysis of these data. The recent rise of large language model (LLM) agents, which can use tools to reason about and interact with the world, presents a promising… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 38 pages

  23. arXiv:2406.03262  [pdf, other

    cs.CV

    ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

    Authors: Jiangning Zhang, Haoyang He, Zhenye Gan, Qingdong He, Yuxuan Cai, Zhucun Xue, Yabiao Wang, Chengjie Wang, Lei Xie, Yong Liu

    Abstract: Visual anomaly detection aims to identify anomalous regions in images through unsupervised learning paradigms, with increasing application demand and value in fields such as industrial inspection and medical lesion detection. Despite significant progress in recent years, there is a lack of comprehensive benchmarks to adequately evaluate the performance of various mainstream methods across differen… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  24. arXiv:2406.01902  [pdf, other

    math.AP

    Large Time Behavior and Sharp Interface Limit of Compressible Navier-Stokes/Allen-Cahn System for Interacting Shock Waves

    Authors: Yazhou Chen, Qiaolin He, Xiaoding Shi, Xiaoping Wang

    Abstract: In this paper, we study the large time behavior and sharp interface limit of the Cauchy problem for compressible Navier-Stokes/Allen-Cahn system with interaction shock waves in the same family. This system is an important mathematical model for describing the motion of immiscible two-phase flow. The results show that, if the initial density and velocity are near the superposition of two shock wave… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 41pages, 2 figures

    MSC Class: 35Q35; 35B65; 76N10; 35M10; 35B40; 35C20; 76T30

  25. arXiv:2406.01103  [pdf, other

    cs.AI cs.HC cs.LG

    Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment

    Authors: Chen Zhang, Qiang He, Zhou Yuan, Elvis S. Liu, Hong Wang, Jian Zhao, Yang Wang

    Abstract: Deep Reinforcement Learning (DRL) agents have demonstrated impressive success in a wide range of game genres. However, existing research primarily focuses on optimizing DRL competence rather than addressing the challenge of prolonged player interaction. In this paper, we propose a practical DRL agent system for fighting games named Shūkai, which has been successfully deployed to Naruto Mobile, a p… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accept at ICML 2024

  26. arXiv:2405.20081  [pdf, other

    cs.CV cs.AI

    NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models

    Authors: Kai Wu, Boyuan Jiang, Zhengkai Jiang, Qingdong He, Donghao Luo, Shengzhi Wang, Qingwen Liu, Chengjie Wang

    Abstract: Multimodal large language models (MLLMs) contribute a powerful mechanism to understanding visual information building on large language models. However, MLLMs are notorious for suffering from hallucinations, especially when generating lengthy, detailed descriptions for images. Our analysis reveals that hallucinations stem from the inherent summarization mechanism of large language models, leading… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures with supplementary material

  27. arXiv:2405.19664  [pdf, other

    quant-ph

    Quantum Zeno Effect on Genuine Tripartite Nonlocality and Entanglement in Quantum Dissipative System

    Authors: Zi-Yu Xiong, Yong-Jun Xiao, Ye-Qi Zhang, Qi-Liang He

    Abstract: As a precious global resource in quantum information, genuine tripartite nonlocality(GTN) can be quantified by violating Svetlichny inequality. However, there is still no analytical expression for the general three-qubit states due to the difficulty of theoretical calculations. In this paper, we achieve highly accurate quantization of GTN for arbitrary three-qubit quantum states numerically. As an… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 7 pages, 7 figures

  28. arXiv:2405.19633  [pdf, other

    quant-ph

    Phase transition and multistability in Dicke dimer

    Authors: Yilun Xu, Feng-Xiao Sun, Wei Zhang, Qiongyi He, Han Pu

    Abstract: The exotic phase transitions and multistabilities in atom-cavity coupled systems have attracted tremendous interests recently. In this work, we investigate the effect of photon hopping between two Dicke cavities, which induces rich quantum phases for steady states and dynamic process. Starting from a generic dimer system where the two cavities are not necessarily identical, we analytically prove a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  29. arXiv:2405.17741  [pdf, other

    cs.AI

    LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design

    Authors: Rui Kong, Qiyang Li, Xinyu Fang, Qingtian Feng, Qingfeng He, Yazhu Dong, Weijun Wang, Yuanchun Li, Linghe Kong, Yunxin Liu

    Abstract: Recent literature has found that an effective method to customize or further improve large language models (LLMs) is to add dynamic adapters, such as low-rank adapters (LoRA) with Mixture-of-Experts (MoE) structures. Though such dynamic adapters incur modest computational complexity, they surprisingly lead to huge inference latency overhead, slowing down the decoding speed by 2.5+ times. In this p… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  30. arXiv:2405.17718  [pdf, other

    cs.CV cs.LG

    AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval

    Authors: Sihe Zhang, Qingdong He, Jinlong Peng, Yuxi Li, Zhengkai Jiang, Jiafu Wu, Mingmin Chi, Yabiao Wang, Chengjie Wang

    Abstract: Image retrieval aims to identify visually similar images within a database using a given query image. Traditional methods typically employ both global and local features extracted from images for matching, and may also apply re-ranking techniques to enhance accuracy. However, these methods often fail to account for the noise present in query images, which can stem from natural or human-induced fac… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  31. arXiv:2405.16265  [pdf, other

    cs.LG

    MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time

    Authors: Jikun Kang, Xin Zhe Li, Xi Chen, Amirreza Kazemi, Qianyi Sun, Boxing Chen, Dong Li, Xu He, Quan He, Feng Wen, Jianye Hao, Jun Yao

    Abstract: Although Large Language Models (LLMs) achieve remarkable performance across various tasks, they often struggle with complex reasoning tasks, such as answering mathematical questions. Recent efforts to address this issue have primarily focused on leveraging mathematical datasets through supervised fine-tuning or self-improvement techniques. However, these methods often depend on high-quality datase… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  32. arXiv:2405.15580  [pdf, other

    cs.CV

    Open-Vocabulary SAM3D: Understand Any 3D Scene

    Authors: Hanchen Tai, Qingdong He, Jiangning Zhang, Yijie Qian, Zhenyu Zhang, Xiaobin Hu, Yabiao Wang, Yong Liu

    Abstract: Open-vocabulary 3D scene understanding presents a significant challenge in the field. Recent advancements have sought to transfer knowledge embedded in vision language models from the 2D domain to 3D domain. However, these approaches often require learning prior knowledge from specific 3D scene datasets, which limits their applicability in open-world scenarios. The Segment Anything Model (SAM) has… ▽ More

    Submitted 21 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Project page: https://hithqd.github.io/projects/OV-SAM3D

  33. arXiv:2405.15214  [pdf, other

    cs.CV

    PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning

    Authors: Qingdong He, Jiangning Zhang, Jinlong Peng, Haoyang He, Yabiao Wang, Chengjie Wang

    Abstract: Transformers have revolutionized the point cloud learning task, but the quadratic complexity hinders its extension to long sequence and makes a burden on limited computational resources. The recent advent of RWKV, a fresh breed of deep sequence models, has shown immense potential for sequence modeling in NLP tasks. In this paper, we present PointRWKV, a model of linear complexity derived from the… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  34. arXiv:2405.14210  [pdf, other

    cs.CV eess.IV

    Eidos: Efficient, Imperceptible Adversarial 3D Point Clouds

    Authors: Hanwei Zhang, Luo Cheng, Qisong He, Wei Huang, Renjue Li, Ronan Sicre, Xiaowei Huang, Holger Hermanns, Lijun Zhang

    Abstract: Classification of 3D point clouds is a challenging machine learning (ML) task with important real-world applications in a spectrum from autonomous driving and robot-assisted surgery to earth observation from low orbit. As with other ML tasks, classification models are notoriously brittle in the presence of adversarial attacks. These are rooted in imperceptible changes to inputs with the effect tha… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Preprint

  35. arXiv:2405.14165  [pdf, other

    cond-mat.supr-con

    Spatial topological insulator

    Authors: Qinghua He, Wenlong Gao, Feng Liu

    Abstract: Traditional topological insulators often rely on band inversions driven by nonuniform hopping textures and spin-orbit coupling, as exemplified in the Su-Schrieffer-Heeger and Kane-Mele models. We present a novel approach utilizing the spatial nature of sublattice symmetry to induce nontrivial topological insulating properties characterized by second-order corner states without band inversion. To s… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 3 pages, 3 figures

  36. arXiv:2405.13902  [pdf, other

    cs.LG cs.AI

    LOGIN: A Large Language Model Consulted Graph Neural Network Training Framework

    Authors: Yiran Qiao, Xiang Ao, Yang Liu, Jiarong Xu, Xiaoqian Sun, Qing He

    Abstract: Recent prevailing works on graph machine learning typically follow a similar methodology that involves designing advanced variants of graph neural networks (GNNs) to maintain the superior performance of GNNs on different graphs. In this paper, we aim to streamline the GNN design process and leverage the advantages of Large Language Models (LLMs) to improve the performance of GNNs on downstream tas… ▽ More

    Submitted 6 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  37. arXiv:2405.12490  [pdf, other

    cs.CV

    Customize Your Own Paired Data via Few-shot Way

    Authors: Jinshu Chen, Bingchuan Li, Miao Hua, Panpan Xu, Qian He

    Abstract: Existing solutions to image editing tasks suffer from several issues. Though achieving remarkably satisfying generated results, some supervised methods require huge amounts of paired training data, which greatly limits their usages. The other unsupervised methods take full advantage of large-scale pre-trained priors, thus being strictly restricted to the domains where the priors are trained on and… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted by AI4CC CVPR2024 WorkShop

  38. arXiv:2405.04828  [pdf, other

    cs.CL

    ChuXin: 1.6B Technical Report

    Authors: Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu

    Abstract: In this report, we present ChuXin, an entirely open-source language model with a size of 1.6 billion parameters. Unlike the majority of works that only open-sourced the model weights and architecture, we have made everything needed to train a model available, including the training data, the training process, and the evaluation code. Our goal is to empower and strengthen the open research communit… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Technical Report

  39. arXiv:2405.04451  [pdf, ps, other

    math-ph cond-mat.stat-mech math.PR

    Analyticity for classical hard-core gases via recursion

    Authors: Qidong He

    Abstract: In the recent work of [Michelen, Perkins, Comm. Math. Phys. 399:1 (2023)], a new lower bound of $eC_φ(β)^{-1}$ is obtained for the positive activity up to which the pressure of a classical system of particles with repulsive pair interactions is analytic. In this paper, we extend their method to the class of radially symmetric, locally stable, and tempered pair potentials. Our main result is that t… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 19 pages

  40. arXiv:2405.03349  [pdf, other

    cs.CV

    Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement

    Authors: Jiesong Bai, Yuhao Yin, Qiyuan He, Yuanxian Li, Xiaofeng Zhang

    Abstract: In the field of low-light image enhancement, both traditional Retinex methods and advanced deep learning techniques such as Retinexformer have shown distinct advantages and limitations. Traditional Retinex methods, designed to mimic the human eye's perception of brightness and color, decompose images into illumination and reflection components but struggle with noise management and detail preserva… ▽ More

    Submitted 19 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  41. arXiv:2405.03261  [pdf, other

    quant-ph

    A nonlinear criterion for characterizing high-dimensional multipartite entanglement

    Authors: Shuheng Liu, Qiongyi He, Marcus Huber, Giuseppe Vitagliano

    Abstract: Understanding entanglement of potentially high-dimensional multipartite quantum systems is crucial across different disciplines in quantum sciences. We take inspiration from covariance matrix based techniques to derive a nonlinear criterion that can be used to lower bound the dimensionality vector of mixed quantum states, revealing both the level of multipartiteness and the dimensionality of the e… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  42. arXiv:2405.02593  [pdf

    q-bio.PE

    An Interdisciplinary Perspective of the Built-Environment Microbiome

    Authors: John S. McAlister, Michael J. Blum, Yana Bromberg, Nina H. Fefferman, Qiang He, Eric Lofgren, Debra L. Miller, Courtney Schreiner, K. Selcuk Candan, Heather Szabo-Rogers, J. Michael Reed

    Abstract: The built environment provides an excellent setting for interdisciplinary research on the dynamics of microbial communities. The system is simplified compared to many natural settings, and to some extent the entire environment can be manipulated, from architectural design, to materials use, air flow, human traffic, and capacity to disrupt microbial communities through cleaning. Here we provide an… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 23 pages

  43. arXiv:2405.00236  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    STT: Stateful Tracking with Transformers for Autonomous Driving

    Authors: Longlong Jing, Ruichi Yu, Xu Chen, Zhengli Zhao, Shiwei Sheng, Colin Graber, Qi Chen, Qinru Li, Shangxuan Wu, Han Deng, Sangjin Lee, Chris Sweeney, Qiurui He, Wei-Chih Hung, Tong He, Xingyi Zhou, Farshid Moussavi, Zijian Guo, Yin Zhou, Mingxing Tan, Weilong Yang, Congcong Li

    Abstract: Tracking objects in three-dimensional space is critical for autonomous driving. To ensure safety while driving, the tracker must be able to reliably track objects across frames and accurately estimate their states such as velocity and acceleration in the present. Existing works frequently focus on the association task while either neglecting the model performance on state estimation or deploying c… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: ICRA 2024

  44. arXiv:2404.18057  [pdf, other

    cs.CL

    Efficient LLM Inference with Kcache

    Authors: Qiaozhi He, Zhihua Wu

    Abstract: Large Language Models(LLMs) have had a profound impact on AI applications, particularly in the domains of long-text comprehension and generation. KV Cache technology is one of the most widely used techniques in the industry. It ensures efficient sequence generation by caching previously computed KV states. However, it also introduces significant memory overhead. We discovered that KV Cache is not… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Technical Report, 8 pages

  45. arXiv:2404.16353  [pdf, other

    math.AP

    Rigorous derivation of a Hele-Shaw type model and its non-symmetric traveling wave solution

    Authors: Yu Feng, Qingyou He, Jian-Guo Liu, Zhennan Zhou

    Abstract: In this paper, we consider a Hele-Shaw model that describes tumor growth subject to nutrient supply. This model was recently studied in \cite{feng2022tumor} via asymptotic analysis. Our contributions are twofold: Firstly, we provide a rigorous derivation of this Hele-Shaw model by taking the incompressible limit of the porous medium reaction-diffusion equation, which solidifies the mathematical fo… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 23 pages, 2 figures

    MSC Class: 35R35; 76D27; 92C10; 70K50

  46. arXiv:2404.16022  [pdf, other

    cs.CV

    PuLID: Pure and Lightning ID Customization via Contrastive Alignment

    Authors: Zinan Guo, Yanze Wu, Zhuowei Chen, Lang Chen, Qian He

    Abstract: We propose Pure and Lightning ID customization (PuLID), a novel tuning-free ID customization method for text-to-image generation. By incorporating a Lightning T2I branch with a standard diffusion one, PuLID introduces both contrastive alignment loss and accurate ID loss, minimizing disruption to the original model and ensuring high ID fidelity. Experiments show that PuLID achieves superior perform… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Tech Report. Codes and models will be available at https://github.com/ToTheBeginning/PuLID

  47. arXiv:2404.15846  [pdf, other

    cs.CL

    From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models

    Authors: Qianyu He, Jie Zeng, Qianxi He, Jiaqing Liang, Yanghua Xiao

    Abstract: It is imperative for Large language models (LLMs) to follow instructions with elaborate requirements (i.e. Complex Instructions Following). Yet, it remains under-explored how to enhance the ability of LLMs to follow complex instructions with multiple constraints. To bridge the gap, we initially study what training data is effective in enhancing complex constraints following abilities. We found tha… ▽ More

    Submitted 18 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  48. arXiv:2404.14705  [pdf, other

    cs.CV

    Think-Program-reCtify: 3D Situated Reasoning with Large Language Models

    Authors: Qingrong He, Kejun Lin, Shizhe Chen, Anwen Hu, Qin Jin

    Abstract: This work addresses the 3D situated reasoning task which aims to answer questions given egocentric observations in a 3D environment. The task remains challenging as it requires comprehensive 3D perception and complex reasoning skills. End-to-end models trained on supervised data for 3D situated reasoning suffer from data scarcity and generalization ability. Inspired by the recent success of levera… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  49. arXiv:2404.12754  [pdf, other

    cs.LG cs.AI

    Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation

    Authors: Qiang He, Tianyi Zhou, Meng Fang, Setareh Maghsudi

    Abstract: Representation rank is an important concept for understanding the role of Neural Networks (NNs) in Deep Reinforcement learning (DRL), which measures the expressive capacity of value networks. Existing studies focus on unboundedly maximizing this rank; nevertheless, that approach would introduce overly complex models in the learning, thus undermining performance. Hence, fine-tuning representation r… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR23; Code: https://github.com/sweetice/BEER-ICLR2024

  50. arXiv:2404.11326  [pdf, other

    cs.CV

    Single-temporal Supervised Remote Change Detection for Domain Generalization

    Authors: Qiangang Du, Jinlong Peng, Xu Chen, Qingdong He, Liren He, Qiang Nie, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

    Abstract: Change detection is widely applied in remote sensing image analysis. Existing methods require training models separately for each dataset, which leads to poor domain generalization. Moreover, these methods rely heavily on large amounts of high-quality pair-labelled data for training, which is expensive and impractical. In this paper, we propose a multimodal contrastive learning (ChangeCLIP) based… ▽ More

    Submitted 23 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.