subscribe to arXiv mailings

Neural Networks Trained by Weight Permutation are Universal Approximators

Authors: Yongqiang Cai, Gaohang Chen, Zhonghua Qiao

Abstract: The universal approximation property is fundamental to the success of neural networks, and has traditionally been achieved by training networks without any constraints on their parameters. However, recent experimental research proposed a novel permutation-based training method, which exhibited a desired classification performance without modifying the exact weight values. In this paper, we provide… ▽ More The universal approximation property is fundamental to the success of neural networks, and has traditionally been achieved by training networks without any constraints on their parameters. However, recent experimental research proposed a novel permutation-based training method, which exhibited a desired classification performance without modifying the exact weight values. In this paper, we provide a theoretical guarantee of this permutation training method by proving its ability to guide a ReLU network to approximate one-dimensional continuous functions. Our numerical results further validate this method's efficiency in regression tasks with various initializations. The notable observations during weight permutation suggest that permutation training can provide an innovative tool for describing network learning behavior. △ Less

Submitted 1 July, 2024; originally announced July 2024.

MSC Class: 41A30; 68T05; 68T07

arXiv:2406.17769 [pdf]

Flat bands and distinct density wave orders in correlated Kagome superconductor CsCr$_3$Sb$_5$

Authors: Shuting Peng, Yulei Han, Yongkai Li, Jianchang Shen, Yu Miao, Yang Luo, Linwei Huai, Zhipeng Ou, Hongyu Li, Ziji Xiang, Zhengtai Liu, Dawei Shen, Makoto Hashimoto, Donghui Lu, Yugui Yao, Zhenhua Qiao, Zhiwei Wang, Junfeng He

Abstract: Kagome metal CsV$_3$Sb$_5$ has attracted much recent attention due to the coexistence of multiple exotic orders and the associated proposals to mimic unconventional high temperature superconductors. Nevertheless, magnetism and strong electronic correlations -- two essential ingredients for unconventional superconductivity, are absent in this V-based Kagome metal. CsCr$_3$Sb$_5$ is a newly discover… ▽ More Kagome metal CsV$_3$Sb$_5$ has attracted much recent attention due to the coexistence of multiple exotic orders and the associated proposals to mimic unconventional high temperature superconductors. Nevertheless, magnetism and strong electronic correlations -- two essential ingredients for unconventional superconductivity, are absent in this V-based Kagome metal. CsCr$_3$Sb$_5$ is a newly discovered Cr-based parallel of CsV$_3$Sb$_5$, in which magnetism appears with charge density wave and superconductivity at different temperature and pressure regions. Enhanced electronic correlations are also suggested by theoretical proposals due to the calculated flat bands. Here, we report angle-resolved photoemission measurements and first-principles calculations on this new material system. Electron energy bands and the associated orbitals are resolved. Flat bands are observed near the Fermi level. Doping dependent measurements on Cs(Cr$_x$V$_{1-x}$)$_3$Sb$_5$ reveal a gradually enhanced band renormalization from CsV$_3$Sb$_5$ to CsCr$_3$Sb$_5$, accompanied by distinct spatial symmetry breaking states in the phase diagram. △ Less

Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.08116 [pdf, other]

Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling

Authors: Zile Qiao, Wei Ye, Yong Jiang, Tong Mo, Pengjun Xie, Weiping Li, Fei Huang, Shikun Zhang

Abstract: Retrieval-augmented language models (RALMs) have recently shown great potential in mitigating the limitations of implicit knowledge in LLMs, such as untimely updating of the latest expertise and unreliable retention of long-tail knowledge. However, since the external knowledge base, as well as the retriever, can not guarantee reliability, potentially leading to the knowledge retrieved not being he… ▽ More Retrieval-augmented language models (RALMs) have recently shown great potential in mitigating the limitations of implicit knowledge in LLMs, such as untimely updating of the latest expertise and unreliable retention of long-tail knowledge. However, since the external knowledge base, as well as the retriever, can not guarantee reliability, potentially leading to the knowledge retrieved not being helpful or even misleading for LLM generation. In this paper, we introduce Supportiveness-based Knowledge Rewriting (SKR), a robust and pluggable knowledge rewriter inherently optimized for LLM generation. Specifically, we introduce the novel concept of "supportiveness"--which represents how effectively a knowledge piece facilitates downstream tasks--by considering the perplexity impact of augmented knowledge on the response text of a white-box LLM. Based on knowledge supportiveness, we first design a training data curation strategy for our rewriter model, effectively identifying and filtering out poor or irrelevant rewrites (e.g., with low supportiveness scores) to improve data efficacy. We then introduce the direct preference optimization (DPO) algorithm to align the generated rewrites to optimal supportiveness, guiding the rewriter model to summarize augmented content that better improves the final response. Comprehensive evaluations across six popular knowledge-intensive tasks and four LLMs have demonstrated the effectiveness and superiority of SKR. With only 7B parameters, SKR has shown better knowledge rewriting capability over GPT-4, the current state-of-the-art general-purpose LLM. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.07413 [pdf, other]

Holistic Memory Diversification for Incremental Learning in Growing Graphs

Authors: Ziyue Qiao, Junren Xiao, Qingqiang Sun, Meng Xiao, Hui Xiong

Abstract: This paper addresses the challenge of incremental learning in growing graphs with increasingly complex tasks. The goal is to continually train a graph model to handle new tasks while retaining its inference ability on previous tasks. Existing methods usually neglect the importance of memory diversity, limiting in effectively selecting high-quality memory from previous tasks and remembering broad p… ▽ More This paper addresses the challenge of incremental learning in growing graphs with increasingly complex tasks. The goal is to continually train a graph model to handle new tasks while retaining its inference ability on previous tasks. Existing methods usually neglect the importance of memory diversity, limiting in effectively selecting high-quality memory from previous tasks and remembering broad previous knowledge within the scarce memory on graphs. To address that, we introduce a novel holistic Diversified Memory Selection and Generation (DMSG) framework for incremental learning in graphs, which first introduces a buffer selection strategy that considers both intra-class and inter-class diversities, employing an efficient greedy algorithm for sampling representative training nodes from graphs into memory buffers after learning each new task. Then, to adequately rememorize the knowledge preserved in the memory buffer when learning new tasks, we propose a diversified memory generation replay method. This method first utilizes a variational layer to generate the distribution of buffer node embeddings and sample synthesized ones for replaying. Furthermore, an adversarial variational embedding learning method and a reconstruction-based decoder are proposed to maintain the integrity and consolidate the generalization of the synthesized node embeddings, respectively. Finally, we evaluate our model on node classification tasks involving increasing class numbers. Extensive experimental results on publicly accessible datasets demonstrate the superiority of DMSG over state-of-the-art methods. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2406.07404 [pdf, other]

Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy

Authors: Xiaohan Huang, Dongjie Wang, Zhiyuan Ning, Ziyue Qiao, Qingqing Long, Haowei Zhu, Min Wu, Yuanchun Zhou, Meng Xiao

Abstract: Tabular data optimization methods aim to automatically find an optimal feature transformation process that generates high-value features and improves the performance of downstream machine learning tasks. Current frameworks for automated feature transformation rely on iterative sequence generation tasks, optimizing decision strategies through performance feedback from downstream tasks. However, the… ▽ More Tabular data optimization methods aim to automatically find an optimal feature transformation process that generates high-value features and improves the performance of downstream machine learning tasks. Current frameworks for automated feature transformation rely on iterative sequence generation tasks, optimizing decision strategies through performance feedback from downstream tasks. However, these approaches fail to effectively utilize historical decision-making experiences and overlook potential relationships among generated features, thus limiting the depth of knowledge extraction. Moreover, the granularity of the decision-making process lacks dynamic backtracking capabilities for individual features, leading to insufficient adaptability when encountering inefficient pathways, adversely affecting overall robustness and exploration efficiency. To address the limitations observed in current automatic feature engineering frameworks, we introduce a novel method that utilizes a feature-state transformation graph to effectively preserve the entire feature transformation journey, where each node represents a specific transformation state. During exploration, three cascading agents iteratively select nodes and idea mathematical operations to generate new transformation states. This strategy leverages the inherent properties of the graph structure, allowing for the preservation and reuse of valuable transformations. It also enables backtracking capabilities through graph pruning techniques, which can rectify inefficient transformation paths. To validate the efficacy and flexibility of our approach, we conducted comprehensive experiments and detailed case studies, demonstrating superior performance in diverse scenarios. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 17 pages

arXiv:2406.06272 [pdf, ps, other]

Global-in-time energy stability analysis for the exponential time differencing Runge-Kutta scheme for the phase field crystal equation

Authors: Xiao Li, Zhonghua Qiao, Cheng Wang, Nan Zheng

Abstract: The global-in-time energy estimate is derived for the second-order accurate exponential time differencing Runge-Kutta (ETDRK2) numerical scheme to the phase field crystal (PFC) equation, a sixth-order parabolic equation modeling crystal evolution. To recover the value of stabilization constant, some local-in-time convergence analysis has been reported, and the energy stability becomes available ov… ▽ More The global-in-time energy estimate is derived for the second-order accurate exponential time differencing Runge-Kutta (ETDRK2) numerical scheme to the phase field crystal (PFC) equation, a sixth-order parabolic equation modeling crystal evolution. To recover the value of stabilization constant, some local-in-time convergence analysis has been reported, and the energy stability becomes available over a fixed final time. In this work, we develop a global-in-time energy estimate for the ETDRK2 numerical scheme to the PFC equation by showing the energy dissipation property for any final time. An a priori assumption at the previous time step, combined with a single-step $H^2$ estimate of the numerical solution, is the key point in the analysis. Such an $H^2$ estimate recovers the maximum norm bound of the numerical solution at the next time step, and then the value of the stabilization parameter can be theoretically justified. This justification ensures the energy dissipation at the next time step, so that the mathematical induction can be effectively applied, by then the global-in-time energy estimate is accomplished. This paper represents the first effort to theoretically establish a global-in-time energy stability analysis for a second-order stabilized numerical scheme in terms of the original free energy functional. The presented methodology is expected to be available for many other Runge-Kutta numerical schemes to the gradient flow equations. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2406.01037 [pdf, ps, other]

Engineering second-order topological insulators via coupling two first-order topological insulators

Authors: Lizhou Liu, Jiaqi An, Yafei Ren, Yingtao Zhang, Zhenhua Qiao, Qian Niu

Abstract: We theoretically investigate the engineering of two-dimensional second-order topological insulators with corner states by coupling two first-order topological insulators. We find that the interlayer coupling between two topological insulators with opposite topological invariants results in the formation of edge-state gaps, which are essential for the emergence of the corner states. Using the effec… ▽ More We theoretically investigate the engineering of two-dimensional second-order topological insulators with corner states by coupling two first-order topological insulators. We find that the interlayer coupling between two topological insulators with opposite topological invariants results in the formation of edge-state gaps, which are essential for the emergence of the corner states. Using the effective Hamiltonian framework, We elucidate that the formation of topological corner states requires either the preservation of symmetry in the crystal system or effective mass countersigns for neighboring edge states. Our proposed strategy for inducing corner state through interlayer coupling is versatile and applicable to both $\mathbb{Z}_2$ topological insulators and quantum anomalous Hall effects. We demonstrate this approach using several representative models including the seminal Kane-Mele model, the Bernevig-Hughes-Zhang model, and the Rashba graphene model to explicitly exhibit the formation of corner states via interlater coupling. Moreover, we also observe that the stacking of the coupled $\mathbb{Z}_2$ topological insulating systems results in the formation of the time-reversal invariant three-dimensional second-order nodal ring semimetals. Remarkably, the three-dimensional system from the stacking of the Bernevig-Hughes-Zhang model can be transformed into second-order Dirac semimetals, characterized by one-dimensional hinge Fermi arcs. Our strategy of engineering second-order topological phases via simple interlayer coupling promises to advance the exploration of higher-order topological insulators in two-dimensional spinful systems. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.14398 [pdf, other]

SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network

Authors: Weiyu Guo, Ying Sun, Yijie Xu, Ziyue Qiao, Yongkui Yang, Hui Xiong

Abstract: Surface electromyography (sEMG) based gesture recognition offers a natural and intuitive interaction modality for wearable devices. Despite significant advancements in sEMG-based gesture-recognition models, existing methods often suffer from high computational latency and increased energy consumption. Additionally, the inherent instability of sEMG signals, combined with their sensitivity to distri… ▽ More Surface electromyography (sEMG) based gesture recognition offers a natural and intuitive interaction modality for wearable devices. Despite significant advancements in sEMG-based gesture-recognition models, existing methods often suffer from high computational latency and increased energy consumption. Additionally, the inherent instability of sEMG signals, combined with their sensitivity to distribution shifts in real-world settings, compromises model robustness. To tackle these challenges, we propose a novel SpGesture framework based on Spiking Neural Networks, which possesses several unique merits compared with existing methods: (1) Robustness: By utilizing membrane potential as a memory list, we pioneer the introduction of Source-Free Domain Adaptation into SNN for the first time. This enables SpGesture to mitigate the accuracy degradation caused by distribution shifts. (2) High Accuracy: With a novel Spiking Jaccard Attention, SpGesture enhances the SNNs' ability to represent sEMG features, leading to a notable rise in system accuracy. To validate SpGesture's performance, we collected a new sEMG gesture dataset which has different forearm postures, where SpGesture achieved the highest accuracy among the baselines ($89.26\%$). Moreover, the actual deployment on the CPU demonstrated a system latency below 100ms, well within real-time requirements. This impressive performance showcases SpGesture's potential to enhance the applicability of sEMG in real-world scenarios. The code is available at https://anonymous.4open.science/r/SpGesture. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.11249 [pdf, ps, other]

Interlayer Coupling Induced Topological Phase Transition to Higher Order

Authors: Lizhou Liu, Jiaqi An, Yafei Ren, Yingtao Zhang, Zhenhua Qiao, Qian Niu

Abstract: We theoretically find that the second-order topological insulator, i.e., corner states, can be engineered by coupling two copies of two-dimensional $\mathbb{Z}_2$ topological insulators with opposite spin-helicities. As concrete examples, we utilize Kane-Mele models (i.e., graphene with intrinsic spin-orbit coupling) to realize the corner states by setting the respective graphenes to be… ▽ More We theoretically find that the second-order topological insulator, i.e., corner states, can be engineered by coupling two copies of two-dimensional $\mathbb{Z}_2$ topological insulators with opposite spin-helicities. As concrete examples, we utilize Kane-Mele models (i.e., graphene with intrinsic spin-orbit coupling) to realize the corner states by setting the respective graphenes to be $\mathbb{Z}_2$ topological insulators with opposite intrinsic spin-orbit couplings. To exhibit its universality, we generalize our findings to other representative $\mathbb{Z}_2$ topological insulators, e.g., the Bernevig-Hughes-Zhang model. An effective model is presented to reveal the physical origin of corner states. We further show that the corner states can also be designed in other topological systems, e.g., by coupling quantum anomalous Hall systems with opposite Chern numbers. Our work suggests that interlayer coupling can be treated as a simple and efficient strategy to drive lower-order topological insulators to the higher-order ones. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.03969 [pdf, other]

Speak the Same Language: Global LiDAR Registration on BIM Using Pose Hough Transform

Authors: Zhijian Qiao, Haoming Huang, Chuhao Liu, Shaojie Shen, Fumin Zhang, Huan Yin

Abstract: The construction and robotic sensing data originate from disparate sources and are associated with distinct frames of reference. The primary objective of this study is to align LiDAR point clouds with building information modeling (BIM) using a global point cloud registration approach, aimed at establishing a shared understanding between the two modalities, i.e., ``speak the same language''. To ac… ▽ More The construction and robotic sensing data originate from disparate sources and are associated with distinct frames of reference. The primary objective of this study is to align LiDAR point clouds with building information modeling (BIM) using a global point cloud registration approach, aimed at establishing a shared understanding between the two modalities, i.e., ``speak the same language''. To achieve this, we design a cross-modality registration method, spanning from front end the back end. At the front end, we extract descriptors by identifying walls and capturing the intersected corners. Subsequently, for the back-end pose estimation, we employ the Hough transform for pose estimation and estimate multiple pose candidates. The final pose is verified by wall-pixel correlation. To evaluate the effectiveness of our method, we conducted real-world multi-session experiments in a large-scale university building, involving two different types of LiDAR sensors. We also report our findings and plan to make our collected dataset open-sourced. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 12 pages, 10 figures

arXiv:2405.01054 [pdf, other]

Continual Learning for Robust Gate Detection under Dynamic Lighting in Autonomous Drone Racing

Authors: Zhongzheng Qiao, Xuan Huy Pham, Savitha Ramasamy, Xudong Jiang, Erdal Kayacan, Andriy Sarabakha

Abstract: In autonomous and mobile robotics, a principal challenge is resilient real-time environmental perception, particularly in situations characterized by unknown and dynamic elements, as exemplified in the context of autonomous drone racing. This study introduces a perception technique for detecting drone racing gates under illumination variations, which is common during high-speed drone flights. The… ▽ More In autonomous and mobile robotics, a principal challenge is resilient real-time environmental perception, particularly in situations characterized by unknown and dynamic elements, as exemplified in the context of autonomous drone racing. This study introduces a perception technique for detecting drone racing gates under illumination variations, which is common during high-speed drone flights. The proposed technique relies upon a lightweight neural network backbone augmented with capabilities for continual learning. The envisaged approach amalgamates predictions of the gates' positional coordinates, distance, and orientation, encapsulating them into a cohesive pose tuple. A comprehensive number of tests serve to underscore the efficacy of this approach in confronting diverse and challenging scenarios, specifically those involving variable lighting conditions. The proposed methodology exhibits notable robustness in the face of illumination variations, thereby substantiating its effectiveness. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: 8 pages, 6 figures, in 2024 International Joint Conference on Neural Networks (IJCNN)

arXiv:2404.17086 [pdf, ps, other]

On the equivalence of the semiclassical theory and the response theory

Authors: Jinxiong Jia, Longjun Xiang, Zhenhua Qiao, Jian Wang

Abstract: It is commonly believed that the response theory can give quantum correction (or interband coherent effects) to the semiclassical theory, while both formulations essentially are perturbatively solving the time-dependent Schödinger equation in a periodic potential probed by the electric field within the independent-particle approximation. Herein, by extending the semiclassical theory under an AC un… ▽ More It is commonly believed that the response theory can give quantum correction (or interband coherent effects) to the semiclassical theory, while both formulations essentially are perturbatively solving the time-dependent Schödinger equation in a periodic potential probed by the electric field within the independent-particle approximation. Herein, by extending the semiclassical theory under an AC uniform electric field to the nonlinear regime, we show that up to the second order of the electric field, the AC semiclassical theory is equivalent to the response theory in the absence of relaxation. Remarkably, this equivalence can be inherited when the relaxation is incorporated into the response theory, particularly by taking the semiclassical results with a finite relaxation time obtained by solving the Boltzmann equation as a benchmark. ...... △ Less

Submitted 10 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

Comments: add a section for the nonlinear AC responses and reshape the main conclusions

arXiv:2404.13305 [pdf, ps, other]

Chern Number Tunable Quantum Anomalous Hall Effect in Compensated Antiferromagnets

Authors: Wenhao Liang, Jiaqi An, Zeyu Li, Yafei Ren, Zhenhua Qiao, Qian Niu

Abstract: We propose to realize quantum anomalous Hall effect (QAHE) in two-dimensional antiferromagnetic topological insulators. We consider antiferromagnetic MnBi$_2$Te$_4$ as a concrete example. In contrast to the even-layer A-type antiferromagnetic MnBi$_2$Te$_4$ that has zero Chern number due to the combined parity-time ($\mathcal{PT}$) symmetry, the system can host a nonzero Chern number by breaking t… ▽ More We propose to realize quantum anomalous Hall effect (QAHE) in two-dimensional antiferromagnetic topological insulators. We consider antiferromagnetic MnBi$_2$Te$_4$ as a concrete example. In contrast to the even-layer A-type antiferromagnetic MnBi$_2$Te$_4$ that has zero Chern number due to the combined parity-time ($\mathcal{PT}$) symmetry, the system can host a nonzero Chern number by breaking this symmetry. We show that by controlling the antiferromagnetic spin configuration, for example, down/up/up/down, to break $\mathcal{PT}$ symmetry, tetralayer antiferromagnetic MnBi$_2$Te$_4$ can realize QAHE with Chern number $\mathcal{C}=-1$. Such spin configuration can be stablized by pinning the spin orientations on top and bottom layers. Furthermore, we reveal that the edge states are layer-selective and primarily locate at the boundaries of the bottom and top layers. In addition, via tuning the on-site orbital energy which determines the inverted band gap, we find tunable Chern number from $\mathcal{C}=-1$ to $\mathcal{C}=2$ and then to $\mathcal{C}=-1$. Our work not only proposes a scheme to realize Chern number tunable QAHE in antiferromagnets without net spin magnetization, but also provide a platform for layer-selective dissipationless transport devices. △ Less

Submitted 20 April, 2024; originally announced April 2024.

arXiv:2404.11213 [pdf, other]

Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in Surface Electromyographic Signal Analysis

Authors: Weiyu Guo, Ziyue Qiao, Ying Sun, Hui Xiong

Abstract: Gesture recognition based on surface electromyography (sEMG) has been gaining importance in many 3D Interactive Scenes. However, sEMG is easily influenced by various forms of noise in real-world environments, leading to challenges in providing long-term stable interactions through sEMG. Existing methods often struggle to enhance model noise resilience through various predefined data augmentation t… ▽ More Gesture recognition based on surface electromyography (sEMG) has been gaining importance in many 3D Interactive Scenes. However, sEMG is easily influenced by various forms of noise in real-world environments, leading to challenges in providing long-term stable interactions through sEMG. Existing methods often struggle to enhance model noise resilience through various predefined data augmentation techniques. In this work, we revisit the problem from a short term enhancement perspective to improve precision and robustness against various common noisy scenarios with learnable denoise using sEMG intrinsic pattern information and sliding-window attention. We propose a Short Term Enhancement Module(STEM) which can be easily integrated with various models. STEM offers several benefits: 1) Learnable denoise, enabling noise reduction without manual data augmentation; 2) Scalability, adaptable to various models; and 3) Cost-effectiveness, achieving short-term enhancement through minimal weight-sharing in an efficient attention mechanism. In particular, we incorporate STEM into a transformer, creating the Short Term Enhanced Transformer (STET). Compared with best-competing approaches, the impact of noise on STET is reduced by more than 20%. We also report promising results on both classification and regression datasets and demonstrate that STEM generalizes across different gesture recognition tasks. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2403.13553 [pdf]

VCounselor: A Psychological Intervention Chat Agent Based on a Knowledge-Enhanced Large Language Model

Authors: H. Zhang, Z. Qiao, H. Wang, B. Duan, J. Yin

Abstract: Conversational artificial intelligence can already independently engage in brief conversations with clients with psychological problems and provide evidence-based psychological interventions. The main objective of this study is to improve the effectiveness and credibility of the large language model in psychological intervention by creating a specialized agent, the VCounselor, to address the limit… ▽ More Conversational artificial intelligence can already independently engage in brief conversations with clients with psychological problems and provide evidence-based psychological interventions. The main objective of this study is to improve the effectiveness and credibility of the large language model in psychological intervention by creating a specialized agent, the VCounselor, to address the limitations observed in popular large language models such as ChatGPT in domain applications. We achieved this goal by proposing a new affective interaction structure and knowledge-enhancement structure. In order to evaluate VCounselor, this study compared the general large language model, the fine-tuned large language model, and VCounselor's knowledge-enhanced large language model. At the same time, the general large language model and the fine-tuned large language model will also be provided with an avatar to compare them as an agent with VCounselor. The comparison results indicated that the affective interaction structure and knowledge-enhancement structure of VCounselor significantly improved the effectiveness and credibility of the psychological intervention, and VCounselor significantly provided positive tendencies for clients' emotions. The conclusion of this study strongly supports that VConselor has a significant advantage in providing psychological support to clients by being able to analyze the patient's problems with relative accuracy and provide professional-level advice that enhances support for clients. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 24 pages, 6 figures

ACM Class: J.4

arXiv:2403.13550 [pdf]

The Tribal Theater Model: Social Regulation for Dynamic User Adaptation in Virtual Interactive Environments

Authors: H. Zhang, B. Duan, H. Wang, Z. Qiao, J. Yin

Abstract: This paper proposes a social regulation model for dynamic adaptation according to user characteristics in virtual interactive environments, namely the tribal theater model. The model focuses on organizational regulation and builds an interaction scheme with more resilient user performance by improving the subjectivity of the user. This paper discusses the sociological theoretical basis of this mod… ▽ More This paper proposes a social regulation model for dynamic adaptation according to user characteristics in virtual interactive environments, namely the tribal theater model. The model focuses on organizational regulation and builds an interaction scheme with more resilient user performance by improving the subjectivity of the user. This paper discusses the sociological theoretical basis of this model and how it was migrated to an engineering implementation of a virtual interactive environment. The model defines user interactions within a field that are regulated by a matrix through the allocation of resources. To verify the effectiveness of the tribal theater model, we designed an experimental scene using a chatroom as an example. We trained the matrix as an AI model using a temporal transformer and compared it with an interaction field with different levels of control. The experimental results showed that the tribal theater model can improve users' interactive experience, enhance resilient user performance, and effectively complete environmental interaction tasks under rule-based interaction. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 20 pages, 6 figures

ACM Class: J.4

arXiv:2403.07504 [pdf]

doi 10.1038/s41535-024-00635-5

Two-dimensional phase diagram of the charge density wave in doped CsV$_3$Sb$_5$

Authors: Linwei Huai, Hongyu Li, Yulei Han, Yang Luo, Shuting Peng, Zhiyuan Wei, Jianchang Shen, Bingqian Wang, Yu Miao, Xiupeng Sun, Zhipeng Ou, Bo Liu, Xiaoxiao Yu, Ziji Xiang, Min-Quan Kuang, Zhenhua Qiao, Xianhui Chen, Junfeng He

Abstract: Kagome superconductors AV$_3$Sb$_5$ (A = K, Rb and Cs) have attracted much recent attention due to the coexistence of multiple exotic orders. Among them, the charge density wave (CDW) order has been shown to host various unconventional behaviors. Here, we investigate the CDW order by a combination of both bulk and surface doping methods. While element substitutions in bulk doping change both carri… ▽ More Kagome superconductors AV$_3$Sb$_5$ (A = K, Rb and Cs) have attracted much recent attention due to the coexistence of multiple exotic orders. Among them, the charge density wave (CDW) order has been shown to host various unconventional behaviors. Here, we investigate the CDW order by a combination of both bulk and surface doping methods. While element substitutions in bulk doping change both carriers and the crystal lattice, the surface doping primarily tunes the carrier concentration. As such, our results reveal a two-dimensional phase diagram of the CDW in doped CsV$_3$Sb$_5$. In the lightly bulk doped regime, the existence of CDW order is reversible by tuning the carrier concentration. But excessive bulk doping permanently destroys the CDW, regardless of the carrier doping level. These results provide insights to the origin of the CDW from both electronic and structural degrees of freedom. They also open an avenue for manipulating the exotic CDW order in Kagome superconductors. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: 14 pages, 4 figures

Journal ref: npj Quantum Mater. 9,23(2024)

arXiv:2402.12035 [pdf, other]

Class-incremental Learning for Time Series: Benchmark and Evaluation

Authors: Zhongzheng Qiao, Quang Pham, Zhen Cao, Hoang H Le, P. N. Suganthan, Xudong Jiang, Ramasamy Savitha

Abstract: Real-world environments are inherently non-stationary, frequently introducing new classes over time. This is especially common in time series classification, such as the emergence of new disease classification in healthcare or the addition of new activities in human activity recognition. In such cases, a learning system is required to assimilate novel classes effectively while avoiding catastrophi… ▽ More Real-world environments are inherently non-stationary, frequently introducing new classes over time. This is especially common in time series classification, such as the emergence of new disease classification in healthcare or the addition of new activities in human activity recognition. In such cases, a learning system is required to assimilate novel classes effectively while avoiding catastrophic forgetting of the old ones, which gives rise to the Class-incremental Learning (CIL) problem. However, despite the encouraging progress in the image and language domains, CIL for time series data remains relatively understudied. Existing studies suffer from inconsistent experimental designs, necessitating a comprehensive evaluation and benchmarking of methods across a wide range of datasets. To this end, we first present an overview of the Time Series Class-incremental Learning (TSCIL) problem, highlight its unique challenges, and cover the advanced methodologies. Further, based on standardized settings, we develop a unified experimental framework that supports the rapid development of new algorithms, easy integration of new datasets, and standardization of the evaluation process. Using this framework, we conduct a comprehensive evaluation of various generic and time-series-specific CIL methods in both standard and privacy-sensitive scenarios. Our extensive experiments not only provide a standard baseline to support future research but also shed light on the impact of various design factors such as normalization layers or memory budget thresholds. Codes are available at https://github.com/zqiao11/TSCIL. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: Currently under review for KDD 2024 (ADS track)

arXiv:2402.04555 [pdf, other]

FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models

Authors: Chuhao Liu, Ke Wang, Jieqi Shi, Zhijian Qiao, Shaojie Shen

Abstract: Semantic mapping based on the supervised object detectors is sensitive to image distribution. In real-world environments, the object detection and segmentation performance can lead to a major drop, preventing the use of semantic mapping in a wider domain. On the other hand, the development of vision-language foundation models demonstrates a strong zero-shot transferability across data distribution… ▽ More Semantic mapping based on the supervised object detectors is sensitive to image distribution. In real-world environments, the object detection and segmentation performance can lead to a major drop, preventing the use of semantic mapping in a wider domain. On the other hand, the development of vision-language foundation models demonstrates a strong zero-shot transferability across data distribution. It provides an opportunity to construct generalizable instance-aware semantic maps. Hence, this work explores how to boost instance-aware semantic mapping from object detection generated from foundation models. We propose a probabilistic label fusion method to predict close-set semantic classes from open-set label measurements. An instance refinement module merges the over-segmented instances caused by inconsistent segmentation. We integrate all the modules into a unified semantic mapping system. Reading a sequence of RGB-D input, our work incrementally reconstructs an instance-aware semantic map. We evaluate the zero-shot performance of our method in ScanNet and SceneNN datasets. Our method achieves 40.3 mean average precision (mAP) on the ScanNet semantic instance segmentation task. It outperforms the traditional semantic mapping method significantly. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: Accepted by IEEE RA-L

arXiv:2402.02200 [pdf, other]

Less is More: Physical-enhanced Radar-Inertial Odometry

Authors: Qiucan Huang, Yuchen Liang, Zhijian Qiao, Shaojie Shen, Huan Yin

Abstract: Radar offers the advantage of providing additional physical properties related to observed objects. In this study, we design a physical-enhanced radar-inertial odometry system that capitalizes on the Doppler velocities and radar cross-section information. The filter for static radar points, correspondence estimation, and residual functions are all strengthened by integrating the physical propertie… ▽ More Radar offers the advantage of providing additional physical properties related to observed objects. In this study, we design a physical-enhanced radar-inertial odometry system that capitalizes on the Doppler velocities and radar cross-section information. The filter for static radar points, correspondence estimation, and residual functions are all strengthened by integrating the physical properties. We conduct experiments on both public datasets and our self-collected data, with different mobile platforms and sensor types. Our quantitative results demonstrate that the proposed radar-inertial odometry system outperforms alternative methods using the physical-enhanced components. Our findings also reveal that using the physical properties results in fewer radar points for odometry estimation, but the performance is still guaranteed and even improved, thus aligning with the ``less is more'' principle. △ Less

Submitted 3 February, 2024; originally announced February 2024.

Comments: Accepted by ICRA 2024

arXiv:2401.16011 [pdf, other]

GPS: Graph Contrastive Learning via Multi-scale Augmented Views from Adversarial Pooling

Authors: Wei Ju, Yiyang Gu, Zhengyang Mao, Ziyue Qiao, Yifang Qin, Xiao Luo, Hui Xiong, Ming Zhang

Abstract: Self-supervised graph representation learning has recently shown considerable promise in a range of fields, including bioinformatics and social networks. A large number of graph contrastive learning approaches have shown promising performance for representation learning on graphs, which train models by maximizing agreement between original graphs and their augmented views (i.e., positive views). U… ▽ More Self-supervised graph representation learning has recently shown considerable promise in a range of fields, including bioinformatics and social networks. A large number of graph contrastive learning approaches have shown promising performance for representation learning on graphs, which train models by maximizing agreement between original graphs and their augmented views (i.e., positive views). Unfortunately, these methods usually involve pre-defined augmentation strategies based on the knowledge of human experts. Moreover, these strategies may fail to generate challenging positive views to provide sufficient supervision signals. In this paper, we present a novel approach named Graph Pooling ContraSt (GPS) to address these issues. Motivated by the fact that graph pooling can adaptively coarsen the graph with the removal of redundancy, we rethink graph pooling and leverage it to automatically generate multi-scale positive views with varying emphasis on providing challenging positives and preserving semantics, i.e., strongly-augmented view and weakly-augmented view. Then, we incorporate both views into a joint contrastive learning framework with similarity learning and consistency learning, where our pooling module is adversarially trained with respect to the encoder for adversarial robustness. Experiments on twelve datasets on both graph classification and transfer learning tasks verify the superiority of the proposed method over its counterparts. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: Accepted by SCIENCE CHINA Information Sciences (SCIS 2024)

arXiv:2401.06174 [pdf]

Machine Learning Applications in Spine Biomechanics

Authors: Farshid Ghezelbash, Amir Hossein Eskandari, Xavier Robert-Lachaine, Frank Cao, Mehran Pesteie, Zhuohua Qiao, Aboulfazl Shirazi-Adl, Christian Larivière

Abstract: Spine biomechanics is at a transformation with the advent and integration of machine learning and computer vision technologies. These novel techniques facilitate the estimation of 3D body shapes, anthropometrics, and kinematics from as simple as a single-camera image, making them more accessible and practical for a diverse range of applications. This study introduces a framework that merges these… ▽ More Spine biomechanics is at a transformation with the advent and integration of machine learning and computer vision technologies. These novel techniques facilitate the estimation of 3D body shapes, anthropometrics, and kinematics from as simple as a single-camera image, making them more accessible and practical for a diverse range of applications. This study introduces a framework that merges these methodologies with traditional musculoskeletal modeling, enabling comprehensive analysis of spinal biomechanics during complex activities from a single camera. Additionally, we aim to evaluate their performance and limitations in spine biomechanics applications. The real-world applications explored in this study include assessment in workplace lifting, evaluation of whiplash injuries in car accidents, and biomechanical analysis in professional sports. Our results demonstrate potential and limitations of various algorithms in estimating body shape, kinematics, and conducting in-field biomechanical analyses. In industrial settings, the potential to utilize these new technologies for biomechanical risk assessments offers a pathway for preventive measures against back injuries. In sports activities, the proposed framework provides new opportunities for performance optimization, injury prevention, and rehabilitation. The application in forensic domain further underscores the wide-reaching implications of this technology. While certain limitations were identified, particularly in accuracy of predictions, complex interactions, and external load estimation, this study demonstrates their potential for advancement in spine biomechanics, heralding an optimistic future in both research and practical applications. △ Less

Submitted 9 January, 2024; originally announced January 2024.

arXiv:2401.02295 [pdf]

Tunning the number of chiral edge channels in a fixed quantum anomalous Hall system

Authors: Peng Deng, Yulei Han, Peng Zhang, Su Kong Chong, Zhenhua Qiao, Kang L. Wang

Abstract: Quantum anomalous Hall (QAH) insulators exhibit chiral edge channels characterized by vanishing longitudinal conductance and quantized Hall conductance of Ce2/h, wherein the Chern number C is an integer equal to the number of the parallel chiral edge channels. These chiral edge channels conduct dissipationless transport in QAH insulators, making them pivotal for applications in low-consumption ele… ▽ More Quantum anomalous Hall (QAH) insulators exhibit chiral edge channels characterized by vanishing longitudinal conductance and quantized Hall conductance of Ce2/h, wherein the Chern number C is an integer equal to the number of the parallel chiral edge channels. These chiral edge channels conduct dissipationless transport in QAH insulators, making them pivotal for applications in low-consumption electronics and topological quantum computing. While the QAH effect with multiple chiral edge channels (i.e., C >1) has been demonstrated in multilayers consisting of magnetic topological insulators and normal insulators, the channel number remains fixed for a given sample. Here, we unveil the tunability of the number of chiral edge channels within a single QAH insulator device. By tuning the magnetization of individual layers within the multilayer system, Chern insulating states with different Chern numbers are unveiled. The tunable Chern number was corroborated by our theoretical calculations. Furthermore, we conducted layer-dependent calculations to elucidate the contribution of the Chern number from different layers in the multilayer. Our findings demonstrate an extra degree of freedom in manipulating the chiral edge channels in QAH insulators. This newfound tunability offers extra dimension for the implementation of the QAH-based multi-channel dissipationless transport. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: The findings and content of this manuscript were also presented in a talk at the CPS meeting in December 2022. The video recording of the talk can be accessed at the following link: https://www.koushare.com/video/videodetail/39429

arXiv:2312.11923 [pdf, other]

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition

Authors: Xiaomeng Yang, Zhi Qiao, Yu Zhou, Weiping Wang

Abstract: Nowadays, scene text recognition has attracted more and more attention due to its diverse applications. Most state-of-the-art methods adopt an encoder-decoder framework with the attention mechanism, autoregressively generating text from left to right. Despite the convincing performance, this sequential decoding strategy constrains inference speed. Conversely, non-autoregressive models provide fast… ▽ More Nowadays, scene text recognition has attracted more and more attention due to its diverse applications. Most state-of-the-art methods adopt an encoder-decoder framework with the attention mechanism, autoregressively generating text from left to right. Despite the convincing performance, this sequential decoding strategy constrains inference speed. Conversely, non-autoregressive models provide faster, simultaneous predictions but often sacrifice accuracy. Although utilizing an explicit language model can improve performance, it burdens the computational load. Besides, separating linguistic knowledge from vision information may harm the final prediction. In this paper, we propose an alternative solution, using a parallel and iterative decoder that adopts an easy-first decoding strategy. Furthermore, we regard text recognition as an image-based conditional text generation task and utilize the discrete diffusion strategy, ensuring exhaustive exploration of bidirectional contextual information. Extensive experiments demonstrate that the proposed approach achieves superior results on the benchmark datasets, including both Chinese and English text images. △ Less

Submitted 19 December, 2023; originally announced December 2023.

arXiv:2312.05842 [pdf, other]

Mutual Enhancement of Large and Small Language Models with Cross-Silo Knowledge Transfer

Authors: Yongheng Deng, Ziqing Qiao, Ju Ren, Yang Liu, Yaoxue Zhang

Abstract: While large language models (LLMs) are empowered with broad knowledge, their task-specific performance is often suboptimal. It necessitates fine-tuning LLMs with task-specific data, but such data may be inaccessible due to privacy concerns. In this paper, we propose a novel approach to enhance LLMs with smaller language models (SLMs) that are trained on clients using their private task-specific da… ▽ More While large language models (LLMs) are empowered with broad knowledge, their task-specific performance is often suboptimal. It necessitates fine-tuning LLMs with task-specific data, but such data may be inaccessible due to privacy concerns. In this paper, we propose a novel approach to enhance LLMs with smaller language models (SLMs) that are trained on clients using their private task-specific data. To enable mutual enhancement between LLMs and SLMs, we propose CrossLM, where the SLMs promote the LLM to generate task-specific high-quality data, and both the LLM and SLMs are enhanced with the generated data. We evaluate CrossLM using publicly accessible language models across a range of benchmark tasks. The results demonstrate that CrossLM significantly enhances the task-specific performance of SLMs on clients and the LLM on the cloud server simultaneously while preserving the LLM's generalization capability. △ Less

Submitted 10 December, 2023; originally announced December 2023.

arXiv:2312.04093 [pdf, ps, other]

Dissipationless gyrotropic magnetic Hall effect

Authors: Longjun Xiang, Jinxiong Jia, Fuming Xu, Zhenhua Qiao, Jian Wang

Abstract: A dissipationless longitudinal current can be generated by a pure magnetic field through the chiral magnetic effect. Herein, we propose that a pure oscillating magnetic field through Zeeman coupling can further drive an AC magnetic Hall current in two-dimensional systems without inversion symmetry. We dub this effect the "gyrotropic magnetic Hall effect" (GMHE), in analogy with the gyrotropic curr… ▽ More A dissipationless longitudinal current can be generated by a pure magnetic field through the chiral magnetic effect. Herein, we propose that a pure oscillating magnetic field through Zeeman coupling can further drive an AC magnetic Hall current in two-dimensional systems without inversion symmetry. We dub this effect the "gyrotropic magnetic Hall effect" (GMHE), in analogy with the gyrotropic current achieved by rectifying the optical fields. Importantly, we find that the GMHE conductivity is a reactive or dissipationless transport coefficient, which is even under time-reversal symmetry. We reveal the "Zeeman Berry curvature" as the quantum origin of the GMHE, whose integral over all states below the Fermi energy gives the GMHE conductivity. Furthermore, by symmetry analysis, we show that the GMHE can appear in a wide range of two-dimensional materials. To demonstrate our proposal, we evaluate the GMHE current in two-dimensional Rashba system and in the surface of topological insulator, where a low-frequency magnetic field with a small amplitude can be converted into a detectable Hall voltage. △ Less

Submitted 7 December, 2023; originally announced December 2023.

arXiv:2312.00951 [pdf, other]

AV4EV: Open-Source Modular Autonomous Electric Vehicle Platform for Making Mobility Research Accessible

Authors: Zhijie Qiao, Mingyan Zhou, Zhijun Zhuang, Tejas Agarwal, Felix Jahncke, Po-Jen Wang, Jason Friedman, Hongyi Lai, Divyanshu Sahu, Tomáš Nagy, Martin Endler, Jason Schlessman, Rahul Mangharam

Abstract: When academic researchers develop and validate autonomous driving algorithms, there is a challenge in balancing high-performance capabilities with the cost and complexity of the vehicle platform. Much of today's research on autonomous vehicles (AV) is limited to experimentation on expensive commercial vehicles that require large skilled teams to retrofit the vehicles and test them in dedicated fac… ▽ More When academic researchers develop and validate autonomous driving algorithms, there is a challenge in balancing high-performance capabilities with the cost and complexity of the vehicle platform. Much of today's research on autonomous vehicles (AV) is limited to experimentation on expensive commercial vehicles that require large skilled teams to retrofit the vehicles and test them in dedicated facilities. On the other hand, 1/10th-1/16th scaled-down vehicle platforms are more affordable but have limited similitude in performance and drivability. To address this issue, we present the design of a one-third-scale autonomous electric go-kart platform with open-source mechatronics design along with fully functional autonomous driving software. The platform's multi-modal driving system is capable of manual, autonomous, and teleoperation driving modes. It also features a flexible sensing suite for the algorithm deployment across perception, localization, planning, and control. This development serves as a bridge between full-scale vehicles and reduced-scale cars while accelerating cost-effective algorithmic advancements. Our experimental results demonstrate the AV4EV platform's capabilities and ease of use for developing new AV algorithms. All materials are available at AV4EV.org to stimulate collaborative efforts within the AV and electric vehicle (EV) communities. △ Less

Submitted 12 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

Comments: 6 pages, 5 figures

arXiv:2310.19663 [pdf, ps, other]

A linear doubly stabilized Crank-Nicolson scheme for the Allen-Cahn equation with a general mobility

Authors: Dianming Hou, Zhonghua Qiao, Lili Ju

Abstract: In this paper, a linear second order numerical scheme is developed and investigated for the Allen-Cahn equation with a general positive mobility. In particular, our fully discrete scheme is mainly constructed based on the Crank-Nicolson formula for temporal discretization and the central finite difference method for spatial approximation, and two extra stabilizing terms are also introduced for the… ▽ More In this paper, a linear second order numerical scheme is developed and investigated for the Allen-Cahn equation with a general positive mobility. In particular, our fully discrete scheme is mainly constructed based on the Crank-Nicolson formula for temporal discretization and the central finite difference method for spatial approximation, and two extra stabilizing terms are also introduced for the purpose of improving numerical stability. The proposed scheme is shown to unconditionally preserve the maximum bound principle (MBP) under mild restrictions on the stabilization parameters, which is of practical importance for achieving good accuracy and stability simultaneously. With the help of uniform boundedness of the numerical solutions due to MBP, we then successfully derive $H^{1}$-norm and $L^{\infty}$-norm error estimates for the Allen-Cahn equation with a constant and a variable mobility, respectively. Moreover, the energy stability of the proposed scheme is also obtained in the sense that the discrete free energy is uniformly bounded by the one at the initial time plus a {\color{black}constant}. Finally, some numerical experiments are carried out to verify the theoretical results and illustrate the performance of the proposed scheme with a time adaptive strategy. △ Less

Submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.15858 [pdf, ps, other]

Topology-aware Debiased Self-supervised Graph Learning for Recommendation

Authors: Lei Han, Hui Yan, Zhicheng Qiao

Abstract: In recommendation, graph-based Collaborative Filtering (CF) methods mitigate the data sparsity by introducing Graph Contrastive Learning (GCL). However, the random negative sampling strategy in these GCL-based CF models neglects the semantic structure of users (items), which not only introduces false negatives (negatives that are similar to anchor user (item)) but also ignores the potential positi… ▽ More In recommendation, graph-based Collaborative Filtering (CF) methods mitigate the data sparsity by introducing Graph Contrastive Learning (GCL). However, the random negative sampling strategy in these GCL-based CF models neglects the semantic structure of users (items), which not only introduces false negatives (negatives that are similar to anchor user (item)) but also ignores the potential positive samples. To tackle the above issues, we propose Topology-aware Debiased Self-supervised Graph Learning (TDSGL) for recommendation, which constructs contrastive pairs according to the semantic similarity between users (items). Specifically, since the original user-item interaction data commendably reflects the purchasing intent of users and certain characteristics of items, we calculate the semantic similarity between users (items) on interaction data. Then, given a user (item), we construct its negative pairs by selecting users (items) which embed different semantic structures to ensure the semantic difference between the given user (item) and its negatives. Moreover, for a user (item), we design a feature extraction module that converts other semantically similar users (items) into an auxiliary positive sample to acquire a more informative representation. Experimental results show that the proposed model outperforms the state-of-the-art models significantly on three public datasets. Our model implementation codes are available at https://github.com/malajikuai/TDSGL. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: 6 pages,8 figures

arXiv:2310.15017 [pdf, other]

Mind the Model, Not the Agent: The Primacy Bias in Model-based RL

Authors: Zhongjian Qiao, Jiafei Lyu, Xiu Li

Abstract: The primacy bias in model-free reinforcement learning (MFRL), which refers to the agent's tendency to overfit early data and lose the ability to learn from new data, can significantly decrease the performance of MFRL algorithms. Previous studies have shown that employing simple techniques, such as resetting the agent's parameters, can substantially alleviate the primacy bias in MFRL. However, the… ▽ More The primacy bias in model-free reinforcement learning (MFRL), which refers to the agent's tendency to overfit early data and lose the ability to learn from new data, can significantly decrease the performance of MFRL algorithms. Previous studies have shown that employing simple techniques, such as resetting the agent's parameters, can substantially alleviate the primacy bias in MFRL. However, the primacy bias in model-based reinforcement learning (MBRL) remains unexplored. In this work, we focus on investigating the primacy bias in MBRL. We begin by observing that resetting the agent's parameters harms its performance in the context of MBRL. We further find that the primacy bias in MBRL is more closely related to the primacy bias of the world model instead of the primacy bias of the agent. Based on this finding, we propose \textit{world model resetting}, a simple yet effective technique to alleviate the primacy bias in MBRL. We apply our method to two different MBRL algorithms, MBPO and DreamerV2. We validate the effectiveness of our method on multiple continuous control tasks on MuJoCo and DeepMind Control Suite, as well as discrete control tasks on Atari 100k benchmark. The experimental results show that \textit{world model resetting} can significantly alleviate the primacy bias in the model-based setting and improve the algorithm's performance. We also give a guide on how to perform \textit{world model resetting} effectively. △ Less

Submitted 7 July, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: Accepted by European Conference on Artificial Intelligence (ECAI) 2024

arXiv:2310.05411 [pdf, other]

Local Structure-Preserving Relaxation Method for Charged Systems on Unstructured Meshes

Authors: Zhonghua Qiao, Zhenli Xu, Qian Yin, Shenggao Zhou

Abstract: This work considers charged systems described by the modified Poisson--Nernst--Planck (PNP) equations, which incorporate ionic steric effects and the Born solvation energy for dielectric inhomogeneity. Solving the steady-state modified PNP equations poses numerical challenges due to the emergence of sharp boundary layers caused by small Debye lengths, particularly when local ionic concentrations r… ▽ More This work considers charged systems described by the modified Poisson--Nernst--Planck (PNP) equations, which incorporate ionic steric effects and the Born solvation energy for dielectric inhomogeneity. Solving the steady-state modified PNP equations poses numerical challenges due to the emergence of sharp boundary layers caused by small Debye lengths, particularly when local ionic concentrations reach saturation. To address this, we first reformulate the steady-state problem as a constraint optimization, where the ionic concentrations on unstructured Delaunay nodes are treated as fractional particles moving along edges between nodes. The electric fields are then updated to minimize the objective free energy while satisfying the discrete Gauss's law. We develop a local relaxation method on unstructured meshes that inherently respects the discrete Gauss's law, ensuring curl-free electric fields. Numerical analysis demonstrates that the optimal mass of the moving fractional particles guarantees the positivity of both ionic and solvent concentrations. Additionally, the free energy of the charged system consistently decreases during successive updates of ionic concentrations and electric fields. We conduct numerical tests to validate the expected numerical accuracy, positivity, free-energy dissipation, and robustness of our method in simulating charged systems with sharp boundary layers. △ Less

Submitted 9 October, 2023; originally announced October 2023.

arXiv:2310.00824 [pdf, ps, other]

Energy-dissipative spectral renormalization exponential integrator method for gradient flow problems

Authors: Dianming Hou, Lili Ju, Zhonghua Qiao

Abstract: In this paper, we present a novel spectral renormalization exponential integrator method for solving gradient flow problems. Our method is specifically designed to simultaneously satisfy discrete analogues of the energy dissipation laws and achieve high-order accuracy in time. To accomplish this, our method first incorporates the energy dissipation law into the target gradient flow equation by int… ▽ More In this paper, we present a novel spectral renormalization exponential integrator method for solving gradient flow problems. Our method is specifically designed to simultaneously satisfy discrete analogues of the energy dissipation laws and achieve high-order accuracy in time. To accomplish this, our method first incorporates the energy dissipation law into the target gradient flow equation by introducing a time-dependent spectral renormalization (TDSR) factor. Then, the coupled equations are discretized using the spectral approximation in space and the exponential time differencing (ETD) in time. Finally, the resulting fully discrete nonlinear system is decoupled and solved using the Picard iteration at each time step. Furthermore, we introduce an extra enforcing term into the system for updating the TDSR factor, which greatly relaxes the time step size restriction of the proposed method and enhances its computational efficiency. Extensive numerical tests with various gradient flows are also presented to demonstrate the accuracy and effectiveness of our method as well as its high efficiency when combined with an adaptive time-stepping strategy for long-term simulations. △ Less

Submitted 1 October, 2023; originally announced October 2023.

Comments: 24 pages, 12 figures

arXiv:2309.10773 [pdf, other]

Semi-supervised Domain Adaptation in Graph Transfer Learning

Authors: Ziyue Qiao, Xiao Luo, Meng Xiao, Hao Dong, Yuanchun Zhou, Hui Xiong

Abstract: As a specific case of graph transfer learning, unsupervised domain adaptation on graphs aims for knowledge transfer from label-rich source graphs to unlabeled target graphs. However, graphs with topology and attributes usually have considerable cross-domain disparity and there are numerous real-world scenarios where merely a subset of nodes are labeled in the source graph. This imposes critical ch… ▽ More As a specific case of graph transfer learning, unsupervised domain adaptation on graphs aims for knowledge transfer from label-rich source graphs to unlabeled target graphs. However, graphs with topology and attributes usually have considerable cross-domain disparity and there are numerous real-world scenarios where merely a subset of nodes are labeled in the source graph. This imposes critical challenges on graph transfer learning due to serious domain shifts and label scarcity. To address these challenges, we propose a method named Semi-supervised Graph Domain Adaptation (SGDA). To deal with the domain shift, we add adaptive shift parameters to each of the source nodes, which are trained in an adversarial manner to align the cross-domain distributions of node embedding, thus the node classifier trained on labeled source nodes can be transferred to the target nodes. Moreover, to address the label scarcity, we propose pseudo-labeling on unlabeled nodes, which improves classification on the target graph via measuring the posterior influence of nodes based on their relative position to the class centroids. Finally, extensive experiments on a range of publicly accessible datasets validate the effectiveness of our proposed SGDA in different experimental settings. △ Less

Submitted 19 September, 2023; originally announced September 2023.

arXiv:2309.08464 [pdf, ps, other]

Differentially Private Average Consensus with Improved Accuracy-Privacy Trade-off

Authors: Lei Wang, Weijia Liu, Fanghong Guo, Zixin Qiao, Zhengguang Wu

Abstract: This paper studies the average consensus problem with differential privacy of initial states, for which it is widely recognized that there is a trade-off between the mean-square computation accuracy and privacy level. Considering the trade-off gap between the average consensus algorithm and the centralized averaging approach with differential privacy, we propose a distributed shuffling mechanism b… ▽ More This paper studies the average consensus problem with differential privacy of initial states, for which it is widely recognized that there is a trade-off between the mean-square computation accuracy and privacy level. Considering the trade-off gap between the average consensus algorithm and the centralized averaging approach with differential privacy, we propose a distributed shuffling mechanism based on the Paillier cryptosystem to generate correlated zero-sum randomness. By randomizing each local privacy-sensitive initial state with an i.i.d. Gaussian noise and the output of the mechanism using Gaussian noises, it is shown that the resulting average consensus algorithm can eliminate the gap in the sense that the accuracy-privacy trade-off of the centralized averaging approach with differential privacy can be almost recovered by appropriately designing the variances of the added noises. We also extend such a design framework with Gaussian noises to the one using Laplace noises, and show that the improved privacy-accuracy trade-off is preserved. △ Less

Submitted 5 May, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

arXiv:2309.02657 [pdf, other]

Energy stable and maximum bound principle preserving schemes for the Q-tensor flow of liquid crystals

Authors: Dianming Hou, Xiaoli Li, Zhonghua Qiao, Nan Zheng

Abstract: In this paper, we propose two efficient fully-discrete schemes for Q-tensor flow of liquid crystals by using the first- and second-order stabilized exponential scalar auxiliary variable (sESAV) approach in time and the finite difference method for spatial discretization. The modified discrete energy dissipation laws are unconditionally satisfied for both two constructed schemes. A particular featu… ▽ More In this paper, we propose two efficient fully-discrete schemes for Q-tensor flow of liquid crystals by using the first- and second-order stabilized exponential scalar auxiliary variable (sESAV) approach in time and the finite difference method for spatial discretization. The modified discrete energy dissipation laws are unconditionally satisfied for both two constructed schemes. A particular feature is that, for two-dimensional (2D) and a kind of three-dimensional (3D) Q-tensor flows, the unconditional maximum-bound-principle (MBP) preservation of the constructed first-order scheme is successfully established, and the proposed second-order scheme preserves the discrete MBP property with a mild restriction on the time-step sizes. Furthermore, we rigorously derive the corresponding error estimates for the fully-discrete second-order schemes by using the built-in stability results. Finally, various numerical examples validating the theoretical results, such as the orientation of liquid crystal in 2D and 3D, are presented for the constructed schemes. △ Less

Submitted 15 July, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

arXiv:2309.01717 [pdf, other]

Interdisciplinary Fairness in Imbalanced Research Proposal Topic Inference: A Hierarchical Transformer-based Method with Selective Interpolation

Authors: Meng Xiao, Min Wu, Ziyue Qiao, Yanjie Fu, Zhiyuan Ning, Yi Du, Yuanchun Zhou

Abstract: The objective of topic inference in research proposals aims to obtain the most suitable disciplinary division from the discipline system defined by a funding agency. The agency will subsequently find appropriate peer review experts from their database based on this division. Automated topic inference can reduce human errors caused by manual topic filling, bridge the knowledge gap between funding a… ▽ More The objective of topic inference in research proposals aims to obtain the most suitable disciplinary division from the discipline system defined by a funding agency. The agency will subsequently find appropriate peer review experts from their database based on this division. Automated topic inference can reduce human errors caused by manual topic filling, bridge the knowledge gap between funding agencies and project applicants, and improve system efficiency. Existing methods focus on modeling this as a hierarchical multi-label classification problem, using generative models to iteratively infer the most appropriate topic information. However, these methods overlook the gap in scale between interdisciplinary research proposals and non-interdisciplinary ones, leading to an unjust phenomenon where the automated inference system categorizes interdisciplinary proposals as non-interdisciplinary, causing unfairness during the expert assignment. How can we address this data imbalance issue under a complex discipline system and hence resolve this unfairness? In this paper, we implement a topic label inference system based on a Transformer encoder-decoder architecture. Furthermore, we utilize interpolation techniques to create a series of pseudo-interdisciplinary proposals from non-interdisciplinary ones during training based on non-parametric indicators such as cross-topic probabilities and topic occurrence probabilities. This approach aims to reduce the bias of the system during model training. Finally, we conduct extensive experiments on a real-world dataset to verify the effectiveness of the proposed method. The experimental results demonstrate that our training strategy can significantly mitigate the unfairness generated in the topic inference task. △ Less

Submitted 3 June, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

Comments: 21 pages, accepted by ACM Transactions on Knowledge Discovery from Data

arXiv:2308.11573 [pdf, other]

G3Reg: Pyramid Graph-based Global Registration using Gaussian Ellipsoid Model

Authors: Zhijian Qiao, Zehuan Yu, Binqian Jiang, Huan Yin, Shaojie Shen

Abstract: This study introduces a novel framework, G3Reg, for fast and robust global registration of LiDAR point clouds. In contrast to conventional complex keypoints and descriptors, we extract fundamental geometric primitives, including planes, clusters, and lines (PCL) from the raw point cloud to obtain low-level semantic segments. Each segment is represented as a unified Gaussian Ellipsoid Model (GEM),… ▽ More This study introduces a novel framework, G3Reg, for fast and robust global registration of LiDAR point clouds. In contrast to conventional complex keypoints and descriptors, we extract fundamental geometric primitives, including planes, clusters, and lines (PCL) from the raw point cloud to obtain low-level semantic segments. Each segment is represented as a unified Gaussian Ellipsoid Model (GEM), using a probability ellipsoid to ensure the ground truth centers are encompassed with a certain degree of probability. Utilizing these GEMs, we present a distrust-and-verify scheme based on a Pyramid Compatibility Graph for Global Registration (PAGOR). Specifically, we establish an upper bound, which can be traversed based on the confidence level for compatibility testing to construct the pyramid graph. Then, we solve multiple maximum cliques (MAC) for each level of the pyramid graph, thus generating the corresponding transformation candidates. In the verification phase, we adopt a precise and efficient metric for point cloud alignment quality, founded on geometric primitives, to identify the optimal candidate. The algorithm's performance is validated on three publicly available datasets and a self-collected multi-session dataset. Parameter settings remained unchanged during the experiment evaluations. The results exhibit superior robustness and real-time performance of the G3Reg framework compared to state-of-the-art methods. Furthermore, we demonstrate the potential for integrating individual GEM and PAGOR components into other registration frameworks to enhance their efficacy. Code: https://github.com/HKUST-Aerial-Robotics/G3Reg △ Less

Submitted 24 April, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

Comments: Accepted to 2024 IEEE Transactions on Automation Science and Engineering (IEEE TASE)

arXiv:2308.11508 [pdf, other]

Rogue peakon, well-posedness, ill-posedness and blow-up phenomenon for an integrable Camassa-Holm type equation

Authors: Mingxuan Zhu, Zhenteng Zeng, Zaihong Jiang, Baoqiang Xia, Zhijun Qiao

Abstract: In this paper, we study an integrable Camassa-Holm (CH) type equation with quadratic nonlinearity. The CH type equation is shown integrable through a Lax pair, and particularly the equation is found to possess a new kind of peaked soliton (peakon) solution - called {\sf rogue peakon}, that is given in a rational form with some logarithmic function, but not a regular traveling wave. We also provide… ▽ More In this paper, we study an integrable Camassa-Holm (CH) type equation with quadratic nonlinearity. The CH type equation is shown integrable through a Lax pair, and particularly the equation is found to possess a new kind of peaked soliton (peakon) solution - called {\sf rogue peakon}, that is given in a rational form with some logarithmic function, but not a regular traveling wave. We also provide multi-rogue peakon solutions. Furthermore, we discuss the local well-posedness of the solution in the Besov space $B_{p,r}^{s}$ with $1\leq p,r\leq\infty$, $s>\max \left\{1+1/p,3/2\right\}$ or $B_{2,1}^{3/2}$, and then prove the ill-posedness of the solution in $B_{2,\infty}^{3/2}$. Moreover, we establish the global existence and blow-up phenomenon of the solution, which is, if $m_0(x)=u_0-u_{0xx}\geq(\not\equiv) 0$, then the corresponding solution exists globally, meanwhile, if $m_0(x)\leq(\not\equiv) 0$, then the corresponding solution blows up in a finite time. △ Less

Submitted 23 August, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

Comments: 23 pages, 6 figures

MSC Class: 37K10; 35G25; 35L05

arXiv:2308.02256 [pdf, ps, other]

Disorder-Induced Phase Transitions in Three-Dimensional Chiral Second-Order Topological Insulator

Authors: Yedi Shen, Zeyu Li, Qian Niu, Zhenhua Qiao

Abstract: Topological insulators have been extended to higher-order versions that possess topological hinge or corner states in lower dimensions. However, their robustness against disorder is still unclear. Here, we theoretically investigate the phase transitions of three-dimensional (3D) chiral second-order topological insulator (SOTI) in the presence of disorders. Our results show that, by increasing diso… ▽ More Topological insulators have been extended to higher-order versions that possess topological hinge or corner states in lower dimensions. However, their robustness against disorder is still unclear. Here, we theoretically investigate the phase transitions of three-dimensional (3D) chiral second-order topological insulator (SOTI) in the presence of disorders. Our results show that, by increasing disorder strength, the nonzero densities of states of side surface and bulk emerge at critical disorder strengths of $W_{S}$ and $W_{B}$, respectively. The spectral function indicates that the bulk gap is only closed at one of the $R_{4z}\mathcal{T}$-invariant points, i.e., $Γ_{3}$. The closing of side surface gap or bulk gap is ascribed to the significant decrease of the elastic mean free time of quasi-particles. Because of the localization of side surface states, we find that the 3D chiral SOTI is robust at an averaged quantized conductance of $2e^{2}/h$ with disorder strength up to $W_{B}$. When the disorder strength is beyond $W_{B}$, the 3D chiral SOTI is then successively driven into two phases, i.e., diffusive metallic phase and Anderson insulating phase. Furthermore, an averaged conductance plateau of $e^{2}/h$ emerges in the diffusive metallic phase. △ Less

Submitted 5 September, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

arXiv:2307.12116 [pdf, other]

Pyramid Semantic Graph-based Global Point Cloud Registration with Low Overlap

Authors: Zhijian Qiao, Zehuan Yu, Huan Yin, Shaojie Shen

Abstract: Global point cloud registration is essential in many robotics tasks like loop closing and relocalization. Unfortunately, the registration often suffers from the low overlap between point clouds, a frequent occurrence in practical applications due to occlusion and viewpoint change. In this paper, we propose a graph-theoretic framework to address the problem of global point cloud registration with l… ▽ More Global point cloud registration is essential in many robotics tasks like loop closing and relocalization. Unfortunately, the registration often suffers from the low overlap between point clouds, a frequent occurrence in practical applications due to occlusion and viewpoint change. In this paper, we propose a graph-theoretic framework to address the problem of global point cloud registration with low overlap. To this end, we construct a consistency graph to facilitate robust data association and employ graduated non-convexity (GNC) for reliable pose estimation, following the state-of-the-art (SoTA) methods. Unlike previous approaches, we use semantic cues to scale down the dense point clouds, thus reducing the problem size. Moreover, we address the ambiguity arising from the consistency threshold by constructing a pyramid graph with multi-level consistency thresholds. Then we propose a cascaded gradient ascend method to solve the resulting densest clique problem and obtain multiple pose candidates for every consistency threshold. Finally, fast geometric verification is employed to select the optimal estimation from multiple pose candidates. Our experiments, conducted on a self-collected indoor dataset and the public KITTI dataset, demonstrate that our method achieves the highest success rate despite the low overlap of point clouds and low semantic quality. We have open-sourced our code https://github.com/HKUST-Aerial-Robotics/Pagor for this project. △ Less

Submitted 22 July, 2023; originally announced July 2023.

Comments: Accepted by IROS2023

arXiv:2307.11653 [pdf, other]

Online Monocular Lane Mapping Using Catmull-Rom Spline

Authors: Zhijian Qiao, Zehuan Yu, Huan Yin, Shaojie Shen

Abstract: In this study, we introduce an online monocular lane mapping approach that solely relies on a single camera and odometry for generating spline-based maps. Our proposed technique models the lane association process as an assignment issue utilizing a bipartite graph, and assigns weights to the edges by incorporating Chamfer distance, pose uncertainty, and lateral sequence consistency. Furthermore, w… ▽ More In this study, we introduce an online monocular lane mapping approach that solely relies on a single camera and odometry for generating spline-based maps. Our proposed technique models the lane association process as an assignment issue utilizing a bipartite graph, and assigns weights to the edges by incorporating Chamfer distance, pose uncertainty, and lateral sequence consistency. Furthermore, we meticulously design control point initialization, spline parameterization, and optimization to progressively create, expand, and refine splines. In contrast to prior research that assessed performance using self-constructed datasets, our experiments are conducted on the openly accessible OpenLane dataset. The experimental outcomes reveal that our suggested approach enhances lane association and odometry precision, as well as overall lane map quality. We have open-sourced our code1 for this project. △ Less

Submitted 21 July, 2023; originally announced July 2023.

Comments: Accepted by IROS2023

arXiv:2307.07344 [pdf, other]

Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks

Authors: Chaoyu Liu, Zhonghua Qiao, Chao Li, Carola-Bibiane Schönlieb

Abstract: Traditional image processing methods employing partial differential equations (PDEs) offer a multitude of meaningful regularizers, along with valuable theoretical foundations for a wide range of image-related tasks. This makes their integration into neural networks a promising avenue. In this paper, we introduce a novel regularization approach inspired by the reverse process of PDE-based evolution… ▽ More Traditional image processing methods employing partial differential equations (PDEs) offer a multitude of meaningful regularizers, along with valuable theoretical foundations for a wide range of image-related tasks. This makes their integration into neural networks a promising avenue. In this paper, we introduce a novel regularization approach inspired by the reverse process of PDE-based evolution models. Specifically, we propose inverse evolution layers (IELs), which serve as bad property amplifiers to penalize neural networks of which outputs have undesired characteristics. Using IELs, one can achieve specific regularization objectives and endow neural networks' outputs with corresponding properties of the PDE models. Our experiments, focusing on semantic segmentation tasks using heat-diffusion IELs, demonstrate their effectiveness in mitigating noisy label effects. Additionally, we develop curve-motion IELs to enforce convex shape regularization in neural network-based segmentation models for preventing the generation of concave outputs. Theoretical analysis confirms the efficacy of IELs as an effective regularization mechanism, particularly in handling training with label issues. △ Less

Submitted 1 July, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

arXiv:2307.07145 [pdf, ps, other]

Linear displacement current solely driven by the quantum metric

Authors: Longjun Xiang, Bin Wang, Yadong Wei, Zhenhua Qiao, Jian Wang

Abstract: Quantum metric and Berry curvature are the real part and imaginary part of the quantum geometric tensor, respectively. The T-odd (T: time-reversal) nonlinear Hall effect driven by the quantum metric dipole, recently confirmed in Science 381, 181 (2023) and Nature 621, 487 (2023), established the geometric duality to the T-even nonlinear Hall effect that driven by the Berry curvature dipole. Intere… ▽ More Quantum metric and Berry curvature are the real part and imaginary part of the quantum geometric tensor, respectively. The T-odd (T: time-reversal) nonlinear Hall effect driven by the quantum metric dipole, recently confirmed in Science 381, 181 (2023) and Nature 621, 487 (2023), established the geometric duality to the T-even nonlinear Hall effect that driven by the Berry curvature dipole. Interestingly, a similar geometric duality between the quantum metric and the Berry curvature, particularly for the linear response of Bloch electrons, has not been established, although the T-odd linear intrinsic anomalous Hall effect (IAHE) solely driven by the Berry curvature has been known for a long time. Herein, we develop the quantum theory for displacement current under an AC electric field. Particularly, we show that the T-even component of the linear displacement current conductivity (LDCC) is solely determined by the quantum metric, by both the response theory and the semiclassical theory. Notably, with symmetry analysis we find that the T-even LDCC can contribute a Hall current in T-invariant systems but with low symmetry, while its longitudinal component is immune to symmetry. Furthermore, employing the Dirac Hamiltonian, we arrive at a $1/μ$ ($μ$: chemical potential) experimental observable enhancement of the displacement current owing to the divergent behavior of quantum metric near Dirac point, similar to the IAHE at Weyl point. Our work reveals the band geometric origin of the linear displacement current and establishes, together with the IAHE, the geometric duality for the linear response of Bloch electrons. Additionally, our work offers the very first intrinsic Hall effect in T-invariant materials, which can not be envisioned in DC transport in both linear and nonlinear regimes. △ Less

Submitted 6 February, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

Comments: 3 figures

arXiv:2307.07126 [pdf, other]

Multi-Session, Localization-oriented and Lightweight LiDAR Mapping Using Semantic Lines and Planes

Authors: Zehuan Yu, Zhijian Qiao, Liuyang Qiu, Huan Yin, Shaojie Shen

Abstract: In this paper, we present a centralized framework for multi-session LiDAR mapping in urban environments, by utilizing lightweight line and plane map representations instead of widely used point clouds. The proposed framework achieves consistent mapping in a coarse-to-fine manner. Global place recognition is achieved by associating lines and planes on the Grassmannian manifold, followed by an outli… ▽ More In this paper, we present a centralized framework for multi-session LiDAR mapping in urban environments, by utilizing lightweight line and plane map representations instead of widely used point clouds. The proposed framework achieves consistent mapping in a coarse-to-fine manner. Global place recognition is achieved by associating lines and planes on the Grassmannian manifold, followed by an outlier rejection-aided pose graph optimization for map merging. Then a novel bundle adjustment is also designed to improve the local consistency of lines and planes. In the experimental section, both public and self-collected datasets are used to demonstrate efficiency and effectiveness. Extensive results validate that our LiDAR mapping framework could merge multi-session maps globally, optimize maps incrementally, and is applicable for lightweight robot localization. △ Less

Submitted 13 July, 2023; originally announced July 2023.

Comments: Accepted by IROS2023

arXiv:2307.02861 [pdf]

doi 10.1364/OE.499781

High-speed 4 ${\times}$ 4 silicon photonic electro-optic switch, operating at the 2 μm waveband

Authors: Jiawei Wang, Jia Xu Brian Sia, Xiang Li, Xin Guo, Wanjun Wang, Zhongliang Qiao, Callum G. Littlejohns. Chongyang Liu, Graham T. Reed, Rusli, Hong Wang

Abstract: The escalating need for expansive data bandwidth, and the resulting capacity constraints of the single mode fiber (SMF) have positioned the 2-$μ$m waveband as a prospective window for emerging applications in optical communication. This has initiated an ecosystem of silicon photonic components in the region driven by CMOS compatibility, low cost, high efficiency and potential for large-scale integ… ▽ More The escalating need for expansive data bandwidth, and the resulting capacity constraints of the single mode fiber (SMF) have positioned the 2-$μ$m waveband as a prospective window for emerging applications in optical communication. This has initiated an ecosystem of silicon photonic components in the region driven by CMOS compatibility, low cost, high efficiency and potential for large-scale integration. In this study, we demonstrate a plasma dispersive, 4 ${\times}$ 4 electro-optic switch operating at the 2-$μ$m waveband with the shortest switching times. The demonstrated switch operates across a 45-nm bandwidth, with 10-90% rise and 90-10% fall time of 1.78 ns and 3.02 ns respectively. In a 4 ${\times}$ 4 implementation, crosstalk below -15 dB and power consumption below 19.15 mW across all 16 ports are indicated. The result brings high-speed optical switching to the portfolio of devices at the promising waveband. △ Less

Submitted 6 July, 2023; originally announced July 2023.

Journal ref: Opt. Express 31(20), 33548-33564 (2023)

arXiv:2307.01042 [pdf]

doi 10.1038/s41467-023-39500-7

A unique van Hove singularity in kagome superconductor CsV$_{3-x}$Ta$_x$Sb$_5$ with enhanced superconductivity

Authors: Yang Luo, Yulei Han, Jinjin Liu, Hui Chen, Zihao Huang, Linwei Huai, Hongyu Li, Bingqian Wang, Jianchang Shen, Shuhan Ding, Zeyu Li, Shuting Peng, Zhiyuan Wei, Yu Miao, Xiupeng Sun, Zhipeng Ou, Ziji Xiang, Makoto Hashimoto, Donghui Lu, Yugui Yao, Haitao Yang, Xianhui Chen, Hong-Jun Gao, Zhenhua Qiao, Zhiwei Wang , et al. (1 additional authors not shown)

Abstract: Van Hove singularity (VHS) has been considered as a driving source for unconventional superconductivity. A VHS in two-dimensional (2D) materials consists of a saddle point connecting electron-like and hole-like bands. In a rare case, when a VHS appears at Fermi level, both electron-like and hole-like conduction can coexist, giving rise to an enhanced density of states as well as an attractive comp… ▽ More Van Hove singularity (VHS) has been considered as a driving source for unconventional superconductivity. A VHS in two-dimensional (2D) materials consists of a saddle point connecting electron-like and hole-like bands. In a rare case, when a VHS appears at Fermi level, both electron-like and hole-like conduction can coexist, giving rise to an enhanced density of states as well as an attractive component of Coulomb interaction for unconventional electronic pairing. However, this van Hove scenario is often destroyed by an incorrect chemical potential or competing instabilities. Here, by using angle-resolved photoemission measurements, we report the observation of a VHS perfectly aligned with the Fermi level in a kagome superconductor CsV$_{3-x}$Ta$_x$Sb$_5$ (x~0.4), in which a record-high superconducting transition temperature is achieved among all the current variants of AV$_3$Sb$_5$ (A=Cs, Rb, K) at ambient pressure. Doping dependent measurements reveal the important role of van Hove scenario in boosting superconductivity, and spectroscopic-imaging scanning tunneling microscopy measurements indicate a distinct superconducting state in this system. △ Less

Submitted 3 July, 2023; originally announced July 2023.

Comments: 20 pages, 4 figures

Journal ref: Nature Communications 14, 3819 (2023)

arXiv:2306.02998 [pdf, other]

doi 10.1103/PhysRevLett.131.256901

Berry-Curvature Engineering for Nonreciprocal Directional Dichroism in Two-Dimensional Antiferromagnets

Authors: Wenhao Liang, Junjie Zeng, Zhenhua Qiao, Yang Gao, Qian Niu

Abstract: In two-dimensional antiferromagnets, we identify the mixed Berry curvature as the geometrical origin of the nonreciprocal directional dichroism (NDD), which refers to the difference in light absorption with the propagation direction flipped. Such a Berry curvature is strongly tied to the uniaxial strain in accordance with the symmetry constraint, leading to a highly tunable NDD, whose sign and mag… ▽ More In two-dimensional antiferromagnets, we identify the mixed Berry curvature as the geometrical origin of the nonreciprocal directional dichroism (NDD), which refers to the difference in light absorption with the propagation direction flipped. Such a Berry curvature is strongly tied to the uniaxial strain in accordance with the symmetry constraint, leading to a highly tunable NDD, whose sign and magnitude can be manipulated via the strain direction. As a concrete example, we demonstrate such a phenomenon in a lattice model of MnBi2Te4. The coupling between the mixed Berry curvature and strain also suggests the magnetic quadrupole of the Bloch wave packet as the macroscopic order parameter probed by the NDD in two dimensions, distinct from the multiferroic order P times M or the spin toroidal and quadrupole order within a unit cell in previous studies. Our work paves the way of the Berry-curvature engineering for optical nonreciprocity in two-dimensional antiferromagnets. △ Less

Submitted 5 June, 2023; originally announced June 2023.

Journal ref: Phys. Rev. Lett. 131, 256901 (2023)

arXiv:2305.16172 [pdf, other]

Masked and Permuted Implicit Context Learning for Scene Text Recognition

Authors: Xiaomeng Yang, Zhi Qiao, Jin Wei, Dongbao Yang, Yu Zhou

Abstract: Scene Text Recognition (STR) is difficult because of the variations in text styles, shapes, and backgrounds. Though the integration of linguistic information enhances models' performance, existing methods based on either permuted language modeling (PLM) or masked language modeling (MLM) have their pitfalls. PLM's autoregressive decoding lacks foresight into subsequent characters, while MLM overloo… ▽ More Scene Text Recognition (STR) is difficult because of the variations in text styles, shapes, and backgrounds. Though the integration of linguistic information enhances models' performance, existing methods based on either permuted language modeling (PLM) or masked language modeling (MLM) have their pitfalls. PLM's autoregressive decoding lacks foresight into subsequent characters, while MLM overlooks inter-character dependencies. Addressing these problems, we propose a masked and permuted implicit context learning network for STR, which unifies PLM and MLM within a single decoder, inheriting the advantages of both approaches. We utilize the training procedure of PLM, and to integrate MLM, we incorporate word length information into the decoding process and replace the undetermined characters with mask tokens. Besides, perturbation training is employed to train a more robust model against potential length prediction errors. Our empirical evaluations demonstrate the performance of our model. It not only achieves superior performance on the common benchmarks but also achieves a substantial improvement of $9.1\%$ on the more challenging Union14M-Benchmark. △ Less

Submitted 20 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

arXiv:2305.09540 [pdf, other]

Sensing orbital hybridization of graphene-diamond interface with a single spin

Authors: Yucheng Hao, Zhiping Yang, Zeyu Li, Xi Kong, Wenna Tang, Tianyu Xie, Shaoyi Xu, Xiangyu Ye, Pei Yu, Pengfei Wang, Ya Wang, Zhenhua Qiao, Libo Gao, Jian-Hua Jiang, Fazhan Shi, Jiangfeng Du

Abstract: Interfacial interactions are crucial in a variety of fields and can greatly affect the electric, magnetic, and chemical properties of materials. Among them, interface orbital hybridization plays a fundamental role in the properties of surface electrons such as dispersion, interaction, and ground states. Conventional measurements of electronic states at interfaces such as scanning tunneling microsc… ▽ More Interfacial interactions are crucial in a variety of fields and can greatly affect the electric, magnetic, and chemical properties of materials. Among them, interface orbital hybridization plays a fundamental role in the properties of surface electrons such as dispersion, interaction, and ground states. Conventional measurements of electronic states at interfaces such as scanning tunneling microscopes are all based on electric interactions which, however, suffer from strong perturbation on these electrons. Here we unveil a new experimental detection of interface electrons based on the weak magnetic interactions between them and the nitrogen-vacancy (NV) center in diamond. With negligible perturbation on the interface electrons, their physical properties can be revealed by the NV spin coherence time. In our system, the interface interaction leads to significant decreases in both the density and coherence time of the electron spins at the diamond-graphene interface. Furthermore, together with electron spin resonance spectra and first-principle calculations, we can retrieve the effect of interface electron orbital hybridization. Our study opens a new pathway toward the microscopic probing of interfacial electronic states with weak magnetic interactions and provides a new avenue for future research on material interfaces. △ Less

Submitted 16 May, 2023; originally announced May 2023.

arXiv:2304.12604 [pdf, other]

Adaptive Path-Memory Network for Temporal Knowledge Graph Reasoning

Authors: Hao Dong, Zhiyuan Ning, Pengyang Wang, Ziyue Qiao, Pengfei Wang, Yuanchun Zhou, Yanjie Fu

Abstract: Temporal knowledge graph (TKG) reasoning aims to predict the future missing facts based on historical information and has gained increasing research interest recently. Lots of works have been made to model the historical structural and temporal characteristics for the reasoning task. Most existing works model the graph structure mainly depending on entity representation. However, the magnitude of… ▽ More Temporal knowledge graph (TKG) reasoning aims to predict the future missing facts based on historical information and has gained increasing research interest recently. Lots of works have been made to model the historical structural and temporal characteristics for the reasoning task. Most existing works model the graph structure mainly depending on entity representation. However, the magnitude of TKG entities in real-world scenarios is considerable, and an increasing number of new entities will arise as time goes on. Therefore, we propose a novel architecture modeling with relation feature of TKG, namely aDAptivE path-MemOry Network (DaeMon), which adaptively models the temporal path information between query subject and each object candidate across history time. It models the historical information without depending on entity representation. Specifically, DaeMon uses path memory to record the temporal path information derived from path aggregation unit across timeline considering the memory passing strategy between adjacent timestamps. Extensive experiments conducted on four real-world TKG datasets demonstrate that our proposed model obtains substantial performance improvement and outperforms the state-of-the-art up to 4.8% absolute in MRR. △ Less

Submitted 25 April, 2023; originally announced April 2023.

Comments: Accepted to IJCAI 2023

Showing 1–50 of 271 results for author: Qiao, Z