subscribe to arXiv mailings

Exact local distribution of the absolutely continuous spectral measure

Authors: Xianzhe Li, Jiangong You, Qi Zhou

Abstract: It is well-established that the spectral measure for one-frequency Schrödinger operators with Diophantine frequencies exhibits optimal $1/2$-Hölder continuity within the absolutely continuous spectrum. This study extends these findings by precisely characterizing the local distribution of the spectral measure for dense small potentials, including a notable result for any subcritical almost Mathieu… ▽ More It is well-established that the spectral measure for one-frequency Schrödinger operators with Diophantine frequencies exhibits optimal $1/2$-Hölder continuity within the absolutely continuous spectrum. This study extends these findings by precisely characterizing the local distribution of the spectral measure for dense small potentials, including a notable result for any subcritical almost Mathieu operators. Additionally, we investigate the stratified Hölder continuity of the spectral measure at subcritical energies. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 49 pages

arXiv:2407.08353 [pdf]

One-dimensional flat bands in phosphorene nanoribbons with pentagonal nature

Authors: Shuo Sun, Jing-Yang You, Zhihao Cai, Jie Su, Tong Yang, Xinnan Peng, Yihe Wang, Daiyu Geng, Jian Gou, Yuli Huang, Sisheng Duan, Lan Chen, Kehui Wu, Andrew T. S. Wee, Yuan Ping Feng, Jia Lin Zhang, Jiong Lu, Baojie Feng, Wei Chen

Abstract: Materials with topological flat bands can serve as a promising platform to investigate strongly interacting phenomena. However, experimental realization of ideal flat bands is mostly limited to artificial lattices or moiré systems. Here we report a general way to construct one-dimensional (1D) flat bands in phosphorene nanoribbons (PNRs) with pentagonal nature: penta-hexa-PNRs and penta-dodeca-PNR… ▽ More Materials with topological flat bands can serve as a promising platform to investigate strongly interacting phenomena. However, experimental realization of ideal flat bands is mostly limited to artificial lattices or moiré systems. Here we report a general way to construct one-dimensional (1D) flat bands in phosphorene nanoribbons (PNRs) with pentagonal nature: penta-hexa-PNRs and penta-dodeca-PNRs, wherein the corresponding flat bands are directly verified by using angle-resolved photoemission spectroscopy. We confirm that the observed 1D flat bands originate from the electronic 1D sawtooth and Lieb lattices, respectively, as revealed by the combination of bond-resolved scanning tunneling microscopy, scanning tunneling spectroscopy, tight-binding models, and first-principles calculations. Our study demonstrates a general way to construct 1D flat bands in 1D solid materials system, which provides a robust platform to explore strongly interacting phases of matter. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 13 pages, 4 figures

arXiv:2407.07715 [pdf, other]

Multi-User Localization and Tracking with Spatiotemporal Correlation in Multi-RIS-Assisted Systems

Authors: Ronghua Peng, Peng Gao, Jing You, Lixiang Lian

Abstract: As a promising technique, reconfigurable intelligent surfaces (RISs) exhibit its tremendous potential for high accuracy positioning. In this paper, we investigates multi-user localization and tracking problem in multi-RISs-assisted system. In particular, we incorporate statistical spatiotemporal correlation of multi-user locations and develop a general spatiotemporal Markov random field model (ST-… ▽ More As a promising technique, reconfigurable intelligent surfaces (RISs) exhibit its tremendous potential for high accuracy positioning. In this paper, we investigates multi-user localization and tracking problem in multi-RISs-assisted system. In particular, we incorporate statistical spatiotemporal correlation of multi-user locations and develop a general spatiotemporal Markov random field model (ST-+MRF) to capture multi-user dynamic motion states. To achieve superior performance, a novel multi-user tracking algorithm is proposed based on Bayesian inference to effectively utilize the correlation among users. Besides that, considering the necessity of RISs configuration for tracking performance, we further propose a predictive RISs beamforming optimization scheme via semidefinite relaxation (SDR). Compared to other pioneering work, finally, we confirm that the proposed strategy by alternating tracking algorithm and RISs optimization, can achieve significant performance gains over benchmark schemes. △ Less

Submitted 14 June, 2024; originally announced July 2024.

arXiv:2407.05490 [pdf, ps, other]

Structured quantitative almost reducibility and its applications

Authors: Lingrui Ge, Jiangong You, Qi Zhou

Abstract: We establish \textit{structured quantitative almost reducibility}, tailored for analytic quasiperiodic $SL(2,\mathbb{R})$-cocycles, which effectively addresses the challenge of infinitely many \textit{normal frequency} resonances. This method paves the way for optimal arithmetic reducibility results for such cocycles, thereby resolving Jitomirskaya's conjecture. From a spectral perspective, it lea… ▽ More We establish \textit{structured quantitative almost reducibility}, tailored for analytic quasiperiodic $SL(2,\mathbb{R})$-cocycles, which effectively addresses the challenge of infinitely many \textit{normal frequency} resonances. This method paves the way for optimal arithmetic reducibility results for such cocycles, thereby resolving Jitomirskaya's conjecture. From a spectral perspective, it leads to optimal arithmetic Anderson localization for a class of quasiperiodic long-range operators on higher-dimensional lattices. In particular, using structured quantitative almost reducibility, we establish a sharp quantitative version of Aubry duality, enabling us to uncover new spectral insights for almost Mathieu operators with Diophantine frequencies. For example, we precisely determine the exponential decay rate of spectral gaps in non-critical cases, thus addressing a question raised by Goldstein. Additionally, we reveal the optimal asymptotic growth of extended eigenfunctions for subcritical almost Mathieu operators. △ Less

Submitted 7 July, 2024; originally announced July 2024.

Comments: 55 pages

arXiv:2407.02485 [pdf, other]

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Authors: Yue Yu, Wei Ping, Zihan Liu, Boxin Wang, Jiaxuan You, Chao Zhang, Mohammad Shoeybi, Bryan Catanzaro

Abstract: Large language models (LLMs) typically utilize the top-k contexts from a retriever in retrieval-augmented generation (RAG). In this work, we propose a novel instruction fine-tuning framework RankRAG, which instruction-tunes a single LLM for the dual purpose of context ranking and answer generation in RAG. In particular, the instruction-tuned LLMs work surprisingly well by adding a small fraction o… ▽ More Large language models (LLMs) typically utilize the top-k contexts from a retriever in retrieval-augmented generation (RAG). In this work, we propose a novel instruction fine-tuning framework RankRAG, which instruction-tunes a single LLM for the dual purpose of context ranking and answer generation in RAG. In particular, the instruction-tuned LLMs work surprisingly well by adding a small fraction of ranking data into the training blend, and outperform existing expert ranking models, including the same LLM exclusively fine-tuned on a large amount of ranking data. For generation, we compare our model with many strong baselines, including GPT-4-0613, GPT-4-turbo-2024-0409, and ChatQA-1.5, an open-sourced model with the state-of-the-art performance on RAG benchmarks. Specifically, our Llama3-RankRAG significantly outperforms Llama3-ChatQA-1.5 and GPT-4 models on nine knowledge-intensive benchmarks. In addition, it also performs comparably to GPT-4 on five RAG benchmarks in the biomedical domain without instruction fine-tuning on biomedical data, demonstrating its superb capability for generalization to new domains. △ Less

Submitted 2 July, 2024; originally announced July 2024.

arXiv:2406.13871 [pdf, other]

Robust Time Series Forecasting with Non-Heavy-Tailed Gaussian Loss-Weighted Sampler

Authors: Jiang You, Arben Cela, René Natowicz, Jacob Ouanounou, Patrick Siarry

Abstract: Forecasting multivariate time series is a computationally intensive task challenged by extreme or redundant samples. Recent resampling methods aim to increase training efficiency by reweighting samples based on their running losses. However, these methods do not solve the problems caused by heavy-tailed distribution losses, such as overfitting to outliers. To tackle these issues, we introduce a no… ▽ More Forecasting multivariate time series is a computationally intensive task challenged by extreme or redundant samples. Recent resampling methods aim to increase training efficiency by reweighting samples based on their running losses. However, these methods do not solve the problems caused by heavy-tailed distribution losses, such as overfitting to outliers. To tackle these issues, we introduce a novel approach: a Gaussian loss-weighted sampler that multiplies their running losses with a Gaussian distribution weight. It reduces the probability of selecting samples with very low or very high losses while favoring those close to average losses. As it creates a weighted loss distribution that is not heavy-tailed theoretically, there are several advantages to highlight compared to existing methods: 1) it relieves the inefficiency in learning redundant easy samples and overfitting to outliers, 2) It improves training efficiency by preferentially learning samples close to the average loss. Application on real-world time series forecasting datasets demonstrate improvements in prediction quality for 1%-4% using mean square error measurements in channel-independent settings. The code will be available online after 1 the review. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 8 pages

arXiv:2406.11941 [pdf, other]

Crossfusor: A Cross-Attention Transformer Enhanced Conditional Diffusion Model for Car-Following Trajectory Prediction

Authors: Junwei You, Haotian Shi, Keshu Wu, Keke Long, Sicheng Fu, Sikai Chen, Bin Ran

Abstract: Vehicle trajectory prediction is crucial for advancing autonomous driving and advanced driver assistance systems (ADAS), enhancing road safety and traffic efficiency. While traditional methods have laid foundational work, modern deep learning techniques, particularly transformer-based models and generative approaches, have significantly improved prediction accuracy by capturing complex and non-lin… ▽ More Vehicle trajectory prediction is crucial for advancing autonomous driving and advanced driver assistance systems (ADAS), enhancing road safety and traffic efficiency. While traditional methods have laid foundational work, modern deep learning techniques, particularly transformer-based models and generative approaches, have significantly improved prediction accuracy by capturing complex and non-linear patterns in vehicle motion and traffic interactions. However, these models often overlook the detailed car-following behaviors and inter-vehicle interactions essential for real-world driving scenarios. This study introduces a Cross-Attention Transformer Enhanced Conditional Diffusion Model (Crossfusor) specifically designed for car-following trajectory prediction. Crossfusor integrates detailed inter-vehicular interactions and car-following dynamics into a robust diffusion framework, improving both the accuracy and realism of predicted trajectories. The model leverages a novel temporal feature encoding framework combining GRU, location-based attention mechanisms, and Fourier embedding to capture historical vehicle dynamics. It employs noise scaled by these encoded historical features in the forward diffusion process, and uses a cross-attention transformer to model intricate inter-vehicle dependencies in the reverse denoising process. Experimental results on the NGSIM dataset demonstrate that Crossfusor outperforms state-of-the-art models, particularly in long-term predictions, showcasing its potential for enhancing the predictive capabilities of autonomous driving systems. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.11704 [pdf, other]

Nemotron-4 340B Technical Report

Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation benchmarks, and were sized to fit on a single DGX H100 with 8 GPUs when deployed in FP8 precision. We believe that the community can benefit from these models in various research studies and commercial applications, especially for generating synthetic data to train smaller language models. Notably, over 98% of data used in our model alignment process is synthetically generated, showcasing the effectiveness of these models in generating synthetic data. To further support open research and facilitate model development, we are also open-sourcing the synthetic data generation pipeline used in our model alignment process. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.11291 [pdf, ps, other]

Simulation of chiral motion of excitation within the ground-state manifolds of neutral atoms

Authors: Hao-Yuan Tang, Xiao-Xuan Li, Jia-Bin You, Xiao-Qiang Shao

Abstract: Laser-induced gauge fields in neutral atoms serve as a means of mimicking the effects of a magnetic field, providing researchers with a platform to explore behaviors analogous to those observed in condensed matter systems under real magnetic fields. Here, we propose a method to generate chiral motion in atomic excitations within the neutral atomic ground-state manifolds. This is achieved through t… ▽ More Laser-induced gauge fields in neutral atoms serve as a means of mimicking the effects of a magnetic field, providing researchers with a platform to explore behaviors analogous to those observed in condensed matter systems under real magnetic fields. Here, we propose a method to generate chiral motion in atomic excitations within the neutral atomic ground-state manifolds. This is achieved through the application of polychromatic driving fields coupled to the ground-Rydberg transition, along with unconventional Rydberg pumping. The scheme offers the advantage of arbitrary adjustment of the effective magnetic flux by setting the relative phases between different external laser fields. Additionally, the effective interaction strength between the atomic ground states can be maintained at 10 kHz, surpassing the capabilities of the previous approach utilizing Floquet modulation. Notably, the proposed method can be readily extended to implement a hexagonal neutral atom lattice, serving as the fundamental unit in realizing the Haldane model. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 9 pages, 7 figures, revised manuscript submitted to APL Quantum

arXiv:2406.11092 [pdf, other]

Guaranteed Sampling Flexibility for Low-tubal-rank Tensor Completion

Authors: Bowen Su, Juntao You, HanQin Cai, Longxiu Huang

Abstract: While Bernoulli sampling is extensively studied in tensor completion, t-CUR sampling approximates low-tubal-rank tensors via lateral and horizontal subtensors. However, both methods lack sufficient flexibility for diverse practical applications. To address this, we introduce Tensor Cross-Concentrated Sampling (t-CCS), a novel and straightforward sampling model that advances the matrix cross-concen… ▽ More While Bernoulli sampling is extensively studied in tensor completion, t-CUR sampling approximates low-tubal-rank tensors via lateral and horizontal subtensors. However, both methods lack sufficient flexibility for diverse practical applications. To address this, we introduce Tensor Cross-Concentrated Sampling (t-CCS), a novel and straightforward sampling model that advances the matrix cross-concentrated sampling concept within a tensor framework. t-CCS effectively bridges the gap between Bernoulli and t-CUR sampling, offering additional flexibility that can lead to computational savings in various contexts. A key aspect of our work is the comprehensive theoretical analysis provided. We establish a sufficient condition for the successful recovery of a low-rank tensor from its t-CCS samples. In support of this, we also develop a theoretical framework validating the feasibility of t-CUR via uniform random sampling and conduct a detailed theoretical sampling complexity analysis for tensor completion problems utilizing the general Bernoulli sampling model. Moreover, we introduce an efficient non-convex algorithm, the Iterative t-CUR Tensor Completion (ITCURTC) algorithm, specifically designed to tackle the t-CCS-based tensor completion. We have intensively tested and validated the effectiveness of the t-CCS model and the ITCURTC algorithm across both synthetic and real-world datasets. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.10615 [pdf, other]

Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation

Authors: Tong Zhang, Yingdong Hu, Jiacheng You, Yang Gao

Abstract: Given the high cost of collecting robotic data in the real world, sample efficiency is a consistently compelling pursuit in robotics. In this paper, we introduce SGRv2, an imitation learning framework that enhances sample efficiency through improved visual and action representations. Central to the design of SGRv2 is the incorporation of a critical inductive bias-action locality, which posits that… ▽ More Given the high cost of collecting robotic data in the real world, sample efficiency is a consistently compelling pursuit in robotics. In this paper, we introduce SGRv2, an imitation learning framework that enhances sample efficiency through improved visual and action representations. Central to the design of SGRv2 is the incorporation of a critical inductive bias-action locality, which posits that robot's actions are predominantly influenced by the target object and its interactions with the local environment. Extensive experiments in both simulated and real-world settings demonstrate that action locality is essential for boosting sample efficiency. SGRv2 excels in RLBench tasks with keyframe control using merely 5 demonstrations and surpasses the RVT baseline in 23 of 26 tasks. Furthermore, when evaluated on ManiSkill2 and MimicGen using dense control, SGRv2's success rate is 2.54 times that of SGR. In real-world environments, with only eight demonstrations, SGRv2 can perform a variety of tasks at a markedly higher success rate compared to baseline models. Project website: http://sgrv2-robot.github.io △ Less

Submitted 15 June, 2024; originally announced June 2024.

Comments: Project website: http://sgrv2-robot.github.io

arXiv:2406.07409 [pdf, other]

Accelerating Ill-conditioned Hankel Matrix Recovery via Structured Newton-like Descent

Authors: HanQin Cai, Longxiu Huang, Xiliang Lu, Juntao You

Abstract: This paper studies the robust Hankel recovery problem, which simultaneously removes the sparse outliers and fulfills missing entries from the partial observation. We propose a novel non-convex algorithm, coined Hankel Structured Newton-Like Descent (HSNLD), to tackle the robust Hankel recovery problem. HSNLD is highly efficient with linear convergence, and its convergence rate is independent of th… ▽ More This paper studies the robust Hankel recovery problem, which simultaneously removes the sparse outliers and fulfills missing entries from the partial observation. We propose a novel non-convex algorithm, coined Hankel Structured Newton-Like Descent (HSNLD), to tackle the robust Hankel recovery problem. HSNLD is highly efficient with linear convergence, and its convergence rate is independent of the condition number of the underlying Hankel matrix. The recovery guarantee has been established under some mild conditions. Numerical experiments on both synthetic and real datasets show the superior performance of HSNLD against state-of-the-art algorithms. △ Less

Submitted 11 June, 2024; originally announced June 2024.

MSC Class: 15A29; 15A83; 47B35; 90C17; 90C26; 90C53

arXiv:2406.07068 [pdf]

Emergent Moiré fringes in direct-grown quasicrystal

Authors: Jingwei Li, Kejie Bao, Honglin Sun, Xingxu Yan, Ting Huang, Qicheng Zhang, Yaoqiang Zhou, Zhenjing Liu, Paul Masih Das, Jiawen You, Jiong Zhao, Jianbin Xu, Xiaoqing Pan, Yongli Mi, Junyi Zhu, Zhaoli Gao

Abstract: Quasicrystals represent a category of rarely structured solids that challenge traditional periodicity in crystal materials. Recent advancements in the synthesis of two-dimensional (2D) van der Waals materials have paved the way for exploring the unique physical properties of these systems. Here, we report on the synthesis of 2D quasicrystals featuring 30° alternating twist angles between multiple… ▽ More Quasicrystals represent a category of rarely structured solids that challenge traditional periodicity in crystal materials. Recent advancements in the synthesis of two-dimensional (2D) van der Waals materials have paved the way for exploring the unique physical properties of these systems. Here, we report on the synthesis of 2D quasicrystals featuring 30° alternating twist angles between multiple graphene layers, using chemical vapor deposition (CVD). Strikingly, we observed periodic Moiré patterns in the quasicrystal, a finding that has not been previously reported in traditional alloy-based quasicrystals. The Moiré periodicity, varying with the parity of the constituent layers, aligns with the theoretical predictions that suggest a stress cancellation mechanism in force. The emergence of Moiré fringes is attributed to the spontaneous mismatched lattice constant in the oriented graphene layers, proving the existence of atomic relaxation. This phenomenon, which has been largely understudied in graphene systems with large twist angles, has now been validated through our use of scanning transmission electron microscopy (STEM). Our CVD-grown Moiré quasicrystal provides an ideal platform for exploring the unusual physical properties that arise from Moiré periodicity within quasicrystals. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2406.01882 [pdf, other]

HoneyGPT: Breaking the Trilemma in Terminal Honeypots with Large Language Model

Authors: Ziyang Wang, Jianzhou You, Haining Wang, Tianwei Yuan, Shichao Lv, Yang Wang, Limin Sun

Abstract: Honeypots, as a strategic cyber-deception mechanism designed to emulate authentic interactions and bait unauthorized entities, continue to struggle with balancing flexibility, interaction depth, and deceptive capability despite their evolution over decades. Often they also lack the capability of proactively adapting to an attacker's evolving tactics, which restricts the depth of engagement and sub… ▽ More Honeypots, as a strategic cyber-deception mechanism designed to emulate authentic interactions and bait unauthorized entities, continue to struggle with balancing flexibility, interaction depth, and deceptive capability despite their evolution over decades. Often they also lack the capability of proactively adapting to an attacker's evolving tactics, which restricts the depth of engagement and subsequent information gathering. Under this context, the emergent capabilities of large language models, in tandem with pioneering prompt-based engineering techniques, offer a transformative shift in the design and deployment of honeypot technologies. In this paper, we introduce HoneyGPT, a pioneering honeypot architecture based on ChatGPT, heralding a new era of intelligent honeypot solutions characterized by their cost-effectiveness, high adaptability, and enhanced interactivity, coupled with a predisposition for proactive attacker engagement. Furthermore, we present a structured prompt engineering framework that augments long-term interaction memory and robust security analytics. This framework, integrating thought of chain tactics attuned to honeypot contexts, enhances interactivity and deception, deepens security analytics, and ensures sustained engagement. The evaluation of HoneyGPT includes two parts: a baseline comparison based on a collected dataset and a field evaluation in real scenarios for four weeks. The baseline comparison demonstrates HoneyGPT's remarkable ability to strike a balance among flexibility, interaction depth, and deceptive capability. The field evaluation further validates HoneyGPT's efficacy, showing its marked superiority in enticing attackers into more profound interactive engagements and capturing a wider array of novel attack vectors in comparison to existing honeypot technologies. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.00715 [pdf, other]

Gyrokinetic simulation of the spontaneous toroidal rotation of plasma in a stochastic magnetic field

Authors: Jinxiang You, Shaojie Wang

Abstract: Since the DIII-D resonant magnetic perturbation experiment [Nucl. Fusion $\bm{59}$, 126010 (2019)] suggests that the neoclassical toroidal viscosity due to the collisional effects associated with the non-resonant magnetic perturbations is not enough to explain the observed toroidal rotation, it is of interest to investigate the toroidal rotation induced by the anomalous diffusion due to the resona… ▽ More Since the DIII-D resonant magnetic perturbation experiment [Nucl. Fusion $\bm{59}$, 126010 (2019)] suggests that the neoclassical toroidal viscosity due to the collisional effects associated with the non-resonant magnetic perturbations is not enough to explain the observed toroidal rotation, it is of interest to investigate the toroidal rotation induced by the anomalous diffusion due to the resonant magnetic perturbations. Gyrokinetic simulation of the toroidal rotation of plasma in a stochastic magnetic field is carried out to investigate the resonant magnetic perturbations effects on toroidal rotation. The simulation results suggest that, in a stochastic magnetic field, resonant magnetic perturbations drive the plasma to toroidally rotate through the ambipolar radial electric field. It is found that this spontaneous flow driven on the time scale less than an ion-ion collision time is the parallel return flow of the $\bm{E}_r\times\bm{B}_0$ drift, which is due to the the ambipolar radial electric field induced by the non-ambipolar radial diffusion in the stochastic magnetic field. Collisional effect changes the plasma toroidal rotation from the return flow to the rigid-body flow after a few ion-ion collision times. The toroidal rotation observed in DIII-D resonant magnetic perturbation experiment [Nucl. Fusion $\bm{59}$, 126010 (2019)], can be explained by the rigid-body rotation driven by the ambipolar radial electric field generated by the stochastic magnetic field layer. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2405.14251 [pdf, other]

Efficient Navigation of a Robotic Fish Swimming Across the Vortical Flow Field

Authors: Haodong Feng, Dehan Yuan, Jiale Miao, Jie You, Yue Wang, Yi Zhu, Dixia Fan

Abstract: Navigating efficiently across vortical flow fields presents a significant challenge in various robotic applications. The dynamic and unsteady nature of vortical flows often disturbs the control of underwater robots, complicating their operation in hydrodynamic environments. Conventional control methods, which depend on accurate modeling, fail in these settings due to the complexity of fluid-struct… ▽ More Navigating efficiently across vortical flow fields presents a significant challenge in various robotic applications. The dynamic and unsteady nature of vortical flows often disturbs the control of underwater robots, complicating their operation in hydrodynamic environments. Conventional control methods, which depend on accurate modeling, fail in these settings due to the complexity of fluid-structure interactions (FSI) caused by unsteady hydrodynamics. This study proposes a deep reinforcement learning (DRL) algorithm, trained in a data-driven manner, to enable efficient navigation of a robotic fish swimming across vortical flows. Our proposed algorithm incorporates the LSTM architecture and uses several recent consecutive observations as the state to address the issue of partial observation, often due to sensor limitations. We present a numerical study of navigation within a Karman vortex street, created by placing a stationary cylinder in a uniform flow, utilizing the immersed boundary-lattice Boltzmann method (IB-LBM). The aim is to train the robotic fish to discover efficient navigation policies, enabling it to reach a designated target point across the Karman vortex street from various initial positions. After training, the fish demonstrates the ability to rapidly reach the target from different initial positions, showcasing the effectiveness and robustness of our proposed algorithm. Analysis of the results reveals that the robotic fish can leverage velocity gains and pressure differences induced by the vortices to reach the target, underscoring the potential of our proposed algorithm in enhancing navigation in complex hydrodynamic environments. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.11281 [pdf, other]

Cooperative Cognitive Dynamic System in UAV Swarms: Reconfigurable Mechanism and Framework

Authors: Ziye Jia, Jiahao You, Chao Dong, Qihui Wu, Fuhui Zhou, Dusit Niyato, Zhu Han

Abstract: As the demands for immediate and effective responses increase in both civilian and military domains, the unmanned aerial vehicle (UAV) swarms emerge as effective solutions, in which multiple cooperative UAVs can work together to achieve specific goals. However, how to manage such complex systems to ensure real-time adaptability lack sufficient researches. Hence, in this paper, we propose the coope… ▽ More As the demands for immediate and effective responses increase in both civilian and military domains, the unmanned aerial vehicle (UAV) swarms emerge as effective solutions, in which multiple cooperative UAVs can work together to achieve specific goals. However, how to manage such complex systems to ensure real-time adaptability lack sufficient researches. Hence, in this paper, we propose the cooperative cognitive dynamic system (CCDS), to optimize the management for UAV swarms. CCDS leverages a hierarchical and cooperative control structure that enables real-time data processing and decision. Accordingly, CCDS optimizes the UAV swarm management via dynamic reconfigurability and adaptive intelligent optimization. In addition, CCDS can be integrated with the biomimetic mechanism to efficiently allocate tasks for UAV swarms. Further, the distributed coordination of CCDS ensures reliable and resilient control, thus enhancing the adaptability and robustness. Finally, the potential challenges and future directions are analyzed, to provide insights into managing UAV swarms in dynamic heterogeneous networking. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.10451 [pdf, ps, other]

doi 10.1103/PhysRevA.109.062603

Simulation of a feedback-based algorithm for quantum optimization for a realistic neutral atom system with an optimized small-angle controlled-phase gate

Authors: S. X. Li, W. L. Mu, J. B. You, X. Q. Shao

Abstract: In contrast to the classical optimization process required by the quantum approximate optimization algorithm, FALQON, a feedback-based algorithm for quantum optimization [A. B. Magann {\it et al.,} {\color{blue}Phys. Rev. Lett. {\bf129}, 250502 (2022)}], enables one to obtain approximate solutions to combinatorial optimization problems without any classical optimization effort. In this study, we l… ▽ More In contrast to the classical optimization process required by the quantum approximate optimization algorithm, FALQON, a feedback-based algorithm for quantum optimization [A. B. Magann {\it et al.,} {\color{blue}Phys. Rev. Lett. {\bf129}, 250502 (2022)}], enables one to obtain approximate solutions to combinatorial optimization problems without any classical optimization effort. In this study, we leverage the specifications of a recent experimental platform for the neutral atom system [Z. Fu {\it et al.,} {\color{blue}Phys. Rev. A {\bf105}, 042430 (2022)}] and present a scheme to implement an optimally tuned small-angle controlled-phase gate. By examining the 2- to 4-qubit FALQON algorithms in the Max-Cut problem and considering the spontaneous emission of the neutral atomic system, we have observed that the performance of FALQON implemented with small-angle controlled-phase gates exceeds that of FALQON utilizing CZ gates. This approach has the potential to significantly simplify the logic circuit required to simulate FALQON and effectively address the Max-Cut problem, which may pave a way for the experimental implementation of near-term noisy intermediate-scale quantum algorithms with neutral-atom systems. △ Less

Submitted 10 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: typos corrected and figures updated

Journal ref: Phys. Rev. A 109, 062603 (2024)

arXiv:2405.10313 [pdf, other]

How Far Are We From AGI

Authors: Tao Feng, Chuanyang Jin, Jingyu Liu, Kunlun Zhu, Haoqin Tu, Zirui Cheng, Guanyu Lin, Jiaxuan You

Abstract: The evolution of artificial intelligence (AI) has profoundly impacted human society, driving significant advancements in multiple sectors. Yet, the escalating demands on AI have highlighted the limitations of AI's current offerings, catalyzing a movement towards Artificial General Intelligence (AGI). AGI, distinguished by its ability to execute diverse real-world tasks with efficiency and effectiv… ▽ More The evolution of artificial intelligence (AI) has profoundly impacted human society, driving significant advancements in multiple sectors. Yet, the escalating demands on AI have highlighted the limitations of AI's current offerings, catalyzing a movement towards Artificial General Intelligence (AGI). AGI, distinguished by its ability to execute diverse real-world tasks with efficiency and effectiveness comparable to human intelligence, reflects a paramount milestone in AI evolution. While existing works have summarized specific recent advancements of AI, they lack a comprehensive discussion of AGI's definitions, goals, and developmental trajectories. Different from existing survey papers, this paper delves into the pivotal questions of our proximity to AGI and the strategies necessary for its realization through extensive surveys, discussions, and original perspectives. We start by articulating the requisite capability frameworks for AGI, integrating the internal, interface, and system dimensions. As the realization of AGI requires more advanced capabilities and adherence to stringent constraints, we further discuss necessary AGI alignment technologies to harmonize these factors. Notably, we emphasize the importance of approaching AGI responsibly by first defining the key levels of AGI progression, followed by the evaluation framework that situates the status-quo, and finally giving our roadmap of how to reach the pinnacle of AGI. Moreover, to give tangible insights into the ubiquitous impact of the integration of AI, we outline existing challenges and potential pathways toward AGI in multiple domains. In sum, serving as a pioneering exploration into the current state and future trajectory of AGI, this paper aims to foster a collective comprehension and catalyze broader public discussions among researchers and practitioners on AGI. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.08684 [pdf, other]

Superconducting and topological properties of compound Lu$_4$H$_7$N

Authors: Zheng-Wei Liao, Xin-Wei Yi, Jing-Yang You, Bo Gu, Gang Su

Abstract: A recent experiment has reported a nitrogen-doped lutetium hydride acheving a remarkable Tc of 294 K at just 1 GPa, significantly reducing the required pressure for obtaining room temperature superconductivity. However, subsequent experimental and theoretical investigations have encountered difficulties in replicating these results, leaving the structure of this Lu-H-N compound shrouded in uncerta… ▽ More A recent experiment has reported a nitrogen-doped lutetium hydride acheving a remarkable Tc of 294 K at just 1 GPa, significantly reducing the required pressure for obtaining room temperature superconductivity. However, subsequent experimental and theoretical investigations have encountered difficulties in replicating these results, leaving the structure of this Lu-H-N compound shrouded in uncertainty. Here, we propose a stable structure for Lu$_4$H$_7$N employing first-principles calculations. Our calculations reveal that Lu$_4$H$_7$N has a Tc of 1.044 K, which can be substantially enhanced to 11.721 K at 150 GPa, due to the increasing electron-phonon coupling (EPC). Notably, we delve into the nontrivial Z2 band topology of Lu$_4$H$_7$N, featuring discernible surface states near the Fermi level, and we explore its spin Hall conductivity characteristics. Furthermore, we find that the electron doping can enhance the EPC strength and Tc of Lu$_4$H$_7$N, such as the Lu$_4$H$_7$O structure we predict simulating electron doping for Lu$_4$H$_7$N with an impressive Tc of 3.837 K. This work demonstrates the coexistence of superconducting and topological properties in a Lu-H-N system compound, which holds the promise of guiding the search for novel topological superconducting materials. △ Less

Submitted 14 May, 2024; originally announced May 2024.

arXiv:2405.06189 [pdf, ps, other]

doi 10.1140/epjqt/s40507-024-00246-w

Holonomic swap and controlled-swap gates of neutral atoms via selective Rydberg pumping

Authors: C. F. Sun, X. Y. Chen, W. L. Mu, G. C. Wang, J. B. You, X. Q. Shao

Abstract: Holonomic quantum computing offers a promising paradigm for quantum computation due to its error resistance and the ability to perform universal quantum computations. Here, we propose a scheme for the rapid implementation of a holonomic swap gate in neutral atomic systems, based on the selective Rydberg pumping mechanism. By employing time-dependent soft control, we effectively mitigate the impact… ▽ More Holonomic quantum computing offers a promising paradigm for quantum computation due to its error resistance and the ability to perform universal quantum computations. Here, we propose a scheme for the rapid implementation of a holonomic swap gate in neutral atomic systems, based on the selective Rydberg pumping mechanism. By employing time-dependent soft control, we effectively mitigate the impact of off-resonant terms even at higher driving intensities compared to time-independent driving. This approach accelerates the synthesis of logic gates and passively reduces the decoherence effects. Furthermore, by introducing an additional atom and applying the appropriate driving field, our scheme can be directly extended to implement a three-qubit controlled-swap gate. This advancement makes it a valuable tool for quantum state preparation, quantum switches, and a variational quantum algorithm in neutral atom systems. △ Less

Submitted 9 May, 2024; originally announced May 2024.

Comments: Accepted by EPJ Quantum Technology

Journal ref: EPJ Quantum Technology 11, 34 (2024)

arXiv:2405.04729 [pdf]

Machine learning aided parameter analysis in Perovskite X-ray Detector

Authors: Bobo Zhang, Endai Huang, Xinyi Du, Xiaokang Ma, Lu Zhang, Jiaxue You, Alex K. Y. Jen, Shengzhong, Liu

Abstract: Many factors in perovskite X-ray detectors, such as crystal lattice and carrier dynamics, determine the final device performance (e.g., sensitivity and detection limit). However, the relationship between these factors remains unknown due to the complexity of the material. In this study, we employ machine learning to reveal the relationship between 15 intrinsic properties of halide perovskite mater… ▽ More Many factors in perovskite X-ray detectors, such as crystal lattice and carrier dynamics, determine the final device performance (e.g., sensitivity and detection limit). However, the relationship between these factors remains unknown due to the complexity of the material. In this study, we employ machine learning to reveal the relationship between 15 intrinsic properties of halide perovskite materials and their device performance. We construct a database of X-ray detectors for the training of machine learning. The results show that the band gap is mainly influenced by the atomic number of the B-site metal, and the lattice length parameter b has the greatest impact on the carrier mobility-lifetime product (μτ). An X-ray detector (m-F-PEA)2PbI4 were generated in the experiment and it further verified the accuracy of our ML models. We suggest further study on random forest regression for X-ray detector applications. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: 20 pages

arXiv:2405.02299 [pdf, other]

Deep Reinforcement Learning for Modelling Protein Complexes

Authors: Ziqi Gao, Tao Feng, Jiaxuan You, Chenyi Zi, Yan Zhou, Chen Zhang, Jia Li

Abstract: AlphaFold can be used for both single-chain and multi-chain protein structure prediction, while the latter becomes extremely challenging as the number of chains increases. In this work, by taking each chain as a node and assembly actions as edges, we show that an acyclic undirected connected graph can be used to predict the structure of multi-chain protein complexes (a.k.a., protein complex modell… ▽ More AlphaFold can be used for both single-chain and multi-chain protein structure prediction, while the latter becomes extremely challenging as the number of chains increases. In this work, by taking each chain as a node and assembly actions as edges, we show that an acyclic undirected connected graph can be used to predict the structure of multi-chain protein complexes (a.k.a., protein complex modelling, PCM). However, there are still two challenges: 1) The huge combinatorial optimization space of $N^{N-2}$ ($N$ is the number of chains) for the PCM problem can easily lead to high computational cost. 2) The scales of protein complexes exhibit distribution shift due to variance in chain numbers, which calls for the generalization in modelling complexes of various scales. To address these challenges, we propose GAPN, a Generative Adversarial Policy Network powered by domain-specific rewards and adversarial loss through policy gradient for automatic PCM prediction. Specifically, GAPN learns to efficiently search through the immense assembly space and optimize the direct docking reward through policy gradient. Importantly, we design an adversarial reward function to enhance the receptive field of our model. In this way, GAPN will simultaneously focus on a specific batch of complexes and the global assembly rules learned from complexes with varied chain numbers. Empirically, we have achieved both significant accuracy (measured by RMSD and TM-Score) and efficiency improvements compared to leading PCM softwares. △ Less

Submitted 6 May, 2024; v1 submitted 11 March, 2024; originally announced May 2024.

Comments: International Conference on Learning Representations (ICLR 2024)

arXiv:2404.16898 [pdf, other]

How to Parameterize Asymmetric Quantization Ranges for Quantization-Aware Training

Authors: Jaeseong You, Minseop Park, Kyunggeun Lee, Seokjun An, Chirag Patel, Markus Nage

Abstract: This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma. We perform a comprehensive comparative analysis of these parameterizations' influence on quantization-aware training, using both controlled experiments and real-world large language models. Our particula… ▽ More This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma. We perform a comprehensive comparative analysis of these parameterizations' influence on quantization-aware training, using both controlled experiments and real-world large language models. Our particular focus is on their changing behavior in response to critical training hyperparameters, bit width and learning rate. Based on our investigation, we propose best practices to stabilize and accelerate quantization-aware training with learnable asymmetric quantization ranges. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.15225 [pdf, other]

PHLP: Sole Persistent Homology for Link Prediction -- Interpretable Feature Extraction

Authors: Junwon You, Eunwoo Heo, Jae-Hun Jung

Abstract: Link prediction (LP), inferring the connectivity between nodes, is a significant research area in graph data, where a link represents essential information on relationships between nodes. Although graph neural network (GNN)-based models have achieved high performance in LP, understanding why they perform well is challenging because most comprise complex neural networks. We employ persistent homolo… ▽ More Link prediction (LP), inferring the connectivity between nodes, is a significant research area in graph data, where a link represents essential information on relationships between nodes. Although graph neural network (GNN)-based models have achieved high performance in LP, understanding why they perform well is challenging because most comprise complex neural networks. We employ persistent homology (PH), a topological data analysis method that helps analyze the topological information of graphs, to explain the reasons for the high performance. We propose a novel method that employs PH for LP (PHLP) focusing on how the presence or absence of target links influences the overall topology. The PHLP utilizes the angle hop subgraph and new node labeling called degree double radius node labeling (Degree DRNL), distinguishing the information of graphs better than DRNL. Using only a classifier, PHLP performs similarly to state-of-the-art (SOTA) models on most benchmark datasets. Incorporating the outputs calculated using PHLP into the existing GNN-based SOTA models improves performance across all benchmark datasets. To the best of our knowledge, PHLP is the first method of applying PH to LP without GNNs. The proposed approach, employing PH while not relying on neural networks, enables the identification of crucial factors for improving performance. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.11157 [pdf, ps, other]

Self-Ordered Supersolid in Spinor Condensates with Cavity-Mediated Spin-Momentum-Mixing Interactions

Authors: Jingjun You, Su Yi, Yuangang Deng

Abstract: Ultracold atoms with cavity-mediated long-range interactions offer a promising platform for investing novel quantum phenomena. Exploiting recent experimental advancements, we propose an experimental scheme to create self-ordered supersolid in spin-$1/2$ condensates confined within an optical cavity. The interplay of cavity and pump fields gives rise to supersolid square and plane wave phases, comp… ▽ More Ultracold atoms with cavity-mediated long-range interactions offer a promising platform for investing novel quantum phenomena. Exploiting recent experimental advancements, we propose an experimental scheme to create self-ordered supersolid in spin-$1/2$ condensates confined within an optical cavity. The interplay of cavity and pump fields gives rise to supersolid square and plane wave phases, comprehensively described by the two-component Tavis-Cummings model. We show that the self-ordered supersolid phase exhibits an undamped gapless Goldstone mode over a wide parameter range. This proposal, achievable with current experimental setups utilizing identical laser configurations, is in contrast to the realization of checkerboard supersolidity, which hinges on constructing a $U(1)$ symmetry by utilizing two ${\cal Z}_2$ symmetries with precisely matched atom-cavity coupling in multimode resonators. By employing the superradiant photon-exchange process, we realize for the first time cavity-mediated spin-momentum-mixing interactions between highly correlated spin and momentum modes, analogous to that observed spin-mixing in spin-1 condensates. Our scheme provides a unique platform for realizing spin-momentum squeezing and spatially distributed multipartite entanglement. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 5+9 pages, 4+3 figures

arXiv:2404.10102 [pdf, other]

Chinchilla Scaling: A replication attempt

Authors: Tamay Besiroglu, Ege Erdil, Matthew Barnett, Josh You

Abstract: Hoffmann et al. (2022) propose three methods for estimating a compute-optimal scaling law. We attempt to replicate their third estimation procedure, which involves fitting a parametric loss function to a reconstruction of data from their plots. We find that the reported estimates are inconsistent with their first two estimation methods, fail at fitting the extracted data, and report implausibly na… ▽ More Hoffmann et al. (2022) propose three methods for estimating a compute-optimal scaling law. We attempt to replicate their third estimation procedure, which involves fitting a parametric loss function to a reconstruction of data from their plots. We find that the reported estimates are inconsistent with their first two estimation methods, fail at fitting the extracted data, and report implausibly narrow confidence intervals--intervals this narrow would require over 600,000 experiments, while they likely only ran fewer than 500. In contrast, our rederivation of the scaling law using the third approach yields results that are compatible with the findings from the first two estimation procedures described by Hoffmann et al. △ Less

Submitted 14 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

arXiv:2404.07598 [pdf, other]

Electro-optically Modulated Nonlinear Metasurfaces

Authors: Zhengqing He, Lun Qu, Wei Wu, Jikun Liu, Jingfei You, Weiye Liu, Lu Bai, Chunyan Jin, Chenxiong Wang, Zhidong Gu, Wei Cai, Mengxin Ren, Jingjun Xu

Abstract: Tunable nonlinearity facilitates the creation of reconfigurable nonlinear metasurfaces, enabling innovative applications in signal processing, light switching, and sensing. This paper presents a novel approach to electrically modulate SHG from a lithium niobate (LN) metasurface, exploiting the electro-optical (EO) effect. By fabricating a nanohole array metasurface on a thin LN film and applying a… ▽ More Tunable nonlinearity facilitates the creation of reconfigurable nonlinear metasurfaces, enabling innovative applications in signal processing, light switching, and sensing. This paper presents a novel approach to electrically modulate SHG from a lithium niobate (LN) metasurface, exploiting the electro-optical (EO) effect. By fabricating a nanohole array metasurface on a thin LN film and applying an electric field, we demonstrate the alteration of the material's refractive index, resulting in resonance shifts and modulation of SHG intensity at specific wavelengths. Our findings provide valuable insights for the development of electrically tunable nonlinear light sources, quantum optics, dynamic nonlinear holography, and nonlinear information processing. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: 4 pages, 4 figures

arXiv:2404.00242 [pdf, other]

DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference

Authors: Jinwei Yao, Kaiqi Chen, Kexun Zhang, Jiaxuan You, Binhang Yuan, Zeke Wang, Tao Lin

Abstract: Given the increasing demand for tree-structured interactions with LLMs, we introduce DeFT (Decoding with Flash Tree-Attention), an IO-aware tree attention algorithm tailored for tree-structured inference. Unlike traditional sequence-based decoding, tree-structured decoding better accommodates modern task requirements, including self-consistency, few-shot prompting, multi-step reasoning, and multi-… ▽ More Given the increasing demand for tree-structured interactions with LLMs, we introduce DeFT (Decoding with Flash Tree-Attention), an IO-aware tree attention algorithm tailored for tree-structured inference. Unlike traditional sequence-based decoding, tree-structured decoding better accommodates modern task requirements, including self-consistency, few-shot prompting, multi-step reasoning, and multi-model/head coordination. However, existing sequence-based inference systems are ill-suited for tree-structured decoding, resulting in redundancy in computation, memory footprints, and memory access, thereby undermining inference efficiency. To address this challenge, DeFT maintains memory-efficient attention calculation with low memory footprints through two key stages: (1) QKV Preparation: We propose a KV-Guided Grouping Strategy with Tree Split to intelligently group QKV, optimizing GPU resource utilization while minimizing memory reads/writes for KV cache between GPU global memory and on-chip shared memory; (2)Attention Calculation: We compute partial attention of each QKV group in a fused kernel and employ a Tree-topology-aware Global Reduction strategy to obtain final attention. By reducing 73-99% KV cache IO and nearly 100% IO for partial results during attention calculation (e.g., Softmax), DeFT achieves up to 2.52/3.82x speedup in the end-to-end/attention latency across three practical tree-based workloads: namely, few-shot prompting, multi-step reasoning, and speculative decoding, over state-of-the-art attention algorithms. △ Less

Submitted 29 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

Comments: Update DeFT-v2. DeFT-v1 was accepted by ICLR'24 AGI Workshop ( https://openreview.net/forum?id=HqfLHoX8bR ). Code will be released soon

arXiv:2403.15234 [pdf, other]

Shadow Generation for Composite Image Using Diffusion model

Authors: Qingyang Liu, Junqi You, Jianting Wang, Xinhao Tao, Bo Zhang, Li Niu

Abstract: In the realm of image composition, generating realistic shadow for the inserted foreground remains a formidable challenge. Previous works have developed image-to-image translation models which are trained on paired training data. However, they are struggling to generate shadows with accurate shapes and intensities, hindered by data scarcity and inherent task complexity. In this paper, we resort to… ▽ More In the realm of image composition, generating realistic shadow for the inserted foreground remains a formidable challenge. Previous works have developed image-to-image translation models which are trained on paired training data. However, they are struggling to generate shadows with accurate shapes and intensities, hindered by data scarcity and inherent task complexity. In this paper, we resort to foundation model with rich prior knowledge of natural shadow images. Specifically, we first adapt ControlNet to our task and then propose intensity modulation modules to improve the shadow intensity. Moreover, we extend the small-scale DESOBA dataset to DESOBAv2 using a novel data acquisition pipeline. Experimental results on both DESOBA and DESOBAv2 datasets as well as real composite images demonstrate the superior capability of our model for shadow generation task. The dataset, code, and model are released at https://github.com/bcmi/Object-Shadow-Generation-Dataset-DESOBAv2. △ Less

Submitted 22 March, 2024; originally announced March 2024.

Comments: accepted by CVPR2024

arXiv:2403.11455 [pdf, other]

Antiferromagnetic Ground State, Charge Density Waves and Oxygen Vacancies Induced Metal-Insulator Transition in Pressurized La$_{3}$Ni$_{2}$O$_{7}$

Authors: Xin-Wei Yi, Ying Meng, Jia-Wen Li, Zheng-Wei Liao, Jing-Yang You, Bo Gu, Gang Su

Abstract: La$_{3}$Ni$_{2}$O$_{7}$ has garnered widespread interest recently due to its high-temperature superconductivity under pressure, accompanied by charge density wave (CDW) ordering and metal-insulator (MI) transitions in the phase diagram. Here, we reveal with comprehensive calculations that La$_{3}$Ni$_{2}$O$_{7}$ possesses an antiferromagnetic ground state under both low and high pressures, with th… ▽ More La$_{3}$Ni$_{2}$O$_{7}$ has garnered widespread interest recently due to its high-temperature superconductivity under pressure, accompanied by charge density wave (CDW) ordering and metal-insulator (MI) transitions in the phase diagram. Here, we reveal with comprehensive calculations that La$_{3}$Ni$_{2}$O$_{7}$ possesses an antiferromagnetic ground state under both low and high pressures, with the strong Fermi surface nesting contributed by the flat band that leads to phonon softening and electronic instabilities. Several stable CDW orders with oxygen octahedral distortions are identified, which can trigger the MI transitions. The estimated CDW transition temperature ($\approx$120 K) at ambient pressure agrees nicely with experimental results. In the presence of apical oxygen vacancies, we identify two different phases, say, half distortion and full distortion phases, respectively, and their competition can lead to a pressure-induced MI transition, in good agreement with experimental observations. In addition, we find that the electron-phonon coupling is too small to contribute to superconductivity. These results appear to indicate an unconventional superconducting pairing mechanism mediated by antiferromagnetic fluctuations. A phase diagram that is consistent with the experimental results is given. The present results not only explain the origins of experimentally observed CDW and MI transitions, but also provide insight for deeply understanding the properties like superconductivity, CDW and the role of oxygen vacancies in pressurized La$_{3}$Ni$_{2}$O$_{7}$. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.00564 [pdf, other]

EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

Authors: Shengjie Wang, Shaohuai Liu, Weirui Ye, Jiacheng You, Yang Gao

Abstract: Sample efficiency remains a crucial challenge in applying Reinforcement Learning (RL) to real-world tasks. While recent algorithms have made significant strides in improving sample efficiency, none have achieved consistently superior performance across diverse domains. In this paper, we introduce EfficientZero V2, a general framework designed for sample-efficient RL algorithms. We have expanded th… ▽ More Sample efficiency remains a crucial challenge in applying Reinforcement Learning (RL) to real-world tasks. While recent algorithms have made significant strides in improving sample efficiency, none have achieved consistently superior performance across diverse domains. In this paper, we introduce EfficientZero V2, a general framework designed for sample-efficient RL algorithms. We have expanded the performance of EfficientZero to multiple domains, encompassing both continuous and discrete actions, as well as visual and low-dimensional inputs. With a series of improvements we propose, EfficientZero V2 outperforms the current state-of-the-art (SOTA) by a significant margin in diverse tasks under the limited data setting. EfficientZero V2 exhibits a notable advancement over the prevailing general algorithm, DreamerV3, achieving superior outcomes in 50 of 66 evaluated tasks across diverse benchmarks, such as Atari 100k, Proprio Control, and Vision Control. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 21 pages,10 figures

arXiv:2403.00345 [pdf, other]

Microwave-to-optics conversion using magnetostatic modes and a tunable optical cavity

Authors: Wei-Jiang Wu, Yi-Pu Wang, Jie Li, Gang Li, J. Q. You

Abstract: Quantum computing, quantum communication and quantum networks rely on hybrid quantum systems operating in different frequency ranges. For instance, the superconducting qubits work in the gigahertz range, while the optical photons used in communication are in the range of hundreds of terahertz. Due to the large frequency mismatch, achieving the direct coupling and information exchange between diffe… ▽ More Quantum computing, quantum communication and quantum networks rely on hybrid quantum systems operating in different frequency ranges. For instance, the superconducting qubits work in the gigahertz range, while the optical photons used in communication are in the range of hundreds of terahertz. Due to the large frequency mismatch, achieving the direct coupling and information exchange between different information carriers is generally difficult. Accordingly, a quantum interface is demanded, which serves as a bridge to establish information linkage between different quantum systems operating at distinct frequencies. Recently, the magnon mode in ferromagnetic spin systems has received significant attention. While the inherent weak optomagnonic coupling strength restricts the microwave-to-optical photon conversion efficiency using magnons, the versatility of the magnon modes, together with their readily achievable strong coupling with other quantum systems, endow them with many distinct advantages. Here, we realize the magnon-based microwave-light interface by adopting an optical cavity with adjustable free spectrum range and different kinds of magnetostatic modes in two microwave cavity configurations. By optimizing the parameters, an internal conversion efficiency of $1.28 \times 10^{-7}$ is achieved. We analyze the impact of various parameters on the microwave-to-optics conversion. The study provides useful guidance and insights to further enhancing the microwave-to-optics conversion efficiency using magnons. △ Less

Submitted 4 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

Comments: 11 pages 7 figures

arXiv:2403.00264 [pdf, other]

Generation and optimization of entanglement between giant atoms chirally coupled to spin cavities

Authors: Jia-Bin You, Jian Feng Kong, Davit Aghamalyan, Wai-Keong Mok, Kian Hwee Lim, Jun Ye, Ching Eng Png, Francisco J. García-Vidal

Abstract: We explore a scheme for entanglement generation and optimization in giant atoms by coupling them to finite one-dimensional arrays of spins that behave as cavities. We find that high values for the concurrence can be achieved in small-sized cavities, being the generation time very short. When exciting the system by external means, optimal concurrence is obtained for very weak drivings. We also anal… ▽ More We explore a scheme for entanglement generation and optimization in giant atoms by coupling them to finite one-dimensional arrays of spins that behave as cavities. We find that high values for the concurrence can be achieved in small-sized cavities, being the generation time very short. When exciting the system by external means, optimal concurrence is obtained for very weak drivings. We also analyze the effect of disorder in these systems, showing that although the average concurrence decreases with disorder, high concurrences can still be obtained even in scenarios presenting strong disorder. This result leads us to propose an optimization procedure in which by engineering the on-site energies or hoppings in the cavity, concurrences close to 1 can be reached within an extremely short period of time. △ Less

Submitted 29 February, 2024; originally announced March 2024.

arXiv:2402.16819 [pdf, other]

Nemotron-4 15B Technical Report

Authors: Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Mostofa Patwary, Sandeep Subramanian, Dan Su, Chen Zhu, Deepak Narayanan, Aastha Jhunjhunwala, Ayush Dattagupta, Vibhu Jawa, Jiwei Liu, Ameya Mahabaleshwarkar, Osvald Nitski, Annika Brundyn, James Maki, Miguel Martinez, Jiaxuan You, John Kamalu, Patrick LeGresley, Denys Fridman, Jared Casper, Ashwath Aithal, Oleksii Kuchaiev, Mohammad Shoeybi , et al. (2 additional authors not shown)

Abstract: We introduce Nemotron-4 15B, a 15-billion-parameter large multilingual language model trained on 8 trillion text tokens. Nemotron-4 15B demonstrates strong performance when assessed on English, multilingual, and coding tasks: it outperforms all existing similarly-sized open models on 4 out of 7 downstream evaluation areas and achieves competitive performance to the leading open models in the remai… ▽ More We introduce Nemotron-4 15B, a 15-billion-parameter large multilingual language model trained on 8 trillion text tokens. Nemotron-4 15B demonstrates strong performance when assessed on English, multilingual, and coding tasks: it outperforms all existing similarly-sized open models on 4 out of 7 downstream evaluation areas and achieves competitive performance to the leading open models in the remaining ones. Specifically, Nemotron-4 15B exhibits the best multilingual capabilities of all similarly-sized models, even outperforming models over four times larger and those explicitly specialized for multilingual tasks. △ Less

Submitted 27 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

arXiv:2402.15090 [pdf, other]

Frustration elimination for effective optical spins in coherent Ising machines

Authors: Zheng-Yang Zhou, Clemens Gneiting, J. Q. You, Franco Nori

Abstract: Frustration, that is, the impossibility to satisfy the energetic preferences between all spin pairs simultaneously, underlies the complexity of many fundamental properties in spin systems, including the computational hardness to determine their ground states. Coherent Ising machines (CIM) have been proposed as a promising analog computational approach to efficiently find different degenerate groun… ▽ More Frustration, that is, the impossibility to satisfy the energetic preferences between all spin pairs simultaneously, underlies the complexity of many fundamental properties in spin systems, including the computational hardness to determine their ground states. Coherent Ising machines (CIM) have been proposed as a promising analog computational approach to efficiently find different degenerate ground states of large and complex Ising models. However, CIMs also face challenges in solving frustrated Ising models: Frustration not only reduces the probability to find good solutions, but it also prohibits to leverage quantum effects in doing so. To circumvent these detrimental effects of frustration, we show how frustrated Ising models can be mapped to frustration-free CIM configurations by including ancillary modes and modifying the coupling protocol used in current CIM designs. In our proposal, degenerate optical parametric oscillator (DOPO) modes encode the ground state candidates of the studied Ising model, while the ancillary modes enable the autonomous transformation to a frustration-free Ising model that preserves the ground states encoded in the DOPO modes. Such frustration elimination may empower current CIMs to improve precision and to benefit from quantum effects in dealing with frustrated Ising models. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: 6 pages, 3 figures

arXiv:2402.14367 [pdf, other]

Representation Learning for Frequent Subgraph Mining

Authors: Rex Ying, Tianyu Fu, Andrew Wang, Jiaxuan You, Yu Wang, Jure Leskovec

Abstract: Identifying frequent subgraphs, also called network motifs, is crucial in analyzing and predicting properties of real-world networks. However, finding large commonly-occurring motifs remains a challenging problem not only due to its NP-hard subroutine of subgraph counting, but also the exponential growth of the number of possible subgraphs patterns. Here we present Subgraph Pattern Miner (SPMiner)… ▽ More Identifying frequent subgraphs, also called network motifs, is crucial in analyzing and predicting properties of real-world networks. However, finding large commonly-occurring motifs remains a challenging problem not only due to its NP-hard subroutine of subgraph counting, but also the exponential growth of the number of possible subgraphs patterns. Here we present Subgraph Pattern Miner (SPMiner), a novel neural approach for approximately finding frequent subgraphs in a large target graph. SPMiner combines graph neural networks, order embedding space, and an efficient search strategy to identify network subgraph patterns that appear most frequently in the target graph. SPMiner first decomposes the target graph into many overlapping subgraphs and then encodes each subgraph into an order embedding space. SPMiner then uses a monotonic walk in the order embedding space to identify frequent motifs. Compared to existing approaches and possible neural alternatives, SPMiner is more accurate, faster, and more scalable. For 5- and 6-node motifs, we show that SPMiner can almost perfectly identify the most frequent motifs while being 100x faster than exact enumeration methods. In addition, SPMiner can also reliably identify frequent 10-node motifs, which is well beyond the size limit of exact enumeration approaches. And last, we show that SPMiner can find large up to 20 node motifs with 10-100x higher frequency than those found by current approximate methods. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: Oral Presentation in The Graph Representation Learning and Beyond (GRL+) Workshop from The 37th International Conference on Ma- chine Learning, 2020

arXiv:2402.00197 [pdf, other]

doi 10.1021/acs.est.3c06447

Determination of Trace Organic Contaminant Concentration via Machine Classification of Surface-Enhanced Raman Spectra

Authors: Vishnu Jayaprakash, Jae Bem You, Chiranjeevi Kanike, Jinfeng Liu, Christopher McCallum, Xuehua Zhang

Abstract: Accurate detection and analysis of traces of persistent organic pollutants in water is important in many areas, including environmental monitoring and food quality control, due to their long environmental stability and potential bioaccumulation. While conventional analysis of organic pollutants requires expensive equipment, surface enhanced Raman spectroscopy (SERS) has demonstrated great potentia… ▽ More Accurate detection and analysis of traces of persistent organic pollutants in water is important in many areas, including environmental monitoring and food quality control, due to their long environmental stability and potential bioaccumulation. While conventional analysis of organic pollutants requires expensive equipment, surface enhanced Raman spectroscopy (SERS) has demonstrated great potential for accurate detection of these contaminants. However, SERS analytical difficulties, such as spectral preprocessing, denoising, and substrate-based spectral variation, have hindered widespread use of the technique. Here, we demonstrate an approach for predicting the concentration of sample pollutants from messy, unprocessed Raman data using machine learning. Frequency domain transform methods, including the Fourier and Walsh Hadamard transforms, are applied to sets of Raman spectra of three model micropollutants in water (rhodamine 6G, chlorpyrifos, and triclosan), which are then used to train machine learning algorithms. Using standard machine learning models, the concentration of sample pollutants are predicted with more than 80 percent cross-validation accuracy from raw Raman data. cross-validation accuracy of 85 percent was achieved using deep learning for a moderately sized dataset (100 spectra), and 70 to 80 percent cross-validation accuracy was achieved even for very small datasets (50 spectra). Additionally, standard models were shown to accurately identify characteristic peaks via analysis of their importance scores. The approach shown here has the potential to be applied to facilitate accurate detection and analysis of persistent organic pollutants by surface-enhanced Raman spectroscopy. △ Less

Submitted 31 January, 2024; originally announced February 2024.

arXiv:2401.07422 [pdf, other]

Multiperson Detection and Vital-Sign Sensing Empowered by Space-Time-Coding RISs

Authors: Xinyu Li, Jian Wei You, Ze Gu, Qian Ma, Jingyuan Zhang, Long Chen, Tie Jun Cui

Abstract: Passive human sensing using wireless signals has attracted increasing attention due to its superiorities of non-contact and robustness in various lighting conditions. However, when multiple human individuals are present, their reflected signals could be intertwined in the time, frequency and spatial domains, making it challenging to separate them. To address this issue, this paper proposes a novel… ▽ More Passive human sensing using wireless signals has attracted increasing attention due to its superiorities of non-contact and robustness in various lighting conditions. However, when multiple human individuals are present, their reflected signals could be intertwined in the time, frequency and spatial domains, making it challenging to separate them. To address this issue, this paper proposes a novel system for multiperson detection and monitoring of vital signs (i.e., respiration and heartbeat) with the assistance of space-time-coding (STC) reconfigurable intelligent metasurfaces (RISs). Specifically, the proposed system scans the area of interest (AoI) for human detection by using the harmonic beams generated by the STC RIS. Simultaneously, frequencyorthogonal beams are assigned to each detected person for accurate estimation of their respiration rate (RR) and heartbeat rate (HR). Furthermore, to efficiently extract the respiration signal and the much weaker heartbeat signal, we propose an improved variational mode decomposition (VMD) algorithm to accurately decompose the complex reflected signals into a smaller number of intrinsic mode functions (IMFs). We build a prototype to validate the proposed multiperson detection and vital-sign monitoring system. Experimental results demonstrate that the proposed system can simultaneously monitor the vital signs of up to four persons. The errors of RR and HR estimation using the improved VMD algorithm are below 1 RPM (respiration per minute) and 5 BPM (beats per minute), respectively. Further analysis reveals that the flexible beam controlling mechanism empowered by the STC RIS can reduce the noise reflected from other irrelative objects on the physical layer, and improve the signal-to-noise ratio of echoes from the human chest. △ Less

Submitted 14 January, 2024; originally announced January 2024.

arXiv:2401.02738 [pdf, other]

Strong coupling between a single photon and a photon pair

Authors: Shuai-Peng Wang, Alberto Mercurio, Alessandro Ridolfo, Yuqing Wang, Mo Chen, Tiefu Li, Franco Nori, Salvatore Savasta, J. Q. You

Abstract: The realization of strong nonlinear coupling between single photons has been a long-standing goal in quantum optics and quantum information science, promising wide impact applications, such as all-optical deterministic quantum logic and single-photon frequency conversion. Here, we report an experimental observation of the strong coupling between a single photon and a photon pair in an ultrastrongl… ▽ More The realization of strong nonlinear coupling between single photons has been a long-standing goal in quantum optics and quantum information science, promising wide impact applications, such as all-optical deterministic quantum logic and single-photon frequency conversion. Here, we report an experimental observation of the strong coupling between a single photon and a photon pair in an ultrastrongly-coupled circuit-QED system. This strong nonlinear interaction is realized by introducing a detuned flux qubit working as an effective coupler between two modes of a superconducting coplanar waveguide resonator. The ultrastrong light--matter interaction breaks the excitation number conservation, and an external flux bias breaks the parity conservation. The combined effect of the two enables the strong one--two-photon coupling. Quantum Rabi-like avoided crossing is resolved when tuning the two-photon resonance frequency of the first mode across the single-photon resonance frequency of the second mode. Within this new photonic regime, we observe the second harmonic generation for a mean photon number below one. Our results represent a key step towards a new regime of quantum nonlinear optics, where individual photons can deterministically and coherently interact with each other in the absence of any stimulating fields. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 13 pages, 7 figures

arXiv:2401.01613 [pdf, ps, other]

Synthetically enhanced sensitivity using higher-order exceptional point and coherent perfect absorption

Authors: Yao-Dong Hu, Yi-Pu Wang, Rui-Chang Shen, Zi-Qi Wang, Wei-Jiang Wu, J. Q. You

Abstract: Sensors play a crucial role in advanced apparatuses and it is persistently pursued to improve their sensitivities. Recently, the singularity of a non-Hermitian system, known as the exceptional point (EP), has drawn much attention for this goal. Response of the eigenfrequency shift to a perturbation $ε$ follows the $ε^{1/n}$-dependence at an $n$th-order EP, leading to significantly enhanced sensiti… ▽ More Sensors play a crucial role in advanced apparatuses and it is persistently pursued to improve their sensitivities. Recently, the singularity of a non-Hermitian system, known as the exceptional point (EP), has drawn much attention for this goal. Response of the eigenfrequency shift to a perturbation $ε$ follows the $��^{1/n}$-dependence at an $n$th-order EP, leading to significantly enhanced sensitivity via a high-order EP. However, due to the requirement of increasingly complicated systems, great difficulties will occur along the path of increasing the EP order to enhance the sensitivity. Here we report that by utilizing the spectral anomaly of the coherent perfect absorption (CPA), the sensitivity at a third-order EP can be further enhanced owing to the cooperative effects of both CPA and EP. We realize this synthetically enhanced sensor using a pseudo-Hermitian cavity magnonic system composed of two yttrium iron garnet spheres and a microwave cavity. The detectable minimum change of the magnetic field reaches $4.2\times10^{-21}$T. It opens a new avenue to design novel sensors using hybrid non-Hermitian quantum systems. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 18 pages,5 figures

arXiv:2401.01479 [pdf, other]

Kernel-U-Net: Symmetric and Hierarchical Architecture for Multivariate Time Series Forecasting

Authors: Jiang You, René Natowicz, Arben Cela, Jacob Ouanounou, Patrick Siarry

Abstract: Time series forecasting task predicts future trends based on historical information. Transformer-based U-Net architectures, despite their success in medical image segmentation, have limitations in both expressiveness and computation efficiency in time series forecasting as evidenced in YFormer. To tackle these challenges, we introduce Kernel-U-Net, a symmetric and hierarchical U-shape neural netwo… ▽ More Time series forecasting task predicts future trends based on historical information. Transformer-based U-Net architectures, despite their success in medical image segmentation, have limitations in both expressiveness and computation efficiency in time series forecasting as evidenced in YFormer. To tackle these challenges, we introduce Kernel-U-Net, a symmetric and hierarchical U-shape neural network architecture. The kernel-U-Net encoder compresses gradually input series into latent vectors, and its symmetric decoder subsequently expands these vectors into output series. Specifically, Kernel-U-Net separates the procedure of partitioning input time series into patches from kernel manipulation, thereby providing the convenience of executing customized kernels. Our method offers two primary advantages: 1) Flexibility in kernel customization to adapt to specific datasets; 2) Enhanced computational efficiency, with the complexity of the Transformer layer reduced to linear. Experiments on seven real-world datasets, considering both multivariate and univariate settings, demonstrate that Kernel-U-Net's performance either exceeds or meets that of the existing state-of-the-art model PatchTST in the majority of cases and outperforms Yformer. The source code for Kernel-U-Net will be made publicly available for further research and application. △ Less

Submitted 11 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

arXiv:2312.07298 [pdf, ps, other]

doi 10.1016/j.sysconle.2023.105641

Combined Invariant Subspace \& Frequency-Domain Subspace Method for Identification of Discrete-Time MIMO Linear Systems

Authors: Jingze You, Chao Huang, Hao Zhang

Abstract: Recently, a novel system identification method based on invariant subspace theory is introduced, aiming to address the identification problem of continuous-time (CT) linear time-invariant (LTI) systems by combining time-domain and frequency-domain methods. Subsequently, the combined Invariant-Subspace and Subspace Identification Method (cISSIM) is introduced, enabling direct estimation of CT LTI s… ▽ More Recently, a novel system identification method based on invariant subspace theory is introduced, aiming to address the identification problem of continuous-time (CT) linear time-invariant (LTI) systems by combining time-domain and frequency-domain methods. Subsequently, the combined Invariant-Subspace and Subspace Identification Method (cISSIM) is introduced, enabling direct estimation of CT LTI systems in state-space forms. It produces consistent estimation that is robust in an error-in-variable and slow-sampling conditions, while no pre-filtering operation of the input-output signals is needed. This paper presents the discrete-cISSIM, which extends cISSIM to discrete-time (DT) systems and offers the following improvements: 1) the capability to utilize arbitrary discrete periodic excitations while cISSIM uses multi-sine signals; 2) a faster estimation with reduced computational complexity is proposed; 3) the covariance estimation problem can be addressed concurrently with the system parameter estimation. An implementation of discrete-cISSIM by MATLAB has also been provided. △ Less

Submitted 12 December, 2023; originally announced December 2023.

Comments: algorithm implemented via MATLAB: https://github.com/wyqy/dcissim

Journal ref: Systems & Control Letters, vol. 181, p. 105641, Nov. 2023

arXiv:2312.04615 [pdf, other]

Relational Deep Learning: Graph Representation Learning on Relational Databases

Authors: Matthias Fey, Weihua Hu, Kexin Huang, Jan Eric Lenssen, Rishabh Ranjan, Joshua Robinson, Rex Ying, Jiaxuan You, Jure Leskovec

Abstract: Much of the world's most valued data is stored in relational databases and data warehouses, where the data is organized into many tables connected by primary-foreign key relations. However, building machine learning models using this data is both challenging and time consuming. The core problem is that no machine learning method is capable of learning on multiple tables interconnected by primary-f… ▽ More Much of the world's most valued data is stored in relational databases and data warehouses, where the data is organized into many tables connected by primary-foreign key relations. However, building machine learning models using this data is both challenging and time consuming. The core problem is that no machine learning method is capable of learning on multiple tables interconnected by primary-foreign key relations. Current methods can only learn from a single table, so the data must first be manually joined and aggregated into a single training table, the process known as feature engineering. Feature engineering is slow, error prone and leads to suboptimal models. Here we introduce an end-to-end deep representation learning approach to directly learn on data laid out across multiple tables. We name our approach Relational Deep Learning (RDL). The core idea is to view relational databases as a temporal, heterogeneous graph, with a node for each row in each table, and edges specified by primary-foreign key links. Message Passing Graph Neural Networks can then automatically learn across the graph to extract representations that leverage all input data, without any manual feature engineering. Relational Deep Learning leads to more accurate models that can be built much faster. To facilitate research in this area, we develop RelBench, a set of benchmark datasets and an implementation of Relational Deep Learning. The data covers a wide spectrum, from discussions on Stack Exchange to book reviews on the Amazon Product Catalog. Overall, we define a new research area that generalizes graph machine learning and broadens its applicability to a wide set of AI use cases. △ Less

Submitted 7 December, 2023; originally announced December 2023.

Comments: https://relbench.stanford.edu

arXiv:2311.14936 [pdf, other]

Single-image based deep learning for precise atomic defects identification

Authors: Kangshu Li, Xiaocang Han, Yanhui Hong, Yuan Meng, Xiang Chen, Junxian Li, Jing-Yang You, Lin Yao, Wenchao Hu, Zhiyi Xia, Guolin Ke, Linfeng Zhang, Jin Zhang, Xiaoxu Zhao

Abstract: Defect engineering has been profoundly employed to confer desirable functionality to materials that pristine lattices inherently lack. Although single atomic-resolution scanning transmission electron microscopy (STEM) images are widely accessible for defect engineering, harnessing atomic-scale images containing various defects through traditional image analysis methods is hindered by random noise… ▽ More Defect engineering has been profoundly employed to confer desirable functionality to materials that pristine lattices inherently lack. Although single atomic-resolution scanning transmission electron microscopy (STEM) images are widely accessible for defect engineering, harnessing atomic-scale images containing various defects through traditional image analysis methods is hindered by random noise and human bias. Yet the rise of deep learning (DL) offering an alternative approach, its widespread application is primarily restricted by the need for large amounts of training data with labeled ground truth. In this study, we propose a two-stage method to address the problems of high annotation cost and image noise in the detection of atomic defects in monolayer 2D materials. In the first stage, to tackle the issue of data scarcity, we employ a two-state transformation network based on U-GAT-IT for adding realistic noise to simulated images with pre-located ground truth labels, thereby infinitely expanding the training dataset. In the second stage, atomic defects in monolayer 2D materials are effectively detected with high accuracy using U-Net models trained with the data generated in the first stage, avoiding random noise and human bias issues. In both stages, we utilize segmented unit-cell-level images to simplify the model's task and enhance its accuracy. Our results demonstrate that not only sulfur vacancies, we are also able to visualize oxygen dopants in monolayer MoS2, which are usually overwhelmed by random background noise. As the training was based on a few segmented unit-cell-level realistic images, this method can be readily extended to other 2D materials. Therefore, our results outline novel ways to train the model with minimized datasets, offering great opportunities to fully exploit the power of machine learning (ML) applicable to a broad materials science community. △ Less

Submitted 25 November, 2023; originally announced November 2023.

arXiv:2311.11283 [pdf, other]

High Curie temperature and high hole mobility in diluted magnetic semiconductors (B, Mn)X (X = N, P, As, Sb)

Authors: Xiang Li, Jia-Wen Li, Jing-Yang You, Gang Su, Bo Gu

Abstract: Doping nonmagnetic semiconductors with magnetic impurities is a feasible way to obtain diluted magnetic semiconductors (DMSs). It is generally accepted that for the most extensively studied DMS, (Ga, Mn)As, its highest Curie temperature T$_{\text{C}}$ was achieved at 200 K with a Mn concentration of approximately 16% in experiments. A recent experiment reported record-breaking high electron and ho… ▽ More Doping nonmagnetic semiconductors with magnetic impurities is a feasible way to obtain diluted magnetic semiconductors (DMSs). It is generally accepted that for the most extensively studied DMS, (Ga, Mn)As, its highest Curie temperature T$_{\text{C}}$ was achieved at 200 K with a Mn concentration of approximately 16% in experiments. A recent experiment reported record-breaking high electron and hole mobilities in the semiconductor BAs [Science 377, 437 (2022)]. Since BAs shares the same zinc-blende structure with GaAs, here we predict four DMSs (B, Mn)X (X = N, P, As, Sb) by density functional theory calculations. Our results indicate that a significantly higher T$_{\text{C}}$ in the range of 254 K to 300 K for (B, Mn)As with a Mn concentration of around 15.6%, and even higher T$_{\text{C}}$ values above the room temperature for (B, Mn)N and (B, Mn)P with a Mn concentration exceeding 12.5%. Furthermore, we have predicted a large hole mobility of 1561 cm$^{\text{2}}$V$^{\text{-1}}$s$^{\text{-1}}$ at 300 K for (B, Mn)As with a Mn concentration of about 3.7%, which is three orders of magnitude larger than the hole mobility of 4 cm$^{\text{2}}$V$^{\text{-1}}$s$^{\text{-1}}$ at 300 K observed in the experiment for (Ga, Mn)As. Our findings predict the emergence of a new family of DMS, (B, Mn)X, and are expected to stimulate both experimental and theoretical studies of the DMS with high T$_{\text{C}}$ and high mobilities. △ Less

Submitted 19 November, 2023; originally announced November 2023.

arXiv:2311.09899 [pdf, ps, other]

Spectrum of Hatano-Nelson model with strictly ergodic potentials

Authors: Xueyin Wang, Zhenfu Wang, Jiangong You, Qi Zhou

Abstract: We provide a precise formula for the spectrum of the Hatano-Nelson model with strictly ergodic potentials in terms of its Lyapunov exponent. As applications, one clearly observes the real-complex spectrum transition. Moreover, if the Lyapunov exponent is continuous, the spectrum of the Hatano-Nelson model in $\ell^{2}(\mathbb{Z})$ can be approximated by the spectrum of its finite-interval truncati… ▽ More We provide a precise formula for the spectrum of the Hatano-Nelson model with strictly ergodic potentials in terms of its Lyapunov exponent. As applications, one clearly observes the real-complex spectrum transition. Moreover, if the Lyapunov exponent is continuous, the spectrum of the Hatano-Nelson model in $\ell^{2}(\mathbb{Z})$ can be approximated by the spectrum of its finite-interval truncation with periodic boundary conditions. Both of these results are strikingly different from the Hatano-Nelson model with random potentials \cite{Dav01A, Dav01, Dav02}. △ Less

Submitted 16 November, 2023; originally announced November 2023.

arXiv:2311.09230 [pdf]

Evaluating and Improving Value Judgments in AI: A Scenario-Based Study on Large Language Models' Depiction of Social Conventions

Authors: Jaeyoun You, Bongwon Suh

Abstract: The adoption of generative AI technologies is swiftly expanding. Services employing both linguistic and mul-timodal models are evolving, offering users increasingly precise responses. Consequently, human reliance on these technologies is expected to grow rapidly. With the premise that people will be impacted by the output of AI, we explored approaches to help AI output produce better results. Init… ▽ More The adoption of generative AI technologies is swiftly expanding. Services employing both linguistic and mul-timodal models are evolving, offering users increasingly precise responses. Consequently, human reliance on these technologies is expected to grow rapidly. With the premise that people will be impacted by the output of AI, we explored approaches to help AI output produce better results. Initially, we evaluated how contemporary AI services competitively meet user needs, then examined society's depiction as mirrored by Large Language Models (LLMs). We did a query experiment, querying about social conventions in various countries and eliciting a one-word response. We compared the LLMs' value judgments with public data and suggested an model of decision-making in value-conflicting scenarios which could be adopted for future machine value judgments. This paper advocates for a practical approach to using AI as a tool for investigating other remote worlds. This re-search has significance in implicitly rejecting the notion of AI making value judgments and instead arguing a more critical perspective on the environment that defers judgmental capabilities to individuals. We anticipate this study will empower anyone, regardless of their capacity, to receive safe and accurate value judgment-based out-puts effectively. △ Less

Submitted 4 October, 2023; originally announced November 2023.

Comments: 11 pages, 1 figure, 2 tables, The 18th International AAAI Conference on Web and Social Media (ICWSM 2024) Accepted

arXiv:2311.07873 [pdf, other]

Passive Human Sensing Enhanced by Reconfigurable Intelligent Surface: Opportunities and Challenges

Authors: Xinyu Li, Jian Wei You, Ze Gu, Qian Ma, Long Chen, Jingyuan Zhang, Shi Jin, Tie Jun Cui

Abstract: Reconfigurable intelligent surfaces (RISs) have flexible and exceptional performance in manipulating electromagnetic waves and customizing wireless channels. These capabilities enable them to provide a plethora of valuable activity-related information for promoting wireless human sensing. In this article, we present a comprehensive review of passive human sensing using radio frequency signals with… ▽ More Reconfigurable intelligent surfaces (RISs) have flexible and exceptional performance in manipulating electromagnetic waves and customizing wireless channels. These capabilities enable them to provide a plethora of valuable activity-related information for promoting wireless human sensing. In this article, we present a comprehensive review of passive human sensing using radio frequency signals with the assistance of RISs. Specifically, we first introduce fundamental principles and physical platform of RISs. Subsequently, based on the specific applications, we categorize the state-of-the-art human sensing techniques into three types, including human imaging,localization, and activity recognition. Meanwhile, we would also investigate the benefits that RISs bring to these applications. Furthermore, we explore the application of RISs in human micro-motion sensing, and propose a vital signs monitoring system enhanced by RISs. Experimental results are presented to demonstrate the promising potential of RISs in sensing vital signs for manipulating individuals. Finally, we discuss the technical challenges and opportunities in this field. △ Less

Submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.07480 [pdf, ps, other]

doi 10.1145/3649832

Qualifying System F-sub

Authors: Edward Lee, Yaoyu Zhao, James You, Kavin Satheeskumar, Ondřej Lhoták, Jonathan Brachthäuser

Abstract: Type qualifiers offer a lightweight mechanism for enriching existing type systems to enforce additional, desirable, program invariants. They do so by offering a restricted but effective form of subtyping. While the theory of type qualifiers is well understood and present in many programming languages today, polymorphism over type qualifiers is an area that is less examined. We explore how such a p… ▽ More Type qualifiers offer a lightweight mechanism for enriching existing type systems to enforce additional, desirable, program invariants. They do so by offering a restricted but effective form of subtyping. While the theory of type qualifiers is well understood and present in many programming languages today, polymorphism over type qualifiers is an area that is less examined. We explore how such a polymorphic system could arise by constructing a calculus System F<:Q which combines the higher-rank bounded polymorphism of System F<: with the theory of type qualifiers. We explore how the ideas used to construct System F<:Q can be reused in situations where type qualifiers naturally arise -- in reference immutability, function colouring, and capture checking. Finally, we re-examine other qualifier systems in the literature in light of the observations presented while developing System F<:Q. △ Less

Submitted 13 November, 2023; originally announced November 2023.

Comments: 24 pages

Journal ref: Proc. ACM Program. Lang. 8, OOPSLA1, Article 115 (April 2024), 30 pages

Showing 1–50 of 575 results for author: You, J