subscribe to arXiv mailings

Distributed online generalized Nash Equilibrium learning in multi-cluster games: A delay-tolerant algorithm

Authors: Bingqian Liu, Guanghui Wen, Xiao Fang, Tingwen Huang, Guanrong Chen

Abstract: This paper addresses the problem of distributed online generalized Nash equilibrium (GNE) learning for multi-cluster games with delayed feedback information. Specifically, each agent in the game is assumed to be informed a sequence of local cost functions and constraint functions, which are known to the agent with time-varying delays subsequent to decision-making at each round. The objective of ea… ▽ More This paper addresses the problem of distributed online generalized Nash equilibrium (GNE) learning for multi-cluster games with delayed feedback information. Specifically, each agent in the game is assumed to be informed a sequence of local cost functions and constraint functions, which are known to the agent with time-varying delays subsequent to decision-making at each round. The objective of each agent within a cluster is to collaboratively optimize the cluster's cost function, subject to time-varying coupled inequality constraints and local feasible set constraints over time. Additionally, it is assumed that each agent is required to estimate the decisions of all other agents through interactions with its neighbors, rather than directly accessing the decisions of all agents, i.e., each agent needs to make decision under partial-decision information. To solve such a challenging problem, a novel distributed online delay-tolerant GNE learning algorithm is developed based upon the primal-dual algorithm with an aggregation gradient mechanism. The system-wise regret and the constraint violation are formulated to measure the performance of the algorithm, demonstrating sublinear growth with respect to time under certain conditions. Finally, numerical results are presented to verify the effectiveness of the proposed algorithm. △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2406.14060 [pdf, ps, other]

Distributed Event-Triggered Bandit Convex Optimization with Time-Varying Constraints

Authors: Kunpeng Zhang, Xinlei Yi, Guanghui Wen, Ming Cao, Karl H. Johansson, Tianyou Chai, Tao Yang

Abstract: This paper considers the distributed bandit convex optimization problem with time-varying inequality constraints over a network of agents, where the goal is to minimize network regret and cumulative constraint violation. Existing distributed online algorithms require that each agent broadcasts its decision to its neighbors at each iteration. To better utilize the limited communication resources, w… ▽ More This paper considers the distributed bandit convex optimization problem with time-varying inequality constraints over a network of agents, where the goal is to minimize network regret and cumulative constraint violation. Existing distributed online algorithms require that each agent broadcasts its decision to its neighbors at each iteration. To better utilize the limited communication resources, we propose a distributed event-triggered online primal--dual algorithm with two-point bandit feedback. Under several classes of appropriately chosen decreasing parameter sequences and non-increasing event-triggered threshold sequences, we establish dynamic network regret and network cumulative constraint violation bounds. These bounds are comparable to the results achieved by distributed event-triggered online algorithms with full-information feedback. Finally, a numerical example is provided to verify the theoretical results. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: 34 pages, 4 figures. arXiv admin note: text overlap with arXiv:2311.01957

arXiv:2404.17875 [pdf, other]

Noisy Node Classification by Bi-level Optimization based Multi-teacher Distillation

Authors: Yujing Liu, Zongqian Wu, Zhengyu Lu, Ci Nie, Guoqiu Wen, Ping Hu, Xiaofeng Zhu

Abstract: Previous graph neural networks (GNNs) usually assume that the graph data is with clean labels for representation learning, but it is not true in real applications. In this paper, we propose a new multi-teacher distillation method based on bi-level optimization (namely BO-NNC), to conduct noisy node classification on the graph data. Specifically, we first employ multiple self-supervised learning me… ▽ More Previous graph neural networks (GNNs) usually assume that the graph data is with clean labels for representation learning, but it is not true in real applications. In this paper, we propose a new multi-teacher distillation method based on bi-level optimization (namely BO-NNC), to conduct noisy node classification on the graph data. Specifically, we first employ multiple self-supervised learning methods to train diverse teacher models, and then aggregate their predictions through a teacher weight matrix. Furthermore, we design a new bi-level optimization strategy to dynamically adjust the teacher weight matrix based on the training progress of the student model. Finally, we design a label improvement module to improve the label quality. Extensive experimental results on real datasets show that our method achieves the best results compared to state-of-the-art methods. △ Less

Submitted 8 May, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

arXiv:2404.11354 [pdf, other]

Distributed Fractional Bayesian Learning for Adaptive Optimization

Authors: Yaqun Yang, Jinlong Lei, Guanghui Wen, Yiguang Hong

Abstract: This paper considers a distributed adaptive optimization problem, where all agents only have access to their local cost functions with a common unknown parameter, whereas they mean to collaboratively estimate the true parameter and find the optimal solution over a connected network. A general mathematical framework for such a problem has not been studied yet. We aim to provide valuable insights fo… ▽ More This paper considers a distributed adaptive optimization problem, where all agents only have access to their local cost functions with a common unknown parameter, whereas they mean to collaboratively estimate the true parameter and find the optimal solution over a connected network. A general mathematical framework for such a problem has not been studied yet. We aim to provide valuable insights for addressing parameter uncertainty in distributed optimization problems and simultaneously find the optimal solution. Thus, we propose a novel Prediction while Optimization scheme, which utilizes distributed fractional Bayesian learning through weighted averaging on the log-beliefs to update the beliefs of unknown parameters, and distributed gradient descent for renewing the estimation of the optimal solution. Then under suitable assumptions, we prove that all agents' beliefs and decision variables converge almost surely to the true parameter and the optimal solution under the true parameter, respectively. We further establish a sublinear convergence rate for the belief sequence. Finally, numerical experiments are implemented to corroborate the theoretical analysis. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 16 pages, 6 figures

arXiv:2402.15069 [pdf, other]

Investigation of profile shifting and subpulse movement in PSR J0344-0901 with FAST

Authors: H. M. Tedila, R. Yuen, N. Wang, D. Li, Z. G. Wen, W. M. Yan, J. P. Yuan, X. H. Han, P. Wang, W. W. Zhu, S. J. Dang, S. Q. Wang, J. T. Xie, Q. D. Wu, Sh. Khasanov, FAST Collaboration

Abstract: We report two phenomena detected in PSR J0344$-$0901 from two observations conducted at frequency centered at 1.25 GHz using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The first phenomenon manifests as shifting in the pulse emission to later longitudinal phases and then gradually returns to its original location. The event lasts for about 216 pulse periods, with an average s… ▽ More We report two phenomena detected in PSR J0344$-$0901 from two observations conducted at frequency centered at 1.25 GHz using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The first phenomenon manifests as shifting in the pulse emission to later longitudinal phases and then gradually returns to its original location. The event lasts for about 216 pulse periods, with an average shift of about $0.7^\circ$ measured at the peak of the integrated profile. Changes in the polarization position angle (PPA) are detected around the trailing edge of the profile, together with an increase in the profile width. The second phenomenon is characterized by the apparent movement of subpulses, which results in different subpulse track patterns across the profile window. For the first time in this pulsar, we identify four emission modes, each with unique subpulse movement, and determine the pattern periods for three of the emission modes. Pulse nulling was not detected. Modeling of the changes in the PPA using the rotating vector model gives an inclination angle of $75.12^\circ \pm 3.80^\circ$ and an impact parameter of $-3.17^\circ \pm 5.32^\circ$ for this pulsar. We speculate that the subpulse movement may be related to the shifting of the pulse emission. △ Less

Submitted 22 February, 2024; originally announced February 2024.

arXiv:2401.03363 [pdf, other]

Data-driven Dynamic Event-triggered Control

Authors: Tao Xu, Zhiyong Sun, Guanghui Wen, Zhisheng Duan

Abstract: This paper revisits the event-triggered control problem from a data-driven perspective, where unknown continuous-time linear systems subject to disturbances are taken into account. Using data information collected off-line instead of accurate system model information, a data-driven dynamic event-triggered control scheme is developed in this paper. The dynamic property is reflected by that the desi… ▽ More This paper revisits the event-triggered control problem from a data-driven perspective, where unknown continuous-time linear systems subject to disturbances are taken into account. Using data information collected off-line instead of accurate system model information, a data-driven dynamic event-triggered control scheme is developed in this paper. The dynamic property is reflected by that the designed event-triggering function embedded in the event-triggering mechanism (ETM) is dynamically updated as a whole. Thanks to this dynamic design, a strictly positive minimum inter-event time (MIET) is guaranteed without sacrificing control performance. Specifically, exponential input-to-state stability (ISS) of the closed-loop system with respect to disturbances is achieved in this paper, which is superior to some existing results that only guarantee a practical exponential ISS property. The dynamic ETM is easy-to-implement in practical operation since all designed parameters are determined only by a simple data-driven linear matrix inequality (LMI), without additional complicated conditions as required in relevant literature. As quantization is the most common signal constraint in practice, the developed control scheme is further extended to the case where state transmission is affected by a uniform or logarithmic quantization effect. Finally, adequate simulations are performed to show the validity and superiority of the proposed control schemes. △ Less

Submitted 6 January, 2024; originally announced January 2024.

arXiv:2401.00283 [pdf, other]

Near-Space Communications: the Last Piece of 6G Space-Air-Ground-Sea Integrated Network Puzzle

Authors: Hongshan Liu, Tong Qin, Zhen Gao, Tianqi Mao, Keke Ying, Ziwei Wan, Li Qiao, Rui Na, Zhongxiang Li, Chun Hu, Yikun Mei, Tuan Li, Guanghui Wen, Lei Chen, Zhonghuai Wu, Ruiqi Liu, Gaojie Chen, Shuo Wang, Dezhi Zheng

Abstract: This article presents a comprehensive study on the emerging near-space communications (NS-COM) within the context of space-air-ground-sea integrated network (SAGSIN). Specifically, we firstly explore the recent technical developments of NS-COM, followed by the discussions about motivations behind integrating NS-COM into SAGSIN. To further demonstrate the necessity of NS-COM, a comparative analysis… ▽ More This article presents a comprehensive study on the emerging near-space communications (NS-COM) within the context of space-air-ground-sea integrated network (SAGSIN). Specifically, we firstly explore the recent technical developments of NS-COM, followed by the discussions about motivations behind integrating NS-COM into SAGSIN. To further demonstrate the necessity of NS-COM, a comparative analysis between the NS-COM network and other counterparts in SAGSIN is conducted, covering aspects of deployment, coverage, channel characteristics and unique problems of NS-COM network. Afterwards, the technical aspects of NS-COM, including channel modeling, random access, channel estimation, array-based beam management and joint network optimization, are examined in detail. Furthermore, we explore the potential applications of NS-COM, such as structural expansion in SAGSIN communication, civil aviation communication, remote and urgent communication, weather monitoring and carbon neutrality. Finally, some promising research avenues are identified, including stratospheric satellite (StratoSat) -to-ground direct links for mobile terminals, reconfigurable multiple-input multiple-output (MIMO) and holographic MIMO, federated learning in NS-COM networks, maritime communication, electromagnetic spectrum sensing and adversarial game, integrated sensing and communications, StratoSat-based radar detection and imaging, NS-COM assisted enhanced global navigation system, NS-COM assisted intelligent unmanned system and free space optical (FSO) communication. Overall, this paper highlights that the NS-COM plays an indispensable role in the SAGSIN puzzle, providing substantial performance and coverage enhancement to the traditional SAGSIN architecture. △ Less

Submitted 4 March, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

Comments: 28 pages, 8 figures, 2 tables

arXiv:2312.10920 [pdf, other]

Domain adaption and physical constrains transfer learning for shale gas production

Authors: Zhaozhong Yang, Liangjie Gou, Chao Min, Duo Yi, Xiaogang Li, Guoquan Wen

Abstract: Effective prediction of shale gas production is crucial for strategic reservoir development. However, in new shale gas blocks, two main challenges are encountered: (1) the occurrence of negative transfer due to insufficient data, and (2) the limited interpretability of deep learning (DL) models. To tackle these problems, we propose a novel transfer learning methodology that utilizes domain adaptat… ▽ More Effective prediction of shale gas production is crucial for strategic reservoir development. However, in new shale gas blocks, two main challenges are encountered: (1) the occurrence of negative transfer due to insufficient data, and (2) the limited interpretability of deep learning (DL) models. To tackle these problems, we propose a novel transfer learning methodology that utilizes domain adaptation and physical constraints. This methodology effectively employs historical data from the source domain to reduce negative transfer from the data distribution perspective, while also using physical constraints to build a robust and reliable prediction model that integrates various types of data. The methodology starts by dividing the production data from the source domain into multiple subdomains, thereby enhancing data diversity. It then uses Maximum Mean Discrepancy (MMD) and global average distance measures to decide on the feasibility of transfer. Through domain adaptation, we integrate all transferable knowledge, resulting in a more comprehensive target model. Lastly, by incorporating drilling, completion, and geological data as physical constraints, we develop a hybrid model. This model, a combination of a multi-layer perceptron (MLP) and a Transformer (Transformer-MLP), is designed to maximize interpretability. Experimental validation in China's southwestern region confirms the method's effectiveness. △ Less

Submitted 17 December, 2023; originally announced December 2023.

arXiv:2312.06255 [pdf, ps, other]

Ensemble Interpretation: A Unified Method for Interpretable Machine Learning

Authors: Chao Min, Guoyong Liao, Guoquan Wen, Yingjun Li, Xing Guo

Abstract: To address the issues of stability and fidelity in interpretable learning, a novel interpretable methodology, ensemble interpretation, is presented in this paper which integrates multi-perspective explanation of various interpretation methods. On one hand, we define a unified paradigm to describe the common mechanism of different interpretation methods, and then integrate the multiple interpretati… ▽ More To address the issues of stability and fidelity in interpretable learning, a novel interpretable methodology, ensemble interpretation, is presented in this paper which integrates multi-perspective explanation of various interpretation methods. On one hand, we define a unified paradigm to describe the common mechanism of different interpretation methods, and then integrate the multiple interpretation results to achieve more stable explanation. On the other hand, a supervised evaluation method based on prior knowledge is proposed to evaluate the explaining performance of an interpretation method. The experiment results show that the ensemble interpretation is more stable and more consistent with human experience and cognition. As an application, we use the ensemble interpretation for feature selection, and then the generalization performance of the corresponding learning model is significantly improved. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2311.13371 [pdf, other]

A Novel Dynamic Event-triggered Mechanism for Dynamic Average Consensus

Authors: Tao Xu, Zhisheng Duan, Guanghui Wen, Zhiyong Sun

Abstract: This paper studies a challenging issue introduced in a recent survey, namely designing a distributed event-based scheme to solve the dynamic average consensus (DAC) problem. First, a robust adaptive distributed event-based DAC algorithm is designed without imposing specific initialization criteria to perform estimation task under intermittent communication. Second, a novel adaptive distributed dyn… ▽ More This paper studies a challenging issue introduced in a recent survey, namely designing a distributed event-based scheme to solve the dynamic average consensus (DAC) problem. First, a robust adaptive distributed event-based DAC algorithm is designed without imposing specific initialization criteria to perform estimation task under intermittent communication. Second, a novel adaptive distributed dynamic event-triggered mechanism is proposed to determine the triggering time when neighboring agents broadcast information to each other. Compared to the existing event-triggered mechanisms, the novelty of the proposed dynamic event-triggered mechanism lies in that it guarantees the existence of a positive and uniform minimum inter-event interval without sacrificing any accuracy of the estimation, which is much more practical than only ensuring the exclusion of the Zeno behavior or the boundedness of the estimation error. Third, a composite adaptive law is developed to update the adaptive gain employed in the distributed event-based DAC algorithm and dynamic event-triggered mechanism. Using the composite adaptive update law, the distributed event-based solution proposed in our work is implemented without requiring any global information. Finally, numerical simulations are provided to illustrate the effectiveness of the theoretical results. △ Less

Submitted 22 November, 2023; originally announced November 2023.

Comments: 9 pages, 8 figures

arXiv:2311.06848 [pdf, other]

doi 10.1109/JAS.2023.124089

Fixed-Time Gradient Flows for Solving Constrained Optimization: A Unified Approach

Authors: Xinli Shi, Xiangping Xu, Guanghui Wen, Jinde Cao

Abstract: The accelerated method in solving optimization problems has always been an absorbing topic. Based on the fixed-time (FxT) stability of nonlinear dynamical systems, we provide a unified approach for designing FxT gradient flows (FxTGFs). First, a general class of nonlinear functions in designing FxTGFs is provided. A unified method for designing first-order FxTGFs is shown under PolyakL jasiewicz i… ▽ More The accelerated method in solving optimization problems has always been an absorbing topic. Based on the fixed-time (FxT) stability of nonlinear dynamical systems, we provide a unified approach for designing FxT gradient flows (FxTGFs). First, a general class of nonlinear functions in designing FxTGFs is provided. A unified method for designing first-order FxTGFs is shown under PolyakL jasiewicz inequality assumption, a weaker condition than strong convexity. When there exist both bounded and vanishing disturbances in the gradient flow, a specific class of nonsmooth robust FxTGFs with disturbance rejection is presented. Under the strict convexity assumption, Newton-based FxTGFs is given and further extended to solve time-varying optimization. Besides, the proposed FxTGFs are further used for solving equation-constrained optimization. Moreover, an FxT proximal gradient flow with a wide range of parameters is provided for solving nonsmooth composite optimization. To show the effectiveness of various FxTGFs, the static regret analysis for several typical FxTGFs are also provided in detail. Finally, the proposed FxTGFs are applied to solve two network problems, i.e., the network consensus problem and solving a system linear equations, respectively, from the respective of optimization. Particularly, by choosing component-wisely sign-preserving functions, these problems can be solved in a distributed way, which extends the existing results. The accelerated convergence and robustness of the proposed FxTGFs are validated in several numerical examples stemming from practical applications. △ Less

Submitted 12 November, 2023; originally announced November 2023.

arXiv:2310.18871 [pdf, ps, other]

Compressed Gradient Tracking Algorithms for Distributed Nonconvex Optimization

Authors: Lei Xu, Xinlei Yi, Guanghui Wen, Yang Shi, Karl H. Johansson, Tao Yang

Abstract: In this paper, we study the distributed nonconvex optimization problem, which aims to minimize the average value of the local nonconvex cost functions using local information exchange. To reduce the communication overhead, we introduce three general classes of compressors, i.e., compressors with bounded relative compression error, compressors with globally bounded absolute compression error, and c… ▽ More In this paper, we study the distributed nonconvex optimization problem, which aims to minimize the average value of the local nonconvex cost functions using local information exchange. To reduce the communication overhead, we introduce three general classes of compressors, i.e., compressors with bounded relative compression error, compressors with globally bounded absolute compression error, and compressors with locally bounded absolute compression error. By integrating them with distributed gradient tracking algorithm, we then propose three compressed distributed nonconvex optimization algorithms. For each algorithm, we design a novel Lyapunov function to demonstrate its sublinear convergence to a stationary point if the local cost functions are smooth. Furthermore, when the global cost function satisfies the Polyak--Łojasiewicz (P--Ł) condition, we show that our proposed algorithms linearly converge to a global optimal point. It is worth noting that, for compressors with bounded relative compression error and globally bounded absolute compression error, our proposed algorithms' parameters do not require prior knowledge of the P--Ł constant. The theoretical results are illustrated by numerical examples, which demonstrate the effectiveness of the proposed algorithms in significantly reducing the communication burden while maintaining the convergence performance. Moreover, simulation results show that the proposed algorithms outperform state-of-the-art compressed distributed nonconvex optimization algorithms. △ Less

Submitted 15 July, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

arXiv:2310.00033 [pdf]

OriWheelBot: An origami-wheeled robot

Authors: Jie Liu, Zufeng Pang, Zhiyong Li, Guilin Wen, Zhoucheng Su, Junfeng He, Kaiyue Liu, Dezheng Jiang, Zenan Li, Shouyan Chen, Yang Tian, Yi Min Xie, Zhenpei Wang, Zhuangjian Liu

Abstract: Origami-inspired robots with multiple advantages, such as being lightweight, requiring less assembly, and exhibiting exceptional deformability, have received substantial and sustained attention. However, the existing origami-inspired robots are usually of limited functionalities and developing feature-rich robots is very challenging. Here, we report an origami-wheeled robot (OriWheelBot) with vari… ▽ More Origami-inspired robots with multiple advantages, such as being lightweight, requiring less assembly, and exhibiting exceptional deformability, have received substantial and sustained attention. However, the existing origami-inspired robots are usually of limited functionalities and developing feature-rich robots is very challenging. Here, we report an origami-wheeled robot (OriWheelBot) with variable width and outstanding sand walking versatility. The OriWheelBot's ability to adjust wheel width over obstacles is achieved by origami wheels made of Miura origami. An improved version, called iOriWheelBot, is also developed to automatically judge the width of the obstacles. Three actions, namely direct pass, variable width pass, and direct return, will be carried out depending on the width of the channel between the obstacles. We have identified two motion mechanisms, i.e., sand-digging and sand-pushing, with the latter being more conducive to walking on the sand. We have systematically examined numerous sand walking characteristics, including carrying loads, climbing a slope, walking on a slope, and navigating sand pits, small rocks, and sand traps. The OriWheelBot can change its width by 40%, has a loading-carrying ratio of 66.7% on flat sand and can climb a 17-degree sand incline. The OriWheelBot can be useful for planetary subsurface exploration and disaster area rescue. △ Less

Submitted 29 September, 2023; originally announced October 2023.

Comments: 23 papes, 7 figures

arXiv:2309.12081 [pdf, other]

A Framework on Fully Distributed State Estimation and Cooperative Stabilization of LTI Plants

Authors: Peihu Duan, Yuezu Lv, Guanghui Wen, Maciej Ogorzałek

Abstract: How to realize high-level autonomy of individuals is one of key technical issues to promote swarm intelligence of multi-agent (node) systems with collective tasks, while the fully distributed design is a potential way to achieve this goal. This paper works on the fully distributed state estimation and cooperative stabilization problem of linear time-invariant (LTI) plants with multiple nodes commu… ▽ More How to realize high-level autonomy of individuals is one of key technical issues to promote swarm intelligence of multi-agent (node) systems with collective tasks, while the fully distributed design is a potential way to achieve this goal. This paper works on the fully distributed state estimation and cooperative stabilization problem of linear time-invariant (LTI) plants with multiple nodes communicating over general directed graphs, and is aimed to provide a fully distributed framework for each node to perform cooperative stabilization tasks. First, by incorporating a novel adaptive law, a consensus-based estimator is designed for each node to obtain the plant state based on its local measurement and local interaction with neighbors, without using any global information of the communication topology. Subsequently, a local controller is developed for each node to stabilize the plant collaboratively with performance guaranteed under mild conditions. Specifically, the proposed method only requires that the communication graph be strongly connected, and the plant be collectively controllable and observable. Further, the proposed method can be applied to pure fully distributed state estimation scenarios and modified for noise-bounded LTI plants. Finally, two numerical examples are provided to show the effectiveness of the theoretical results. △ Less

Submitted 14 January, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

arXiv:2308.13743 [pdf, other]

Extended Zero-Gradient-Sum Approach for Constrained Distributed Optimization with Free Initialization

Authors: Xinli Shi, Xinghuo Yu, Guanghui Wen, Xiangping Xu

Abstract: This paper proposes an extended zero-gradient-sum (EZGS) approach for solving constrained distributed optimization (DO) with free initialization. A Newton-based continuous-time algorithm (CTA) is first designed for general constrained optimization and then extended to solve constrained DO based on the EZGS method. It is shown that for typical consensus protocols, the EZGS CTA can achieve the perfo… ▽ More This paper proposes an extended zero-gradient-sum (EZGS) approach for solving constrained distributed optimization (DO) with free initialization. A Newton-based continuous-time algorithm (CTA) is first designed for general constrained optimization and then extended to solve constrained DO based on the EZGS method. It is shown that for typical consensus protocols, the EZGS CTA can achieve the performance with exponential/finite/fixed/prescribed-time convergence. Particularly, the nonlinear consensus protocols for finite-time EZGS algorithms can have heterogeneous power coefficients. The prescribed-time EZGS dynamics is continuous and uniformly bounded, which can achieve the optimal solution in one stage. Moreover, the barrier method is employed to tackle the inequality constraints. Finally, the performance of the proposed algorithms is verified by numerical examples. △ Less

Submitted 2 June, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2307.02126 [pdf, other]

Robust Graph Structure Learning with the Alignment of Features and Adjacency Matrix

Authors: Shaogao Lv, Gang Wen, Shiyu Liu, Linsen Wei, Ming Li

Abstract: To improve the robustness of graph neural networks (GNN), graph structure learning (GSL) has attracted great interest due to the pervasiveness of noise in graph data. Many approaches have been proposed for GSL to jointly learn a clean graph structure and corresponding representations. To extend the previous work, this paper proposes a novel regularized GSL approach, particularly with an alignment… ▽ More To improve the robustness of graph neural networks (GNN), graph structure learning (GSL) has attracted great interest due to the pervasiveness of noise in graph data. Many approaches have been proposed for GSL to jointly learn a clean graph structure and corresponding representations. To extend the previous work, this paper proposes a novel regularized GSL approach, particularly with an alignment of feature information and graph information, which is motivated mainly by our derived lower bound of node-level Rademacher complexity for GNNs. Additionally, our proposed approach incorporates sparse dimensional reduction to leverage low-dimensional node features that are relevant to the graph structure. To evaluate the effectiveness of our approach, we conduct experiments on real-world graphs. The results demonstrate that our proposed GSL method outperforms several competitive baselines, especially in scenarios where the graph structures are heavily affected by noise. Overall, our research highlights the importance of integrating feature and graph information alignment in GSL, as inspired by our derived theoretical result, and showcases the superiority of our approach in handling noisy graph structures through comprehensive experiments on real-world datasets. △ Less

Submitted 5 July, 2023; originally announced July 2023.

arXiv:2306.09648 [pdf, other]

Learning CO$_2$ plume migration in faulted reservoirs with Graph Neural Networks

Authors: Xin Ju, François P. Hamon, Gege Wen, Rayan Kanfar, Mauricio Araya-Polo, Hamdi A. Tchelepi

Abstract: Deep-learning-based surrogate models provide an efficient complement to numerical simulations for subsurface flow problems such as CO$_2$ geological storage. Accurately capturing the impact of faults on CO$_2$ plume migration remains a challenge for many existing deep learning surrogate models based on Convolutional Neural Networks (CNNs) or Neural Operators. We address this challenge with a graph… ▽ More Deep-learning-based surrogate models provide an efficient complement to numerical simulations for subsurface flow problems such as CO$_2$ geological storage. Accurately capturing the impact of faults on CO$_2$ plume migration remains a challenge for many existing deep learning surrogate models based on Convolutional Neural Networks (CNNs) or Neural Operators. We address this challenge with a graph-based neural model leveraging recent developments in the field of Graph Neural Networks (GNNs). Our model combines graph-based convolution Long-Short-Term-Memory (GConvLSTM) with a one-step GNN model, MeshGraphNet (MGN), to operate on complex unstructured meshes and limit temporal error accumulation. We demonstrate that our approach can accurately predict the temporal evolution of gas saturation and pore pressure in a synthetic reservoir with impermeable faults. Our results exhibit a better accuracy and a reduced temporal error accumulation compared to the standard MGN model. We also show the excellent generalizability of our algorithm to mesh configurations, boundary conditions, and heterogeneous permeability fields not included in the training set. This work highlights the potential of GNN-based methods to accurately and rapidly model subsurface flow with complex faults and fractures. △ Less

Submitted 16 June, 2023; originally announced June 2023.

arXiv:2305.15195 [pdf, other]

Cooperative Control of Multi-Channel Linear Systems with Self-Organizing Private Agents

Authors: Peihu Duan, Tao Liu, Yuezu Lv, Guanghui Wen

Abstract: Cooperative behavior design for multi-agent systems with collective tasks is a critical issue in promoting swarm intelligence. This paper investigates cooperative control for a multi-channel system, where each channel is managed by an agent expected to self-organize a controller to stabilize the system collaboratively by communicating with neighbors in a network. Integrating a state decomposition… ▽ More Cooperative behavior design for multi-agent systems with collective tasks is a critical issue in promoting swarm intelligence. This paper investigates cooperative control for a multi-channel system, where each channel is managed by an agent expected to self-organize a controller to stabilize the system collaboratively by communicating with neighbors in a network. Integrating a state decomposition technique and a fusion approach, a fully distributed privacy-preserving mechanism is proposed to shield agents' private information from neighbors' eavesdropping. Moreover, the cost of introducing the privacy-preserving mechanism and the benefit of adding more channels to the system are quantitatively analyzed. Finally, comparative simulation examples are provided to demonstrate the effectiveness of the theoretical results. △ Less

Submitted 10 August, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

arXiv:2304.10643 [pdf, other]

Activity Classification Using Unsupervised Domain Transfer from Body Worn Sensors

Authors: Chaitra Hedge, Gezheng Wen, Layne C. Price

Abstract: Activity classification has become a vital feature of wearable health tracking devices. As innovation in this field grows, wearable devices worn on different parts of the body are emerging. To perform activity classification on a new body location, labeled data corresponding to the new locations are generally required, but this is expensive to acquire. In this work, we present an innovative method… ▽ More Activity classification has become a vital feature of wearable health tracking devices. As innovation in this field grows, wearable devices worn on different parts of the body are emerging. To perform activity classification on a new body location, labeled data corresponding to the new locations are generally required, but this is expensive to acquire. In this work, we present an innovative method to leverage an existing activity classifier, trained on Inertial Measurement Unit (IMU) data from a reference body location (the source domain), in order to perform activity classification on a new body location (the target domain) in an unsupervised way, i.e. without the need for classification labels at the new location. Specifically, given an IMU embedding model trained to perform activity classification at the source domain, we train an embedding model to perform activity classification at the target domain by replicating the embeddings at the source domain. This is achieved using simultaneous IMU measurements at the source and target domains. The replicated embeddings at the target domain are used by a classification model that has previously been trained on the source domain to perform activity classification at the target domain. We have evaluated the proposed methods on three activity classification datasets PAMAP2, MHealth, and Opportunity, yielding high F1 scores of 67.19%, 70.40% and 68.34%, respectively when the source domain is the wrist and the target domain is the torso. △ Less

Submitted 20 April, 2023; originally announced April 2023.

arXiv:2304.09352 [pdf, other]

Optimizing Carbon Storage Operations for Long-Term Safety

Authors: Yizheng Wang, Markus Zechner, Gege Wen, Anthony Louis Corso, John Michael Mern, Mykel J. Kochenderfer, Jef Karel Caers

Abstract: To combat global warming and mitigate the risks associated with climate change, carbon capture and storage (CCS) has emerged as a crucial technology. However, safely sequestering CO2 in geological formations for long-term storage presents several challenges. In this study, we address these issues by modeling the decision-making process for carbon storage operations as a partially observable Markov… ▽ More To combat global warming and mitigate the risks associated with climate change, carbon capture and storage (CCS) has emerged as a crucial technology. However, safely sequestering CO2 in geological formations for long-term storage presents several challenges. In this study, we address these issues by modeling the decision-making process for carbon storage operations as a partially observable Markov decision process (POMDP). We solve the POMDP using belief state planning to optimize injector and monitoring well locations, with the goal of maximizing stored CO2 while maintaining safety. Empirical results in simulation demonstrate that our approach is effective in ensuring safe long-term carbon storage operations. We showcase the flexibility of our approach by introducing three different monitoring strategies and examining their impact on decision quality. Additionally, we introduce a neural network surrogate model for the POMDP decision-making process to handle the complex dynamics of the multi-phase flow. We also investigate the effects of different fidelity levels of the surrogate model on decision qualities. △ Less

Submitted 18 April, 2023; originally announced April 2023.

arXiv:2303.15790 [pdf, other]

doi 10.1007/s11467-023-1333-z

STCF Conceptual Design Report: Volume 1 -- Physics & Detector

Authors: M. Achasov, X. C. Ai, R. Aliberti, L. P. An, Q. An, X. Z. Bai, Y. Bai, O. Bakina, A. Barnyakov, V. Blinov, V. Bobrovnikov, D. Bodrov, A. Bogomyagkov, A. Bondar, I. Boyko, Z. H. Bu, F. M. Cai, H. Cai, J. J. Cao, Q. H. Cao, Z. Cao, Q. Chang, K. T. Chao, D. Y. Chen, H. Chen , et al. (413 additional authors not shown)

Abstract: The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII,… ▽ More The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII, providing a unique platform for exploring the asymmetry of matter-antimatter (charge-parity violation), in-depth studies of the internal structure of hadrons and the nature of non-perturbative strong interactions, as well as searching for exotic hadrons and physics beyond the Standard Model. The STCF project in China is under development with an extensive R\&D program. This document presents the physics opportunities at the STCF, describes conceptual designs of the STCF detector system, and discusses future plans for detector R\&D and physics case studies. △ Less

Submitted 5 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

Journal ref: Front. Phys. 19(1), 14701 (2024)

arXiv:2212.13631

Proceedings of AAAI 2022 Fall Symposium: The Role of AI in Responding to Climate Challenges

Authors: Feras A. Batarseh, Priya L. Donti, Ján Drgoňa, Kristen Fletcher, Pierre-Adrien Hanania, Melissa Hatton, Srinivasan Keshav, Bran Knowles, Raphaela Kotsch, Sean McGinnis, Peetak Mitra, Alex Philp, Jim Spohrer, Frank Stein, Meghna Tare, Svitlana Volkov, Gege Wen

Abstract: Climate change is one of the most pressing challenges of our time, requiring rapid action across society. As artificial intelligence tools (AI) are rapidly deployed, it is therefore crucial to understand how they will impact climate action. On the one hand, AI can support applications in climate change mitigation (reducing or preventing greenhouse gas emissions), adaptation (preparing for the effe… ▽ More Climate change is one of the most pressing challenges of our time, requiring rapid action across society. As artificial intelligence tools (AI) are rapidly deployed, it is therefore crucial to understand how they will impact climate action. On the one hand, AI can support applications in climate change mitigation (reducing or preventing greenhouse gas emissions), adaptation (preparing for the effects of a changing climate), and climate science. These applications have implications in areas ranging as widely as energy, agriculture, and finance. At the same time, AI is used in many ways that hinder climate action (e.g., by accelerating the use of greenhouse gas-emitting fossil fuels). In addition, AI technologies have a carbon and energy footprint themselves. This symposium brought together participants from across academia, industry, government, and civil society to explore these intersections of AI with climate change, as well as how each of these sectors can contribute to solutions. △ Less

Submitted 29 January, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

arXiv:2212.10718 [pdf]

Interpretability and causal discovery of the machine learning models to predict the production of CBM wells after hydraulic fracturing

Authors: Chao Min, Guoquan Wen, Liangjie Gou, Xiaogang Li, Zhaozhong Yang

Abstract: Machine learning approaches are widely studied in the production prediction of CBM wells after hydraulic fracturing, but merely used in practice due to the low generalization ability and the lack of interpretability. A novel methodology is proposed in this article to discover the latent causality from observed data, which is aimed at finding an indirect way to interpret the machine learning result… ▽ More Machine learning approaches are widely studied in the production prediction of CBM wells after hydraulic fracturing, but merely used in practice due to the low generalization ability and the lack of interpretability. A novel methodology is proposed in this article to discover the latent causality from observed data, which is aimed at finding an indirect way to interpret the machine learning results. Based on the theory of causal discovery, a causal graph is derived with explicit input, output, treatment and confounding variables. Then, SHAP is employed to analyze the influence of the factors on the production capability, which indirectly interprets the machine learning models. The proposed method can capture the underlying nonlinear relationship between the factors and the output, which remedies the limitation of the traditional machine learning routines based on the correlation analysis of factors. The experiment on the data of CBM shows that the detected relationship between the production and the geological/engineering factors by the presented method, is coincident with the actual physical mechanism. Meanwhile, compared with traditional methods, the interpretable machine learning models have better performance in forecasting production capability, averaging 20% improvement in accuracy. △ Less

Submitted 20 December, 2022; originally announced December 2022.

arXiv:2212.05193 [pdf, ps, other]

doi 10.1093/mnras/stac3654

Individual pulse emission from the diffuse drifter PSR J1401$-$6357 using the ultrawideband receiver on the Parkes radio telescope

Authors: J. L. Chen, Z. G. Wen, X. F. Duan, D. L. He, N. Wang, H. G. Wang, R. Yuen, J. P. Yuan, W. M. Yan, Z. Wang, C. B. Lv, H. Wang, S. R. Cui

Abstract: In this study, we report on a detailed single pulse analysis of the radio emission from the pulsar J1401$-$6357 (B1358$-$63) based on data observed with the ultrawideband low-frequency receiver on the Parkes radio telescope. In addition to a weak leading component, the integrated pulse profile features a single-humped structure with a slight asymmetry. The frequency evolution of the pulse profile… ▽ More In this study, we report on a detailed single pulse analysis of the radio emission from the pulsar J1401$-$6357 (B1358$-$63) based on data observed with the ultrawideband low-frequency receiver on the Parkes radio telescope. In addition to a weak leading component, the integrated pulse profile features a single-humped structure with a slight asymmetry. The frequency evolution of the pulse profile is studied. Well-defined nulls, with an estimated nulling fraction greater than 2\%, are present across the whole frequency band. No emission is detected with significance above 3$σ$ in the average pulse profile integrated over all null pulses. Using fluctuation spectral analysis, we reveal the existence of temporal-dependent subpulse drifting in this pulsar for the first time. A clear double-peaked feature is present at exactly the alias border across the whole frequency band, which suggests that the apparent drift sense changes during the observation. Our observations provide further confirmation that the phenomena of pulse nulling and subpulse drifting are independent of observing frequency, which suggest that they invoke changes on the global magnetospheric scale. △ Less

Submitted 9 December, 2022; originally announced December 2022.

Comments: 10 pages, 13 figures

arXiv:2211.11424 [pdf, other]

Modeling Hierarchical Structural Distance for Unsupervised Domain Adaptation

Authors: Yingxue Xu, Guihua Wen, Yang Hu, Pei Yang

Abstract: Unsupervised domain adaptation (UDA) aims to estimate a transferable model for unlabeled target domains by exploiting labeled source data. Optimal Transport (OT) based methods have recently been proven to be a promising solution for UDA with a solid theoretical foundation and competitive performance. However, most of these methods solely focus on domain-level OT alignment by leveraging the geometr… ▽ More Unsupervised domain adaptation (UDA) aims to estimate a transferable model for unlabeled target domains by exploiting labeled source data. Optimal Transport (OT) based methods have recently been proven to be a promising solution for UDA with a solid theoretical foundation and competitive performance. However, most of these methods solely focus on domain-level OT alignment by leveraging the geometry of domains for domain-invariant features based on the global embeddings of images. However, global representations of images may destroy image structure, leading to the loss of local details that offer category-discriminative information. This study proposes an end-to-end Deep Hierarchical Optimal Transport method (DeepHOT), which aims to learn both domain-invariant and category-discriminative representations by mining hierarchical structural relations among domains. The main idea is to incorporate a domain-level OT and image-level OT into a unified OT framework, hierarchical optimal transport, to model the underlying geometry in both domain space and image space. In DeepHOT framework, an image-level OT serves as the ground distance metric for the domain-level OT, leading to the hierarchical structural distance. Compared with the ground distance of the conventional domain-level OT, the image-level OT captures structural associations among local regions of images that are beneficial to classification. In this way, DeepHOT, a unified OT framework, not only aligns domains by domain-level OT, but also enhances the discriminative power through image-level OT. Moreover, to overcome the limitation of high computational complexity, we propose a robust and efficient implementation of DeepHOT by approximating origin OT with sliced Wasserstein distance in image-level OT and accomplishing the mini-batch unbalanced domain-level OT. △ Less

Submitted 19 April, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

Comments: accepted by TCVST, code: https://github.com/Innse/DeepHOT

arXiv:2210.17051 [pdf, other]

doi 10.1039/D2EE04204E

Real-time high-resolution CO$_2$ geological storage prediction using nested Fourier neural operators

Authors: Gege Wen, Zongyi Li, Qirui Long, Kamyar Azizzadenesheli, Anima Anandkumar, Sally M. Benson

Abstract: Carbon capture and storage (CCS) plays an essential role in global decarbonization. Scaling up CCS deployment requires accurate and high-resolution modeling of the storage reservoir pressure buildup and the gaseous plume migration. However, such modeling is very challenging at scale due to the high computational costs of existing numerical methods. This challenge leads to significant uncertainties… ▽ More Carbon capture and storage (CCS) plays an essential role in global decarbonization. Scaling up CCS deployment requires accurate and high-resolution modeling of the storage reservoir pressure buildup and the gaseous plume migration. However, such modeling is very challenging at scale due to the high computational costs of existing numerical methods. This challenge leads to significant uncertainties in evaluating storage opportunities, which can delay the pace of large-scale CCS deployment. We introduce Nested Fourier Neural Operator (FNO), a machine-learning framework for high-resolution dynamic 3D CO2 storage modeling at a basin scale. Nested FNO produces forecasts at different refinement levels using a hierarchy of FNOs and speeds up flow prediction nearly 700,000 times compared to existing methods. By learning the solution operator for the family of governing partial differential equations, Nested FNO creates a general-purpose numerical simulator alternative for CO2 storage with diverse reservoir conditions, geological heterogeneity, and injection schemes. Our framework enables unprecedented real-time modeling and probabilistic simulations that can support the scale-up of global CCS deployment. △ Less

Submitted 1 June, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

Journal ref: Energy & Environmental Science, 16(4), 1732-1741 (2023)

arXiv:2210.03947 [pdf, other]

Finite-Time Convergent Algorithms for Time-Varying Distributed Optimization

Authors: Xinli Shi, Guanghui Wen, Xinghuo Yu

Abstract: This paper focuses on finite-time (FT) convergent distributed algorithms for solving time-varying (TV) distributed optimization (TVDO). The objective is to minimize the sum of local TV cost functions subject to the possible TV constraints by the coordination of multiple agents in finite time. Specifically, two classes of TVDO are investigated included unconstrained distributed consensus optimizati… ▽ More This paper focuses on finite-time (FT) convergent distributed algorithms for solving time-varying (TV) distributed optimization (TVDO). The objective is to minimize the sum of local TV cost functions subject to the possible TV constraints by the coordination of multiple agents in finite time. Specifically, two classes of TVDO are investigated included unconstrained distributed consensus optimization and distributed optimal resource allocation problems (DORAP) with both TV cost functions and coupled equation constraints. For the previous one, based on non-smooth analysis, a continuous-time distributed discontinuous dynamics with FT convergence is proposed based on an extended zero-gradient-sum method with a local auxiliary subsystem. Then, an FT convergent distributed dynamics is further obtained for TV-DORAP by dual transformation. Particularly, the inversion of the cost functions' Hessians is not required in the dual variables' dynamics, while another local optimization needs to be solved to obtain the primal variable at each time instant. Finally, two numerical examples are conducted to verify the proposed algorithms. △ Less

Submitted 1 September, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

arXiv:2210.02919 [pdf, ps, other]

Distributed Resource Allocation over Multiple Interacting Coalitions: A Game-Theoretic Approach

Authors: Jialing Zhou, Guanghui Wen, Yuezu Lv, Tao Yang, Guanrong Chen

Abstract: Despite many distributed resource allocation (DRA) algorithms have been reported in literature, it is still unknown how to allocate the resource optimally over multiple interacting coalitions. One major challenge in solving such a problem is that, the relevance of the decision on resource allocation in a coalition to the benefit of others may lead to conflicts of interest among these coalitions. U… ▽ More Despite many distributed resource allocation (DRA) algorithms have been reported in literature, it is still unknown how to allocate the resource optimally over multiple interacting coalitions. One major challenge in solving such a problem is that, the relevance of the decision on resource allocation in a coalition to the benefit of others may lead to conflicts of interest among these coalitions. Under this context, a new type of multi-coalition game is formulated in this paper, termed as resource allocation game, where each coalition contains multiple agents that cooperate to maximize the coalition-level benefit while subject to the resource constraint described by a coupled equality. Inspired by techniques such as variable replacement, gradient tracking and leader-following consensus, two new kinds of DRA algorithms are developed respectively for the scenarios where the individual benefit of each agent explicitly depends on the states of itself and some agents in other coalitions, and on the states of all the game participants. It is shown that the proposed algorithms can converge linearly to the Nash equilibrium (NE) of the multi-coalition game while satisfying the resource constraint during the whole NE-seeking process. Finally, the validity of the present allocation algorithms is verified by numerical simulations. △ Less

Submitted 6 October, 2022; originally announced October 2022.

arXiv:2208.14447 [pdf, ps, other]

A further exploration of deep Multi-Agent Reinforcement Learning with Hybrid Action Space

Authors: Hongzhi Hua, Guixuan Wen, Kaigui Wu

Abstract: The research of extending deep reinforcement learning (drl) to multi-agent field has solved many complicated problems and made great achievements. However, almost all these studies only focus on discrete or continuous action space and there are few works having ever used multi-agent deep reinforcement learning to real-world environment problems which mostly have a hybrid action space. Therefore, i… ▽ More The research of extending deep reinforcement learning (drl) to multi-agent field has solved many complicated problems and made great achievements. However, almost all these studies only focus on discrete or continuous action space and there are few works having ever used multi-agent deep reinforcement learning to real-world environment problems which mostly have a hybrid action space. Therefore, in this paper, we propose two algorithms: deep multi-agent hybrid soft actor-critic (MAHSAC) and multi-agent hybrid deep deterministic policy gradients (MAHDDPG) to fill this gap. This two algorithms follow the centralized training and decentralized execution (CTDE) paradigm and could handle hybrid action space problems. Our experiences are running on multi-agent particle environment which is an easy multi-agent particle world, along with some basic simulated physics. The experimental results show that these algorithms have good performances. △ Less

Submitted 30 August, 2022; originally announced August 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2206.05108

arXiv:2208.12123 [pdf, ps, other]

Distributed Algorithm Over Time-Varying Unbalanced Topologies for Optimization Problem Subject to Multiple Local Constraints

Authors: Hongzhe Liu, Wenwu Yu, Guanghui Wen, Wei Xing Zheng

Abstract: This paper studies the distributed optimization problem with possibly nonidentical local constraints, where its global objective function is composed of $N$ convex functions. The aim is to solve the considered optimization problem in a distributed manner over time-varying unbalanced directed topologies by using only local information and performing only local computations. Towards this end, a new… ▽ More This paper studies the distributed optimization problem with possibly nonidentical local constraints, where its global objective function is composed of $N$ convex functions. The aim is to solve the considered optimization problem in a distributed manner over time-varying unbalanced directed topologies by using only local information and performing only local computations. Towards this end, a new distributed discrete-time algorithm is developed by synthesizing the row stochastic matrices sequence and column stochastic matrices sequence analysis technique. Furthermore, for the developed distributed discrete-time algorithm, its convergence property to the optimal solution as well as its convergence rate are established under some mild assumptions. Numerical simulations are finally presented to verify the theoretical results. △ Less

Submitted 25 August, 2022; originally announced August 2022.

arXiv:2206.05108 [pdf, ps, other]

Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based on Maximum Entropy

Authors: Hongzhi Hua, Kaigui Wu, Guixuan Wen

Abstract: Multi-agent deep reinforcement learning has been applied to address a variety of complex problems with either discrete or continuous action spaces and achieved great success. However, most real-world environments cannot be described by only discrete action spaces or only continuous action spaces. And there are few works having ever utilized deep reinforcement learning (drl) to multi-agent problems… ▽ More Multi-agent deep reinforcement learning has been applied to address a variety of complex problems with either discrete or continuous action spaces and achieved great success. However, most real-world environments cannot be described by only discrete action spaces or only continuous action spaces. And there are few works having ever utilized deep reinforcement learning (drl) to multi-agent problems with hybrid action spaces. Therefore, we propose a novel algorithm: Deep Multi-Agent Hybrid Soft Actor-Critic (MAHSAC) to fill this gap. This algorithm follows the centralized training but decentralized execution (CTDE) paradigm, and extend the Soft Actor-Critic algorithm (SAC) to handle hybrid action space problems in Multi-Agent environments based on maximum entropy. Our experiences are running on an easy multi-agent particle world with a continuous observation and discrete action space, along with some basic simulated physics. The experimental results show that MAHSAC has good performance in training speed, stability, and anti-interference ability. At the same time, it outperforms existing independent deep hybrid learning method in cooperative scenarios and competitive scenarios. △ Less

Submitted 10 June, 2022; originally announced June 2022.

arXiv:2206.03091 [pdf, ps, other]

doi 10.3847/1538-4357/ac75d1

The discovery of a rotating radio transient J1918$-$0449 with intriguing emission properties with the five hundred meter aperture spherical radio telescope

Authors: J. L. Chen, Z. G. Wen, J. P. Yuan, N. Wang, D. Li, H. G. Wang, W. M. Yan, R. Yuen, P. Wang, Z. Wang, W. W. Zhu, J. R. Niu, C. C. Miao, M. Y. Xue, B. P. Gong

Abstract: In this study, we report on a detailed single pulse analysis of the radio emission from a rotating radio transient (RRAT) J1918$-$0449 which is the first RRAT discovered with the five hundred meter aperture spherical radio telescope (FAST). The sensitive observations were carried out on 30 April 2021 using the FAST with a central frequency of 1250 MHz and a short time resolution of 49.152 $μ$s, wh… ▽ More In this study, we report on a detailed single pulse analysis of the radio emission from a rotating radio transient (RRAT) J1918$-$0449 which is the first RRAT discovered with the five hundred meter aperture spherical radio telescope (FAST). The sensitive observations were carried out on 30 April 2021 using the FAST with a central frequency of 1250 MHz and a short time resolution of 49.152 $μ$s, which forms a reliable basis to probe single pulse emission properties in detail. The source was successively observed for around 2 hours. A total of 83 dispersed bursts with significance above 6$σ$ are detected over 1.8 hours. The source's DM and rotational period are determined to be 116.1$\pm$0.4 \pcm \ and 2479.21$\pm$0.03 ms, respectively. The share of registered pulses from the total number of observed period is 3.12\%. No underlying emission is detected in the averaged off pulse profile. For bursts with fluence larger than 10 Jy ms, the pulse energy follows a power-law distribution with an index of $-3.1\pm0.4$, suggesting the existence of bright pulse emission. We find that the distribution of time between subsequent pulses is consistent with a stationary Poisson process and find no evidence of clustering over the 1.8 h observations, giving a mean burst rate of one burst every 66 s. Close inspection of the detected bright pulses reveals that 21 pulses exhibit well-defined quasi-periodicities. The subpulse drifting is present in non-successive rotations with periodicity of $2.51\pm0.06$ periods. Finally, possible physical mechanisms are discussed. △ Less

Submitted 7 June, 2022; originally announced June 2022.

Comments: 11 pages, 11 figures

arXiv:2205.10235 [pdf, other]

An Efficient Methodology to Identify Missing Tags in Large-Scale RFID Systems

Authors: Chu Chu, Rui Xu, Gang Li, Zhenbing Li, Guangjun Wen

Abstract: Radio frequency identification (RFID) has been widely has broad applications. One such application is to use RFID to track inventory in warehouses and retail stores. In this application, timely identifying the missing items is an ongoing engineering problem. A feasible solution to this problem is to map each tag to a time slot and verify the presence of a tag by comparing the status of the predict… ▽ More Radio frequency identification (RFID) has been widely has broad applications. One such application is to use RFID to track inventory in warehouses and retail stores. In this application, timely identifying the missing items is an ongoing engineering problem. A feasible solution to this problem is to map each tag to a time slot and verify the presence of a tag by comparing the status of the predicted time slot and the actual time slot. However, existing works are time inefficient because they only verify tags one by one in singleton slots but ignore the collision slots mapped by multiple tags. To accelerate the identification process, we use bit tracking to verify tags in collision slots and design two protocols accordingly. We first propose the Sequential String based Missing Tag Identification (SSMTI) protocol, which converts all time slots to collision slots and enables tags in each slot to reply to a designed string simultaneously. By using bit tracking to decode the combined string, the reader can verify multiple tags together. To improve the performance of SSMTI when most tags are missing, we further propose the Interactive String based Missing Tag Identification (ISMTI) protocol. ISMTI improves the strategies of designing strings for each collided tag so that the reader can verify more tags using shorter strings than SSMTI.Besides, ISMTI can dynamically adjust the verification mechanism according to the proportion of missing tags to maintain time efficiency. We also provide theoretical analysis for proposed protocols to minimize execution time and evaluate their performance through extensive simulations. Compared with state-of-the-art solutions, the proposed SSMTI and ISMTI can reduce the time cost by as much as 39.74% and 68.87%. △ Less

Submitted 28 April, 2022; originally announced May 2022.

arXiv:2205.01407 [pdf]

doi 10.3847/1538-4357/ac5f42

Emission Variation of a Long-period Pulsar Discovered by the Five-hundred-meter Aperture Spherical Radio Telescope (FAST)

Authors: H. M. Tedila, R. Yuen, N. Wang, J. P. Yuan, Z. G. Wen, W. M. Yan, S. Q. Wang, S. J. Dang, D. Li, P. Wang, W. W. Zhu, J. R. Niu, C. C. Miao, M. Y. Xue, L. Zhang, Z. Y. Tu, R. Rejep, J. T. Xie, FAST Collaboration

Abstract: We report on the variation in the single-pulse emission from PSR J1900+4221 (CRAFTS 19C10) observed at frequency centered at 1.25 GHz using the Five-hundred-meter Aperture Spherical radio Telescope. The integrated pulse profile shows two distinct components, referred to here as the leading and trailing components, with the latter component also containing a third weak component. The single-pulse s… ▽ More We report on the variation in the single-pulse emission from PSR J1900+4221 (CRAFTS 19C10) observed at frequency centered at 1.25 GHz using the Five-hundred-meter Aperture Spherical radio Telescope. The integrated pulse profile shows two distinct components, referred to here as the leading and trailing components, with the latter component also containing a third weak component. The single-pulse sequence reveals different emissions demonstrating as nulling, regular, and bright pulses, each with a particular abundance and duration distribution. There also exists pulses that follow a log-normal distribution suggesting the possibility of another emission, in which the pulsar is radiating weakly. Changes in the profile shape are seen across different emissions. We examine the emission variations in the leading and trailing components collectively and separately, and find moderate correlation between the two components. The inclination angle is estimated to be about 7° based on pulse-width, and we discuss that nulling in this pulsar does not seem to show correlation with age and rotation period. △ Less

Submitted 3 May, 2022; originally announced May 2022.

Journal ref: The Astrophysical Journal, 929:171T (10pp), 2022

arXiv:2204.04456 [pdf, other]

Approximation-free control based on the bioinspired reference model for suspension systems with uncertainty and unknown nonlinearity

Authors: Xiaoyan Hu, Guilin Wen, Shan Yin, Zhao Tan, Zebang Pan

Abstract: Uncertainty and unknown nonlinearity are often inevitable in the suspension systems, which were often solved using fuzzy logic system (FLS) or neural networks (NNs). However, these methods are restricted by the structural complexity of the controller and the huge computing cost. Meanwhile, the estimation error of such approximators is affected by adopted adaptive laws and learning gains. Thus, in… ▽ More Uncertainty and unknown nonlinearity are often inevitable in the suspension systems, which were often solved using fuzzy logic system (FLS) or neural networks (NNs). However, these methods are restricted by the structural complexity of the controller and the huge computing cost. Meanwhile, the estimation error of such approximators is affected by adopted adaptive laws and learning gains. Thus, in view of the above problem, this paper proposes the approximation-free control based on the bioinspired reference model for a class of uncertain suspension systems with unknown nonlinearity. The proposed method integrates the superior vibration suppression of the bioinspired reference model and the structural advantage of the prescribed performance function (PPF) in approximation-free control. Then, the vibration suppression performance is improved, the calculation burden is relieved, and the transient performance is improved, which is analyzed theoretically in this paper. Finally, the simulation results validate the approach, and the comparisons show the advantages of the proposed control method in terms of good vibration suppression, fast convergence, and less calculation burden. △ Less

Submitted 9 April, 2022; originally announced April 2022.

arXiv:2204.00306 [pdf, other]

Building Decision Forest via Deep Reinforcement Learning

Authors: Guixuan Wen, Kaigui Wu

Abstract: Ensemble learning methods whose base classifier is a decision tree usually belong to the bagging or boosting. However, no previous work has ever built the ensemble classifier by maximizing long-term returns to the best of our knowledge. This paper proposes a decision forest building method called MA-H-SAC-DF for binary classification via deep reinforcement learning. First, the building process is… ▽ More Ensemble learning methods whose base classifier is a decision tree usually belong to the bagging or boosting. However, no previous work has ever built the ensemble classifier by maximizing long-term returns to the best of our knowledge. This paper proposes a decision forest building method called MA-H-SAC-DF for binary classification via deep reinforcement learning. First, the building process is modeled as a decentralized partial observable Markov decision process, and a set of cooperative agents jointly constructs all base classifiers. Second, the global state and local observations are defined based on informations of the parent node and the current location. Last, the state-of-the-art deep reinforcement method Hybrid SAC is extended to a multi-agent system under the CTDE architecture to find an optimal decision forest building policy. The experiments indicate that MA-H-SAC-DF has the same performance as random forest, Adaboost, and GBDT on balanced datasets and outperforms them on imbalanced datasets. △ Less

Submitted 1 April, 2022; originally announced April 2022.

arXiv:2203.06882 [pdf, other]

Robust Event Triggering Control for Lateral Dynamics of Intelligent Vehicles with Designable Inter-event Times

Authors: Xing Chu, Zhi Liu, Lei Mao, Xin Jin, Zhaoxia Peng, Guoguang Wen

Abstract: In this brief, an improved event-triggered update mechanism (ETM) for the linear quadratic regulator is proposed to solve the lateral motion control problem of intelligent vehicle under bounded disturbances. Based on a novel event function using a clock-like variable to determine the triggering time, we further introduce two new design parameters to improve control performance. Distinct from exist… ▽ More In this brief, an improved event-triggered update mechanism (ETM) for the linear quadratic regulator is proposed to solve the lateral motion control problem of intelligent vehicle under bounded disturbances. Based on a novel event function using a clock-like variable to determine the triggering time, we further introduce two new design parameters to improve control performance. Distinct from existing event-based control mechanisms, the inter-event times (IETs) derived from the above control framework are designable, meaning that the proposed ETM can be deployed on practical vehicle more easily and effectively. In addition, the improved IETs-designable ETM features a global robust event-separation property that is extremely required for practical lateral motion control of vehicle subject to diverse disturbances. Theoretical analysis proves the feasibility and stability of the proposed control strategy for trajectory tracking under bounded disturbances. Finally, simulation results verify the theoretical results and show the advantages of the proposed control strategy. △ Less

Submitted 14 March, 2022; originally announced March 2022.

Comments: 5pages, 4 figures

arXiv:2201.09453 [pdf, other]

Novel Nussbaum-Type Function based Safe Adaptive Distributed Consensus Control with Arbitrary Unknown Control Direction

Authors: Dan Qiao, Zhaoxia Peng, Guoguang Wen, Tingwen Huang

Abstract: Existing Nussbaum function based methods on the consensus of multi-agent systems require (partial) identical unknown control directions of all agents and cause dangerous dramatic control shocks. This paper develops a novel saturated Nussbaum function to relax such limitations and proposes a Nussbaum function based control scheme for the consensus problem of multi-agent systems with arbitrary non-i… ▽ More Existing Nussbaum function based methods on the consensus of multi-agent systems require (partial) identical unknown control directions of all agents and cause dangerous dramatic control shocks. This paper develops a novel saturated Nussbaum function to relax such limitations and proposes a Nussbaum function based control scheme for the consensus problem of multi-agent systems with arbitrary non-identical unknown control directions and safe control progress. First, a novel type of the Nussbaum function with different frequencies is proposed in the form of saturated time-elongation functions, which provides a more smooth and safer transient performance of the control progress. Furthermore, the novel Nussbaum function is employed to design distributed adaptive control algorithms for linearly parameterized multi-agent systems to achieve average consensus cooperatively without dramatic control shocks. Then, under the undirected connected communication topology, all the signals of the closed-loop systems are proved to be bounded and asymptotically convergent. Finally, two comparative numerical simulation examples are carried out to verify the effectiveness and the superiority of the proposed approach with smaller control shock amplitudes than traditional Nussbaum methods. △ Less

Submitted 23 January, 2022; originally announced January 2022.

arXiv:2201.06778 [pdf, other]

Data-Driven Deep Learning Based Hybrid Beamforming for Aerial Massive MIMO-OFDM Systems with Implicit CSI

Authors: Zhen Gao, Minghui Wu, Chun Hu, Feifei Gao, Guanghui Wen, Dezhi Zheng, Jun Zhang

Abstract: In an aerial hybrid massive multiple-input multiple-output (MIMO) and orthogonal frequency division multiplexing (OFDM) system, how to design a spectral-efficient broadband multi-user hybrid beamforming with a limited pilot and feedback overhead is challenging. To this end, by modeling the key transmission modules as an end-to-end (E2E) neural network, this paper proposes a data-driven deep learni… ▽ More In an aerial hybrid massive multiple-input multiple-output (MIMO) and orthogonal frequency division multiplexing (OFDM) system, how to design a spectral-efficient broadband multi-user hybrid beamforming with a limited pilot and feedback overhead is challenging. To this end, by modeling the key transmission modules as an end-to-end (E2E) neural network, this paper proposes a data-driven deep learning (DL)-based unified hybrid beamforming framework for both the time division duplex (TDD) and frequency division duplex (FDD) systems with implicit channel state information (CSI). For TDD systems, the proposed DL-based approach jointly models the uplink pilot combining and downlink hybrid beamforming modules as an E2E neural network. While for FDD systems, we jointly model the downlink pilot transmission, uplink CSI feedback, and downlink hybrid beamforming modules as an E2E neural network. Different from conventional approaches separately processing different modules, the proposed solution simultaneously optimizes all modules with the sum rate as the optimization object. Therefore, by perceiving the inherent property of air-to-ground massive MIMO-OFDM channel samples, the DL-based E2E neural network can establish the mapping function from the channel to the beamformer, so that the explicit channel reconstruction can be avoided with reduced pilot and feedback overhead. Besides, practical low-resolution phase shifters (PSs) introduce the quantization constraint, leading to the intractable gradient backpropagation when training the neural network. To mitigate the performance loss caused by the phase quantization error, we adopt the transfer learning strategy to further fine-tune the E2E neural network based on a pre-trained network that assumes the ideal infinite-resolution PSs. Numerical results show that our DL-based schemes have considerable advantages over state-of-the-art schemes. △ Less

Submitted 9 September, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

Comments: Accepted by IEEE Journal on Selected Areas in Communications

arXiv:2112.13508 [pdf]

doi 10.1007/s10586-024-04293-x

Duck swarm algorithm: theory, numerical optimization, and applications

Authors: Mengjian Zhang, Guihua Wen

Abstract: A swarm intelligence-based optimization algorithm, named Duck Swarm Algorithm (DSA), is proposed in this study, which is inspired by the searching for food sources and foraging behaviors of the duck swarm. Two rules are modeled from the finding food and foraging of the duck, which corresponds to the exploration and exploitation phases of the proposed DSA, respectively. The performance of the DSA i… ▽ More A swarm intelligence-based optimization algorithm, named Duck Swarm Algorithm (DSA), is proposed in this study, which is inspired by the searching for food sources and foraging behaviors of the duck swarm. Two rules are modeled from the finding food and foraging of the duck, which corresponds to the exploration and exploitation phases of the proposed DSA, respectively. The performance of the DSA is verified by using multiple CEC benchmark functions, where its statistical (best, mean, standard deviation, and average running-time) results are compared with seven well-known algorithms like Particle swarm optimization (PSO), Firefly algorithm (FA), Chicken swarm optimization (CSO), Grey wolf optimizer (GWO), Sine cosine algorithm (SCA), and Marine-predators algorithm (MPA), and Archimedes optimization algorithm (AOA). Moreover, the Wilcoxon rank-sum test, Friedman test, and convergence curves of the comparison results are utilized to prove the superiority of the DSA against other algorithms. The results demonstrate that DSA is a high-performance optimization method in terms of convergence speed and exploration-exploitation balance for solving the numerical optimization problems. Also, DSA is applied for the optimal design of six engineering constrained optimization problems and the node optimization deployment task of the Wireless Sensor Network (WSN). Overall, the comparison results revealed that the DSA is a promising and very competitive algorithm for solving different optimization problems. △ Less

Submitted 1 June, 2024; v1 submitted 26 December, 2021; originally announced December 2021.

Journal ref: Cluster Computing, 2024

arXiv:2112.03113 [pdf, other]

Atmospheric Density Model Optimization and Spacecraft Orbit Prediction Improvements Based on Q-Sat Orbit Data

Authors: Zhaokui Wang, Yulin Zhang, Guangwei Wen, Shunchenqiao Bai, Yingkai Cai, Pu Huang, Dapeng Han, Yunhan He

Abstract: Atmospheric drag calculation error greatly reduce the low-earth orbit spacecraft trajectory prediction fidelity. To solve the issue, the "correction - prediction" strategy is usually employed. In the method, one parameter is fixed and other parameters are revised by inverting spacecraft orbit data. However, based on a single spacecraft data, the strategy usually performs poorly as parameters in dr… ▽ More Atmospheric drag calculation error greatly reduce the low-earth orbit spacecraft trajectory prediction fidelity. To solve the issue, the "correction - prediction" strategy is usually employed. In the method, one parameter is fixed and other parameters are revised by inverting spacecraft orbit data. However, based on a single spacecraft data, the strategy usually performs poorly as parameters in drag force calculation are coupled with each other, which result in convoluted errors. A gravity field recovery and atmospheric density detection satellite, Q-Sat, developed by xxxxx Lab at xxx University, is launched on August 6th, 2020. The satellite is designed to be spherical for a constant drag coefficient regardless of its attitude. An orbit prediction method for low-earth orbit spacecraft with employment of Q-Sat data is proposed in present paper for decoupling atmospheric density and drag coefficient identification. For the first step, by using a dynamic approach-based inversion, several empirical atmospheric density models are revised based on Q-Sat orbit data. Depending on the performs, one of the revised atmospheric density model would be selected for the next step in which the same inversion is employed for drag coefficient identification for a low-earth orbit operating spacecraft whose orbit needs to be predicted. Finally, orbit prediction is conducted by extrapolation with the dynamic parameters in the previous steps. Tests are carried out with the proposed method by using a GOCE satellite 15-day continuous orbit data. Compared with legacy "correction - prediction" method in which only GOCE data is employed, the accuracy of the 24-hour orbit prediction is improved by about 171m the highest for the proposed method. 14-day averaged 24-hour prediction precision is elevated by approximately 70m. △ Less

Submitted 6 December, 2021; originally announced December 2021.

Comments: 12 pages, 7 figures, submitted to nature communication

arXiv:2110.10903 [pdf, ps, other]

doi 10.1093/mnras/stab3063

Nulling and subpulse drifting in PSR J1727-2739

Authors: Rukiye Rejep, N. Wang, W. M. Yan, Z. G. Wen

Abstract: In this paper, we investigate the emission properties of PSR J1727-2739, whose mean pulse profile has two main components, by analysing five single-pulse observations made using the Parkes 64-m radio telescope with a central frequency of 1369 MHz between 2014 April and October. The total observation time is about 6.1 hours which contains 16718 pulses after removal of radio frequency interference (… ▽ More In this paper, we investigate the emission properties of PSR J1727-2739, whose mean pulse profile has two main components, by analysing five single-pulse observations made using the Parkes 64-m radio telescope with a central frequency of 1369 MHz between 2014 April and October. The total observation time is about 6.1 hours which contains 16718 pulses after removal of radio frequency interference (RFI). Previous studies reveal that PSR J1727-2739 exhibits both nulling and subpulse drifting. We estimate the nulling fraction to be 66%, which is consistent with previously published results. In addition to the previously known subpulse drifting in the leading component, we also explore the drifting properties for the trailing component. We observe two distinct drift modes whose vertical drift band separations ($P_{3}$) are consistent with earlier studies. We find that both profile components share the same drift periodicity $P_{3}$ in a certain drift mode, but the measured horizontal separations ($P_{2}$) are quite different for them. That is, PSR J1727-2739 is a pulsar showing both changes of drift periodicity $P_{3}$ between different drift modes and drift rate variations between components in a given drift mode. Pulsars exhibiting nulling along with drift mode changing, such as PSR J1727-2739, give an unique opportunity to investigate the physical mechanism of these phenomena. △ Less

Submitted 21 October, 2021; originally announced October 2021.

Comments: 10 pages, 15 figures

arXiv:2109.03697 [pdf, other]

U-FNO -- An enhanced Fourier neural operator-based deep-learning model for multiphase flow

Authors: Gege Wen, Zongyi Li, Kamyar Azizzadenesheli, Anima Anandkumar, Sally M. Benson

Abstract: Numerical simulation of multiphase flow in porous media is essential for many geoscience applications. Machine learning models trained with numerical simulation data can provide a faster alternative to traditional simulators. Here we present U-FNO, a novel neural network architecture for solving multiphase flow problems with superior accuracy, speed, and data efficiency. U-FNO is designed based on… ▽ More Numerical simulation of multiphase flow in porous media is essential for many geoscience applications. Machine learning models trained with numerical simulation data can provide a faster alternative to traditional simulators. Here we present U-FNO, a novel neural network architecture for solving multiphase flow problems with superior accuracy, speed, and data efficiency. U-FNO is designed based on the newly proposed Fourier neural operator (FNO), which has shown excellent performance in single-phase flows. We extend the FNO-based architecture to a highly complex CO2-water multiphase problem with wide ranges of permeability and porosity heterogeneity, anisotropy, reservoir conditions, injection configurations, flow rates, and multiphase flow properties. The U-FNO architecture is more accurate in gas saturation and pressure buildup predictions than the original FNO and a state-of-the-art convolutional neural network (CNN) benchmark. Meanwhile, it has superior data utilization efficiency, requiring only a third of the training data to achieve the equivalent accuracy as CNN. U-FNO provides superior performance in highly heterogeneous geological formations and critically important applications such as gas saturation and pressure buildup "fronts" determination. The trained model can serve as a general-purpose alternative to routine numerical simulations of 2D-radial CO2 injection problems with significant speed-ups than traditional simulators. △ Less

Submitted 4 May, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

arXiv:2106.11684 [pdf, ps, other]

Solving specified-time distributed optimization problem via sampled-data-based algorithm

Authors: Jialing Zhou, Yuezu Lv, Changyun Wen, Guanghui Wen

Abstract: Despite significant advances on distributed continuous-time optimization of multi-agent networks, there is still lack of an efficient algorithm to achieve the goal of distributed optimization at a pre-specified time. Herein, we design a specified-time distributed optimization algorithm for connected agents with directed topologies to collectively minimize the sum of individual objective functions… ▽ More Despite significant advances on distributed continuous-time optimization of multi-agent networks, there is still lack of an efficient algorithm to achieve the goal of distributed optimization at a pre-specified time. Herein, we design a specified-time distributed optimization algorithm for connected agents with directed topologies to collectively minimize the sum of individual objective functions subject to an equality constraint. With the designed algorithm, the settling time of distributed optimization can be exactly predefined. The specified selection of such a settling time is independent of not only the initial conditions of agents, but also the algorithm parameters and the communication topologies. Furthermore, the proposed algorithm can realize specified-time optimization by exchanging information among neighbours only at discrete sampling instants and thus reduces the communication burden. In addition, the equality constraint is always satisfied during the whole process, which makes the proposed algorithm applicable to online solving distributed optimization problems such as economic dispatch. For the special case of undirected communication topologies, a reduced-order algorithm is also designed. Finally, the effectiveness of the theoretical analysis is justified by numerical simulations. △ Less

Submitted 22 June, 2021; originally announced June 2021.

arXiv:2106.10513 [pdf, other]

Distributed Nash Equilibrium Seeking in Consistency-Constrained Multi-Coalition Games

Authors: Jialing Zhou, Yuezu Lv, Guanghui Wen, Jinhu Lv, Dezhi Zheng

Abstract: Distributed Nash equilibrium (NE) seeking problem for multi-coalition games has attracted increasing attention in recent years, but the research mainly focuses on the case without agreement demand within coalitions. This paper considers a class of networked games among multiple coalitions where each coalition contains multiple agents that cooperate to minimize the sum of their costs, subject to th… ▽ More Distributed Nash equilibrium (NE) seeking problem for multi-coalition games has attracted increasing attention in recent years, but the research mainly focuses on the case without agreement demand within coalitions. This paper considers a class of networked games among multiple coalitions where each coalition contains multiple agents that cooperate to minimize the sum of their costs, subject to the demand of reaching an agreement on their state values. Furthermore, the underlying network topology among the agents does not need to be balanced. To achieve the goal of NE seeking within such a context, two estimates are constructed for each agent, namely, an estimate of partial derivatives of the cost function and an estimate of global state values, based on which, an iterative state updating law is elaborately designed. Linear convergence of the proposed algorithm is demonstrated. It is shown that the consistency-constrained multi-coalition games investigated in this paper put the well-studied networked games among individual players and distributed optimization in a unified framework, and the proposed algorithm can easily degenerate into solutions to these problems. △ Less

Submitted 9 December, 2021; v1 submitted 19 June, 2021; originally announced June 2021.

arXiv:2106.06410 [pdf, other]

What Can Knowledge Bring to Machine Learning? -- A Survey of Low-shot Learning for Structured Data

Authors: Yang Hu, Adriane Chapman, Guihua Wen, Dame Wendy Hall

Abstract: Supervised machine learning has several drawbacks that make it difficult to use in many situations. Drawbacks include: heavy reliance on massive training data, limited generalizability and poor expressiveness of high-level semantics. Low-shot Learning attempts to address these drawbacks. Low-shot learning allows the model to obtain good predictive power with very little or no training data, where… ▽ More Supervised machine learning has several drawbacks that make it difficult to use in many situations. Drawbacks include: heavy reliance on massive training data, limited generalizability and poor expressiveness of high-level semantics. Low-shot Learning attempts to address these drawbacks. Low-shot learning allows the model to obtain good predictive power with very little or no training data, where structured knowledge plays a key role as a high-level semantic representation of human. This article will review the fundamental factors of low-shot learning technologies, with a focus on the operation of structured knowledge under different low-shot conditions. We also introduce other techniques relevant to low-shot learning. Finally, we point out the limitations of low-shot learning, the prospects and gaps of industrial applications, and future research directions. △ Less

Submitted 11 June, 2021; originally announced June 2021.

Comments: 41 pages, 280 references

arXiv:2104.01795 [pdf, other]

doi 10.1016/j.advwatres.2021.104009

CCSNet: a deep learning modeling suite for CO$_2$ storage

Authors: Gege Wen, Catherine Hay, Sally M. Benson

Abstract: Numerical simulation is an essential tool for many applications involving subsurface flow and transport, yet often suffers from computational challenges due to the multi-physics nature, highly non-linear governing equations, inherent parameter uncertainties, and the need for high spatial resolutions to capture multi-scale heterogeneity. We developed CCSNet, a general-purpose deep-learning modeling… ▽ More Numerical simulation is an essential tool for many applications involving subsurface flow and transport, yet often suffers from computational challenges due to the multi-physics nature, highly non-linear governing equations, inherent parameter uncertainties, and the need for high spatial resolutions to capture multi-scale heterogeneity. We developed CCSNet, a general-purpose deep-learning modeling suite that can act as an alternative to conventional numerical simulators for carbon capture and storage (CCS) problems where CO$_2$ is injected into saline aquifers in 2d-radial systems. CCSNet consists of a sequence of deep learning models producing all the outputs that a numerical simulator typically provides, including saturation distributions, pressure buildup, dry-out, fluid densities, mass balance, solubility trapping, and sweep efficiency. The results are 10$^3$ to 10$^4$ times faster than conventional numerical simulators. As an application of CCSNet illustrating the value of its high computational efficiency, we developed rigorous estimation techniques for the sweep efficiency and solubility trapping. △ Less

Submitted 5 April, 2021; originally announced April 2021.

arXiv:2011.05526 [pdf, ps, other]

doi 10.3847/1538-4357/abbfa3

The Mode Switching in Pulsar J1326$-$6700

Authors: Z. G. Wen, W. M. Yan, J. P. Yuan, H. G. Wang, J. L. Chen, M. Mijit, R. Yuen, N. Wang, Z. Y. Tu, S. J. Dang

Abstract: We report on a detailed study of the mode switching in pulsar J1326$-$6700 by analyzing the data acquired from the Parkes 64 m radio telescope at 1369 MHz. During the abnormal mode, the emission at the central and trailing components becomes extremely weak. Meanwhile, the leading emission shifts toward earlier longitude by almost 2°, and remains in this position for typically less than a minute. T… ▽ More We report on a detailed study of the mode switching in pulsar J1326$-$6700 by analyzing the data acquired from the Parkes 64 m radio telescope at 1369 MHz. During the abnormal mode, the emission at the central and trailing components becomes extremely weak. Meanwhile, the leading emission shifts toward earlier longitude by almost 2°, and remains in this position for typically less than a minute. The mean flux density of the normal mode is almost five times that of the abnormal mode. Our data show that, for PSR J1326$-$6700, 85% of the time was spent in the normal mode and 15% was in the abnormal mode. The intrinsic distributions of mode timescales can be well described by Weibull distributions, which present a certain amount of memory in mode switching. Furthermore, a quasiperiodicity has been identified in the mode switching in pulsar J1326$-$6700. The estimated delay emission heights based on the kinematical effects indicate that the abnormal mode may have originated from higher altitude than the normal mode. △ Less

Submitted 10 November, 2020; originally announced November 2020.

Comments: 10 pages, 8 figures

arXiv:2011.00171 [pdf, other]

doi 10.1038/s41586-020-2827-2

Diverse polarization angle swings from a repeating fast radio burst source

Authors: R. Luo, B. J. Wang, Y. P. Men, C. F. Zhang, J. C. Jiang, H. Xu, W. Y. Wang, K. J. Lee, J. L. Han, B. Zhang, R. N. Caballero, M. Z. Chen, X. L. Chen, H. Q. Gan, Y. J. Guo, L. F. Hao, Y. X. Huang, P. Jiang, H. Li, J. Li, Z. X. Li, J. T. Luo, J. Pan, X. Pei, L. Qian , et al. (12 additional authors not shown)

Abstract: Fast radio bursts (FRBs) are millisecond-duration radio transients of unknown origin. Two possible mechanisms that could generate extremely coherent emission from FRBs invoke neutron star magnetospheres or relativistic shocks far from the central energy source. Detailed polarization observations may help us to understand the emission mechanism. However, the available FRB polarization data have bee… ▽ More Fast radio bursts (FRBs) are millisecond-duration radio transients of unknown origin. Two possible mechanisms that could generate extremely coherent emission from FRBs invoke neutron star magnetospheres or relativistic shocks far from the central energy source. Detailed polarization observations may help us to understand the emission mechanism. However, the available FRB polarization data have been perplexing, because they show a host of polarimetric properties, including either a constant polarization angle during each burst for some repeaters or variable polarization angles in some other apparently one-off events. Here we report observations of 15 bursts from FRB 180301 and find various polarization angle swings in seven of them. The diversity of the polarization angle features of these bursts is consistent with a magnetospheric origin of the radio emission, and disfavours the radiation models invoking relativistic shocks. △ Less

Submitted 30 October, 2020; originally announced November 2020.

Comments: Published online in Nature on 29 Oct, 2020

Journal ref: Nature, Volume 586, Pages 693--696 (2020)

arXiv:2009.01490 [pdf, ps, other]

Fixed-Time Cooperative Tracking Control for Double-Integrator Multi-Agent Systems: A Time-Based Generator Approach

Authors: Qiang Chen, Yu Zhao, Guanghui Wen, Guoqing Shi, Xinghuo Yu

Abstract: In this paper, both the fixed-time distributed consensus tracking and the fixed-time distributed average tracking problems for double-integrator-type multi-agent systems with bounded input disturbances are studied, respectively. Firstly, a new practical robust fixed-time sliding mode control method based on the time-based generator is proposed. Secondly, a fixed-time distributed consensus tracking… ▽ More In this paper, both the fixed-time distributed consensus tracking and the fixed-time distributed average tracking problems for double-integrator-type multi-agent systems with bounded input disturbances are studied, respectively. Firstly, a new practical robust fixed-time sliding mode control method based on the time-based generator is proposed. Secondly, a fixed-time distributed consensus tracking observer for double-integrator-type multi-agent systems is designed to estimate the state disagreements between the leader and the followers under undirected and directed communication, respectively. Thirdly, a fixed-time distributed average tracking observer for double-integrator-type multi-agent systems is designed to measure the average value of reference signals under undirected communication. Note that both the observers for the distributed consensus tracking and the distributed average tracking are devised based on time-based generators and can be extended to that of high-order multi-agent systems trivially. Furthermore, by combing the fixed-time sliding mode control with the fixed-time observers, the fixed-time controllers are designed to solve the distributed consensus tracking and the distributed average tracking problems. Finally, a few numerical simulations are shown to verify the results. △ Less

Submitted 3 September, 2020; originally announced September 2020.

Comments: 11 pages, 10 figures

Showing 1–50 of 138 results for author: Wen, G