subscribe to arXiv mailings

Spectral gaps and Fourier decay for self-conformal measures in the plane

Authors: Amir Algom, Federico Rodriguez Hertz, Zhiren Wang

Abstract: We show that every self conformal measure with respect to a $C^ω(\mathbb{C})$ IFS has polynomial Fourier decay under some mild non-linearity and irreducibility conditions. A key step is the proof of a uniform spectral gap for the transfer operator that does not require the cylinder covering of the attractor to be a Markov partition. It is based on a cocycle version of a method of Oh-Winter (2017). We show that every self conformal measure with respect to a $C^ω(\mathbb{C})$ IFS has polynomial Fourier decay under some mild non-linearity and irreducibility conditions. A key step is the proof of a uniform spectral gap for the transfer operator that does not require the cylinder covering of the attractor to be a Markov partition. It is based on a cocycle version of a method of Oh-Winter (2017). △ Less

Submitted 16 July, 2024; originally announced July 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2306.01275

arXiv:2407.10411 [pdf]

doi 10.23977/tracam.2024.040107

A Study on Lampreys Population Based on Sex-Ratio-Related Growth-Balance Model

Authors: Zuhua Ji, Jiarui Chen, Zihang Wang

Abstract: Lampreys are one of the oldest species in the world, living longer than dinosaurs, which is related to the ability to change the sex ratio during their lifespan. In this paper, to understand how sex ratio and food quantity affect the population growth rate of lampreys, the researchers draw inspiration from the logistics model and established a model called EcoSexChange(ESC), which results in a pop… ▽ More Lampreys are one of the oldest species in the world, living longer than dinosaurs, which is related to the ability to change the sex ratio during their lifespan. In this paper, to understand how sex ratio and food quantity affect the population growth rate of lampreys, the researchers draw inspiration from the logistics model and established a model called EcoSexChange(ESC), which results in a population initially increasing and then stabilizing, a reasonable outcome that may apply to other organisms with significant differences in consumption between sexes. Subsequently, this paper develops the Sex Ratio Adaptation Eco Impact (SRAEI) model based on the ESC model using the ABM algorithm to simulate how the population of lampreys, whose lives are divided into seven stages, grows and stabilizes. Then introduces a sudden disaster factor in the middle of the simulation, while also comparing lampreys that cannot adjust their sex ratio. The results of this paper are of great reference significance for people to analyze the population changes of lampreys in different living environments, and they are also easy to apply to other species with large differences between males and females. △ Less

Submitted 14 July, 2024; originally announced July 2024.

Journal ref: Transactions on Computational and Applied Mathematics. 2024 May 6;4(1):48-55

arXiv:2407.09867 [pdf, ps, other]

Stable rank for crossed products by finite group actions with the weak tracial Rokhlin property

Authors: Xiaochun Fang, Zhongli Wang

Abstract: Let $A$ be an infinite-dimensional stably finite simple unital C*-algebra, let $G$ be a finite group, and let $α\colon G\rightarrow \mathrm{Aut}(A)$ be an action of $G$ on $A$ which has the weak tracial Rokhlin property. We prove that if $A$ has property (TM), then the crossed product $A\rtimes_αG$ has property (TM). As a corollary, if $A$ is an infinite-dimensional separable simple unital C*-alge… ▽ More Let $A$ be an infinite-dimensional stably finite simple unital C*-algebra, let $G$ be a finite group, and let $α\colon G\rightarrow \mathrm{Aut}(A)$ be an action of $G$ on $A$ which has the weak tracial Rokhlin property. We prove that if $A$ has property (TM), then the crossed product $A\rtimes_αG$ has property (TM). As a corollary, if $A$ is an infinite-dimensional separable simple unital C*-algebra which has stable rank one and strict comparison, $α\colon G\rightarrow \mathrm{Aut}(A)$ is an action of a finite group $G$ on $A$ with the weak tracial Rokhlin property, then $A\rtimes_αG$ has stable rank one. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: 20 pages

arXiv:2407.08266 [pdf, ps, other]

$N$ -Laplacian and $N/2$-Hessian type equations with exponential reaction term and measure data

Authors: Shiguang Ma, Zijian Wang

Abstract: In this article, we will prove existence results for the equations of the type $-Δ_{N}u=H_{l}(u)+μ$ and $F_{\frac{N}{2}}[-u]=H_{l}(u)+μ$ in a bounded domain $Ω$, with Dirichlet boundary condition, where the source term $H_{l}(r)$ takes the form $e^{r}-\sum_{j=0}^{l-1}\frac{r^{j}}{j!}$ and $μ$ is a nonnegative Radon measure. In this article, we will prove existence results for the equations of the type $-Δ_{N}u=H_{l}(u)+μ$ and $F_{\frac{N}{2}}[-u]=H_{l}(u)+μ$ in a bounded domain $Ω$, with Dirichlet boundary condition, where the source term $H_{l}(r)$ takes the form $e^{r}-\sum_{j=0}^{l-1}\frac{r^{j}}{j!}$ and $μ$ is a nonnegative Radon measure. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 15pages

MSC Class: 35J60; 35B45

arXiv:2407.06905 [pdf, ps, other]

Blowing-up solutions for the Choquard type Brezis-Nirenberg problem in dimension three

Authors: Wenjing Chen, Zexi Wang

Abstract: In this paper, we are interested in the existence of solutions for the following Choquard type Brezis-Nirenberg problem \begin{align*} \left\{ \begin{array}{ll} -Δu=\displaystyle\Big(\int\limits_Ω\frac{u^{6-α}(y)}{|x-y|^α}dy\Big)u^{5-α}+λu, \ \ &\mbox{in}\ Ω, u=0, \ \ &\mbox{on}\ \partial Ω, \end{array} \right. \end{align*} where $Ω$ is a smooth bounded domain in $\mathbb{R}^3$,… ▽ More In this paper, we are interested in the existence of solutions for the following Choquard type Brezis-Nirenberg problem \begin{align*} \left\{ \begin{array}{ll} -Δu=\displaystyle\Big(\int\limits_Ω\frac{u^{6-α}(y)}{|x-y|^α}dy\Big)u^{5-α}+λu, \ \ &\mbox{in}\ Ω, u=0, \ \ &\mbox{on}\ \partial Ω, \end{array} \right. \end{align*} where $Ω$ is a smooth bounded domain in $\mathbb{R}^3$, $α\in (0,3)$, $6-α$ is the upper critical exponent in the sense of the Hardy-Littlewood-Sobolev inequality, and $λ$ is a real positive parameter. By applying the reduction argument, we find and characterize a positive value $λ_0$ such that if $λ-λ_0>0$ is small enough, then the above problem admits a solution, which blows up and concentrates at the critical point of the Robin function as $λ\rightarrow λ_0$. Moreover, we consider the above problem under zero Neumann boundary condition. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.06664 [pdf, other]

PDEformer-1: A Foundation Model for One-Dimensional Partial Differential Equations

Authors: Zhanhong Ye, Xiang Huang, Leheng Chen, Zining Liu, Bingyang Wu, Hongsheng Liu, Zidong Wang, Bin Dong

Abstract: This paper introduces PDEformer-1, a versatile neural solver capable of simultaneously addressing various partial differential equations (PDEs). With the PDE represented as a computational graph, we facilitate the seamless integration of symbolic and numeric information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed subsequently to generate mesh-fre… ▽ More This paper introduces PDEformer-1, a versatile neural solver capable of simultaneously addressing various partial differential equations (PDEs). With the PDE represented as a computational graph, we facilitate the seamless integration of symbolic and numeric information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed subsequently to generate mesh-free predicted solutions. We generated a dataset with up to three million samples involving diverse one-dimensional PDEs to pretrain our model. Compared with baseline models trained specifically on benchmark datasets, our pretrained model achieves comparable accuracy via zero-shot inference, and the advantage expands after finetuning. For PDEs new or unseen in the pretraining stage, our model can adapt quickly by finetuning on a relatively small set of examples from the target equation. Additionally, PDEformer-1 demonstrates promising results in the inverse problem of PDE scalar coefficient recovery and coefficient field recovery. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.05626 [pdf, other]

A Stochastic Interacting Particle-Field Algorithm for a Haptotaxis Advection-Diffusion System Modeling Cancer Cell Invasion

Authors: Boyi Hu, Zhongjian Wang, Jack Xin, Zhiwen Zhang

Abstract: The investigation of tumor invasion and metastasis dynamics is crucial for advancements in cancer biology and treatment. Many mathematical models have been developed to study the invasion of host tissue by tumor cells. In this paper, we develop a novel stochastic interacting particle-field (SIPF) algorithm that accurately simulates the cancer cell invasion process within the haptotaxis advection-d… ▽ More The investigation of tumor invasion and metastasis dynamics is crucial for advancements in cancer biology and treatment. Many mathematical models have been developed to study the invasion of host tissue by tumor cells. In this paper, we develop a novel stochastic interacting particle-field (SIPF) algorithm that accurately simulates the cancer cell invasion process within the haptotaxis advection-diffusion (HAD) system. Our approach approximates solutions using empirical measures of particle interactions, combined with a smoother field variable - the extracellular matrix concentration (ECM) - computed by the spectral method. We derive a one-step time recursion for both the positions of stochastic particles and the field variable using the implicit Euler discretization, which is based on the explicit Green's function of an elliptic operator characterized by the Laplacian minus a positive constant. Our numerical experiments demonstrate the superior performance of the proposed algorithm, especially in computing cancer cell growth with thin free boundaries in three-dimensional (3D) space. Numerical results show that the SIPF algorithm is mesh-free, self-adaptive, and low-cost. Moreover, it is more accurate and efficient than traditional numerical techniques such as the finite difference method (FDM) and spectral methods. △ Less

Submitted 8 July, 2024; originally announced July 2024.

arXiv:2407.01048 [pdf, other]

Self-absorption of Hankel systems on monoids --a seemingly universal property

Authors: Yong Han, Yanqi Qiu, Zipeng Wang

Abstract: Given any cancellative monoid $\mathcal{M}$, we study the Hankel system determined by its multiplication table. We prove that the Hankel system admits self-absorption property provided that the monoid $\mathcal{M}$ has the local algebraic structure: \[ \big(ax = by, cx=dy, az=bw \,\, \text{in $\mathcal{M}$}\big)\Longrightarrow \big(cz=dw \,\, \text{in $\mathcal{M}$}\big). \] Our result holds for a… ▽ More Given any cancellative monoid $\mathcal{M}$, we study the Hankel system determined by its multiplication table. We prove that the Hankel system admits self-absorption property provided that the monoid $\mathcal{M}$ has the local algebraic structure: \[ \big(ax = by, cx=dy, az=bw \,\, \text{in $\mathcal{M}$}\big)\Longrightarrow \big(cz=dw \,\, \text{in $\mathcal{M}$}\big). \] Our result holds for all group-embeddable monoids and goes beyond. In particular, it works for all cancellative Abelian monoids and most common non-Abelian cancellative monoids such as $$ \mathrm{SL}_d(\mathbb{N}): = \big\{[a_{ij}]_{1\le i,j\le d}\in \mathrm{SL}_d(\mathbb{Z})\big| a_{ij} \in \mathbb{N}\big\}. $$ The Hankel system determined by the multiplication table of a monoid is further generalized to that determined by level sets of any abstract two-variable map. We introduce an algebraic notion of lunar maps and establish a stronger hereditary self-absorption property for the corresponding generalized Hankel systems. As a consequence, we prove the self-absorption property for arbitrary spatial compression of the regular representation system $\{λ_G(g)\}_{g\in G}$ of any discrete group $G$, as well as the Hankel system $\{Γ_\ell^Φ\}$ determined by the level sets of any rational map of the form $Φ(x,y)=a x^m + b y^n$ with $a,b,m,n\in \mathbb{Z}^*$: \[ Γ_\ell^Φ(x, y)= \mathbf{1}(a x^m + b y^n= \ell), \quad x, y\in \mathbb{N}^*, \, \ell\in Φ(\mathbb{N}^*\times \mathbb{N}^*). \] The self-absorption property is applied to the study of completely bounded Fourier multipliers between Hardy spaces. Further applications are: i) exact complete bounded norm of the Carleman embedding in any dimension; ii) mixed Fourier-Schur multiplier inequalities with critical exponent $4/3$; iii) failure of hyper-complete-contractivity for the Poisson semigroup. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 45 pages

arXiv:2407.00353 [pdf, ps, other]

New type of solutions for a critical Grushin-type problem with competing potentials

Authors: Wenjing Chen, Zexi Wang

Abstract: In this paper, we consider a critical Grushin-type problem with double potentials. By applying the reduction argument and local Pohouzaev identities, we construct a new family of solutions to this problem, which are concentrated at points lying on the top and the bottom circles of a cylinder. In this paper, we consider a critical Grushin-type problem with double potentials. By applying the reduction argument and local Pohouzaev identities, we construct a new family of solutions to this problem, which are concentrated at points lying on the top and the bottom circles of a cylinder. △ Less

Submitted 29 June, 2024; originally announced July 2024.

MSC Class: 35J15; 35B09; 35B33

arXiv:2406.17763 [pdf, other]

DiffusionPDE: Generative PDE-Solving Under Partial Observation

Authors: Jiahe Huang, Guandao Yang, Zichen Wang, Jeong Joon Park

Abstract: We introduce a general framework for solving partial differential equations (PDEs) using generative diffusion models. In particular, we focus on the scenarios where we do not have the full knowledge of the scene necessary to apply classical solvers. Most existing forward or inverse PDE approaches perform poorly when the observations on the data or the underlying coefficients are incomplete, which… ▽ More We introduce a general framework for solving partial differential equations (PDEs) using generative diffusion models. In particular, we focus on the scenarios where we do not have the full knowledge of the scene necessary to apply classical solvers. Most existing forward or inverse PDE approaches perform poorly when the observations on the data or the underlying coefficients are incomplete, which is a common assumption for real-world measurements. In this work, we propose DiffusionPDE that can simultaneously fill in the missing information and solve a PDE by modeling the joint distribution of the solution and coefficient spaces. We show that the learned generative priors lead to a versatile framework for accurately solving a wide range of PDEs under partial observation, significantly outperforming the state-of-the-art methods for both forward and inverse directions. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: Project page: https://jhhuangchloe.github.io/Diffusion-PDE/

arXiv:2406.17514 [pdf, ps, other]

Lusztig's Jordan decomposition and a finite field instance of relative Langlands duality

Authors: Zhicheng Wang

Abstract: Lusztig \cite{L5,L6} gave a parametrization for $\rm{Irr}(G^F)$, where $G$ is a reductive algebraic group defined over $\mathbb{F}_q$, with Frobenius map $F$. This parametrization is known as Lusztig's Jordan decomposition or Lusztig correspondence. However, there is not a canonical choice of Lusztig correspondence. In this paper, we consider classical groups. We pick a canonical choice of Lusztig… ▽ More Lusztig \cite{L5,L6} gave a parametrization for $\rm{Irr}(G^F)$, where $G$ is a reductive algebraic group defined over $\mathbb{F}_q$, with Frobenius map $F$. This parametrization is known as Lusztig's Jordan decomposition or Lusztig correspondence. However, there is not a canonical choice of Lusztig correspondence. In this paper, we consider classical groups. We pick a canonical choice of Lusztig correspondence which is compatible with parabolic induction and is compatible with theta correspondence. This result extends Pan's result in \cite{P3}. As an application, we give a refinement of the results of the finite Gan-Gross-Prasad problem in \cite{Wang1} and prove a duality between Theta correspondence and finite Gan-Gross-Prasad problem, which can be regarded as a finite field instance of relative Langlands duality of Ben-Zvi-Sakellaridis-Venkatesh \cite{BZSV}. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.15082 [pdf, other]

The sparse Kaczmarz method with surrogate hyperplane for the regularized basis pursuit problem

Authors: Ze Wang, Jun-Feng Yin, Ji-Chen Zhao

Abstract: The Sparse Kaczmarz method is a famous and widely used iterative method for solving the regularized basis pursuit problem. A general scheme of the surrogate hyperplane sparse Kaczmarz method is proposed. In particular, a class of residual-based surrogate hyperplane sparse Kaczmarz method is introduced and the implementations are well discussed. Their convergence theories are proved and the linear… ▽ More The Sparse Kaczmarz method is a famous and widely used iterative method for solving the regularized basis pursuit problem. A general scheme of the surrogate hyperplane sparse Kaczmarz method is proposed. In particular, a class of residual-based surrogate hyperplane sparse Kaczmarz method is introduced and the implementations are well discussed. Their convergence theories are proved and the linear convergence rates are studied and compared in details. Numerical experiments verify the efficiency of the proposed methods. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.13241 [pdf, ps, other]

Achirality of Sol 3-Manifolds, Stevenhagen Conjecture and Shimizu's L-series

Authors: Ye Tian, Shicheng Wang, Zhongzi Wang

Abstract: A closed orientable manifold is {\em achiral} if it admits an orientation reversing homeomorphism. A commensurable class of closed manifolds is achiral if it contains an achiral element, or equivalently, each manifold in $\CM$ has an achiral finite cover. Each commensurable class containing non-orientable elements must be achiral. It is natural to wonder how many commensurable classes are ac… ▽ More A closed orientable manifold is {\em achiral} if it admits an orientation reversing homeomorphism. A commensurable class of closed manifolds is achiral if it contains an achiral element, or equivalently, each manifold in $\CM$ has an achiral finite cover. Each commensurable class containing non-orientable elements must be achiral. It is natural to wonder how many commensurable classes are achiral and how many achiral classes have non-orientable elements. We study this problem for Sol 3-manifolds. Each commensurable class $\CM$ of Sol 3-manifold has a complete topological invariant $D_{\CM}$, the discriminant of $\CM$. Our main result is: (1) Among all commensurable classes of Sol 3-manifolds, there are infinitely many achiral classes; however ordered by discriminants, the density of achiral commensurable classes is 0. (2) Among all achiral commensurable classes of Sol 3-manifolds, ordered by discriminants, the density of classes containing non-orientable elements is $1-ρ$, where $$ρ:=\prod_{j=1}^\infty \left(1+2^{-j}\right)^{-1} = 0.41942\cdots.$$ △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 19 pages

arXiv:2406.09439 [pdf, other]

Classification of Cellular Fake Surfaces

Authors: Lucas Fagan, Yang Qiu, Zhenghan Wang

Abstract: Generic polyhedra are interesting mathematical objects to study in their own right. In this paper, we initialize a systematic study of two-dimensional generic polyhedra with an eye towards applications to low-dimensional topology, especially the Andrews-Curtis and Zeeman conjectures. After recalling the basic notions of generic polyhedra and fake surfaces, we derive some interesting properties of… ▽ More Generic polyhedra are interesting mathematical objects to study in their own right. In this paper, we initialize a systematic study of two-dimensional generic polyhedra with an eye towards applications to low-dimensional topology, especially the Andrews-Curtis and Zeeman conjectures. After recalling the basic notions of generic polyhedra and fake surfaces, we derive some interesting properties of fake surfaces. Our main result is a complete classification of acyclic cellular fake surfaces up to complexity 4 and a classification of acyclic cellular fake surfaces without small disks of complexity 5. From this classification, we prove the contractibility conjecture for acyclic cellular fake surfaces of complexity 4, and the embedded disk conjecture up to complexity 5. We provide evidence for the conjectures that the probability of being a spine among fake surfaces is 0 and that every contractible fake surface has an embedded disk. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 21 pages, 8 figures

arXiv:2406.07870 [pdf, ps, other]

Event-Triggered Optimal Tracking Control for Strict-Feedback Nonlinear Systems With Non-Affine Nonlinear Faults

Authors: Ling Wang, Xin Wang, Ziming Wang

Abstract: This article studies the control ideas of the optimal backstepping technique, proposing an event-triggered optimal tracking control scheme for a class of strict-feedback nonlinear systems with non-affine and nonlinear faults. A simplified identifier-critic-actor framework is employed in the reinforcement learning algorithm to achieve optimal control. The identifier estimates the unknown dynamic fu… ▽ More This article studies the control ideas of the optimal backstepping technique, proposing an event-triggered optimal tracking control scheme for a class of strict-feedback nonlinear systems with non-affine and nonlinear faults. A simplified identifier-critic-actor framework is employed in the reinforcement learning algorithm to achieve optimal control. The identifier estimates the unknown dynamic functions, the critic evaluates the system performance, and the actor implements control actions, enabling modeling and control of anonymous systems for achieving optimal control performance. In this paper, a simplified reinforcement learning algorithm is designed by deriving update rules from the negative gradient of a simple positive function related to the Hamilton-Jacobi-Bellman equation, and it also releases the stringent persistent excitation condition. Then, a fault-tolerant control method is developed by applying filtered signals for controller design. Additionally, to address communication resource reduction, an event-triggered mechanism is employed for designing the actual controller. Finally, the proposed scheme's feasibility is validated through theoretical analysis and simulation. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.06956 [pdf, ps, other]

Arbitrarily slow decay in the logarithmically averaged Sarnak conjecture

Authors: Amir Algom, Zhiren Wang

Abstract: In 2017 Tao proposed a variant Sarnak's Möbius disjointness conjecture with logarithmic averaging: For any zero entropy dynamical system $(X,T)$, $\frac{1}{\log N} \sum_{n=1} ^N \frac{f(T^n x) μ(n)}{n}= o(1)$ for every $f\in \mathcal{C}(X)$ and every $x\in X$. We construct examples showing that this $o(1)$ can go to zero arbitrarily slowly. Nonetheless, all of our examples satisfy the conjecture. In 2017 Tao proposed a variant Sarnak's Möbius disjointness conjecture with logarithmic averaging: For any zero entropy dynamical system $(X,T)$, $\frac{1}{\log N} \sum_{n=1} ^N \frac{f(T^n x) μ(n)}{n}= o(1)$ for every $f\in \mathcal{C}(X)$ and every $x\in X$. We construct examples showing that this $o(1)$ can go to zero arbitrarily slowly. Nonetheless, all of our examples satisfy the conjecture. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: Preprint version, 12 pages. To appear in JMAA

arXiv:2406.00648 [pdf, ps, other]

Level proximal subdifferential, variational convexity, and pointwise Lipschitz smoothness

Authors: Honglin Luo, Xianfu Wang, Ziyuan Wang, Xinmin Yang

Abstract: Level proximal subdifferential was introduced by Rockafellar recently as a tool for studying proximal mappings of possibly nonconvex functions. In this paper we give a systematic study of level proximal subdifferntial, characterize variational convexity of the function by locally firm nonexpansiveness of proximal mappings or locally relative monotonicity of level proximal subdifferential, and inve… ▽ More Level proximal subdifferential was introduced by Rockafellar recently as a tool for studying proximal mappings of possibly nonconvex functions. In this paper we give a systematic study of level proximal subdifferntial, characterize variational convexity of the function by locally firm nonexpansiveness of proximal mappings or locally relative monotonicity of level proximal subdifferential, and investigate pointwise Lipschitz smoothness of the function. Integration and single-valuedness of level proximal subdifferential are also examined. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: 25 pages, comments welcomed

MSC Class: Primary 49J53; 49J52; 47H05; Secondary 90C26; 47H09; 26B25

arXiv:2406.00612 [pdf, ps, other]

Policy Iteration for Exploratory Hamilton--Jacobi--Bellman Equations

Authors: Hung Vinh Tran, Zhenhua Wang, Yuming Paul Zhang

Abstract: We study the policy iteration algorithm (PIA) for entropy-regularized stochastic control problems on an infinite time horizon with a large discount rate, focusing on two main scenarios. First, we analyze PIA with bounded coefficients where the controls applied to the diffusion term satisfy a smallness condition. We demonstrate the convergence of PIA based on a uniform $\mathcal{C}^{2,α}$ estimate… ▽ More We study the policy iteration algorithm (PIA) for entropy-regularized stochastic control problems on an infinite time horizon with a large discount rate, focusing on two main scenarios. First, we analyze PIA with bounded coefficients where the controls applied to the diffusion term satisfy a smallness condition. We demonstrate the convergence of PIA based on a uniform $\mathcal{C}^{2,α}$ estimate for the value sequence generated by PIA, and provide a quantitative convergence analysis for this scenario. Second, we investigate PIA with unbounded coefficients but no control over the diffusion term. In this scenario, we first provide the well-posedness of the exploratory Hamilton--Jacobi--Bellman equation with linear growth coefficients and polynomial growth reward function. By such a well-posedess result we achieve PIA's convergence by establishing a quantitative locally uniform $\mathcal{C}^{1,α}$ estimates for the generated value sequence. △ Less

Submitted 2 July, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

Comments: 21 pages

MSC Class: 35F21; 60J60; 68W40; 93E20

arXiv:2405.20763 [pdf, other]

Improving Generalization and Convergence by Enhancing Implicit Regularization

Authors: Mingze Wang, Haotian He, Jinbo Wang, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

Abstract: In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I… ▽ More In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that IRE can be practically incorporated with {\em generic base optimizers} without introducing significant computational overload. Experiments show that IRE consistently improves the generalization performance for image classification tasks across a variety of benchmark datasets (CIFAR-10/100, ImageNet) and models (ResNets and ViTs). Surprisingly, IRE also achieves a $2\times$ {\em speed-up} compared to AdamW in the pre-training of Llama models (of sizes ranging from 60M to 229M) on datasets including Wikitext-103, Minipile, and Openwebtext. Moreover, we provide theoretical guarantees, showing that IRE can substantially accelerate the convergence towards flat minima in Sharpness-aware Minimization (SAM). △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: 35 pages

arXiv:2405.19852 [pdf, other]

On a problem of Pavlović involving harmonic quasiconformal mappings

Authors: Zhi-Gang Wang, Xiao-Yuan Wang, Antti Rasila, Jia-Le Qiu

Abstract: We obtain a sharp result on order of certain affine and linear invariant families of harmonic quasiconformal mappings with bounded Schwarzian norm. This problem is motivated by the work of Chuaqui, Hernández and Martín [Math. Ann. 367: 1099--1122, 2017]. Firstly, for $K\ge1$, we construct a harmonic $K$-quasiconformal counterpart of the classical Koebe function and use it to formulate the correspo… ▽ More We obtain a sharp result on order of certain affine and linear invariant families of harmonic quasiconformal mappings with bounded Schwarzian norm. This problem is motivated by the work of Chuaqui, Hernández and Martín [Math. Ann. 367: 1099--1122, 2017]. Firstly, for $K\ge1$, we construct a harmonic $K$-quasiconformal counterpart of the classical Koebe function and use it to formulate the corresponding conjectures. Then we consider Hardy spaces $H^p$ of harmonic quasiconformal mappings by applying results for quasiconformal mappings obtained by Astala and Koskela [Pure Appl. Math. Q. 7: 19--50, 2011]. In particular, we determine the optimal order of the family of harmonic quasiconformal mappings with bounded Schwarzian norm to belong to a harmonic Hardy space. This partially solves an open problem posed by Pavlović in 2014. Finally, we derive pre-Schwarzian and Schwarzian norm estimates of certain harmonic mappings. △ Less

Submitted 14 July, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

Comments: 20 pages, 6 figures. Comments are welcome

MSC Class: 30C55; 30C62; 30H10; 31A05

arXiv:2405.19650 [pdf, other]

Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization

Authors: Xi Lin, Yilu Liu, Xiaoyuan Zhang, Fei Liu, Zhenkun Wang, Qingfu Zhang

Abstract: Multi-objective optimization can be found in many real-world applications where some conflicting objectives can not be optimized by a single solution. Existing optimization methods often focus on finding a set of Pareto solutions with different optimal trade-offs among the objectives. However, the required number of solutions to well approximate the whole Pareto optimal set could be exponentially… ▽ More Multi-objective optimization can be found in many real-world applications where some conflicting objectives can not be optimized by a single solution. Existing optimization methods often focus on finding a set of Pareto solutions with different optimal trade-offs among the objectives. However, the required number of solutions to well approximate the whole Pareto optimal set could be exponentially large with respect to the number of objectives, which makes these methods unsuitable for handling many optimization objectives. In this work, instead of finding a dense set of Pareto solutions, we propose a novel Tchebycheff set scalarization method to find a few representative solutions (e.g., 5) to cover a large number of objectives (e.g., $>100$) in a collaborative and complementary manner. In this way, each objective can be well addressed by at least one solution in the small solution set. In addition, we further develop a smooth Tchebycheff set scalarization approach for efficient optimization with good theoretical guarantees. Experimental studies on different problems with many optimization objectives demonstrate the effectiveness of our proposed method. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.19003 [pdf, other]

A structure-preserving scheme for computing effective diffusivity and anomalous diffusion phenomena of random flows

Authors: Tan Zhang, Zhongjian Wang, Jack Xin, Zhiwen Zhang

Abstract: This paper aims to investigate the diffusion behavior of particles moving in stochastic flows under a structure-preserving scheme. We compute the effective diffusivity for normal diffusive random flows and establish the power law between spatial and temporal variables for cases with anomalous diffusion phenomena. From a Lagrangian approach, we separate the corresponding stochastic differential equ… ▽ More This paper aims to investigate the diffusion behavior of particles moving in stochastic flows under a structure-preserving scheme. We compute the effective diffusivity for normal diffusive random flows and establish the power law between spatial and temporal variables for cases with anomalous diffusion phenomena. From a Lagrangian approach, we separate the corresponding stochastic differential equations (SDEs) into sub-problems and construct a one-step structure-preserving method to solve them. Then by modified equation systems, the convergence analysis in calculating the effective diffusivity is provided and compared between the structure-preserving scheme and the Euler-Maruyama scheme. Also, we provide the error estimate for the structure-preserving scheme in calculating the power law for a series of super-diffusive random flows. Finally, we calculate the effective diffusivity and anomalous diffusion phenomena for a series of 2D and 3D random fields. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 39pages, 10 figures, planning to submit for Journal of Scientific Computing or Numerische Mathematik

MSC Class: 37M25; 60J60; 60H35; 65P10; 65M75; 76M50

arXiv:2405.16202 [pdf, ps, other]

Boundary actions by higher-rank lattices: Classification and embedding in low dimensions, local rigidity, smooth factors

Authors: Aaron Brown, Federico Rodriguez Hertz, Zhiren Wang

Abstract: We study actions by lattices in higher-rank (semi)simple Lie groups on compact manifolds. By classifying certain measures invariant under a related higher-rank abelian action (the diagonal action on the suspension space) we deduce a number of new rigidity results related to standard projective actions (i.e. boundary actions) by such groups. Specifically, in low dimensions we show all actions (wi… ▽ More We study actions by lattices in higher-rank (semi)simple Lie groups on compact manifolds. By classifying certain measures invariant under a related higher-rank abelian action (the diagonal action on the suspension space) we deduce a number of new rigidity results related to standard projective actions (i.e. boundary actions) by such groups. Specifically, in low dimensions we show all actions (with infinite image) are conjugate to boundary actions. We also show standard boundary actions (e.g. projective actions on generalized flag varieties) are local rigid and classify all smooth actions that are topological factors of such actions. Finally, for volume-preserving actions in low dimensions (with infinite image) we provide a mechanism to detect the presence of "blow-ups" for the action by studying measures that are $P$-invariant but not $G$-invariant for the suspension action. △ Less

Submitted 2 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.16104 [pdf, other]

Global Well-posedness and Convergence Analysis of Score-based Generative Models via Sharp Lipschitz Estimates

Authors: Connor Mooney, Zhongjian Wang, Jack Xin, Yifeng Yu

Abstract: We establish global well-posedness and convergence of the score-based generative models (SGM) under minimal general assumptions of initial data for score estimation. For the smooth case, we start from a Lipschitz bound of the score function with optimal time length. The optimality is validated by an example whose Lipschitz constant of scores is bounded at initial but blows up in finite time. This… ▽ More We establish global well-posedness and convergence of the score-based generative models (SGM) under minimal general assumptions of initial data for score estimation. For the smooth case, we start from a Lipschitz bound of the score function with optimal time length. The optimality is validated by an example whose Lipschitz constant of scores is bounded at initial but blows up in finite time. This necessitates the separation of time scales in conventional bounds for non-log-concave distributions. In contrast, our follow up analysis only relies on a local Lipschitz condition and is valid globally in time. This leads to the convergence of numerical scheme without time separation. For the non-smooth case, we show that the optimal Lipschitz bound is O(1/t) in the point-wise sense for distributions supported on a compact, smooth and low-dimensional manifold with boundary. △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.16095 [pdf, ps, other]

New type of solutions for the critical polyharmonic equation

Authors: Wenjing Chen, Zexi Wang

Abstract: In this paper, we consider the following critical polyharmonic equation \begin{align*}%\label{abs} ( -Δ)^m u+V(|y'|,y'')u=u^{m^*-1},\quad u>0, \quad y=(y',y'')\in \mathbb{R}^3\times \mathbb{R}^{N-3}, \end{align*} where $m^*=\frac{2N}{N-2m}$, $N>4m+1$, $m\in \mathbb{N}^+$, and $V(|y'|,y'')$ is a bounded nonnegative function in $\mathbb{R}^+\times \mathbb{R}^{N-3}$. By using the reduction argument… ▽ More In this paper, we consider the following critical polyharmonic equation \begin{align*}%\label{abs} ( -Δ)^m u+V(|y'|,y'')u=u^{m^*-1},\quad u>0, \quad y=(y',y'')\in \mathbb{R}^3\times \mathbb{R}^{N-3}, \end{align*} where $m^*=\frac{2N}{N-2m}$, $N>4m+1$, $m\in \mathbb{N}^+$, and $V(|y'|,y'')$ is a bounded nonnegative function in $\mathbb{R}^+\times \mathbb{R}^{N-3}$. By using the reduction argument and local Pohouzaev identities, we prove that if $r^{2m}V(r,y'')$ has a stable critical point $(r_0,y_0'')$ with $r_0>0$ and $V(r_0,y_0'')>0$, then the above problem has a new type of solutions, which concentrate at points lying on the top and the bottom circles of a cylinder. △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.10392 [pdf, other]

Transport based particle methods for the Fokker-Planck-Landau equation

Authors: Vasily Ilin, Jingwei Hu, Zhenfu Wang

Abstract: We propose a particle method for numerically solving the Landau equation, inspired by the score-based transport modeling (SBTM) method for the Fokker-Planck equation. This method can preserve some important physical properties of the Landau equation, such as the conservation of mass, momentum, and energy, and decay of estimated entropy. We prove that matching the gradient of the logarithm of the a… ▽ More We propose a particle method for numerically solving the Landau equation, inspired by the score-based transport modeling (SBTM) method for the Fokker-Planck equation. This method can preserve some important physical properties of the Landau equation, such as the conservation of mass, momentum, and energy, and decay of estimated entropy. We prove that matching the gradient of the logarithm of the approximate solution is enough to recover the true solution to the Landau equation with Maxwellian molecules. Several numerical experiments in low and moderately high dimensions are performed, with particular emphasis on comparing the proposed method with the traditional particle or blob method. △ Less

Submitted 16 May, 2024; originally announced May 2024.

Comments: 26 pages, 6 figures, code https://github.com/Vilin97/GradientFlows.jl

MSC Class: 35Q84; 65M75; 49Q22; 68T07

arXiv:2405.09973 [pdf, ps, other]

Adaptive Ensemble Control for Stochastic Systems with Mixed Asymmetric Laplace Noises

Authors: Yajie Yu, Xuehui Ma, Shiliang Zhang, Zhuzhu Wang, Xubing Shi, Yushuai Li, Tingwen Huang

Abstract: This paper presents an adaptive ensemble control for stochastic systems subject to asymmetric noises and outliers. Asymmetric noises skew system observations, and outliers with large amplitude deteriorate the observations even further. Such disturbances induce poor system estimation and degraded stochastic system control. In this work, we model the asymmetric noises and outliers by mixed asymmetri… ▽ More This paper presents an adaptive ensemble control for stochastic systems subject to asymmetric noises and outliers. Asymmetric noises skew system observations, and outliers with large amplitude deteriorate the observations even further. Such disturbances induce poor system estimation and degraded stochastic system control. In this work, we model the asymmetric noises and outliers by mixed asymmetric Laplace distributions (ALDs), and propose an optimal control for stochastic systems with mixed ALD noises. Particularly, we segregate the system disturbed by mixed ALD noises into subsystems, each of which is subject to a specific ALD noise. For each subsystem, we design an iterative quantile filter (IQF) to estimate the system parameters using system observations. With the estimated parameters by IQF, we derive the certainty equivalence (CE) control law for each subsystem. Then we use the Bayesian approach to ensemble the subsystem CE controllers, with each of the controllers weighted by their posterior probability. We finalize our control law as the weighted sum of the control signals by the sub-system CE controllers. To demonstrate our approach, we conduct numerical simulations and Monte Carlo analyses. The results show improved tracking performance by our approach for skew noises and its robustness to outliers, compared with single ALD based and RLS-based control policy. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.07160 [pdf, ps, other]

Singular Integrals associated with Reflection Groups on Euclidean Space

Authors: Yongsheng Han, Ji Li, Chaoqiang Tan, Zipeng Wang, Xinfeng Wu

Abstract: In the field of harmonic analysis, geometric considerations are frequently crucial. Specially, group actions such as translations, dilations and rotations on Euclidean space are instrumental. The objective of this paper is to extend the study of singular integrals to include the effects of group reflections on Euclidean space, and to establish the T1 theorem for these singular integrals. In the field of harmonic analysis, geometric considerations are frequently crucial. Specially, group actions such as translations, dilations and rotations on Euclidean space are instrumental. The objective of this paper is to extend the study of singular integrals to include the effects of group reflections on Euclidean space, and to establish the T1 theorem for these singular integrals. △ Less

Submitted 12 May, 2024; originally announced May 2024.

arXiv:2405.00276 [pdf, ps, other]

Structure of Dubrovin-Zhang free energy functions and universal identities

Authors: Sergey Shadrin, Zhe Wang

Abstract: We prove a structural theorem relating the higher genera free energy functions of the Dubrovin-Zhang hierarchies to those of the trivial theory, that is, the Witten-Kontsevich free energy functions. As an important application, for any given genus $g\geq 1$, we construct a set of universal identities valid for the free energy functions of any Dubrovin-Zhang hierarchy. We prove a structural theorem relating the higher genera free energy functions of the Dubrovin-Zhang hierarchies to those of the trivial theory, that is, the Witten-Kontsevich free energy functions. As an important application, for any given genus $g\geq 1$, we construct a set of universal identities valid for the free energy functions of any Dubrovin-Zhang hierarchy. △ Less

Submitted 30 April, 2024; originally announced May 2024.

Comments: 24 pages. Comments are welcome!

arXiv:2404.19736 [pdf, other]

On the derivatives of the Liouville currents

Authors: Xinlong Dong, Dragomir Šarić, Zhe Wang

Abstract: The Liouville map, introduced by Bonahon, assigns to each point in the Teichmüller space a natural Radon measure on the space of geodesics of the base surface. The Liouville map is real analytic and it even extends to a holomorphic map of a neighborhood of the Teichmüller space in the Quasi-Fuchsian space of an arbitrary conformally hyperbolic Riemann surface. The earthquake paths and by their ext… ▽ More The Liouville map, introduced by Bonahon, assigns to each point in the Teichmüller space a natural Radon measure on the space of geodesics of the base surface. The Liouville map is real analytic and it even extends to a holomorphic map of a neighborhood of the Teichmüller space in the Quasi-Fuchsian space of an arbitrary conformally hyperbolic Riemann surface. The earthquake paths and by their extension quake-bends, introduced by Thurston, are particularly nice real-analytic and holomorphic paths in the Teichmüller and the Quasi-Fuchsian space, respectively. We find a geometric expression for the derivative of the Liouville map along earthquake paths. △ Less

Submitted 30 April, 2024; originally announced April 2024.

Comments: 24 pages, 5 figures. arXiv admin note: text overlap with arXiv:2111.07809

arXiv:2404.18969 [pdf, ps, other]

Maximum spread of $K_{s,t}$-minor-free graphs

Authors: William Linz, Linyuan Lu, Zhiyu Wang

Abstract: The spread of a graph $G$ is the difference between the largest and smallest eigenvalue of the adjacency matrix of $G$. In this paper, we consider the family of graphs which contain no $K_{s,t}$-minor. We show that for any $t\geq s \geq 2$, there is an integer $ξ_{t}$ such that the extremal $n$-vertex $K_{s,t}$-minor-free graph attaining the maximum spread is the graph obtained by joining a graph… ▽ More The spread of a graph $G$ is the difference between the largest and smallest eigenvalue of the adjacency matrix of $G$. In this paper, we consider the family of graphs which contain no $K_{s,t}$-minor. We show that for any $t\geq s \geq 2$, there is an integer $ξ_{t}$ such that the extremal $n$-vertex $K_{s,t}$-minor-free graph attaining the maximum spread is the graph obtained by joining a graph $L$ on $(s-1)$ vertices to the disjoint union of $\lfloor \frac{2n+ξ_{t}}{3t}\rfloor$ copies of $K_t$ and $n-s+1 - t\lfloor \frac{2n+ξ_t}{3t}\rfloor$ isolated vertices. Furthermore, we give an explicit formula for $ξ_{t}$ and an explicit description for the graph $L$ for $t \geq \frac32(s-3) +\frac{4}{s-1}$. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: 21 pages. arXiv admin note: text overlap with arXiv:2212.05540

arXiv:2404.18041 [pdf, other]

Variational Optimization for Quantum Problems using Deep Generative Networks

Authors: Lingxia Zhang, Xiaodie Lin, Peidong Wang, Kaiyan Yang, Xiao Zeng, Zhaohui Wei, Zizhu Wang

Abstract: Optimization is one of the keystones of modern science and engineering. Its applications in quantum technology and machine learning helped nurture variational quantum algorithms and generative AI respectively. We propose a general approach to design variational optimization algorithms based on generative models: the Variational Generative Optimization Network (VGON). To demonstrate its broad appli… ▽ More Optimization is one of the keystones of modern science and engineering. Its applications in quantum technology and machine learning helped nurture variational quantum algorithms and generative AI respectively. We propose a general approach to design variational optimization algorithms based on generative models: the Variational Generative Optimization Network (VGON). To demonstrate its broad applicability, we apply VGON to three quantum tasks: finding the best state in an entanglement-detection protocol, finding the ground state of a 1D quantum spin model with variational quantum circuits, and generating degenerate ground states of many-body quantum Hamiltonians. For the first task, VGON greatly reduces the optimization time compared to stochastic gradient descent while generating nearly optimal quantum states. For the second task, VGON alleviates the barren plateau problem in variational quantum circuits. For the final task, VGON can identify the degenerate ground state spaces after a single stage of training and generate a variety of states therein. △ Less

Submitted 27 April, 2024; originally announced April 2024.

Comments: 17 pages, 13 figures, comments welcome

arXiv:2404.13866 [pdf, other]

Plug-and-Play Algorithm Convergence Analysis From The Standpoint of Stochastic Differential Equation

Authors: Zhongqi Wang, Bingnan Wang, Maosheng Xiang

Abstract: The Plug-and-Play (PnP) algorithm is popular for inverse image problem-solving. However, this algorithm lacks theoretical analysis of its convergence with more advanced plug-in denoisers. We demonstrate that discrete PnP iteration can be described by a continuous stochastic differential equation (SDE). We can also achieve this transformation through Markov process formulation of PnP. Then, we can… ▽ More The Plug-and-Play (PnP) algorithm is popular for inverse image problem-solving. However, this algorithm lacks theoretical analysis of its convergence with more advanced plug-in denoisers. We demonstrate that discrete PnP iteration can be described by a continuous stochastic differential equation (SDE). We can also achieve this transformation through Markov process formulation of PnP. Then, we can take a higher standpoint of PnP algorithms from stochastic differential equations, and give a unified framework for the convergence property of PnP according to the solvability condition of its corresponding SDE. We reveal that a much weaker condition, bounded denoiser with Lipschitz continuous measurement function would be enough for its convergence guarantee, instead of previous Lipschitz continuous denoiser condition. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: 17pages, Preprint, Under review

arXiv:2404.13492 [pdf, other]

Discrete non-commutative hungry Toda lattice and its application in matrix computation

Authors: Zheng Wang, Shi-Hao Li, Kang-Ya Lu, Jian-Qing Sun

Abstract: In this paper, we plan to show an eigenvalue algorithm for block Hessenberg matrices by using the idea of non-commutative integrable systems and matrix-valued orthogonal polynomials. We introduce adjacent families of matrix-valued $θ$-deformed bi-orthogonal polynomials, and derive corresponding discrete non-commutative hungry Toda lattice from discrete spectral transformations for polynomials. It… ▽ More In this paper, we plan to show an eigenvalue algorithm for block Hessenberg matrices by using the idea of non-commutative integrable systems and matrix-valued orthogonal polynomials. We introduce adjacent families of matrix-valued $θ$-deformed bi-orthogonal polynomials, and derive corresponding discrete non-commutative hungry Toda lattice from discrete spectral transformations for polynomials. It is shown that this discrete system can be used as a pre-precessing algorithm for block Hessenberg matrices. Besides, some convergence analysis and numerical examples of this algorithm are presented. △ Less

Submitted 20 April, 2024; originally announced April 2024.

Comments: 24 pages, 2 figures. Comments are welcome

arXiv:2404.12312 [pdf, ps, other]

A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimiax Optimization

Authors: Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen

Abstract: This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence o… ▽ More This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence of the stochastic gradient descent-ascent algorithm and (ii) the representation learning of the neural networks. We establish convergence under the mean-field regime by considering the continuous-time and infinite-width limit of the optimization dynamics. Under this regime, the stochastic gradient descent-ascent corresponds to a Wasserstein gradient flow over the space of probability measures defined over the space of neural network parameters. We prove that the Wasserstein gradient flow converges globally to a stationary point of the minimax objective at a $O(T^{-1} + α^{-1})$ sublinear rate, and additionally finds the solution to the functional equation when the regularizer of the minimax objective is strongly convex. Here $T$ denotes the time and $α$ is a scaling parameter of the neural networks. In terms of representation learning, our results show that the feature representation induced by the neural networks is allowed to deviate from the initial one by the magnitude of $O(α^{-1})$, measured in terms of the Wasserstein distance. Finally, we apply our general results to concrete examples including policy evaluation, nonparametric instrumental variable regression, asset pricing, and adversarial Riesz representer estimation. △ Less

Submitted 25 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Comments: Submitted

arXiv:2404.11890 [pdf, other]

FCNCP: A Coupled Nonnegative CANDECOMP/PARAFAC Decomposition Based on Federated Learning

Authors: Yukai Cai, Hang Liu, Xiulin Wang, Hongjin Li, Ziyi Wang, Chuanshuai Yang, Fengyu Cong

Abstract: In the field of brain science, data sharing across servers is becoming increasingly challenging due to issues such as industry competition, privacy security, and administrative procedure policies and regulations. Therefore, there is an urgent need to develop new methods for data analysis and processing that enable scientific collaboration without data sharing. In view of this, this study proposes… ▽ More In the field of brain science, data sharing across servers is becoming increasingly challenging due to issues such as industry competition, privacy security, and administrative procedure policies and regulations. Therefore, there is an urgent need to develop new methods for data analysis and processing that enable scientific collaboration without data sharing. In view of this, this study proposes to study and develop a series of efficient non-negative coupled tensor decomposition algorithm frameworks based on federated learning called FCNCP for the EEG data arranged on different servers. It combining the good discriminative performance of tensor decomposition in high-dimensional data representation and decomposition, the advantages of coupled tensor decomposition in cross-sample tensor data analysis, and the features of federated learning for joint modelling in distributed servers. The algorithm utilises federation learning to establish coupling constraints for data distributed across different servers. In the experiments, firstly, simulation experiments are carried out using simulated data, and stable and consistent decomposition results are obtained, which verify the effectiveness of the proposed algorithms in this study. Then the FCNCP algorithm was utilised to decompose the fifth-order event-related potential (ERP) tensor data collected by applying proprioceptive stimuli on the left and right hands. It was found that contralateral stimulation induced more symmetrical components in the activation areas of the left and right hemispheres. The conclusions drawn are consistent with the interpretations of related studies in cognitive neuroscience, demonstrating that the method can efficiently process higher-order EEG data and that some key hidden information can be preserved. △ Less

Submitted 18 April, 2024; originally announced April 2024.

arXiv:2404.10672 [pdf, ps, other]

Betti numbers of normal edge rings

Authors: Zexin Wang, Dancheng Lu

Abstract: A novel approach is introduced for computing the multi-graded Betti numbers of normal edge rings. This method is employed to delve into the edge rings of three distinct classes of simple graphs that adhere to the odd-cycle condition. These classes include compact graphs, which are devoid of even cycles and satisfy the odd-cycle condition; graphs comprised of multiple paths converging at two shared… ▽ More A novel approach is introduced for computing the multi-graded Betti numbers of normal edge rings. This method is employed to delve into the edge rings of three distinct classes of simple graphs that adhere to the odd-cycle condition. These classes include compact graphs, which are devoid of even cycles and satisfy the odd-cycle condition; graphs comprised of multiple paths converging at two shared vertices; and graphs introduced in \cite{HHKO} that exhibit both even and odd cycles. Explicit formulas are provided for the multi-graded Betti numbers pertaining to the edge rings of these graphs. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: 35 pages

arXiv:2404.09314 [pdf, ps, other]

Modular data of non-semisimple modular categories

Authors: Liang Chang, Quinn T. Kolt, Zhenghan Wang, Qing Zhang

Abstract: We investigate non-semisimple modular categories with an eye towards a structure theory, low-rank classification, and applications to low dimensional topology and topological physics. We aim to extend the well-understood theory of semisimple modular categories to the non-semisimple case by using representations of factorizable ribbon Hopf algebras as a case study. We focus on the Cohen-Westreich m… ▽ More We investigate non-semisimple modular categories with an eye towards a structure theory, low-rank classification, and applications to low dimensional topology and topological physics. We aim to extend the well-understood theory of semisimple modular categories to the non-semisimple case by using representations of factorizable ribbon Hopf algebras as a case study. We focus on the Cohen-Westreich modular data, which is obtained from the Lyubashenko-Majid modular representation restricted to the Higman ideal of a factorizable ribbon Hopf algebra. The Cohen-Westreich $S$-matrix diagonalizes the mixed fusion rules and reduces to the usual $S$-matrix for semisimple modular categories. The paper includes detailed studies on small quantum groups $U_qsl(2)$ and the Drinfeld doubles of Nichols Hopf algebras, especially the $\mathrm{SL}(2, \mathbb{Z})$-representation on their centers, Cohen-Westreich modular data, and the congruence kernel theorem's validity. △ Less

Submitted 6 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

Comments: 51 pages. Minor changes to fix typos

arXiv:2404.07552 [pdf, ps, other]

Correspondence Research of the Most Probable Transition Paths between a Stochastic Interacting Particle System and its Mean Field Limit System

Authors: Jianyu Chen, Jianyu Hu, Zibo Wang, Ting Gao Jinqiao Duan

Abstract: This paper derived the indirect approximation theorem of the most probable transition pathway of a stochastic interacting particle system in the mean field sense. This paper studied the problem of indirect approximation of the most probable transition pathway of an interacting particle system (i.e., a high-dimensional stochastic dynamic system) and its mean field limit equation (McKean-Vlasov stoc… ▽ More This paper derived the indirect approximation theorem of the most probable transition pathway of a stochastic interacting particle system in the mean field sense. This paper studied the problem of indirect approximation of the most probable transition pathway of an interacting particle system (i.e., a high-dimensional stochastic dynamic system) and its mean field limit equation (McKean-Vlasov stochastic differential equation). This study is based on the Onsager-Machlup action functional, reformulated the problem as an optimal control problem. With the stochastic Pontryagin's Maximum Principle, this paper completed the derivation. This paper proved the existence and uniqueness theorem of the solution to the mean field optimal control problem of McKean-Vlasov stochastic differential equations, and also established a system of equations satisfying the control parameters $θ^{*}$ and $θ^{N}$ respectively. There are few studies on the most probable transition pathways of stochastic interacting particle systems, it is still a great challenge to solve the most probable transition pathways directly or to approximate it with the mean field limit system. Therefore, this paper first gave the proof of correspondence between the core equation of Pontryagin's Maximum Principle, that is, Hamiltonian extreme condition equation. That is to say, this correspondence indirectly explain the correspondence between the most probable transition pathways of stochastic interacting particle systems and the mean field systems. △ Less

Submitted 11 April, 2024; originally announced April 2024.

arXiv:2404.07323 [pdf, other]

Surrogate modeling for probability distribution estimation:uniform or adaptive design?

Authors: Maijia Su, Ziqi Wang, Oreste Salvatore Bursi, Marco Broccardo

Abstract: The active learning (AL) technique, one of the state-of-the-art methods for constructing surrogate models, has shown high accuracy and efficiency in forward uncertainty quantification (UQ) analysis. This paper provides a comprehensive study on AL-based global surrogates for computing the full distribution function, i.e., the cumulative distribution function (CDF) and the complementary CDF (CCDF).… ▽ More The active learning (AL) technique, one of the state-of-the-art methods for constructing surrogate models, has shown high accuracy and efficiency in forward uncertainty quantification (UQ) analysis. This paper provides a comprehensive study on AL-based global surrogates for computing the full distribution function, i.e., the cumulative distribution function (CDF) and the complementary CDF (CCDF). To this end, we investigate the three essential components for building surrogates, i.e., types of surrogate models, enrichment methods for experimental designs, and stopping criteria. For each component, we choose several representative methods and study their desirable configurations. In addition, we devise a uniform design (i.e., space-filling design) as a baseline for measuring the improvement of using AL. Combining all the representative methods, a total of 1,920 UQ analyses are carried out to solve 16 benchmark examples. The performance of the selected strategies is evaluated based on accuracy and efficiency. In the context of full distribution estimation, this study concludes that (i) AL techniques cannot provide a systematic improvement compared with uniform designs, (ii) the recommended surrogate modeling methods depend on the features of the problems (especially the local nonlinearity), target accuracy, and computational budget. △ Less

Submitted 10 April, 2024; originally announced April 2024.

arXiv:2404.03246 [pdf, ps, other]

On the Range of a class of Complex Monge-Ampère operators on compact Hermitian manifolds

Authors: Yinji Li, Zhiwei Wang, Xiangyu Zhou

Abstract: Let $(X,ω)$ be a compact Hermitian manifold of complex dimension $n$. Let $β$ be a smooth real closed $(1,1)$ form such that there exists a function $ρ\in \mbox{PSH}(X,β)\cap L^{\infty}(X)$. We study the range of the complex non-pluripolar Monge-Ampère operator $\langle(β+dd^c\cdot)^n\rangle$ on weighted Monge-Ampère energy classes on $X$. In particular, when $ρ$ is assumed to be continuous, we gi… ▽ More Let $(X,ω)$ be a compact Hermitian manifold of complex dimension $n$. Let $β$ be a smooth real closed $(1,1)$ form such that there exists a function $ρ\in \mbox{PSH}(X,β)\cap L^{\infty}(X)$. We study the range of the complex non-pluripolar Monge-Ampère operator $\langle(β+dd^c\cdot)^n\rangle$ on weighted Monge-Ampère energy classes on $X$. In particular, when $ρ$ is assumed to be continuous, we give a complete characterization of the range of the complex Monge-Ampère operator on the class $\mathcal E(X,β)$, which is the class of all $\varphi \in \mbox{PSH}(X,β)$ with full Monge-Ampère mass, i.e. $\int_X\langle (β+dd^c\varphi)^n\rangle=\int_Xβ^n$. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: Comments welcome!

arXiv:2404.01639 [pdf, ps, other]

On tight $(k,\ell)$-stable graphs

Authors: Xiaonan Liu, Zi-Xia Song, Zhiyu Wang

Abstract: For integers $k>\ell\ge0$, a graph $G$ is $(k,\ell)$-stable if $α(G-S)\geq α(G)-\ell$ for every $S\subseteq V(G)$ with $|S|=k$. A recent result of Dong and Wu [SIAM J. Discrete Math., 36 (2022) 229--240] shows that every $(k,\ell)$-stable graph $G$ satisfies $α(G) \le \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell$. A $(k,\ell)$-stable graph $G$ is tight if $α(G) = \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell$;… ▽ More For integers $k>\ell\ge0$, a graph $G$ is $(k,\ell)$-stable if $α(G-S)\geq α(G)-\ell$ for every $S\subseteq V(G)$ with $|S|=k$. A recent result of Dong and Wu [SIAM J. Discrete Math., 36 (2022) 229--240] shows that every $(k,\ell)$-stable graph $G$ satisfies $α(G) \le \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell$. A $(k,\ell)$-stable graph $G$ is tight if $α(G) = \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell$; and $q$-tight for some integer $q\ge0$ if $α(G) = \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell-q$. In this paper, we first prove that for all $k\geq 24$, the only tight $(k, 0)$-stable graphs are $K_{k+1}$ and $K_{k+2}$, answering a question of Dong and Luo [arXiv: 2401.16639]. We then prove that for all nonnegative integers $k, \ell, q$ with $k\geq 3\ell+3$, every $q$-tight $(k,\ell)$-stable graph has at most $k-3\ell-3+2^{3(\ell+2q+4)^2}$ vertices, answering a question of Dong and Luo in the negative. △ Less

Submitted 2 April, 2024; originally announced April 2024.

Comments: 11 pages

MSC Class: 05C69

arXiv:2404.00199 [pdf, other]

An Efficient Sparse Identification Algorithm For Stochastic Systems With General Observation Sequences

Authors: Ziming Wang, Xinghua Zhu

Abstract: This paper studies the sparse identification problem of unknown sparse parameter vectors in stochastic dynamic systems. Firstly, a novel sparse identification algorithm is proposed, which can generate sparse estimates based on least squares estimation by adaptively adjusting the threshold. Secondly, under a possibly weakest non-persistent excited condition, we prove that the proposed algorithm can… ▽ More This paper studies the sparse identification problem of unknown sparse parameter vectors in stochastic dynamic systems. Firstly, a novel sparse identification algorithm is proposed, which can generate sparse estimates based on least squares estimation by adaptively adjusting the threshold. Secondly, under a possibly weakest non-persistent excited condition, we prove that the proposed algorithm can correctly identify the zero and nonzero elements of the sparse parameter vector using a finite number of observations, and further estimates of the nonzero elements almost surely converge to the true values. Compared with the related works, e.g., LASSO, our method only requires the weakest assumptions and does not require solving additional optimization problems. Besides, our theoretical results do not require any statistical assumptions on the regression signals, including independence or stationarity, which makes our results promising for application to stochastic feedback systems. Thirdly, the number of finite observations that guarantee the convergence of the zero-element set of unknown sparse parameters of the Hammerstein system is derived for the first time. Finally, numerical simulations are provided, demonstrating the effectiveness of the proposed method. Since there is no additional optimization problem, i.e., no additional numerical error, the proposed algorithm performs much better than other related algorithms. △ Less

Submitted 29 March, 2024; originally announced April 2024.

Comments: arXiv admin note: text overlap with arXiv:2203.02737 by other authors

arXiv:2404.00055 [pdf, other]

Efficient Global Algorithms for Transmit Beamforming Design in ISAC Systems

Authors: Jiageng Wu, Zhiguo Wang, Ya-Feng Liu, Fan Liu

Abstract: In this paper, we propose a multi-input multi-output transmit beamforming optimization model for joint radar sensing and multi-user communications, where the design of the beamformers is formulated as an optimization problem whose objective is a weighted combination of the sum rate and the Cramér-Rao bound, subject to the transmit power budget. Obtaining the global solution for the formulated nonc… ▽ More In this paper, we propose a multi-input multi-output transmit beamforming optimization model for joint radar sensing and multi-user communications, where the design of the beamformers is formulated as an optimization problem whose objective is a weighted combination of the sum rate and the Cramér-Rao bound, subject to the transmit power budget. Obtaining the global solution for the formulated nonconvex problem is a challenging task, since the sum-rate maximization problem itself (even without considering the sensing metric) is known to be NP-hard. The main contributions of this paper are threefold. Firstly, we derive an optimal closed-form solution to the formulated problem in the single-user case and the multi-user case where the channel vectors of different users are orthogonal. Secondly, for the general multi-user case, we propose a novel branch and bound (B\&B) algorithm based on the McCormick envelope relaxation. The proposed algorithm is guaranteed to find the globally optimal solution to the formulated problem. Thirdly, we design a graph neural network (GNN) based pruning policy to determine irrelevant nodes that can be directly pruned in the proposed B\&B algorithm, thereby significantly reducing the number of unnecessary enumerations therein and improving its computational efficiency. Simulation results show the efficiency of the proposed vanilla and GNN-based accelerated B\&B algorithms. △ Less

Submitted 26 March, 2024; originally announced April 2024.

Comments: Submitted for possible publication

arXiv:2403.19413 [pdf, ps, other]

Carleman estimates for space semi-discrete approximations of one-dimensional stochastic parabolic equation and its applications

Authors: Bin Wu, Ying Wang, Zewen Wang

Abstract: In this paper, we study discrete Carleman estimates for space semi-discrete approximations of one-dimensional stochastic parabolic equation. As applications of these discrete Carleman estimates, we apply them to study two inverse problems for the spatial semi-discrete stochastic parabolic equations, including a discrete inverse random source problem and a discrete Cauchy problem. We firstly establ… ▽ More In this paper, we study discrete Carleman estimates for space semi-discrete approximations of one-dimensional stochastic parabolic equation. As applications of these discrete Carleman estimates, we apply them to study two inverse problems for the spatial semi-discrete stochastic parabolic equations, including a discrete inverse random source problem and a discrete Cauchy problem. We firstly establish two Carleman estimates for a one-dimensional semi-discrete stochastic parabolic equation, one for homogeneous boundary and the other for non-homogeneous boundary. Then we apply these two estimates separately to derive two stability results. The first one is the Lipschitz stability for the discrete inverse random source problem. The second one is the Hölder stability for the discrete Cauchy problem. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.16825 [pdf, ps, other]

Weak Convergence Analysis of Online Neural Actor-Critic Algorithms

Authors: Samuel Chun-Hei Lam, Justin Sirignano, Ziheng Wang

Abstract: We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for… ▽ More We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for any convergence analysis. We establish the geometric ergodicity of the data samples under a fixed actor policy. Then, using a Poisson equation, we prove that the fluctuations of the model updates around the limit distribution due to the randomly-arriving data samples vanish as the number of parameter updates $\rightarrow \infty$. Using the Poisson equation and weak convergence techniques, we prove that the actor neural network and critic neural network converge to the solutions of a system of ODEs with random initial conditions. Analysis of the limit ODE shows that the limit critic network will converge to the true value function, which will provide the actor an asymptotically unbiased estimate of the policy gradient. We then prove that the limit actor network will converge to a stationary point. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2403.14958 [pdf, other]

Adapprox: Adaptive Approximation in Adam Optimization via Randomized Low-Rank Matrices

Authors: Pengxiang Zhao, Ping Li, Yingjie Gu, Yi Zheng, Stephan Ludger Kölker, Zhefeng Wang, Xiaoming Yuan

Abstract: As deep learning models exponentially increase in size, optimizers such as Adam encounter significant memory consumption challenges due to the storage of first and second moment data. Current memory-efficient methods like Adafactor and CAME often compromise accuracy with their matrix factorization techniques. Addressing this, we introduce Adapprox, a novel approach that employs randomized low-rank… ▽ More As deep learning models exponentially increase in size, optimizers such as Adam encounter significant memory consumption challenges due to the storage of first and second moment data. Current memory-efficient methods like Adafactor and CAME often compromise accuracy with their matrix factorization techniques. Addressing this, we introduce Adapprox, a novel approach that employs randomized low-rank matrix approximation for a more effective and accurate approximation of Adam's second moment. Adapprox features an adaptive rank selection mechanism, finely balancing accuracy and memory efficiency, and includes an optional cosine similarity guidance strategy to enhance stability and expedite convergence. In GPT-2 training and downstream tasks, Adapprox surpasses AdamW by achieving 34.5% to 49.9% and 33.8% to 49.9% memory savings for the 117M and 345M models, respectively, with the first moment enabled, and further increases these savings without the first moment. Besides, it enhances convergence speed and improves downstream task performance relative to its counterparts. △ Less

Submitted 22 March, 2024; originally announced March 2024.

arXiv:2403.14936 [pdf, ps, other]

Some evaluations of interpolated multiple zeta values and interpolated multiple $t$-values

Authors: Zhonghua Li, Zhenlu Wang

Abstract: In this paper, we study the evaluation formulas of the interpolated multiple zeta values and the interpolated multiple $t$-values with indices involving $1,2,3$. To get these evaluations, we derive the corresponding algebraic relations in the harmonic algebra. In this paper, we study the evaluation formulas of the interpolated multiple zeta values and the interpolated multiple $t$-values with indices involving $1,2,3$. To get these evaluations, we derive the corresponding algebraic relations in the harmonic algebra. △ Less

Submitted 22 April, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

arXiv:2403.13081 [pdf, other]

Parameter Estimation from Single Patient, Single Time-Point Sequencing Data of Recurrent Tumors

Authors: Kevin Leder, Ruping Sun, Zicheng Wang, Xuanming Zhang

Abstract: In this study, we develop consistent estimators for key parameters that govern the dynamics of tumor cell populations when subjected to pharmacological treatments. While these treatments often lead to an initial reduction in the abundance of drug-sensitive cells, a population of drug-resistant cells frequently emerges over time, resulting in cancer recurrence. Samples from recurrent tumors present… ▽ More In this study, we develop consistent estimators for key parameters that govern the dynamics of tumor cell populations when subjected to pharmacological treatments. While these treatments often lead to an initial reduction in the abundance of drug-sensitive cells, a population of drug-resistant cells frequently emerges over time, resulting in cancer recurrence. Samples from recurrent tumors present as an invaluable data source that can offer crucial insights into the ability of cancer cells to adapt and withstand treatment interventions. To effectively utilize the data obtained from recurrent tumors, we derive several large number limit theorems, specifically focusing on the metrics that quantify the clonal diversity of cancer cell populations at the time of cancer recurrence. These theorems then serve as the foundation for constructing our estimators. A distinguishing feature of our approach is that our estimators only require a single time-point sequencing data from a single tumor, thereby enhancing the practicality of our approach and enabling the understanding of cancer recurrence at the individual level. △ Less

Submitted 19 March, 2024; originally announced March 2024.

arXiv:2403.12383 [pdf, ps, other]

New Regularity Criteria for Navier-Stokes and SQG Equations in Critical Spaces

Authors: Yiran Xu, Ly Kim Ha, Haina Li, Zexi Wang

Abstract: In this paper, we investigate some priori estimates to provide the critical regularity criteria for incompressible Navier-Stokes equations on $\mathbb{R}^3$ and super critical surface quasi-geostrophic equations on $\mathbb{R}^2$. Concerning the Navier-Stokes equation, we demonstrate that a Leray-Hopf solution $u$ is regular if… ▽ More In this paper, we investigate some priori estimates to provide the critical regularity criteria for incompressible Navier-Stokes equations on $\mathbb{R}^3$ and super critical surface quasi-geostrophic equations on $\mathbb{R}^2$. Concerning the Navier-Stokes equation, we demonstrate that a Leray-Hopf solution $u$ is regular if $u\in L_T^{\frac{2}{1-α}} \dot{B}^{-α}_{\infty,\infty}(\mathbb{R}^3)$, or $u$ in Lorentz space $ L_T^{p,r} \dot{B}^{-1+\frac{2}{p}}_{\infty,\infty}(\mathbb{R}^3)$, with $4\leq p\leq r<\infty$. Additionally, an alternative regularity condition is expressed as $u\in L_{T}^{\frac{2}{1-α}} \dot{B}^{-α}_{\infty,\infty}(\mathbb{R}^3)+{L_T^\infty\dot{B}^{-1}_{\infty,\infty}}(\mathbb{R}^3)$($α\in(0,1)$), contingent upon a smallness assumption on the norm $L_T^\infty\dot{B}^{-1}_{\infty,\infty}$. For the SQG equation, we derive that a Leray-Hopf weak solution $θ\in L_T^{\fracα{\varepsilon}} \dot{C}^{1-α+ε}(\mathbb{R}^2)$ is smooth for any $\varepsilon$ small enough. Similar to the case of Navier-Stokes equation, we derive regularity criterion in more refined spaces, i.e. Lorentz spaces $L_T^{\fracαε,r}\dot{C}^{1-α+ε}(\mathbb{R}^2)$ and addition of two critical spaces $L_{T}^{\fracαε}\dot{C}^{1-α+ε}(\mathbb{R}^2)+{L_T^\infty\dot{C}^{1-α}(\mathbb{R}^2)}$, with smallness assumption on $L_T^\infty\dot{C}^{1-α}(\mathbb{R}^2)$. △ Less

Submitted 12 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

Showing 1–50 of 1,256 results for author: Wang, Z