-
Spectral gaps and Fourier decay for self-conformal measures in the plane
Authors:
Amir Algom,
Federico Rodriguez Hertz,
Zhiren Wang
Abstract:
We show that every self conformal measure with respect to a $C^ω(\mathbb{C})$ IFS has polynomial Fourier decay under some mild non-linearity and irreducibility conditions. A key step is the proof of a uniform spectral gap for the transfer operator that does not require the cylinder covering of the attractor to be a Markov partition. It is based on a cocycle version of a method of Oh-Winter (2017).
We show that every self conformal measure with respect to a $C^ω(\mathbb{C})$ IFS has polynomial Fourier decay under some mild non-linearity and irreducibility conditions. A key step is the proof of a uniform spectral gap for the transfer operator that does not require the cylinder covering of the attractor to be a Markov partition. It is based on a cocycle version of a method of Oh-Winter (2017).
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
A Study on Lampreys Population Based on Sex-Ratio-Related Growth-Balance Model
Authors:
Zuhua Ji,
Jiarui Chen,
Zihang Wang
Abstract:
Lampreys are one of the oldest species in the world, living longer than dinosaurs, which is related to the ability to change the sex ratio during their lifespan. In this paper, to understand how sex ratio and food quantity affect the population growth rate of lampreys, the researchers draw inspiration from the logistics model and established a model called EcoSexChange(ESC), which results in a pop…
▽ More
Lampreys are one of the oldest species in the world, living longer than dinosaurs, which is related to the ability to change the sex ratio during their lifespan. In this paper, to understand how sex ratio and food quantity affect the population growth rate of lampreys, the researchers draw inspiration from the logistics model and established a model called EcoSexChange(ESC), which results in a population initially increasing and then stabilizing, a reasonable outcome that may apply to other organisms with significant differences in consumption between sexes. Subsequently, this paper develops the Sex Ratio Adaptation Eco Impact (SRAEI) model based on the ESC model using the ABM algorithm to simulate how the population of lampreys, whose lives are divided into seven stages, grows and stabilizes. Then introduces a sudden disaster factor in the middle of the simulation, while also comparing lampreys that cannot adjust their sex ratio. The results of this paper are of great reference significance for people to analyze the population changes of lampreys in different living environments, and they are also easy to apply to other species with large differences between males and females.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
Stable rank for crossed products by finite group actions with the weak tracial Rokhlin property
Authors:
Xiaochun Fang,
Zhongli Wang
Abstract:
Let $A$ be an infinite-dimensional stably finite simple unital C*-algebra, let $G$ be a finite group, and let $α\colon G\rightarrow \mathrm{Aut}(A)$ be an action of $G$ on $A$ which has the weak tracial Rokhlin property. We prove that if $A$ has property (TM), then the crossed product $A\rtimes_αG$ has property (TM). As a corollary, if $A$ is an infinite-dimensional separable simple unital C*-alge…
▽ More
Let $A$ be an infinite-dimensional stably finite simple unital C*-algebra, let $G$ be a finite group, and let $α\colon G\rightarrow \mathrm{Aut}(A)$ be an action of $G$ on $A$ which has the weak tracial Rokhlin property. We prove that if $A$ has property (TM), then the crossed product $A\rtimes_αG$ has property (TM). As a corollary, if $A$ is an infinite-dimensional separable simple unital C*-algebra which has stable rank one and strict comparison, $α\colon G\rightarrow \mathrm{Aut}(A)$ is an action of a finite group $G$ on $A$ with the weak tracial Rokhlin property, then $A\rtimes_αG$ has stable rank one.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
$N$ -Laplacian and $N/2$-Hessian type equations with exponential reaction term and measure data
Authors:
Shiguang Ma,
Zijian Wang
Abstract:
In this article, we will prove existence results for the equations of the type $-Δ_{N}u=H_{l}(u)+μ$ and $F_{\frac{N}{2}}[-u]=H_{l}(u)+μ$ in a bounded domain $Ω$, with Dirichlet boundary condition, where the source term $H_{l}(r)$ takes the form $e^{r}-\sum_{j=0}^{l-1}\frac{r^{j}}{j!}$ and $μ$ is a nonnegative Radon measure.
In this article, we will prove existence results for the equations of the type $-Δ_{N}u=H_{l}(u)+μ$ and $F_{\frac{N}{2}}[-u]=H_{l}(u)+μ$ in a bounded domain $Ω$, with Dirichlet boundary condition, where the source term $H_{l}(r)$ takes the form $e^{r}-\sum_{j=0}^{l-1}\frac{r^{j}}{j!}$ and $μ$ is a nonnegative Radon measure.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Blowing-up solutions for the Choquard type Brezis-Nirenberg problem in dimension three
Authors:
Wenjing Chen,
Zexi Wang
Abstract:
In this paper, we are interested in the existence of solutions for the following Choquard type Brezis-Nirenberg problem \begin{align*}
\left\{
\begin{array}{ll}
-Δu=\displaystyle\Big(\int\limits_Ω\frac{u^{6-α}(y)}{|x-y|^α}dy\Big)u^{5-α}+λu,
\ \ &\mbox{in}\ Ω,
u=0,
\ \ &\mbox{on}\ \partial Ω,
\end{array}
\right.
\end{align*} where $Ω$ is a smooth bounded domain in $\mathbb{R}^3$,…
▽ More
In this paper, we are interested in the existence of solutions for the following Choquard type Brezis-Nirenberg problem \begin{align*}
\left\{
\begin{array}{ll}
-Δu=\displaystyle\Big(\int\limits_Ω\frac{u^{6-α}(y)}{|x-y|^α}dy\Big)u^{5-α}+λu,
\ \ &\mbox{in}\ Ω,
u=0,
\ \ &\mbox{on}\ \partial Ω,
\end{array}
\right.
\end{align*} where $Ω$ is a smooth bounded domain in $\mathbb{R}^3$, $α\in (0,3)$, $6-α$ is the upper critical exponent in the sense of the Hardy-Littlewood-Sobolev inequality, and $λ$ is a real positive parameter. By applying the reduction argument, we find and characterize a positive value $λ_0$ such that if $λ-λ_0>0$ is small enough, then the above problem admits a solution, which blows up and concentrates at the critical point of the Robin function as $λ\rightarrow λ_0$. Moreover, we consider the above problem under zero Neumann boundary condition.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
PDEformer-1: A Foundation Model for One-Dimensional Partial Differential Equations
Authors:
Zhanhong Ye,
Xiang Huang,
Leheng Chen,
Zining Liu,
Bingyang Wu,
Hongsheng Liu,
Zidong Wang,
Bin Dong
Abstract:
This paper introduces PDEformer-1, a versatile neural solver capable of simultaneously addressing various partial differential equations (PDEs). With the PDE represented as a computational graph, we facilitate the seamless integration of symbolic and numeric information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed subsequently to generate mesh-fre…
▽ More
This paper introduces PDEformer-1, a versatile neural solver capable of simultaneously addressing various partial differential equations (PDEs). With the PDE represented as a computational graph, we facilitate the seamless integration of symbolic and numeric information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed subsequently to generate mesh-free predicted solutions. We generated a dataset with up to three million samples involving diverse one-dimensional PDEs to pretrain our model. Compared with baseline models trained specifically on benchmark datasets, our pretrained model achieves comparable accuracy via zero-shot inference, and the advantage expands after finetuning. For PDEs new or unseen in the pretraining stage, our model can adapt quickly by finetuning on a relatively small set of examples from the target equation. Additionally, PDEformer-1 demonstrates promising results in the inverse problem of PDE scalar coefficient recovery and coefficient field recovery.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
A Stochastic Interacting Particle-Field Algorithm for a Haptotaxis Advection-Diffusion System Modeling Cancer Cell Invasion
Authors:
Boyi Hu,
Zhongjian Wang,
Jack Xin,
Zhiwen Zhang
Abstract:
The investigation of tumor invasion and metastasis dynamics is crucial for advancements in cancer biology and treatment. Many mathematical models have been developed to study the invasion of host tissue by tumor cells. In this paper, we develop a novel stochastic interacting particle-field (SIPF) algorithm that accurately simulates the cancer cell invasion process within the haptotaxis advection-d…
▽ More
The investigation of tumor invasion and metastasis dynamics is crucial for advancements in cancer biology and treatment. Many mathematical models have been developed to study the invasion of host tissue by tumor cells. In this paper, we develop a novel stochastic interacting particle-field (SIPF) algorithm that accurately simulates the cancer cell invasion process within the haptotaxis advection-diffusion (HAD) system. Our approach approximates solutions using empirical measures of particle interactions, combined with a smoother field variable - the extracellular matrix concentration (ECM) - computed by the spectral method. We derive a one-step time recursion for both the positions of stochastic particles and the field variable using the implicit Euler discretization, which is based on the explicit Green's function of an elliptic operator characterized by the Laplacian minus a positive constant. Our numerical experiments demonstrate the superior performance of the proposed algorithm, especially in computing cancer cell growth with thin free boundaries in three-dimensional (3D) space. Numerical results show that the SIPF algorithm is mesh-free, self-adaptive, and low-cost. Moreover, it is more accurate and efficient than traditional numerical techniques such as the finite difference method (FDM) and spectral methods.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Self-absorption of Hankel systems on monoids --a seemingly universal property
Authors:
Yong Han,
Yanqi Qiu,
Zipeng Wang
Abstract:
Given any cancellative monoid $\mathcal{M}$, we study the Hankel system determined by its multiplication table. We prove that the Hankel system admits self-absorption property provided that the monoid $\mathcal{M}$ has the local algebraic structure: \[ \big(ax = by, cx=dy, az=bw \,\, \text{in $\mathcal{M}$}\big)\Longrightarrow \big(cz=dw \,\, \text{in $\mathcal{M}$}\big). \] Our result holds for a…
▽ More
Given any cancellative monoid $\mathcal{M}$, we study the Hankel system determined by its multiplication table. We prove that the Hankel system admits self-absorption property provided that the monoid $\mathcal{M}$ has the local algebraic structure: \[ \big(ax = by, cx=dy, az=bw \,\, \text{in $\mathcal{M}$}\big)\Longrightarrow \big(cz=dw \,\, \text{in $\mathcal{M}$}\big). \] Our result holds for all group-embeddable monoids and goes beyond. In particular, it works for all cancellative Abelian monoids and most common non-Abelian cancellative monoids such as $$ \mathrm{SL}_d(\mathbb{N}): = \big\{[a_{ij}]_{1\le i,j\le d}\in \mathrm{SL}_d(\mathbb{Z})\big| a_{ij} \in \mathbb{N}\big\}. $$ The Hankel system determined by the multiplication table of a monoid is further generalized to that determined by level sets of any abstract two-variable map. We introduce an algebraic notion of lunar maps and establish a stronger hereditary self-absorption property for the corresponding generalized Hankel systems. As a consequence, we prove the self-absorption property for arbitrary spatial compression of the regular representation system $\{λ_G(g)\}_{g\in G}$ of any discrete group $G$, as well as the Hankel system $\{Γ_\ell^Φ\}$ determined by the level sets of any rational map of the form $Φ(x,y)=a x^m + b y^n$ with $a,b,m,n\in \mathbb{Z}^*$: \[ Γ_\ell^Φ(x, y)= \mathbf{1}(a x^m + b y^n= \ell), \quad x, y\in \mathbb{N}^*, \, \ell\in Φ(\mathbb{N}^*\times \mathbb{N}^*). \] The self-absorption property is applied to the study of completely bounded Fourier multipliers between Hardy spaces. Further applications are: i) exact complete bounded norm of the Carleman embedding in any dimension; ii) mixed Fourier-Schur multiplier inequalities with critical exponent $4/3$; iii) failure of hyper-complete-contractivity for the Poisson semigroup.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
New type of solutions for a critical Grushin-type problem with competing potentials
Authors:
Wenjing Chen,
Zexi Wang
Abstract:
In this paper, we consider a critical Grushin-type problem with double potentials. By applying the reduction argument and local Pohouzaev identities, we construct a new family of solutions to this problem, which are concentrated at points lying on the top and the bottom circles of a cylinder.
In this paper, we consider a critical Grushin-type problem with double potentials. By applying the reduction argument and local Pohouzaev identities, we construct a new family of solutions to this problem, which are concentrated at points lying on the top and the bottom circles of a cylinder.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
DiffusionPDE: Generative PDE-Solving Under Partial Observation
Authors:
Jiahe Huang,
Guandao Yang,
Zichen Wang,
Jeong Joon Park
Abstract:
We introduce a general framework for solving partial differential equations (PDEs) using generative diffusion models. In particular, we focus on the scenarios where we do not have the full knowledge of the scene necessary to apply classical solvers. Most existing forward or inverse PDE approaches perform poorly when the observations on the data or the underlying coefficients are incomplete, which…
▽ More
We introduce a general framework for solving partial differential equations (PDEs) using generative diffusion models. In particular, we focus on the scenarios where we do not have the full knowledge of the scene necessary to apply classical solvers. Most existing forward or inverse PDE approaches perform poorly when the observations on the data or the underlying coefficients are incomplete, which is a common assumption for real-world measurements. In this work, we propose DiffusionPDE that can simultaneously fill in the missing information and solve a PDE by modeling the joint distribution of the solution and coefficient spaces. We show that the learned generative priors lead to a versatile framework for accurately solving a wide range of PDEs under partial observation, significantly outperforming the state-of-the-art methods for both forward and inverse directions.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Lusztig's Jordan decomposition and a finite field instance of relative Langlands duality
Authors:
Zhicheng Wang
Abstract:
Lusztig \cite{L5,L6} gave a parametrization for $\rm{Irr}(G^F)$, where $G$ is a reductive algebraic group defined over $\mathbb{F}_q$, with Frobenius map $F$. This parametrization is known as Lusztig's Jordan decomposition or Lusztig correspondence. However, there is not a canonical choice of Lusztig correspondence. In this paper, we consider classical groups. We pick a canonical choice of Lusztig…
▽ More
Lusztig \cite{L5,L6} gave a parametrization for $\rm{Irr}(G^F)$, where $G$ is a reductive algebraic group defined over $\mathbb{F}_q$, with Frobenius map $F$. This parametrization is known as Lusztig's Jordan decomposition or Lusztig correspondence. However, there is not a canonical choice of Lusztig correspondence. In this paper, we consider classical groups. We pick a canonical choice of Lusztig correspondence which is compatible with parabolic induction and is compatible with theta correspondence. This result extends Pan's result in \cite{P3}. As an application, we give a refinement of the results of the finite Gan-Gross-Prasad problem in \cite{Wang1} and prove a duality between Theta correspondence and finite Gan-Gross-Prasad problem, which can be regarded as a finite field instance of relative Langlands duality of Ben-Zvi-Sakellaridis-Venkatesh \cite{BZSV}.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
The sparse Kaczmarz method with surrogate hyperplane for the regularized basis pursuit problem
Authors:
Ze Wang,
Jun-Feng Yin,
Ji-Chen Zhao
Abstract:
The Sparse Kaczmarz method is a famous and widely used iterative method for solving the regularized basis pursuit problem. A general scheme of the surrogate hyperplane sparse Kaczmarz method is proposed. In particular, a class of residual-based surrogate hyperplane sparse Kaczmarz method is introduced and the implementations are well discussed. Their convergence theories are proved and the linear…
▽ More
The Sparse Kaczmarz method is a famous and widely used iterative method for solving the regularized basis pursuit problem. A general scheme of the surrogate hyperplane sparse Kaczmarz method is proposed. In particular, a class of residual-based surrogate hyperplane sparse Kaczmarz method is introduced and the implementations are well discussed. Their convergence theories are proved and the linear convergence rates are studied and compared in details. Numerical experiments verify the efficiency of the proposed methods.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Achirality of Sol 3-Manifolds, Stevenhagen Conjecture and Shimizu's L-series
Authors:
Ye Tian,
Shicheng Wang,
Zhongzi Wang
Abstract:
A closed orientable manifold is {\em achiral} if it admits an orientation reversing homeomorphism. A commensurable class of closed manifolds is achiral if it contains an achiral element, or equivalently, each manifold in $\CM$ has an achiral finite cover.
Each commensurable class containing non-orientable elements must be achiral.
It is natural to wonder how many
commensurable classes are ac…
▽ More
A closed orientable manifold is {\em achiral} if it admits an orientation reversing homeomorphism. A commensurable class of closed manifolds is achiral if it contains an achiral element, or equivalently, each manifold in $\CM$ has an achiral finite cover.
Each commensurable class containing non-orientable elements must be achiral.
It is natural to wonder how many
commensurable classes are achiral and how many achiral classes have non-orientable elements.
We study this problem for Sol 3-manifolds. Each commensurable class $\CM$ of Sol 3-manifold has a complete topological invariant $D_{\CM}$, the discriminant of $\CM$. Our main result is:
(1) Among all commensurable classes of Sol 3-manifolds, there are infinitely many achiral classes; however ordered by discriminants, the density of achiral commensurable classes is 0.
(2) Among all achiral commensurable classes of Sol 3-manifolds, ordered by discriminants, the density of classes containing non-orientable elements is $1-ρ$,
where $$ρ:=\prod_{j=1}^\infty \left(1+2^{-j}\right)^{-1} = 0.41942\cdots.$$
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Classification of Cellular Fake Surfaces
Authors:
Lucas Fagan,
Yang Qiu,
Zhenghan Wang
Abstract:
Generic polyhedra are interesting mathematical objects to study in their own right. In this paper, we initialize a systematic study of two-dimensional generic polyhedra with an eye towards applications to low-dimensional topology, especially the Andrews-Curtis and Zeeman conjectures. After recalling the basic notions of generic polyhedra and fake surfaces, we derive some interesting properties of…
▽ More
Generic polyhedra are interesting mathematical objects to study in their own right. In this paper, we initialize a systematic study of two-dimensional generic polyhedra with an eye towards applications to low-dimensional topology, especially the Andrews-Curtis and Zeeman conjectures. After recalling the basic notions of generic polyhedra and fake surfaces, we derive some interesting properties of fake surfaces. Our main result is a complete classification of acyclic cellular fake surfaces up to complexity 4 and a classification of acyclic cellular fake surfaces without small disks of complexity 5. From this classification, we prove the contractibility conjecture for acyclic cellular fake surfaces of complexity 4, and the embedded disk conjecture up to complexity 5. We provide evidence for the conjectures that the probability of being a spine among fake surfaces is 0 and that every contractible fake surface has an embedded disk.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Event-Triggered Optimal Tracking Control for Strict-Feedback Nonlinear Systems With Non-Affine Nonlinear Faults
Authors:
Ling Wang,
Xin Wang,
Ziming Wang
Abstract:
This article studies the control ideas of the optimal backstepping technique, proposing an event-triggered optimal tracking control scheme for a class of strict-feedback nonlinear systems with non-affine and nonlinear faults. A simplified identifier-critic-actor framework is employed in the reinforcement learning algorithm to achieve optimal control. The identifier estimates the unknown dynamic fu…
▽ More
This article studies the control ideas of the optimal backstepping technique, proposing an event-triggered optimal tracking control scheme for a class of strict-feedback nonlinear systems with non-affine and nonlinear faults. A simplified identifier-critic-actor framework is employed in the reinforcement learning algorithm to achieve optimal control. The identifier estimates the unknown dynamic functions, the critic evaluates the system performance, and the actor implements control actions, enabling modeling and control of anonymous systems for achieving optimal control performance. In this paper, a simplified reinforcement learning algorithm is designed by deriving update rules from the negative gradient of a simple positive function related to the Hamilton-Jacobi-Bellman equation, and it also releases the stringent persistent excitation condition. Then, a fault-tolerant control method is developed by applying filtered signals for controller design. Additionally, to address communication resource reduction, an event-triggered mechanism is employed for designing the actual controller. Finally, the proposed scheme's feasibility is validated through theoretical analysis and simulation.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Arbitrarily slow decay in the logarithmically averaged Sarnak conjecture
Authors:
Amir Algom,
Zhiren Wang
Abstract:
In 2017 Tao proposed a variant Sarnak's Möbius disjointness conjecture with logarithmic averaging: For any zero entropy dynamical system $(X,T)$, $\frac{1}{\log N} \sum_{n=1} ^N \frac{f(T^n x) μ(n)}{n}= o(1)$ for every $f\in \mathcal{C}(X)$ and every $x\in X$. We construct examples showing that this $o(1)$ can go to zero arbitrarily slowly. Nonetheless, all of our examples satisfy the conjecture.
In 2017 Tao proposed a variant Sarnak's Möbius disjointness conjecture with logarithmic averaging: For any zero entropy dynamical system $(X,T)$, $\frac{1}{\log N} \sum_{n=1} ^N \frac{f(T^n x) μ(n)}{n}= o(1)$ for every $f\in \mathcal{C}(X)$ and every $x\in X$. We construct examples showing that this $o(1)$ can go to zero arbitrarily slowly. Nonetheless, all of our examples satisfy the conjecture.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Level proximal subdifferential, variational convexity, and pointwise Lipschitz smoothness
Authors:
Honglin Luo,
Xianfu Wang,
Ziyuan Wang,
Xinmin Yang
Abstract:
Level proximal subdifferential was introduced by Rockafellar recently as a tool for studying proximal mappings of possibly nonconvex functions. In this paper we give a systematic study of level proximal subdifferntial, characterize variational convexity of the function by locally firm nonexpansiveness of proximal mappings or locally relative monotonicity of level proximal subdifferential, and inve…
▽ More
Level proximal subdifferential was introduced by Rockafellar recently as a tool for studying proximal mappings of possibly nonconvex functions. In this paper we give a systematic study of level proximal subdifferntial, characterize variational convexity of the function by locally firm nonexpansiveness of proximal mappings or locally relative monotonicity of level proximal subdifferential, and investigate pointwise Lipschitz smoothness of the function. Integration and single-valuedness of level proximal subdifferential are also examined.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Policy Iteration for Exploratory Hamilton--Jacobi--Bellman Equations
Authors:
Hung Vinh Tran,
Zhenhua Wang,
Yuming Paul Zhang
Abstract:
We study the policy iteration algorithm (PIA) for entropy-regularized stochastic control problems on an infinite time horizon with a large discount rate, focusing on two main scenarios. First, we analyze PIA with bounded coefficients where the controls applied to the diffusion term satisfy a smallness condition. We demonstrate the convergence of PIA based on a uniform $\mathcal{C}^{2,α}$ estimate…
▽ More
We study the policy iteration algorithm (PIA) for entropy-regularized stochastic control problems on an infinite time horizon with a large discount rate, focusing on two main scenarios. First, we analyze PIA with bounded coefficients where the controls applied to the diffusion term satisfy a smallness condition. We demonstrate the convergence of PIA based on a uniform $\mathcal{C}^{2,α}$ estimate for the value sequence generated by PIA, and provide a quantitative convergence analysis for this scenario. Second, we investigate PIA with unbounded coefficients but no control over the diffusion term. In this scenario, we first provide the well-posedness of the exploratory Hamilton--Jacobi--Bellman equation with linear growth coefficients and polynomial growth reward function. By such a well-posedess result we achieve PIA's convergence by establishing a quantitative locally uniform $\mathcal{C}^{1,α}$ estimates for the generated value sequence.
△ Less
Submitted 2 July, 2024; v1 submitted 2 June, 2024;
originally announced June 2024.
-
Improving Generalization and Convergence by Enhancing Implicit Regularization
Authors:
Mingze Wang,
Haotian He,
Jinbo Wang,
Zilin Wang,
Guanhua Huang,
Feiyu Xiong,
Zhiyu Li,
Weinan E,
Lei Wu
Abstract:
In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I…
▽ More
In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that IRE can be practically incorporated with {\em generic base optimizers} without introducing significant computational overload. Experiments show that IRE consistently improves the generalization performance for image classification tasks across a variety of benchmark datasets (CIFAR-10/100, ImageNet) and models (ResNets and ViTs). Surprisingly, IRE also achieves a $2\times$ {\em speed-up} compared to AdamW in the pre-training of Llama models (of sizes ranging from 60M to 229M) on datasets including Wikitext-103, Minipile, and Openwebtext. Moreover, we provide theoretical guarantees, showing that IRE can substantially accelerate the convergence towards flat minima in Sharpness-aware Minimization (SAM).
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
On a problem of Pavlović involving harmonic quasiconformal mappings
Authors:
Zhi-Gang Wang,
Xiao-Yuan Wang,
Antti Rasila,
Jia-Le Qiu
Abstract:
We obtain a sharp result on order of certain affine and linear invariant families of harmonic quasiconformal mappings with bounded Schwarzian norm. This problem is motivated by the work of Chuaqui, Hernández and Martín [Math. Ann. 367: 1099--1122, 2017]. Firstly, for $K\ge1$, we construct a harmonic $K$-quasiconformal counterpart of the classical Koebe function and use it to formulate the correspo…
▽ More
We obtain a sharp result on order of certain affine and linear invariant families of harmonic quasiconformal mappings with bounded Schwarzian norm. This problem is motivated by the work of Chuaqui, Hernández and Martín [Math. Ann. 367: 1099--1122, 2017]. Firstly, for $K\ge1$, we construct a harmonic $K$-quasiconformal counterpart of the classical Koebe function and use it to formulate the corresponding conjectures. Then we consider Hardy spaces $H^p$ of harmonic quasiconformal mappings by applying results for quasiconformal mappings obtained by Astala and Koskela [Pure Appl. Math. Q. 7: 19--50, 2011]. In particular, we determine the optimal order of the family of harmonic quasiconformal mappings with bounded Schwarzian norm to belong to a harmonic Hardy space. This partially solves an open problem posed by Pavlović in 2014. Finally, we derive pre-Schwarzian and Schwarzian norm estimates of certain harmonic mappings.
△ Less
Submitted 14 July, 2024; v1 submitted 30 May, 2024;
originally announced May 2024.
-
Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization
Authors:
Xi Lin,
Yilu Liu,
Xiaoyuan Zhang,
Fei Liu,
Zhenkun Wang,
Qingfu Zhang
Abstract:
Multi-objective optimization can be found in many real-world applications where some conflicting objectives can not be optimized by a single solution. Existing optimization methods often focus on finding a set of Pareto solutions with different optimal trade-offs among the objectives. However, the required number of solutions to well approximate the whole Pareto optimal set could be exponentially…
▽ More
Multi-objective optimization can be found in many real-world applications where some conflicting objectives can not be optimized by a single solution. Existing optimization methods often focus on finding a set of Pareto solutions with different optimal trade-offs among the objectives. However, the required number of solutions to well approximate the whole Pareto optimal set could be exponentially large with respect to the number of objectives, which makes these methods unsuitable for handling many optimization objectives. In this work, instead of finding a dense set of Pareto solutions, we propose a novel Tchebycheff set scalarization method to find a few representative solutions (e.g., 5) to cover a large number of objectives (e.g., $>100$) in a collaborative and complementary manner. In this way, each objective can be well addressed by at least one solution in the small solution set. In addition, we further develop a smooth Tchebycheff set scalarization approach for efficient optimization with good theoretical guarantees. Experimental studies on different problems with many optimization objectives demonstrate the effectiveness of our proposed method.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
A structure-preserving scheme for computing effective diffusivity and anomalous diffusion phenomena of random flows
Authors:
Tan Zhang,
Zhongjian Wang,
Jack Xin,
Zhiwen Zhang
Abstract:
This paper aims to investigate the diffusion behavior of particles moving in stochastic flows under a structure-preserving scheme. We compute the effective diffusivity for normal diffusive random flows and establish the power law between spatial and temporal variables for cases with anomalous diffusion phenomena. From a Lagrangian approach, we separate the corresponding stochastic differential equ…
▽ More
This paper aims to investigate the diffusion behavior of particles moving in stochastic flows under a structure-preserving scheme. We compute the effective diffusivity for normal diffusive random flows and establish the power law between spatial and temporal variables for cases with anomalous diffusion phenomena. From a Lagrangian approach, we separate the corresponding stochastic differential equations (SDEs) into sub-problems and construct a one-step structure-preserving method to solve them. Then by modified equation systems, the convergence analysis in calculating the effective diffusivity is provided and compared between the structure-preserving scheme and the Euler-Maruyama scheme. Also, we provide the error estimate for the structure-preserving scheme in calculating the power law for a series of super-diffusive random flows. Finally, we calculate the effective diffusivity and anomalous diffusion phenomena for a series of 2D and 3D random fields.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Boundary actions by higher-rank lattices: Classification and embedding in low dimensions, local rigidity, smooth factors
Authors:
Aaron Brown,
Federico Rodriguez Hertz,
Zhiren Wang
Abstract:
We study actions by lattices in higher-rank (semi)simple Lie groups on compact manifolds. By classifying certain measures invariant under a related higher-rank abelian action (the diagonal action on the suspension space) we deduce a number of new rigidity results related to standard projective actions (i.e. boundary actions) by such groups.
Specifically, in low dimensions we show all actions (wi…
▽ More
We study actions by lattices in higher-rank (semi)simple Lie groups on compact manifolds. By classifying certain measures invariant under a related higher-rank abelian action (the diagonal action on the suspension space) we deduce a number of new rigidity results related to standard projective actions (i.e. boundary actions) by such groups.
Specifically, in low dimensions we show all actions (with infinite image) are conjugate to boundary actions. We also show standard boundary actions (e.g. projective actions on generalized flag varieties) are local rigid and classify all smooth actions that are topological factors of such actions. Finally, for volume-preserving actions in low dimensions (with infinite image) we provide a mechanism to detect the presence of "blow-ups" for the action by studying measures that are $P$-invariant but not $G$-invariant for the suspension action.
△ Less
Submitted 2 June, 2024; v1 submitted 25 May, 2024;
originally announced May 2024.
-
Global Well-posedness and Convergence Analysis of Score-based Generative Models via Sharp Lipschitz Estimates
Authors:
Connor Mooney,
Zhongjian Wang,
Jack Xin,
Yifeng Yu
Abstract:
We establish global well-posedness and convergence of the score-based generative models (SGM) under minimal general assumptions of initial data for score estimation. For the smooth case, we start from a Lipschitz bound of the score function with optimal time length. The optimality is validated by an example whose Lipschitz constant of scores is bounded at initial but blows up in finite time. This…
▽ More
We establish global well-posedness and convergence of the score-based generative models (SGM) under minimal general assumptions of initial data for score estimation. For the smooth case, we start from a Lipschitz bound of the score function with optimal time length. The optimality is validated by an example whose Lipschitz constant of scores is bounded at initial but blows up in finite time. This necessitates the separation of time scales in conventional bounds for non-log-concave distributions. In contrast, our follow up analysis only relies on a local Lipschitz condition and is valid globally in time. This leads to the convergence of numerical scheme without time separation. For the non-smooth case, we show that the optimal Lipschitz bound is O(1/t) in the point-wise sense for distributions supported on a compact, smooth and low-dimensional manifold with boundary.
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
New type of solutions for the critical polyharmonic equation
Authors:
Wenjing Chen,
Zexi Wang
Abstract:
In this paper, we consider the following critical polyharmonic equation \begin{align*}%\label{abs} ( -Δ)^m u+V(|y'|,y'')u=u^{m^*-1},\quad u>0, \quad y=(y',y'')\in \mathbb{R}^3\times \mathbb{R}^{N-3},
\end{align*} where $m^*=\frac{2N}{N-2m}$, $N>4m+1$, $m\in \mathbb{N}^+$, and $V(|y'|,y'')$ is a bounded nonnegative function in $\mathbb{R}^+\times \mathbb{R}^{N-3}$. By using the reduction argument…
▽ More
In this paper, we consider the following critical polyharmonic equation \begin{align*}%\label{abs} ( -Δ)^m u+V(|y'|,y'')u=u^{m^*-1},\quad u>0, \quad y=(y',y'')\in \mathbb{R}^3\times \mathbb{R}^{N-3},
\end{align*} where $m^*=\frac{2N}{N-2m}$, $N>4m+1$, $m\in \mathbb{N}^+$, and $V(|y'|,y'')$ is a bounded nonnegative function in $\mathbb{R}^+\times \mathbb{R}^{N-3}$. By using the reduction argument and local Pohouzaev identities, we prove that if $r^{2m}V(r,y'')$ has a stable critical point $(r_0,y_0'')$ with $r_0>0$ and $V(r_0,y_0'')>0$, then the above problem has a new type of solutions, which concentrate at points lying on the top and the bottom circles of a cylinder.
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
Transport based particle methods for the Fokker-Planck-Landau equation
Authors:
Vasily Ilin,
Jingwei Hu,
Zhenfu Wang
Abstract:
We propose a particle method for numerically solving the Landau equation, inspired by the score-based transport modeling (SBTM) method for the Fokker-Planck equation. This method can preserve some important physical properties of the Landau equation, such as the conservation of mass, momentum, and energy, and decay of estimated entropy. We prove that matching the gradient of the logarithm of the a…
▽ More
We propose a particle method for numerically solving the Landau equation, inspired by the score-based transport modeling (SBTM) method for the Fokker-Planck equation. This method can preserve some important physical properties of the Landau equation, such as the conservation of mass, momentum, and energy, and decay of estimated entropy. We prove that matching the gradient of the logarithm of the approximate solution is enough to recover the true solution to the Landau equation with Maxwellian molecules. Several numerical experiments in low and moderately high dimensions are performed, with particular emphasis on comparing the proposed method with the traditional particle or blob method.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Adaptive Ensemble Control for Stochastic Systems with Mixed Asymmetric Laplace Noises
Authors:
Yajie Yu,
Xuehui Ma,
Shiliang Zhang,
Zhuzhu Wang,
Xubing Shi,
Yushuai Li,
Tingwen Huang
Abstract:
This paper presents an adaptive ensemble control for stochastic systems subject to asymmetric noises and outliers. Asymmetric noises skew system observations, and outliers with large amplitude deteriorate the observations even further. Such disturbances induce poor system estimation and degraded stochastic system control. In this work, we model the asymmetric noises and outliers by mixed asymmetri…
▽ More
This paper presents an adaptive ensemble control for stochastic systems subject to asymmetric noises and outliers. Asymmetric noises skew system observations, and outliers with large amplitude deteriorate the observations even further. Such disturbances induce poor system estimation and degraded stochastic system control. In this work, we model the asymmetric noises and outliers by mixed asymmetric Laplace distributions (ALDs), and propose an optimal control for stochastic systems with mixed ALD noises. Particularly, we segregate the system disturbed by mixed ALD noises into subsystems, each of which is subject to a specific ALD noise. For each subsystem, we design an iterative quantile filter (IQF) to estimate the system parameters using system observations. With the estimated parameters by IQF, we derive the certainty equivalence (CE) control law for each subsystem. Then we use the Bayesian approach to ensemble the subsystem CE controllers, with each of the controllers weighted by their posterior probability. We finalize our control law as the weighted sum of the control signals by the sub-system CE controllers. To demonstrate our approach, we conduct numerical simulations and Monte Carlo analyses. The results show improved tracking performance by our approach for skew noises and its robustness to outliers, compared with single ALD based and RLS-based control policy.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Singular Integrals associated with Reflection Groups on Euclidean Space
Authors:
Yongsheng Han,
Ji Li,
Chaoqiang Tan,
Zipeng Wang,
Xinfeng Wu
Abstract:
In the field of harmonic analysis, geometric considerations are frequently crucial. Specially, group actions such as translations, dilations and rotations on Euclidean space are instrumental. The objective of this paper is to extend the study of singular integrals to include the effects of group reflections on Euclidean space, and to establish the T1 theorem for these singular integrals.
In the field of harmonic analysis, geometric considerations are frequently crucial. Specially, group actions such as translations, dilations and rotations on Euclidean space are instrumental. The objective of this paper is to extend the study of singular integrals to include the effects of group reflections on Euclidean space, and to establish the T1 theorem for these singular integrals.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Structure of Dubrovin-Zhang free energy functions and universal identities
Authors:
Sergey Shadrin,
Zhe Wang
Abstract:
We prove a structural theorem relating the higher genera free energy functions of the Dubrovin-Zhang hierarchies to those of the trivial theory, that is, the Witten-Kontsevich free energy functions. As an important application, for any given genus $g\geq 1$, we construct a set of universal identities valid for the free energy functions of any Dubrovin-Zhang hierarchy.
We prove a structural theorem relating the higher genera free energy functions of the Dubrovin-Zhang hierarchies to those of the trivial theory, that is, the Witten-Kontsevich free energy functions. As an important application, for any given genus $g\geq 1$, we construct a set of universal identities valid for the free energy functions of any Dubrovin-Zhang hierarchy.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
On the derivatives of the Liouville currents
Authors:
Xinlong Dong,
Dragomir Šarić,
Zhe Wang
Abstract:
The Liouville map, introduced by Bonahon, assigns to each point in the Teichmüller space a natural Radon measure on the space of geodesics of the base surface. The Liouville map is real analytic and it even extends to a holomorphic map of a neighborhood of the Teichmüller space in the Quasi-Fuchsian space of an arbitrary conformally hyperbolic Riemann surface. The earthquake paths and by their ext…
▽ More
The Liouville map, introduced by Bonahon, assigns to each point in the Teichmüller space a natural Radon measure on the space of geodesics of the base surface. The Liouville map is real analytic and it even extends to a holomorphic map of a neighborhood of the Teichmüller space in the Quasi-Fuchsian space of an arbitrary conformally hyperbolic Riemann surface. The earthquake paths and by their extension quake-bends, introduced by Thurston, are particularly nice real-analytic and holomorphic paths in the Teichmüller and the Quasi-Fuchsian space, respectively. We find a geometric expression for the derivative of the Liouville map along earthquake paths.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Maximum spread of $K_{s,t}$-minor-free graphs
Authors:
William Linz,
Linyuan Lu,
Zhiyu Wang
Abstract:
The spread of a graph $G$ is the difference between the largest and smallest eigenvalue of the adjacency matrix of $G$. In this paper, we consider the family of graphs which contain no $K_{s,t}$-minor. We show that for any $t\geq s \geq 2$, there is an integer $ξ_{t}$ such that the extremal $n$-vertex $K_{s,t}$-minor-free graph attaining the maximum spread is the graph obtained by joining a graph…
▽ More
The spread of a graph $G$ is the difference between the largest and smallest eigenvalue of the adjacency matrix of $G$. In this paper, we consider the family of graphs which contain no $K_{s,t}$-minor. We show that for any $t\geq s \geq 2$, there is an integer $ξ_{t}$ such that the extremal $n$-vertex $K_{s,t}$-minor-free graph attaining the maximum spread is the graph obtained by joining a graph $L$ on $(s-1)$ vertices to the disjoint union of $\lfloor \frac{2n+ξ_{t}}{3t}\rfloor$ copies of $K_t$ and $n-s+1 - t\lfloor \frac{2n+ξ_t}{3t}\rfloor$ isolated vertices. Furthermore, we give an explicit formula for $ξ_{t}$ and an explicit description for the graph $L$ for $t \geq \frac32(s-3) +\frac{4}{s-1}$.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Variational Optimization for Quantum Problems using Deep Generative Networks
Authors:
Lingxia Zhang,
Xiaodie Lin,
Peidong Wang,
Kaiyan Yang,
Xiao Zeng,
Zhaohui Wei,
Zizhu Wang
Abstract:
Optimization is one of the keystones of modern science and engineering. Its applications in quantum technology and machine learning helped nurture variational quantum algorithms and generative AI respectively. We propose a general approach to design variational optimization algorithms based on generative models: the Variational Generative Optimization Network (VGON). To demonstrate its broad appli…
▽ More
Optimization is one of the keystones of modern science and engineering. Its applications in quantum technology and machine learning helped nurture variational quantum algorithms and generative AI respectively. We propose a general approach to design variational optimization algorithms based on generative models: the Variational Generative Optimization Network (VGON). To demonstrate its broad applicability, we apply VGON to three quantum tasks: finding the best state in an entanglement-detection protocol, finding the ground state of a 1D quantum spin model with variational quantum circuits, and generating degenerate ground states of many-body quantum Hamiltonians. For the first task, VGON greatly reduces the optimization time compared to stochastic gradient descent while generating nearly optimal quantum states. For the second task, VGON alleviates the barren plateau problem in variational quantum circuits. For the final task, VGON can identify the degenerate ground state spaces after a single stage of training and generate a variety of states therein.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Plug-and-Play Algorithm Convergence Analysis From The Standpoint of Stochastic Differential Equation
Authors:
Zhongqi Wang,
Bingnan Wang,
Maosheng Xiang
Abstract:
The Plug-and-Play (PnP) algorithm is popular for inverse image problem-solving. However, this algorithm lacks theoretical analysis of its convergence with more advanced plug-in denoisers. We demonstrate that discrete PnP iteration can be described by a continuous stochastic differential equation (SDE). We can also achieve this transformation through Markov process formulation of PnP. Then, we can…
▽ More
The Plug-and-Play (PnP) algorithm is popular for inverse image problem-solving. However, this algorithm lacks theoretical analysis of its convergence with more advanced plug-in denoisers. We demonstrate that discrete PnP iteration can be described by a continuous stochastic differential equation (SDE). We can also achieve this transformation through Markov process formulation of PnP. Then, we can take a higher standpoint of PnP algorithms from stochastic differential equations, and give a unified framework for the convergence property of PnP according to the solvability condition of its corresponding SDE. We reveal that a much weaker condition, bounded denoiser with Lipschitz continuous measurement function would be enough for its convergence guarantee, instead of previous Lipschitz continuous denoiser condition.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Discrete non-commutative hungry Toda lattice and its application in matrix computation
Authors:
Zheng Wang,
Shi-Hao Li,
Kang-Ya Lu,
Jian-Qing Sun
Abstract:
In this paper, we plan to show an eigenvalue algorithm for block Hessenberg matrices by using the idea of non-commutative integrable systems and matrix-valued orthogonal polynomials. We introduce adjacent families of matrix-valued $θ$-deformed bi-orthogonal polynomials, and derive corresponding discrete non-commutative hungry Toda lattice from discrete spectral transformations for polynomials. It…
▽ More
In this paper, we plan to show an eigenvalue algorithm for block Hessenberg matrices by using the idea of non-commutative integrable systems and matrix-valued orthogonal polynomials. We introduce adjacent families of matrix-valued $θ$-deformed bi-orthogonal polynomials, and derive corresponding discrete non-commutative hungry Toda lattice from discrete spectral transformations for polynomials. It is shown that this discrete system can be used as a pre-precessing algorithm for block Hessenberg matrices. Besides, some convergence analysis and numerical examples of this algorithm are presented.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimiax Optimization
Authors:
Yuchen Zhu,
Yufeng Zhang,
Zhaoran Wang,
Zhuoran Yang,
Xiaohong Chen
Abstract:
This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence o…
▽ More
This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence of the stochastic gradient descent-ascent algorithm and (ii) the representation learning of the neural networks. We establish convergence under the mean-field regime by considering the continuous-time and infinite-width limit of the optimization dynamics. Under this regime, the stochastic gradient descent-ascent corresponds to a Wasserstein gradient flow over the space of probability measures defined over the space of neural network parameters. We prove that the Wasserstein gradient flow converges globally to a stationary point of the minimax objective at a $O(T^{-1} + α^{-1})$ sublinear rate, and additionally finds the solution to the functional equation when the regularizer of the minimax objective is strongly convex. Here $T$ denotes the time and $α$ is a scaling parameter of the neural networks. In terms of representation learning, our results show that the feature representation induced by the neural networks is allowed to deviate from the initial one by the magnitude of $O(α^{-1})$, measured in terms of the Wasserstein distance. Finally, we apply our general results to concrete examples including policy evaluation, nonparametric instrumental variable regression, asset pricing, and adversarial Riesz representer estimation.
△ Less
Submitted 25 May, 2024; v1 submitted 18 April, 2024;
originally announced April 2024.
-
FCNCP: A Coupled Nonnegative CANDECOMP/PARAFAC Decomposition Based on Federated Learning
Authors:
Yukai Cai,
Hang Liu,
Xiulin Wang,
Hongjin Li,
Ziyi Wang,
Chuanshuai Yang,
Fengyu Cong
Abstract:
In the field of brain science, data sharing across servers is becoming increasingly challenging due to issues such as industry competition, privacy security, and administrative procedure policies and regulations. Therefore, there is an urgent need to develop new methods for data analysis and processing that enable scientific collaboration without data sharing. In view of this, this study proposes…
▽ More
In the field of brain science, data sharing across servers is becoming increasingly challenging due to issues such as industry competition, privacy security, and administrative procedure policies and regulations. Therefore, there is an urgent need to develop new methods for data analysis and processing that enable scientific collaboration without data sharing. In view of this, this study proposes to study and develop a series of efficient non-negative coupled tensor decomposition algorithm frameworks based on federated learning called FCNCP for the EEG data arranged on different servers. It combining the good discriminative performance of tensor decomposition in high-dimensional data representation and decomposition, the advantages of coupled tensor decomposition in cross-sample tensor data analysis, and the features of federated learning for joint modelling in distributed servers. The algorithm utilises federation learning to establish coupling constraints for data distributed across different servers. In the experiments, firstly, simulation experiments are carried out using simulated data, and stable and consistent decomposition results are obtained, which verify the effectiveness of the proposed algorithms in this study. Then the FCNCP algorithm was utilised to decompose the fifth-order event-related potential (ERP) tensor data collected by applying proprioceptive stimuli on the left and right hands. It was found that contralateral stimulation induced more symmetrical components in the activation areas of the left and right hemispheres. The conclusions drawn are consistent with the interpretations of related studies in cognitive neuroscience, demonstrating that the method can efficiently process higher-order EEG data and that some key hidden information can be preserved.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Betti numbers of normal edge rings
Authors:
Zexin Wang,
Dancheng Lu
Abstract:
A novel approach is introduced for computing the multi-graded Betti numbers of normal edge rings. This method is employed to delve into the edge rings of three distinct classes of simple graphs that adhere to the odd-cycle condition. These classes include compact graphs, which are devoid of even cycles and satisfy the odd-cycle condition; graphs comprised of multiple paths converging at two shared…
▽ More
A novel approach is introduced for computing the multi-graded Betti numbers of normal edge rings. This method is employed to delve into the edge rings of three distinct classes of simple graphs that adhere to the odd-cycle condition. These classes include compact graphs, which are devoid of even cycles and satisfy the odd-cycle condition; graphs comprised of multiple paths converging at two shared vertices; and graphs introduced in \cite{HHKO} that exhibit both even and odd cycles. Explicit formulas are provided for the multi-graded Betti numbers pertaining to the edge rings of these graphs.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Modular data of non-semisimple modular categories
Authors:
Liang Chang,
Quinn T. Kolt,
Zhenghan Wang,
Qing Zhang
Abstract:
We investigate non-semisimple modular categories with an eye towards a structure theory, low-rank classification, and applications to low dimensional topology and topological physics. We aim to extend the well-understood theory of semisimple modular categories to the non-semisimple case by using representations of factorizable ribbon Hopf algebras as a case study. We focus on the Cohen-Westreich m…
▽ More
We investigate non-semisimple modular categories with an eye towards a structure theory, low-rank classification, and applications to low dimensional topology and topological physics. We aim to extend the well-understood theory of semisimple modular categories to the non-semisimple case by using representations of factorizable ribbon Hopf algebras as a case study. We focus on the Cohen-Westreich modular data, which is obtained from the Lyubashenko-Majid modular representation restricted to the Higman ideal of a factorizable ribbon Hopf algebra. The Cohen-Westreich $S$-matrix diagonalizes the mixed fusion rules and reduces to the usual $S$-matrix for semisimple modular categories. The paper includes detailed studies on small quantum groups $U_qsl(2)$ and the Drinfeld doubles of Nichols Hopf algebras, especially the $\mathrm{SL}(2, \mathbb{Z})$-representation on their centers, Cohen-Westreich modular data, and the congruence kernel theorem's validity.
△ Less
Submitted 6 May, 2024; v1 submitted 14 April, 2024;
originally announced April 2024.
-
Correspondence Research of the Most Probable Transition Paths between a Stochastic Interacting Particle System and its Mean Field Limit System
Authors:
Jianyu Chen,
Jianyu Hu,
Zibo Wang,
Ting Gao Jinqiao Duan
Abstract:
This paper derived the indirect approximation theorem of the most probable transition pathway of a stochastic interacting particle system in the mean field sense. This paper studied the problem of indirect approximation of the most probable transition pathway of an interacting particle system (i.e., a high-dimensional stochastic dynamic system) and its mean field limit equation (McKean-Vlasov stoc…
▽ More
This paper derived the indirect approximation theorem of the most probable transition pathway of a stochastic interacting particle system in the mean field sense. This paper studied the problem of indirect approximation of the most probable transition pathway of an interacting particle system (i.e., a high-dimensional stochastic dynamic system) and its mean field limit equation (McKean-Vlasov stochastic differential equation). This study is based on the Onsager-Machlup action functional, reformulated the problem as an optimal control problem. With the stochastic Pontryagin's Maximum Principle, this paper completed the derivation. This paper proved the existence and uniqueness theorem of the solution to the mean field optimal control problem of McKean-Vlasov stochastic differential equations, and also established a system of equations satisfying the control parameters $θ^{*}$ and $θ^{N}$ respectively. There are few studies on the most probable transition pathways of stochastic interacting particle systems, it is still a great challenge to solve the most probable transition pathways directly or to approximate it with the mean field limit system. Therefore, this paper first gave the proof of correspondence between the core equation of Pontryagin's Maximum Principle, that is, Hamiltonian extreme condition equation. That is to say, this correspondence indirectly explain the correspondence between the most probable transition pathways of stochastic interacting particle systems and the mean field systems.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Surrogate modeling for probability distribution estimation:uniform or adaptive design?
Authors:
Maijia Su,
Ziqi Wang,
Oreste Salvatore Bursi,
Marco Broccardo
Abstract:
The active learning (AL) technique, one of the state-of-the-art methods for constructing surrogate models, has shown high accuracy and efficiency in forward uncertainty quantification (UQ) analysis. This paper provides a comprehensive study on AL-based global surrogates for computing the full distribution function, i.e., the cumulative distribution function (CDF) and the complementary CDF (CCDF).…
▽ More
The active learning (AL) technique, one of the state-of-the-art methods for constructing surrogate models, has shown high accuracy and efficiency in forward uncertainty quantification (UQ) analysis. This paper provides a comprehensive study on AL-based global surrogates for computing the full distribution function, i.e., the cumulative distribution function (CDF) and the complementary CDF (CCDF). To this end, we investigate the three essential components for building surrogates, i.e., types of surrogate models, enrichment methods for experimental designs, and stopping criteria. For each component, we choose several representative methods and study their desirable configurations. In addition, we devise a uniform design (i.e., space-filling design) as a baseline for measuring the improvement of using AL. Combining all the representative methods, a total of 1,920 UQ analyses are carried out to solve 16 benchmark examples. The performance of the selected strategies is evaluated based on accuracy and efficiency. In the context of full distribution estimation, this study concludes that (i) AL techniques cannot provide a systematic improvement compared with uniform designs, (ii) the recommended surrogate modeling methods depend on the features of the problems (especially the local nonlinearity), target accuracy, and computational budget.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
On the Range of a class of Complex Monge-Ampère operators on compact Hermitian manifolds
Authors:
Yinji Li,
Zhiwei Wang,
Xiangyu Zhou
Abstract:
Let $(X,ω)$ be a compact Hermitian manifold of complex dimension $n$. Let $β$ be a smooth real closed $(1,1)$ form such that there exists a function $ρ\in \mbox{PSH}(X,β)\cap L^{\infty}(X)$. We study the range of the complex non-pluripolar Monge-Ampère operator $\langle(β+dd^c\cdot)^n\rangle$ on weighted Monge-Ampère energy classes on $X$. In particular, when $ρ$ is assumed to be continuous, we gi…
▽ More
Let $(X,ω)$ be a compact Hermitian manifold of complex dimension $n$. Let $β$ be a smooth real closed $(1,1)$ form such that there exists a function $ρ\in \mbox{PSH}(X,β)\cap L^{\infty}(X)$. We study the range of the complex non-pluripolar Monge-Ampère operator $\langle(β+dd^c\cdot)^n\rangle$ on weighted Monge-Ampère energy classes on $X$. In particular, when $ρ$ is assumed to be continuous, we give a complete characterization of the range of the complex Monge-Ampère operator on the class $\mathcal E(X,β)$, which is the class of all $\varphi \in \mbox{PSH}(X,β)$ with full Monge-Ampère mass, i.e. $\int_X\langle (β+dd^c\varphi)^n\rangle=\int_Xβ^n$.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
On tight $(k,\ell)$-stable graphs
Authors:
Xiaonan Liu,
Zi-Xia Song,
Zhiyu Wang
Abstract:
For integers $k>\ell\ge0$, a graph $G$ is $(k,\ell)$-stable if $α(G-S)\geq α(G)-\ell$ for every $S\subseteq V(G)$ with $|S|=k$. A recent result of Dong and Wu [SIAM J. Discrete Math., 36 (2022) 229--240] shows that every $(k,\ell)$-stable graph $G$ satisfies $α(G) \le \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell$. A $(k,\ell)$-stable graph $G$ is tight if $α(G) = \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell$;…
▽ More
For integers $k>\ell\ge0$, a graph $G$ is $(k,\ell)$-stable if $α(G-S)\geq α(G)-\ell$ for every $S\subseteq V(G)$ with $|S|=k$. A recent result of Dong and Wu [SIAM J. Discrete Math., 36 (2022) 229--240] shows that every $(k,\ell)$-stable graph $G$ satisfies $α(G) \le \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell$. A $(k,\ell)$-stable graph $G$ is tight if $α(G) = \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell$; and $q$-tight for some integer $q\ge0$ if $α(G) = \lfloor ({|V(G)|-k+1})/{2}\rfloor+\ell-q$. In this paper, we first prove that for all $k\geq 24$, the only tight $(k, 0)$-stable graphs are $K_{k+1}$ and $K_{k+2}$, answering a question of Dong and Luo [arXiv: 2401.16639]. We then prove that for all nonnegative integers $k, \ell, q$ with $k\geq 3\ell+3$, every $q$-tight $(k,\ell)$-stable graph has at most $k-3\ell-3+2^{3(\ell+2q+4)^2}$ vertices, answering a question of Dong and Luo in the negative.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
An Efficient Sparse Identification Algorithm For Stochastic Systems With General Observation Sequences
Authors:
Ziming Wang,
Xinghua Zhu
Abstract:
This paper studies the sparse identification problem of unknown sparse parameter vectors in stochastic dynamic systems. Firstly, a novel sparse identification algorithm is proposed, which can generate sparse estimates based on least squares estimation by adaptively adjusting the threshold. Secondly, under a possibly weakest non-persistent excited condition, we prove that the proposed algorithm can…
▽ More
This paper studies the sparse identification problem of unknown sparse parameter vectors in stochastic dynamic systems. Firstly, a novel sparse identification algorithm is proposed, which can generate sparse estimates based on least squares estimation by adaptively adjusting the threshold. Secondly, under a possibly weakest non-persistent excited condition, we prove that the proposed algorithm can correctly identify the zero and nonzero elements of the sparse parameter vector using a finite number of observations, and further estimates of the nonzero elements almost surely converge to the true values. Compared with the related works, e.g., LASSO, our method only requires the weakest assumptions and does not require solving additional optimization problems. Besides, our theoretical results do not require any statistical assumptions on the regression signals, including independence or stationarity, which makes our results promising for application to stochastic feedback systems. Thirdly, the number of finite observations that guarantee the convergence of the zero-element set of unknown sparse parameters of the Hammerstein system is derived for the first time. Finally, numerical simulations are provided, demonstrating the effectiveness of the proposed method. Since there is no additional optimization problem, i.e., no additional numerical error, the proposed algorithm performs much better than other related algorithms.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Efficient Global Algorithms for Transmit Beamforming Design in ISAC Systems
Authors:
Jiageng Wu,
Zhiguo Wang,
Ya-Feng Liu,
Fan Liu
Abstract:
In this paper, we propose a multi-input multi-output transmit beamforming optimization model for joint radar sensing and multi-user communications, where the design of the beamformers is formulated as an optimization problem whose objective is a weighted combination of the sum rate and the Cramér-Rao bound, subject to the transmit power budget. Obtaining the global solution for the formulated nonc…
▽ More
In this paper, we propose a multi-input multi-output transmit beamforming optimization model for joint radar sensing and multi-user communications, where the design of the beamformers is formulated as an optimization problem whose objective is a weighted combination of the sum rate and the Cramér-Rao bound, subject to the transmit power budget. Obtaining the global solution for the formulated nonconvex problem is a challenging task, since the sum-rate maximization problem itself (even without considering the sensing metric) is known to be NP-hard. The main contributions of this paper are threefold. Firstly, we derive an optimal closed-form solution to the formulated problem in the single-user case and the multi-user case where the channel vectors of different users are orthogonal. Secondly, for the general multi-user case, we propose a novel branch and bound (B\&B) algorithm based on the McCormick envelope relaxation. The proposed algorithm is guaranteed to find the globally optimal solution to the formulated problem. Thirdly, we design a graph neural network (GNN) based pruning policy to determine irrelevant nodes that can be directly pruned in the proposed B\&B algorithm, thereby significantly reducing the number of unnecessary enumerations therein and improving its computational efficiency. Simulation results show the efficiency of the proposed vanilla and GNN-based accelerated B\&B algorithms.
△ Less
Submitted 26 March, 2024;
originally announced April 2024.
-
Carleman estimates for space semi-discrete approximations of one-dimensional stochastic parabolic equation and its applications
Authors:
Bin Wu,
Ying Wang,
Zewen Wang
Abstract:
In this paper, we study discrete Carleman estimates for space semi-discrete approximations of one-dimensional stochastic parabolic equation. As applications of these discrete Carleman estimates, we apply them to study two inverse problems for the spatial semi-discrete stochastic parabolic equations, including a discrete inverse random source problem and a discrete Cauchy problem. We firstly establ…
▽ More
In this paper, we study discrete Carleman estimates for space semi-discrete approximations of one-dimensional stochastic parabolic equation. As applications of these discrete Carleman estimates, we apply them to study two inverse problems for the spatial semi-discrete stochastic parabolic equations, including a discrete inverse random source problem and a discrete Cauchy problem. We firstly establish two Carleman estimates for a one-dimensional semi-discrete stochastic parabolic equation, one for homogeneous boundary and the other for non-homogeneous boundary. Then we apply these two estimates separately to derive two stability results. The first one is the Lipschitz stability for the discrete inverse random source problem. The second one is the Hölder stability for the discrete Cauchy problem.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Weak Convergence Analysis of Online Neural Actor-Critic Algorithms
Authors:
Samuel Chun-Hei Lam,
Justin Sirignano,
Ziheng Wang
Abstract:
We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for…
▽ More
We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for any convergence analysis. We establish the geometric ergodicity of the data samples under a fixed actor policy. Then, using a Poisson equation, we prove that the fluctuations of the model updates around the limit distribution due to the randomly-arriving data samples vanish as the number of parameter updates $\rightarrow \infty$. Using the Poisson equation and weak convergence techniques, we prove that the actor neural network and critic neural network converge to the solutions of a system of ODEs with random initial conditions. Analysis of the limit ODE shows that the limit critic network will converge to the true value function, which will provide the actor an asymptotically unbiased estimate of the policy gradient. We then prove that the limit actor network will converge to a stationary point.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Adapprox: Adaptive Approximation in Adam Optimization via Randomized Low-Rank Matrices
Authors:
Pengxiang Zhao,
Ping Li,
Yingjie Gu,
Yi Zheng,
Stephan Ludger Kölker,
Zhefeng Wang,
Xiaoming Yuan
Abstract:
As deep learning models exponentially increase in size, optimizers such as Adam encounter significant memory consumption challenges due to the storage of first and second moment data. Current memory-efficient methods like Adafactor and CAME often compromise accuracy with their matrix factorization techniques. Addressing this, we introduce Adapprox, a novel approach that employs randomized low-rank…
▽ More
As deep learning models exponentially increase in size, optimizers such as Adam encounter significant memory consumption challenges due to the storage of first and second moment data. Current memory-efficient methods like Adafactor and CAME often compromise accuracy with their matrix factorization techniques. Addressing this, we introduce Adapprox, a novel approach that employs randomized low-rank matrix approximation for a more effective and accurate approximation of Adam's second moment. Adapprox features an adaptive rank selection mechanism, finely balancing accuracy and memory efficiency, and includes an optional cosine similarity guidance strategy to enhance stability and expedite convergence. In GPT-2 training and downstream tasks, Adapprox surpasses AdamW by achieving 34.5% to 49.9% and 33.8% to 49.9% memory savings for the 117M and 345M models, respectively, with the first moment enabled, and further increases these savings without the first moment. Besides, it enhances convergence speed and improves downstream task performance relative to its counterparts.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Some evaluations of interpolated multiple zeta values and interpolated multiple $t$-values
Authors:
Zhonghua Li,
Zhenlu Wang
Abstract:
In this paper, we study the evaluation formulas of the interpolated multiple zeta values and the interpolated multiple $t$-values with indices involving $1,2,3$. To get these evaluations, we derive the corresponding algebraic relations in the harmonic algebra.
In this paper, we study the evaluation formulas of the interpolated multiple zeta values and the interpolated multiple $t$-values with indices involving $1,2,3$. To get these evaluations, we derive the corresponding algebraic relations in the harmonic algebra.
△ Less
Submitted 22 April, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Parameter Estimation from Single Patient, Single Time-Point Sequencing Data of Recurrent Tumors
Authors:
Kevin Leder,
Ruping Sun,
Zicheng Wang,
Xuanming Zhang
Abstract:
In this study, we develop consistent estimators for key parameters that govern the dynamics of tumor cell populations when subjected to pharmacological treatments. While these treatments often lead to an initial reduction in the abundance of drug-sensitive cells, a population of drug-resistant cells frequently emerges over time, resulting in cancer recurrence. Samples from recurrent tumors present…
▽ More
In this study, we develop consistent estimators for key parameters that govern the dynamics of tumor cell populations when subjected to pharmacological treatments. While these treatments often lead to an initial reduction in the abundance of drug-sensitive cells, a population of drug-resistant cells frequently emerges over time, resulting in cancer recurrence. Samples from recurrent tumors present as an invaluable data source that can offer crucial insights into the ability of cancer cells to adapt and withstand treatment interventions. To effectively utilize the data obtained from recurrent tumors, we derive several large number limit theorems, specifically focusing on the metrics that quantify the clonal diversity of cancer cell populations at the time of cancer recurrence. These theorems then serve as the foundation for constructing our estimators. A distinguishing feature of our approach is that our estimators only require a single time-point sequencing data from a single tumor, thereby enhancing the practicality of our approach and enabling the understanding of cancer recurrence at the individual level.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
New Regularity Criteria for Navier-Stokes and SQG Equations in Critical Spaces
Authors:
Yiran Xu,
Ly Kim Ha,
Haina Li,
Zexi Wang
Abstract:
In this paper, we investigate some priori estimates to provide the critical regularity criteria for incompressible
Navier-Stokes equations on $\mathbb{R}^3$ and super critical surface quasi-geostrophic equations on $\mathbb{R}^2$. Concerning the Navier-Stokes equation, we demonstrate that a Leray-Hopf solution $u$ is regular if…
▽ More
In this paper, we investigate some priori estimates to provide the critical regularity criteria for incompressible
Navier-Stokes equations on $\mathbb{R}^3$ and super critical surface quasi-geostrophic equations on $\mathbb{R}^2$. Concerning the Navier-Stokes equation, we demonstrate that a Leray-Hopf solution $u$ is regular if $u\in L_T^{\frac{2}{1-α}} \dot{B}^{-α}_{\infty,\infty}(\mathbb{R}^3)$, or $u$ in Lorentz space $ L_T^{p,r} \dot{B}^{-1+\frac{2}{p}}_{\infty,\infty}(\mathbb{R}^3)$, with $4\leq p\leq r<\infty$. Additionally, an alternative regularity condition is expressed as $u\in L_{T}^{\frac{2}{1-α}}
\dot{B}^{-α}_{\infty,\infty}(\mathbb{R}^3)+{L_T^\infty\dot{B}^{-1}_{\infty,\infty}}(\mathbb{R}^3)$($α\in(0,1)$), contingent upon a smallness assumption on the norm $L_T^\infty\dot{B}^{-1}_{\infty,\infty}$. For the SQG equation, we derive that a Leray-Hopf weak solution $θ\in L_T^{\fracα{\varepsilon}} \dot{C}^{1-α+ε}(\mathbb{R}^2)$ is smooth for any $\varepsilon$ small enough. Similar to the case of Navier-Stokes equation, we derive regularity criterion in more refined spaces, i.e. Lorentz spaces $L_T^{\fracαε,r}\dot{C}^{1-α+ε}(\mathbb{R}^2)$ and addition of two critical spaces $L_{T}^{\fracαε}\dot{C}^{1-α+ε}(\mathbb{R}^2)+{L_T^\infty\dot{C}^{1-α}(\mathbb{R}^2)}$, with smallness assumption on $L_T^\infty\dot{C}^{1-α}(\mathbb{R}^2)$.
△ Less
Submitted 12 April, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.