Skip to main content

Showing 1–50 of 106 results for author: Yang, G

  1. arXiv:2407.11272  [pdf, other

    cs.CV math.DG

    Differentiable Voxelization and Mesh Morphing

    Authors: Yihao Luo, Yikai Wang, Zhengrui Xiang, Yuliang Xiu, Guang Yang, ChoonHwai Yap

    Abstract: In this paper, we propose the differentiable voxelization of 3D meshes via the winding number and solid angles. The proposed approach achieves fast, flexible, and accurate voxelization of 3D meshes, admitting the computation of gradients with respect to the input mesh and GPU acceleration. We further demonstrate the application of the proposed voxelization in mesh morphing, where the voxelized mes… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.01281  [pdf, other

    cs.LG cs.AI math.FA

    Bridging Smoothness and Approximation: Theoretical Insights into Over-Smoothing in Graph Neural Networks

    Authors: Guangrui Yang, Jianfei Li, Ming Li, Han Feng, Ding-Xuan Zhou

    Abstract: In this paper, we explore the approximation theory of functions defined on graphs. Our study builds upon the approximation results derived from the $K$-functional. We establish a theoretical framework to assess the lower bounds of approximation for target functions using Graph Convolutional Networks (GCNs) and examine the over-smoothing phenomenon commonly observed in these networks. Initially, we… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.17763  [pdf, other

    cs.LG cs.AI cs.CV math.NA

    DiffusionPDE: Generative PDE-Solving Under Partial Observation

    Authors: Jiahe Huang, Guandao Yang, Zichen Wang, Jeong Joon Park

    Abstract: We introduce a general framework for solving partial differential equations (PDEs) using generative diffusion models. In particular, we focus on the scenarios where we do not have the full knowledge of the scene necessary to apply classical solvers. Most existing forward or inverse PDE approaches perform poorly when the observations on the data or the underlying coefficients are incomplete, which… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Project page: https://jhhuangchloe.github.io/Diffusion-PDE/

  4. arXiv:2404.18051  [pdf, ps, other

    math.AP

    Liouville type theorems for the 3D stationary MHD and Hall-MHD equations with non-zero constant vectors at infinity

    Authors: Wendong Wang, Guoxu Yang

    Abstract: In this paper, we investigate Liouville type theorems for the three-dimensional steady-state MHD or Hall-MHD system under some asymptotic assumptions at infinity. Firstly, for the Hall-MHD system we obtain that $u$ and $B$ are constant vectors for any fluid viscosity, magnetic resistivity or Hall-coefficient when the magnetic field $B$ tends to a non-zero constant vector at infinity while the velo… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  5. arXiv:2404.00697  [pdf

    math.OC

    A Lane Usage Strategy for General Traffic Access on Bus Lanes under Mixed Traffic Environment

    Authors: Haoran Li, Zhenzhou Yuan, Rui Yue, Guangchuan Yang, Chuang Zhu, Siyuan Chen

    Abstract: The strategy of permitting general traffic to use the bus lane for improved utilization while ensuring bus priority has gained increasingly attention, particularly with the support of vehicle-to-everything technology. In this study, we propose a novel lane usage strategy called Dynamic Spatial-Temporal Priority (DSTP) to ensure bus priority and optimize bus lane usage in a mixed traffic environmen… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 16 pages, 22 figures

  6. arXiv:2402.03541  [pdf, other

    cs.LG math.NA

    HAMLET: Graph Transformer Neural Operator for Partial Differential Equations

    Authors: Andrey Bryutkin, Jiahao Huang, Zhongying Deng, Guang Yang, Carola-Bibiane Schönlieb, Angelica Aviles-Rivero

    Abstract: We present a novel graph transformer framework, HAMLET, designed to address the challenges in solving partial differential equations (PDEs) using neural networks. The framework uses graph transformers with modular input encoders to directly incorporate differential equation information into the solution process. This modularity enhances parameter correspondence control, making HAMLET adaptable to… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 17 pages, 7 figures, 6 tables

  7. arXiv:2310.06560  [pdf, ps, other

    math.CO

    On friendship and cyclic parking functions

    Authors: Yujia Kang, Thomas Selig, Guanyi Yang, Yanting Zhang, Haoyue Zhu

    Abstract: In parking problems, a given number of cars enter a one-way street sequentially, and try to park according to a specified preferred spot in the street. Various models are possible depending on the chosen rule for collisions, when two cars have the same preferred spot. In classical parking functions, if a car's preferred spot is already occupied by a previous car, it drives forward and looks for th… ▽ More

    Submitted 4 January, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 19 pages, 5 figures

    MSC Class: 05A19 (Primary) 05A15; 05A05; 05C30 (Secondary)

  8. arXiv:2310.02808  [pdf, other

    math.PR math.DG

    Probabilistic Method to Fundamental gap problems on the sphere

    Authors: Gunhee Cho, Guofang Wei, Guang Yang

    Abstract: We provide a probabilistic proof of the fundamental gap estimate for Schrödinger operators in convex domains on the sphere, which extends the probabilistic proof of F. Gong, H. Li, and D. Luo for the Euclidean case. Our results further generalize the results achieved for the Laplacian by S. Seto, L. Wang, and G. Wei, as well as by C. He, G. Wei, and Qi S. Zhang. The essential ingredient in our ana… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  9. arXiv:2310.02244  [pdf, other

    cs.NE cond-mat.dis-nn math.PR

    Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks

    Authors: Greg Yang, Dingli Yu, Chen Zhu, Soufiane Hayou

    Abstract: By classifying infinite-width neural networks and identifying the *optimal* limit, Tensor Programs IV and V demonstrated a universal way, called $μ$P, for *widthwise hyperparameter transfer*, i.e., predicting optimal hyperparameters of wide neural networks from narrow ones. Here we investigate the analogous classification for *depthwise parametrizations* of deep residual networks (resnets). We cla… ▽ More

    Submitted 12 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  10. arXiv:2308.01814  [pdf, other

    cs.LG cond-mat.dis-nn cs.NE math.PR

    Tensor Programs IVb: Adaptive Optimization in the Infinite-Width Limit

    Authors: Greg Yang, Etai Littwin

    Abstract: Going beyond stochastic gradient descent (SGD), what new phenomena emerge in wide neural networks trained by adaptive optimizers like Adam? Here we show: The same dichotomy between feature learning and kernel behaviors (as in SGD) holds for general optimizers as well, including Adam -- albeit with a nonlinear notion of "kernel." We derive the corresponding "neural tangent" and "maximal update" lim… ▽ More

    Submitted 7 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: This is the complete version of "Adaptive Optimization in the Infinite-Width Limit" in ICLR 2023, https://openreview.net/forum?id=zgVDqw9ZUES

  11. arXiv:2307.02062  [pdf, other

    math.NA

    Convergence Analysis for Restarted Anderson Mixing and Beyond

    Authors: Fuchao Wei, Chenglong Bao, Yang Liu, Guangwen Yang

    Abstract: Anderson mixing (AM) is a classical method that can accelerate fixed-point iterations by exploring historical information. Despite the successful application of AM in scientific computing, the theoretical properties of AM are still under exploration. In this paper, we study the restarted version of the Type-I and Type-II AM methods, i.e., restarted AM. With a multi-step analysis, we give a unified… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  12. arXiv:2306.05707  [pdf, ps, other

    math.NA q-bio.GN

    On the Mathematics of RNA Velocity II: Algorithmic Aspects

    Authors: Tiejun Li, Yizhuo Wang, Guoguo Yang, Peijie Zhou

    Abstract: In a previous paper [CSIAM Trans. Appl. Math. 2 (2021), 1-55], the authors proposed a theoretical framework for the analysis of RNA velocity, which is a promising concept in scRNA-seq data analysis to reveal the cell state-transition dynamical processes underlying snapshot data. The current paper is devoted to the algorithmic study of some key components in RNA velocity workflow. Four important po… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 32 pages, 5 figures

  13. arXiv:2303.16872  [pdf, ps, other

    math.PR

    Multi-dimensional Mean-field Type Backward Stochastic Differential Equations with Diagonally Quadratic Generators

    Authors: Shanjian Tang, Guang Yang

    Abstract: In this paper, we study the multi-dimensional backward stochastic differential equations (BSDEs) whose generator depends also on the mean of both variables. When the generator is diagonally quadratic, we prove that the BSDE admits a unique local solution with a fixed point argument. When the generator has a logarithmic growth of the off-diagonal elements (i.e., for each $i$, the $i$-th component o… ▽ More

    Submitted 30 March, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: 16 pages. arXiv admin note: text overlap with arXiv:2302.12470

    MSC Class: 60H10

  14. arXiv:2303.11659  [pdf, ps, other

    math.PR math.NA

    Formulae for mixed moments of Wiener processes and a stochastic area integral

    Authors: Yoshio Komori, Guoguo Yang, Kevin Burrage

    Abstract: This paper deals with the expectation of monomials with respect to the stochastic area integral $A_{1,2}(t,t+h)=\int_{t}^{t+h}\int_{t}^{s}{\rm d} W_{1}(r){\rm d} W_{2}(s) -\int_{t}^{t+h}\int_{t}^{s}{\rm d} W_{2}(r){\rm d} W_{1}(s)$ and the increments of two Wiener processes, $Δ{W}_{i}(t,t+h)=W_{i}(t+h)-W_{i}(t),\ i=1,2$. In a monomial, if the exponent of one of the Wiener increments or the stochas… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: This is a preprint of a paper, which has been accepted for publication in SIAM Journal on Numerical Analysis

    MSC Class: 60H10; 60H30; 65C30

  15. arXiv:2303.09016  [pdf, ps, other

    math.PR

    Chaos processes as rough paths

    Authors: Guang Yang

    Abstract: In this article we investigate the rough paths structure of a process $X_t$ living in a fixed Wiener chaos. Specifically, we formulate various types of rough lifts of $X_t$ and study their properties. As application, we study the integrabilities of quantities related to rough differential equations driven by $X_t$.

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 34 pages

  16. arXiv:2302.12470  [pdf, ps, other

    math.PR

    Multi-dimensional Backward Stochastic Differential Equations of Diagonally Quadratic Generators with a Special Structure

    Authors: Guang Yang

    Abstract: The present paper is devoted to the well-posedness of a type of multi-dimensional backward stochastic differential equations (BSDEs) with a diagonally quadratic generator. We give a new priori estimate, and prove that the BSDE admits a unique solution on a given interval when the generator has a sufficiently small growth of the off-diagonal elements (i.e., for each $i$, the $i$-th component of the… ▽ More

    Submitted 15 April, 2024; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: 14 pages

    MSC Class: 60H10

  17. arXiv:2301.12483  [pdf, other

    eess.SY math.OC

    Topological entropy of switched nonlinear and interconnected systems

    Authors: Guosong Yang, Daniel Liberzon, João P. Hespanha

    Abstract: A general upper bound for topological entropy of switched nonlinear systems is constructed, using an asymptotic average of upper limits of the matrix measures of Jacobian matrices of strongly persistent individual modes, weighted by their active rates. A general lower bound is constructed as well, using a similar weighted average of lower limits of the traces of these Jacobian matrices. In a case… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

    Comments: 31 pages, 3 figures

    MSC Class: 37B40; 93C30 (Primary) 93C57 (Secondary)

  18. Least absolute deviation estimation for AR(1) processes with roots close to unity

    Authors: Nannan Ma, Hailin Sang, Guangyu Yang

    Abstract: We establish the asymptotic theory of least absolute deviation estimators for AR(1) processes with autoregressive parameter satisfying $n(ρ_n-1)\toγ$ for some fixed $γ$ as $n\to\infty$, which is parallel to the results of ordinary least squares estimators developed by Andrews and Guggenberger (2008) in the case $γ=0$ or Chan and Wei (1987) and Phillips (1987) in the case $γ\ne 0$. Simulation exper… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

    Comments: accepted by Annals of the Institute of Statistical Mathematics, 29 pages, 8 figures, 4 tables

    MSC Class: 62M10; 62F12

  19. arXiv:2301.00507  [pdf, ps, other

    math.DG

    On Geodesics of Sprays and Projective Completeness

    Authors: Guojun Yang

    Abstract: Geodesics, which play an important role in spray-Finsler geometry, are integral curves of a spray vector field on a manifold. Some comparison theorems and rigidity issues are established on the completeness of geodesics of a spray or a Finsler metric. In this paper, projectively flat sprays with weak Ricci constant (eps. constant curvature) are classified at the level of geodesics. Further, a geod… ▽ More

    Submitted 1 January, 2023; originally announced January 2023.

    Comments: 16 pages

    MSC Class: 53B40; 53C60

  20. arXiv:2211.16578  [pdf, other

    cs.CV cs.LG math.NA

    ButterflyNet2D: Bridging Classical Methods and Neural Network Methods in Image Processing

    Authors: Gengzhi Yang, Yingzhou Li

    Abstract: Both classical Fourier transform-based methods and neural network methods are widely used in image processing tasks. The former has better interpretability, whereas the latter often achieves better performance in practice. This paper introduces ButterflyNet2D, a regular CNN with sparse cross-channel connections. A Fourier initialization strategy for ButterflyNet2D is proposed to approximate Fourie… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  21. arXiv:2211.06197  [pdf, ps, other

    math.OC math.PR

    A convergence study of SGD-type methods for stochastic optimization

    Authors: Tiannan Xiao, Guoguo Yang

    Abstract: In this paper, we first reinvestigate the convergence of vanilla SGD method in the sense of $L^2$ under more general learning rates conditions and a more general convex assumption, which relieves the conditions on learning rates and do not need the problem to be strongly convex. Then, by taking advantage of the Lyapunov function technique, we present the convergence of the momentum SGD and Nestero… ▽ More

    Submitted 9 June, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 14 pages

    MSC Class: 60F05; 60J22; 37N40

  22. arXiv:2211.02221  [pdf, ps, other

    math.AC math.GR

    Gorenstein homological dimension and some invariants of groups

    Authors: Wei Ren, Gang Yang

    Abstract: For any group $G$, the Gorenstein homological dimension ${\rm Ghd}_RG$ is defined to be the Gorenstein flat dimension of the coefficient ring $R$, which is considered as an $RG$-module with trivial group action. We prove that ${\rm Ghd}_RG < \infty$ if and only if the Gorenstein flat dimension of any $RG$-module is finite, if and only if there exists an $R$-pure $RG$-monic $R\rightarrow A$ with… ▽ More

    Submitted 18 April, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: Revised version of the paper which was previously titled "Gorenstein flat dimension with group ring coefficients". We appreciate any comments and suggestions

    MSC Class: 18G20; 18G25; 20J05

  23. arXiv:2211.01151  [pdf, ps, other

    math.DG

    On stability of subelliptic harmonic maps with potential

    Authors: Tian Chong, Yuxin Dong, Guilin Yang

    Abstract: In this paper, we investigate the stability problem of subelliptic harmonic maps with potential. First, we derive the first and second variation formulas for subelliptic harmonic maps with potential. As a result, it is proved that a subelliptic harmonic map with potential is stable if the target manifold has nonpositive curvature and the Hessian of the potential is nonpositive definite. We also gi… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  24. arXiv:2209.04157  [pdf, other

    eess.SY cs.CE math.OC

    A Fast Algorithm for Onboard Atmospheric Powered Descent Guidance

    Authors: Yushu Chen, Guangwen Yang, Lu Wang, Qingzhong Gan, Haipeng Chen, Quanyong Xu

    Abstract: Atmospheric powered descent guidance can be solved by successive convexification; however, its onboard application is impeded by the sharp increase in computation caused by nonlinear aerodynamic forces. The problem has to be converted into a sequence of convex subproblems instead of a single convex problem when aerodynamic forces are ignored. Besides, each subproblem is significantly more complica… ▽ More

    Submitted 6 June, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: The paper is accepted by IEEE Transactions on Aerospace and Electronic Systems, 2023

  25. arXiv:2208.09960  [pdf, ps, other

    math.DG math.CV math.PR

    The Stochastic Schwarz lemma on Kähler Manifolds by Couplings and Its Applications

    Authors: Myeongju Chae, Gunhee Cho, Maria Gordina, Guang Yang

    Abstract: We first provide a stochastic formula for the Carathéodory distance in terms of general Markovian couplings and prove a comparison result between the Carathéodory distance and the complete Kähler metric with a negative lower curvature bound using the Kendall-Cranston coupling. This probabilistic approach gives a version of the Schwarz lemma on complete non-compact Kähler manifolds with a further d… ▽ More

    Submitted 30 November, 2023; v1 submitted 21 August, 2022; originally announced August 2022.

    Comments: To appear JLMS

  26. arXiv:2208.01615  [pdf, ps, other

    math.PR

    On Non-degenerate Chaos Processes

    Authors: Guang Yang

    Abstract: We consider a process $\{X_t\}_{0\leq t\leq 1}$ in a fixed Wiener chaos $\mathcal{H}_n$. We establish some non-degenerate properties and related results for $\{X_t\}_{0\leq t\leq 1}$. As an application, we show that solution to SDE driven by $\{X_t\}_{0\leq t\leq 1}$ admits a density. Our approach relies on an interplay between Malliavin calculus and analysis on Wiener space.

    Submitted 2 August, 2022; originally announced August 2022.

  27. arXiv:2207.11755  [pdf, other

    math.OC math.PR

    Revisiting the central limit theorems for the SGD-type methods

    Authors: Tiejun Li, Tiannan Xiao, Guoguo Yang

    Abstract: We revisited the central limit theorem (CLT) for stochastic gradient descent (SGD) type methods, including the vanilla SGD, momentum SGD and Nesterov accelerated SGD methods with constant or vanishing damping parameters. By taking advantage of Lyapunov function technique and $L^p$ bound estimates, we established the CLT under more general conditions on learning rates for broader classes of SGD met… ▽ More

    Submitted 9 June, 2023; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: 23 pages, 2 figures

    MSC Class: 60F05; 60J22; 37N40

  28. arXiv:2207.03756  [pdf, ps, other

    math.DG

    Sprays on Hamel-Funk Functions Model

    Authors: Guojun Yang

    Abstract: Hamel functions of a spray play an important role in the study of the projective metrizability of the concerned spray, and Funk functions are special Hamel functions. A Finsler metric is a special Hamel function of the spray induced by the metric itself and a Funk metric is a special Funk function of a Minkowski spray. In this paper, we study sprays on a Hamel or Funk function model. Firstly, we g… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 21 pages

    MSC Class: 53C60; 53B40

  29. arXiv:2205.11892  [pdf, ps, other

    math.DG

    On Sprays of Scalar Curvature and Metrizability

    Authors: Guojun Yang

    Abstract: Every Finsler metric naturally induces a spray but not so for the converse. The notion for sprays of scalar (resp. isotropic) curvature has been known as a generalization for Finsler metrics of scalar (resp. isotropic) flag curvature. In this paper, a new notion, sprays of constant curvature, is introduced and especially it shows that a spray of isotropic curvature is not necessarily of constant c… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 20 pages

    MSC Class: 53C60; 53B40

  30. arXiv:2205.08038  [pdf, other

    math.OC eess.SY

    Newton and interior-point methods for (constrained) nonconvex-nonconcave minmax optimization with stability and instability guarantees

    Authors: Raphael Chinchilla, Guosong Yang, Joao P. Hespanha

    Abstract: We address the problem of finding a local solution to a nonconvex-nonconcave minmax optimization using Newton type methods, including interior-point ones. We modify the Hessian matrix of these methods such that, at each step, the modified Newton update direction can be seen as the solution to a quadratic program that locally approximates the minmax problem. Moreover, we show that by selecting the… ▽ More

    Submitted 11 February, 2024; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: Published at the Journal of the Mathematics of Control, Signals, and Systems

  31. arXiv:2205.03570  [pdf, ps, other

    math.OC cs.CC

    Iteration Complexity of an Infeasible Interior Point Methods for Seconder-order Cone Programming and its Warmstarting

    Authors: Yushu Chen, Guangwen Yang, Lu Wang, Qingzhong Gan, Haipeng Chen

    Abstract: This paper studies the worst case iteration complexity of an infeasible interior point method (IPM) for seconder order cone programming (SOCP), which is more convenient for warmstarting compared with feasible IPMs. The method studied bases on the homogeneous and self-dual model and the Monteiro-Zhang family of searching directions. Its worst case iteration complexity is… ▽ More

    Submitted 24 January, 2023; v1 submitted 7 May, 2022; originally announced May 2022.

    ACM Class: F.2.1; G.1.6

  32. arXiv:2205.01445  [pdf, other

    stat.ML cs.LG math.ST

    High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation

    Authors: Jimmy Ba, Murat A. Erdogdu, Taiji Suzuki, Zhichao Wang, Denny Wu, Greg Yang

    Abstract: We study the first gradient descent step on the first-layer parameters $\boldsymbol{W}$ in a two-layer neural network: $f(\boldsymbol{x}) = \frac{1}{\sqrt{N}}\boldsymbol{a}^\topσ(\boldsymbol{W}^\top\boldsymbol{x})$, where $\boldsymbol{W}\in\mathbb{R}^{d\times N}, \boldsymbol{a}\in\mathbb{R}^{N}$ are randomly initialized, and the training objective is the empirical MSE loss:… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 71 pages

  33. arXiv:2203.04126  [pdf, ps, other

    math.CO

    Some multivariable Rado numbers

    Authors: Gang Yang, Yaping Mao, Changxiang He, Zhao Wang

    Abstract: The Rado number of an equation is a Ramsey-theoretic quantity associated to the equation. Let $\mathcal{E}$ be a linear equation. Denote by $\operatorname{R}_r(\mathcal{E})$ the minimal integer, if it exists, such that any $r$-coloring of $[1,\operatorname{R}_r(\mathcal{E})]$ must admit a monochromatic solution to $\mathcal{E}$. In this paper, we give upper and lower bounds for the Rado number of… ▽ More

    Submitted 10 March, 2022; v1 submitted 5 March, 2022; originally announced March 2022.

  34. arXiv:2111.00534  [pdf, other

    eess.IV cs.AI cs.CV math.OC

    Focal Attention Networks: optimising attention for biomedical image segmentation

    Authors: Michael Yeung, Leonardo Rundo, Evis Sala, Carola-Bibiane Schönlieb, Guang Yang

    Abstract: In recent years, there has been increasing interest to incorporate attention into deep learning architectures for biomedical image segmentation. The modular design of attention mechanisms enables flexible integration into convolutional neural network architectures, such as the U-Net. Whether attention is appropriate to use, what type of attention to use, and where in the network to incorporate att… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  35. arXiv:2111.00533  [pdf, other

    eess.IV cs.AI cs.CV math.OC

    Incorporating Boundary Uncertainty into loss functions for biomedical image segmentation

    Authors: Michael Yeung, Guang Yang, Evis Sala, Carola-Bibiane Schönlieb, Leonardo Rundo

    Abstract: Manual segmentation is used as the gold-standard for evaluating neural networks on automated image segmentation tasks. Due to considerable heterogeneity in shapes, colours and textures, demarcating object boundaries is particularly difficult in biomedical images, resulting in significant inter and intra-rater variability. Approaches, such as soft labelling and distance penalty term, apply a global… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  36. arXiv:2111.00528  [pdf, other

    eess.IV cs.AI cs.CV math.OC

    Calibrating the Dice loss to handle neural network overconfidence for biomedical image segmentation

    Authors: Michael Yeung, Leonardo Rundo, Yang Nan, Evis Sala, Carola-Bibiane Schönlieb, Guang Yang

    Abstract: The Dice similarity coefficient (DSC) is both a widely used metric and loss function for biomedical image segmentation due to its robustness to class imbalance. However, it is well known that the DSC loss is poorly calibrated, resulting in overconfident predictions that cannot be usefully interpreted in biomedical and clinical practice. Performance is often the only metric used to evaluate segment… ▽ More

    Submitted 1 November, 2022; v1 submitted 31 October, 2021; originally announced November 2021.

  37. arXiv:2109.03688  [pdf, ps, other

    math.PR

    Limit theorems for linear random fields with innovations in the domain of attraction of a stable law

    Authors: Magda Peligrad, Hailin Sang, Yimin Xiao, Guangyu Yang

    Abstract: In this paper we study the convergence in distribution and the local limit theorem for the partial sums of linear random fields with i.i.d. innovations that have infinite second moment and belong to the domain of attraction of a stable law with index $0<α\leq2$ under the condition that the innovations are centered if $1<α\leq2$ and are symmetric if $α=1$. We establish these two types of limit theo… ▽ More

    Submitted 7 May, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 23 pages, accepted by Stochastic Processes and their Applications

    MSC Class: Primary 60G60; 60G52; Secondary 60F05; 62M40

  38. arXiv:2105.14588  [pdf, ps, other

    math.PR math.DG

    A note on first eigenvalue estimates by coupling methods in Kähler and quaternion Kähler manifolds

    Authors: Fabrice Baudoin, Gunhee Cho, Guang Yang

    Abstract: In this short note, using the Kendall-Cranston coupling, we study on Kähler (resp. quaternion Kähler) manifolds first eigenvalue estimates in terms of dimension, diameter, and lower bounds on the holomorphic (resp. quaternionic) sectional curvature.

    Submitted 8 December, 2021; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: v2: References added

  39. arXiv:2105.03703  [pdf, other

    cs.LG cs.NE math.PR

    Tensor Programs IIb: Architectural Universality of Neural Tangent Kernel Training Dynamics

    Authors: Greg Yang, Etai Littwin

    Abstract: Yang (2020a) recently showed that the Neural Tangent Kernel (NTK) at initialization has an infinite-width limit for a large class of architectures including modern staples such as ResNet and Transformers. However, their analysis does not apply to training. Here, we show the same neural networks (in the so-called NTK parametrization) during training follow a kernel gradient descent dynamics in func… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: ICML 2021

  40. arXiv:2101.03253  [pdf, other

    cs.GT eess.SY math.OC

    Adaptive Learning in Two-Player Stackelberg Games with Application to Network Security

    Authors: Guosong Yang, Radha Poovendran, João P. Hespanha

    Abstract: We study a two-player Stackelberg game with incomplete information such that the follower's strategy belongs to a known family of parameterized functions with an unknown parameter vector. We design an adaptive learning approach to simultaneously estimate the unknown parameter and minimize the leader's cost, based on adaptive control techniques and hysteresis switching. Our approach guarantees that… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    MSC Class: 91A26; 91A65 (Primary) 37N40; 93C40; 65L20 (Secondary)

  41. arXiv:2009.10685  [pdf, other

    cs.NE math.PR

    Tensor Programs III: Neural Matrix Laws

    Authors: Greg Yang

    Abstract: In a neural network (NN), *weight matrices* linearly transform inputs into *preactivations* that are then transformed nonlinearly into *activations*. A typical NN interleaves multitudes of such linear and nonlinear transforms to express complex functions. Thus, the (pre-)activations depend on the weights in an intricate manner. We show that, surprisingly, (pre-)activations of a randomly initialize… ▽ More

    Submitted 8 May, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

  42. arXiv:2008.10116  [pdf, ps, other

    math.PR

    Octonionic Brownian Windings

    Authors: Gunhee Cho, Guang Yang

    Abstract: We define and study the windings along Brownian paths in the octonionic Euclidean, projective and hyperbolic spaces which are isometric to 8-dimensional Riemannian model spaces. In particular, the asymptotic laws of these windings are shown to be Gaussian for the flat and spherical geometries while the hyperbolic winding exhibits a different long time-behavior.

    Submitted 20 August, 2021; v1 submitted 23 August, 2020; originally announced August 2020.

    Comments: To appear in Journal of Theoretical Probability

  43. arXiv:2005.09192  [pdf, ps, other

    math.PR

    A Version of Hörmander's Theorem for Markovian Rough Paths

    Authors: Guang Yang

    Abstract: We consider a rough differential equation of the form \(dY_t=\sum_i V_i(Y_t)d\boldsymbol{X}^i_t+V_0(Y_t)dt \), where \(\boldsymbol{X}_t \) is a Markovian rough path. We demonstrate that if the vector fields \((V_i)_{0\leq i\leq d} \) satisfy Hörmander's bracket generating condition, then \(Y_t\) admits a smooth density with a Gaussian type upper bound, given that the generator of \(X_t\) satisfy c… ▽ More

    Submitted 1 February, 2022; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: Improved the writing, results unchanged

  44. arXiv:2003.05603  [pdf, ps, other

    math.DG math.PR

    Brownian motions and heat kernel lower bounds on Kähler and quaternion Kähler manifolds

    Authors: Fabrice Baudoin, Guang Yang

    Abstract: We study the radial parts of the Brownian motions on Kähler and quaternion Kähler manifolds. Thanks to sharp Laplacian comparison theorems, we deduce as a consequence a sharp Cheeger-Yau type lower bound for the heat kernels of such manifolds and also sharp Cheng's type estimates for the Dirichlet eigenvalues of metric balls.

    Submitted 11 March, 2020; originally announced March 2020.

  45. arXiv:1912.01820  [pdf, ps, other

    math.FA

    The equivalent theorem of a new generalized Bernstein-Bezier operators

    Authors: Qiu-Lan Qi, Dan-Dan Guo, Ge Yang

    Abstract: In this paper, a new generalized Bernstein-Bezier type operators is constructed.The estimates of the moments of these operators are investigated. The rate of convergence in terms of modulus of continuity is given. Then, the equivalent theorem of these operators is studied.

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: 15 pages

  46. arXiv:1910.03764  [pdf, ps, other

    math.RT

    Geck's Conjecture and the Generalized Gelfand-Graev Representations in Bad Characteristic

    Authors: Junbin Dong, Gao Yang

    Abstract: For a connected reductive algebraic group $G$ defined over a finite field $\mathbb F_q$, Kawanaka introduced the generalized Gelfand-Graev representations (GGGRs for short) of the finite group $G(\mathbb F_q)$ in the case where $q$ is a power of a good prime for $G$. This representation has been widely studied and used in various contexts. Recently, Geck proposed a conjecture, characterizing Luszt… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: 33 pages

  47. arXiv:1909.02159  [pdf, ps, other

    math.AC cs.LG math.CO

    Free resolutions of function classes via order complexes

    Authors: Justin Chen, Christopher Eur, Greg Yang, Mengyuan Zhang

    Abstract: Function classes are collections of Boolean functions on a finite set, which are fundamental objects of study in theoretical computer science. We study algebraic properties of ideals associated to function classes previously defined by the third author. We consider the broad family of intersection-closed function classes, and describe cellular free resolutions of their ideals by order complexes of… ▽ More

    Submitted 16 June, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: 18 pages with figures. Final journal version, to appear in Advances in Applied Mathematics

    MSC Class: 13D02; 68Q32; 05E40; 06A12; 05B35

  48. arXiv:1905.01422  [pdf, other

    cs.LG math.OC stat.ML

    An Adaptive Remote Stochastic Gradient Method for Training Neural Networks

    Authors: Yushu Chen, Hao Jing, Wenlai Zhao, Zhiqiang Liu, Ouyi Li, Liang Qiao, Wei Xue, Guangwen Yang

    Abstract: We present the remote stochastic gradient (RSG) method, which computes the gradients at configurable remote observation points, in order to improve the convergence rate and suppress gradient noise at the same time for different curvatures. RSG is further combined with adaptive methods to construct ARSG for acceleration. The method is efficient in computation and memory, and is straightforward to i… ▽ More

    Submitted 6 September, 2020; v1 submitted 3 May, 2019; originally announced May 2019.

    Comments: The generalization is improved by modifying the preconditioner. For training ResNet-50 on ImageNet, ARSG outperforms ADAM in convergence speed and meanwhile it surpasses SGD in generalization. We also present a convergence bound in non-convex settings

  49. arXiv:1902.08129  [pdf, other

    cs.NE cond-mat.dis-nn cs.LG math.DS

    A Mean Field Theory of Batch Normalization

    Authors: Greg Yang, Jeffrey Pennington, Vinay Rao, Jascha Sohl-Dickstein, Samuel S. Schoenholz

    Abstract: We develop a mean field theory for batch normalization in fully-connected feedforward neural networks. In so doing, we provide a precise characterization of signal propagation and gradient backpropagation in wide batch-normalized networks at initialization. Our theory shows that gradient signals grow exponentially in depth and that these exploding gradients cannot be eliminated by tuning the initi… ▽ More

    Submitted 5 March, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: To appear in ICLR 2019

  50. arXiv:1809.10210  [pdf, ps, other

    stat.ML cs.LG math.OC

    A Machine Learning Approach to Shipping Box Design

    Authors: Guang Yang, Cun Mu

    Abstract: Having the right assortment of shipping boxes in the fulfillment warehouse to pack and ship customer's online orders is an indispensable and integral part of nowadays eCommerce business, as it will not only help maintain a profitable business but also create great experiences for customers. However, it is an extremely challenging operations task to strategically select the best combination of tens… ▽ More

    Submitted 25 March, 2019; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: Accepted by 2019 Intelligent Systems Conference (A shorter version of the paper is presented at the 13th INFORMS Workshop on Data Mining and Decision Analytics)