subscribe to arXiv mailings

PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration

Authors: Runzhao Yao, Shaoyi Du, Wenting Cui, Canhui Tang, Chengwu Yang

Abstract: Learning rotation-invariant distinctive features is a fundamental requirement for point cloud registration. Existing methods often use rotation-sensitive networks to extract features, while employing rotation augmentation to learn an approximate invariant mapping rudely. This makes networks fragile to rotations, overweight, and hinders the distinctiveness of features. To tackle these problems, we… ▽ More Learning rotation-invariant distinctive features is a fundamental requirement for point cloud registration. Existing methods often use rotation-sensitive networks to extract features, while employing rotation augmentation to learn an approximate invariant mapping rudely. This makes networks fragile to rotations, overweight, and hinders the distinctiveness of features. To tackle these problems, we propose a novel position-aware rotation-equivariant network, for efficient, light-weighted, and robust registration. The network can provide a strong model inductive bias to learn rotation-equivariant/invariant features, thus addressing the aforementioned limitations. To further improve the distinctiveness of descriptors, we propose a position-aware convolution, which can better learn spatial information of local structures. Moreover, we also propose a feature-based hypothesis proposer. It leverages rotation-equivariant features that encode fine-grained structure orientations to generate reliable model hypotheses. Each correspondence can generate a hypothesis, thus it is more efficient than classic estimators that require multiple reliable correspondences. Accordingly, a contrastive rotation loss is presented to enhance the robustness of rotation-equivariant features against data degradation. Extensive experiments on indoor and outdoor datasets demonstrate that our method significantly outperforms the SOTA methods in terms of registration recall while being lightweight and keeping a fast speed. Moreover, experiments on rotated datasets demonstrate its robustness against rotation variations. Code is available at https://github.com/yaorz97/PARENet. △ Less

Submitted 14 July, 2024; originally announced July 2024.

arXiv:2406.10057 [pdf, other]

First Multi-Dimensional Evaluation of Flowchart Comprehension for Multimodal Large Language Models

Authors: Enming Zhang, Ruobing Yao, Huanyong Liu, Junhui Yu, Jiale Wang

Abstract: With the development of Multimodal Large Language Models (MLLMs) technology, its general capabilities are increasingly powerful. To evaluate the various abilities of MLLMs, numerous evaluation systems have emerged. But now there is still a lack of a comprehensive method to evaluate MLLMs in the tasks related to flowcharts, which are very important in daily life and work. We propose the first compr… ▽ More With the development of Multimodal Large Language Models (MLLMs) technology, its general capabilities are increasingly powerful. To evaluate the various abilities of MLLMs, numerous evaluation systems have emerged. But now there is still a lack of a comprehensive method to evaluate MLLMs in the tasks related to flowcharts, which are very important in daily life and work. We propose the first comprehensive method, FlowCE, to assess MLLMs across various dimensions for tasks related to flowcharts. It encompasses evaluating MLLMs' abilities in Reasoning, Localization Recognition, Information Extraction, Logical Verification, and Summarization on flowcharts. However, we find that even the GPT4o model achieves only a score of 56.63. Among open-source models, Phi-3-Vision obtained the highest score of 49.97. We hope that FlowCE can contribute to future research on MLLMs for tasks based on flowcharts. \url{https://github.com/360AILAB-NLP/FlowCE} \end{abstract} △ Less

Submitted 18 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

arXiv:2406.07952 [pdf, other]

Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation

Authors: Zhenhuan Zhou, Along He, Yanlin Wu, Rui Yao, Xueshuo Xie, Tao Li

Abstract: In medical images, various types of lesions often manifest significant differences in their shape and texture. Accurate medical image segmentation demands deep learning models with robust capabilities in multi-scale and boundary feature learning. However, previous networks still have limitations in addressing the above issues. Firstly, previous networks simultaneously fuse multi-level features or… ▽ More In medical images, various types of lesions often manifest significant differences in their shape and texture. Accurate medical image segmentation demands deep learning models with robust capabilities in multi-scale and boundary feature learning. However, previous networks still have limitations in addressing the above issues. Firstly, previous networks simultaneously fuse multi-level features or employ deep supervision to enhance multi-scale learning. However, this may lead to feature redundancy and excessive computational overhead, which is not conducive to network training and clinical deployment. Secondly, the majority of medical image segmentation networks exclusively learn features in the spatial domain, disregarding the abundant global information in the frequency domain. This results in a bias towards low-frequency components, neglecting crucial high-frequency information. To address these problems, we introduce SF-UNet, a spatial-frequency dual-domain attention network. It comprises two main components: the Multi-scale Progressive Channel Attention (MPCA) block, which progressively extract multi-scale features across adjacent encoder layers, and the lightweight Frequency-Spatial Attention (FSA) block, with only 0.05M parameters, enabling concurrent learning of texture and boundary features from both spatial and frequency domains. We validate the effectiveness of the proposed SF-UNet on three public datasets. Experimental results show that compared to previous state-of-the-art (SOTA) medical image segmentation networks, SF-UNet achieves the best performance, and achieves up to 9.4\% and 10.78\% improvement in DSC and IOU. Codes will be released at https://github.com/nkicsl/SF-UNet. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 8 pages

arXiv:2406.04790 [pdf, ps, other]

On location of maximal gradient of torsion function over some non-symmetric planar domains

Authors: Qinfeng Li, Shuangquan Xie, Hang Yang, Ruofei Yao

Abstract: We investigate the location of the maximal gradient of the torsion function on some non-symmetric planar domains. First, for triangles, by reflection method, we show that the maximal gradient of the torsion function always occurs on the longest sides, lying between the foot of the altitude and the middle point. Moreover, via nodal line analysis and continuity method, we demonstrate that restricted… ▽ More We investigate the location of the maximal gradient of the torsion function on some non-symmetric planar domains. First, for triangles, by reflection method, we show that the maximal gradient of the torsion function always occurs on the longest sides, lying between the foot of the altitude and the middle point. Moreover, via nodal line analysis and continuity method, we demonstrate that restricted on each side, the critical point of gradient of the torsion function is unique and nondegenerate. Second, by establishing uniform estimates for narrow domains, we prove that as a planar domain bounded by two graphs of function becomes increasingly narrow, the location of maximal gradient of its torsion tends toward the endpoint of the longest vertical line segment, with smaller curvature among them. This shows that Saint-Venant's conjecture on location of fail points is valid for asymptotically narrow domains. Third, using the reflection method, we prove that for a non-concentric annulus, maximal gradient of torsion always occurs at the point on the inner ring closest to the center of the outer ring. △ Less

Submitted 14 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

arXiv:2405.04628 [pdf, other]

Wasserstein Proximal Coordinate Gradient Algorithms

Authors: Rentian Yao, Xiaohui Chen, Yun Yang

Abstract: Motivated by approximation Bayesian computation using mean-field variational approximation and the computation of equilibrium in multi-species systems with cross-interaction, this paper investigates the composite geodesically convex optimization problem over multiple distributions. The objective functional under consideration is composed of a convex potential energy on a product of Wasserstein spa… ▽ More Motivated by approximation Bayesian computation using mean-field variational approximation and the computation of equilibrium in multi-species systems with cross-interaction, this paper investigates the composite geodesically convex optimization problem over multiple distributions. The objective functional under consideration is composed of a convex potential energy on a product of Wasserstein spaces and a sum of convex self-interaction and internal energies associated with each distribution. To efficiently solve this problem, we introduce the Wasserstein Proximal Coordinate Gradient (WPCG) algorithms with parallel, sequential and random update schemes. Under a quadratic growth (QC) condition that is weaker than the usual strong convexity requirement on the objective functional, we show that WPCG converges exponentially fast to the unique global optimum. In the absence of the QG condition, WPCG is still demonstrated to converge to the global optimal solution, albeit at a slower polynomial rate. Numerical results for both motivating examples are consistent with our theoretical findings. △ Less

Submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.19401 [pdf, other]

UniFS: Universal Few-shot Instance Perception with Point Representations

Authors: Sheng Jin, Ruijie Yao, Lumin Xu, Wentao Liu, Chen Qian, Ji Wu, Ping Luo

Abstract: Instance perception tasks (object detection, instance segmentation, pose estimation, counting) play a key role in industrial applications of visual models. As supervised learning methods suffer from high labeling cost, few-shot learning methods which effectively learn from a limited number of labeled examples are desired. Existing few-shot learning methods primarily focus on a restricted set of ta… ▽ More Instance perception tasks (object detection, instance segmentation, pose estimation, counting) play a key role in industrial applications of visual models. As supervised learning methods suffer from high labeling cost, few-shot learning methods which effectively learn from a limited number of labeled examples are desired. Existing few-shot learning methods primarily focus on a restricted set of tasks, presumably due to the challenges involved in designing a generic model capable of representing diverse tasks in a unified manner. In this paper, we propose UniFS, a universal few-shot instance perception model that unifies a wide range of instance perception tasks by reformulating them into a dynamic point representation learning framework. Additionally, we propose Structure-Aware Point Learning (SAPL) to exploit the higher-order structural relationship among points to further enhance representation learning. Our approach makes minimal assumptions about the tasks, yet it achieves competitive results compared to highly specialized and well optimized specialist models. Codes will be released soon. △ Less

Submitted 15 July, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

Comments: Accepted by ECCV 2024

arXiv:2404.14701 [pdf, other]

Deep neural networks for choice analysis: Enhancing behavioral regularity with gradient regularization

Authors: Siqi Feng, Rui Yao, Stephane Hess, Ricardo A. Daziano, Timothy Brathwaite, Joan Walker, Shenhao Wang

Abstract: Deep neural networks (DNNs) frequently present behaviorally irregular patterns, significantly limiting their practical potentials and theoretical validity in travel behavior modeling. This study proposes strong and weak behavioral regularities as novel metrics to evaluate the monotonicity of individual demand functions (a.k.a. law of demand), and further designs a constrained optimization framewor… ▽ More Deep neural networks (DNNs) frequently present behaviorally irregular patterns, significantly limiting their practical potentials and theoretical validity in travel behavior modeling. This study proposes strong and weak behavioral regularities as novel metrics to evaluate the monotonicity of individual demand functions (a.k.a. law of demand), and further designs a constrained optimization framework with six gradient regularizers to enhance DNNs' behavioral regularity. The proposed framework is applied to travel survey data from Chicago and London to examine the trade-off between predictive power and behavioral regularity for large vs. small sample scenarios and in-domain vs. out-of-domain generalizations. The results demonstrate that, unlike models with strong behavioral foundations such as the multinomial logit, the benchmark DNNs cannot guarantee behavioral regularity. However, gradient regularization (GR) increases DNNs' behavioral regularity by around 6 percentage points (pp) while retaining their relatively high predictive power. In the small sample scenario, GR is more effective than in the large sample scenario, simultaneously improving behavioral regularity by about 20 pp and log-likelihood by around 1.7%. Comparing with the in-domain generalization of DNNs, GR works more effectively in out-of-domain generalization: it drastically improves the behavioral regularity of poorly performing benchmark DNNs by around 65 pp, indicating the criticality of behavioral regularization for enhancing model transferability and application in forecasting. Moreover, the proposed framework is applicable to other NN-based choice models such as TasteNets. Future studies could use behavioral regularity as a metric along with log-likelihood in evaluating travel demand models, and investigate other methods to further enhance behavioral regularity when adopting complex machine learning models. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2402.10834 [pdf, other]

Agent-based Simulation Evaluation of CBD Tolling: A Case Study from New York City

Authors: Qingnan Liang, Ruili Yao, Ruixuan Zhang, Zhibin Chen, Guoyuan Wu

Abstract: Congestion tollings have been widely developed and adopted as an effective tool to mitigate urban traffic congestion and enhance transportation system sustainability. Nevertheless, these tolling schemes are often tailored on a city-by-city or even area-by-area basis, and the cost of conducting field experiments often makes the design and evaluation process challenging. In this work, we leverage MA… ▽ More Congestion tollings have been widely developed and adopted as an effective tool to mitigate urban traffic congestion and enhance transportation system sustainability. Nevertheless, these tolling schemes are often tailored on a city-by-city or even area-by-area basis, and the cost of conducting field experiments often makes the design and evaluation process challenging. In this work, we leverage MATSim, a simulation platform that provides microscopic behaviors at the agent level, to evaluate performance on tolling schemes. Specifically, we conduct a case study of the Manhattan Central Business District (CBD) in New York City (NYC) using a fine-granularity traffic network model in the large-scale agent behavior setting. The flexibility of MATSim enables the implementation of a customized tolling policy proposed yet not deployed by the NYC agency while providing detailed interpretations. The quantitative and qualitative results indicate that the tested tolling program can regulate the personal vehicle volume in the CBD area and encourage the usage of public transportation, which proves to be a practical move towards sustainable transportation systems. More importantly, our work demonstrates that agent-based simulation helps better understand the travel pattern change subject to tollings in dense and complex urban environments, and it has the potential to facilitate efficient decision-making for the devotion to sustainable traffic management. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: Accepted by 2024 IEEE Forum on Integrated and Sustainable Transportation Systems

arXiv:2402.07461 [pdf, other]

Simulating the spin-boson model with a controllable reservoir in an ion trap

Authors: G. -X. Wang, Y. -K. Wu, R. Yao, W. -Q. Lian, Z. -J. Cheng, Y. -L. Xu, C. Zhang, Y. Jiang, Y. -Z. Xu, B. -X. Qi, P. -Y. Hou, Z. -C. Zhou, L. He, L. -M. Duan

Abstract: The spin-boson model is a prototypical model for open quantum dynamics. Here we simulate the spin-boson model using a chain of trapped ions where a spin is coupled to a structured reservoir of bosonic modes. We engineer the spectral density of the reservoir by adjusting the ion number, the target ion location, the laser detuning to the phonon sidebands, and the number of frequency components in th… ▽ More The spin-boson model is a prototypical model for open quantum dynamics. Here we simulate the spin-boson model using a chain of trapped ions where a spin is coupled to a structured reservoir of bosonic modes. We engineer the spectral density of the reservoir by adjusting the ion number, the target ion location, the laser detuning to the phonon sidebands, and the number of frequency components in the laser, and we observe their effects on the collapse and revival of the initially encoded information. Our work demonstrates the ion trap as a powerful platform for simulating open quantum dynamics with complicated reservoir structures. △ Less

Submitted 12 February, 2024; originally announced February 2024.

arXiv:2401.17912 [pdf, ps, other]

Monotonicity of positive solutions to semilinear elliptic equations with mixed boundary conditions in triangles

Authors: Rui Li, Ruofei Yao

Abstract: This paper investigates semilinear elliptic problems in planar triangles with Dirichlet conditions specified on one side of the boundary and Neumann conditions imposed on the remaining two sides. By employing moving plane method, we establish that the positive solution is monotone in the normal direction of the Dirichlet side when the Neumann vertex is non-obtuse. In the case where the Neumann ver… ▽ More This paper investigates semilinear elliptic problems in planar triangles with Dirichlet conditions specified on one side of the boundary and Neumann conditions imposed on the remaining two sides. By employing moving plane method, we establish that the positive solution is monotone in the normal direction of the Dirichlet side when the Neumann vertex is non-obtuse. In the case where the Neumann vertex is obtuse, the positive solution is monotone in the normal direction of the longer Neumann side provided some technical conditions. Furthermore, this monotonicity property extends to the first mixed eigenfunction in triangles through continuity method via domain deformation. It is noteworthy that the maximum of the first positive eigenfunction in a triangle with mixed boundary conditions, consisting of two Neumann sides and one Dirichlet side, is uniquely located on the Neumann side with the greater length. This maximum point coincides with the Neumann vertex if and only if either the Neumann vertex is non-obtuse or the two Neumann sides have equal lengths. This result successfully resolves a specific problem posed within the Polymath project: Polymath7 research thread 1. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 29 pages

arXiv:2401.01829 [pdf, other]

Constraints on Axion-like Particles from the Observation of Galactic Sources by LHAASO

Authors: Jun Li, Xiao-Jun Bi, Lin-Qing Gao, Xiaoyuan Huang, Run-Min Yao, Peng-Fei Yin

Abstract: High-energy photons may oscillate with axion-like particles (ALPs) when they propagate through the Milky Way's magnetic field, resulting in an alteration in the observed photon energy spectrum. The ultra-high energy gamma-ray spectra, measured by the Large High Altitude Air Shower Observatory (LHAASO) up to $\mathcal{O}(1)~\mathrm{PeV}$, provide a promising opportunity to investigate the ALP-photo… ▽ More High-energy photons may oscillate with axion-like particles (ALPs) when they propagate through the Milky Way's magnetic field, resulting in an alteration in the observed photon energy spectrum. The ultra-high energy gamma-ray spectra, measured by the Large High Altitude Air Shower Observatory (LHAASO) up to $\mathcal{O}(1)~\mathrm{PeV}$, provide a promising opportunity to investigate the ALP-photon oscillation effect. In this study, we utilize the gamma-ray spectra of four Galactic sources measured by LHAASO, including the Crab Nebula, LHAASO J2226+6057, LHAASO J1908+0621, and LHAASO J1825-1326, to explore this effect. We employ the $\rm CL_s$ method to set constraints on the ALP parameters. Combing the observations of the four sources, our analysis reveals that the ALP-photon coupling $g_{aγ}$ is constrained to be smaller than $1.4\times10^{-10}$ ${\rm GeV}^{-1}$ for the ALP mass of $\sim 4\times10^{-7} ~\mathrm{eV}$ at the 95\% C.L. By combing the observations of the Crab Nebula from LHAASO and other experiments, we find that the ALP-photon coupling could be set to be about $7.2\times10^{-11}$ ${\rm GeV}^{-1}$ for the ALP mass $\sim 4 \times10^{-7}~\mathrm{eV}$ , which is in close proximity to the CAST constraint. △ Less

Submitted 3 January, 2024; originally announced January 2024.

arXiv:2312.14181 [pdf, other]

doi 10.1103/PhysRevB.109.155407

Reversal of Orbital Hall Conductivity and Emergence of Tunable Topological Quantum States in Orbital Hall Insulator

Authors: Shilei Ji, Chuye Quan, Ruijia Yao, Jianping Yang, Xing'ao Li

Abstract: Recent findings indicate that orbital angular momentum (OAM) has the capability to induce the intrinsic orbital Hall effect (OHE), which is characterized by orbital Chern number in the orbital Hall insulator. Unlike the spin-polarized channel in Quantum anomalous Hall insulator, the OAM is valley-locked, posing challenges in manipulating the corresponding edge state. Here we demonstrate the sign-r… ▽ More Recent findings indicate that orbital angular momentum (OAM) has the capability to induce the intrinsic orbital Hall effect (OHE), which is characterized by orbital Chern number in the orbital Hall insulator. Unlike the spin-polarized channel in Quantum anomalous Hall insulator, the OAM is valley-locked, posing challenges in manipulating the corresponding edge state. Here we demonstrate the sign-reversal orbital Chern number through strain engineering by combing the $k \cdot p$ model and first-principles calculation. Under the manipulation of strain, we observe the transfer of non-zero OAM from the valence band to the conduction band, aligning with the orbital contribution in the electronic structure. Our investigation reveals that electrons and holes with OAM exhibit opposing trajectories, resulting in a reversal of the orbital Hall conductivity. Furthermore, we explore the topological quantum state between the sign-reversible OHE. △ Less

Submitted 21 February, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2312.12611 [pdf]

A Semi-Analytical Approach for State-Space Electromagnetic Transient Simulation Using the Differential Transformation

Authors: Min Xiong, Kaiyang Huang, Yang Liu, Rui Yao, Kai Sun, Feng Qiu

Abstract: Electromagnetic transient (EMT) simulation is a crucial tool for power system dynamic analysis because of its detailed component modeling and high simulation accuracy. However, it suffers from computational burdens for large power grids since a tiny time step is typically required for accuracy. This paper proposes an efficient and accurate semi-analytical approach for state-space EMT simulations o… ▽ More Electromagnetic transient (EMT) simulation is a crucial tool for power system dynamic analysis because of its detailed component modeling and high simulation accuracy. However, it suffers from computational burdens for large power grids since a tiny time step is typically required for accuracy. This paper proposes an efficient and accurate semi-analytical approach for state-space EMT simulations of power grids. It employs high-order semi-analytical solutions derived using the differential transformation from the state-space EMT grid model. The approach incorporates a proposed variable time step strategy based on equation imbalance, leveraging structural information of the grid model, to enlarge the time step and accelerate simulations, while high resolution is maintained by reconstructing detailed fast EMT dynamics through an efficient dense output mechanism. It also addresses limit-induced switches during large time steps by using a binary search-enhanced quadratic interpolation algorithm. Case studies are conducted on EMT models of the IEEE 39-bus system and a synthetic 390-bus system to demonstrate the merits of the new simulation approach against traditional methods. △ Less

Submitted 19 December, 2023; originally announced December 2023.

arXiv:2312.00848 [pdf, other]

Perturbed utility stochastic traffic assignment

Authors: Rui Yao, Mogens Fosgerau, Mads Paulsen, Thomas Kjær Rasmussen

Abstract: This paper develops a fast algorithm for computing the equilibrium assignment with the perturbed utility route choice (PURC) model. Without compromise, this allows the significant advantages of the PURC model to be used in large-scale applications. We formulate the PURC equilibrium assignment problem as a convex minimization problem and find a closed-form stochastic network loading expression that… ▽ More This paper develops a fast algorithm for computing the equilibrium assignment with the perturbed utility route choice (PURC) model. Without compromise, this allows the significant advantages of the PURC model to be used in large-scale applications. We formulate the PURC equilibrium assignment problem as a convex minimization problem and find a closed-form stochastic network loading expression that allows us to formulate the Lagrangian dual of the assignment problem as an unconstrained optimization problem. To solve this dual problem, we formulate a quasi-Newton accelerated gradient descent algorithm (qN-AGD*). Our numerical evidence shows that qN-AGD* clearly outperforms a conventional primal algorithm as well as a plain accelerated gradient descent algorithm. qN-AGD* is fast with a runtime that scales about linearly with the problem size, indicating that solving the perturbed utility assignment problem is feasible also with very large networks. △ Less

Submitted 1 December, 2023; originally announced December 2023.

arXiv:2311.17629 [pdf, other]

Efficient Decoder for End-to-End Oriented Object Detection in Remote Sensing Images

Authors: Jiaqi Zhao, Zeyu Ding, Yong Zhou, Hancheng Zhu, Wenliang Du, Rui Yao, Abdulmotaleb El Saddik

Abstract: Object instances in remote sensing images often distribute with multi-orientations, varying scales, and dense distribution. These issues bring challenges to end-to-end oriented object detectors including multi-scale features alignment and a large number of queries. To address these limitations, we propose an end-to-end oriented detector equipped with an efficient decoder, which incorporates two te… ▽ More Object instances in remote sensing images often distribute with multi-orientations, varying scales, and dense distribution. These issues bring challenges to end-to-end oriented object detectors including multi-scale features alignment and a large number of queries. To address these limitations, we propose an end-to-end oriented detector equipped with an efficient decoder, which incorporates two technologies, Rotated RoI attention (RRoI attention) and Selective Distinct Queries (SDQ). Specifically, RRoI attention effectively focuses on oriented regions of interest through a cross-attention mechanism and aligns multi-scale features. SDQ collects queries from intermediate decoder layers and then filters similar queries to obtain distinct queries. The proposed SDQ can facilitate the optimization of one-to-one label assignment, without introducing redundant initial queries or extra auxiliary branches. Extensive experiments on five datasets demonstrate the effectiveness of our method. Notably, our method achieves state-of-the-art performance on DIOR-R (67.31% mAP), DOTA-v1.5 (67.43% mAP), and DOTA-v2.0 (53.28% mAP) with the ResNet50 backbone. △ Less

Submitted 1 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

Comments: 11 pages, 7 figures, 13 tables

arXiv:2311.17163 [pdf, other]

A Site-Resolved 2D Quantum Simulator with Hundreds of Trapped Ions

Authors: S. -A. Guo, Y. -K. Wu, J. Ye, L. Zhang, W. -Q. Lian, R. Yao, Y. Wang, R. -Y. Yan, Y. -J. Yi, Y. -L. Xu, B. -W. Li, Y. -H. Hou, Y. -Z. Xu, W. -X. Guo, C. Zhang, B. -X. Qi, Z. -C. Zhou, L. He, L. -M. Duan

Abstract: A large qubit capacity and an individual readout capability are two crucial requirements for large-scale quantum computing and simulation. As one of the leading physical platforms for quantum information processing, the ion trap has achieved quantum simulation of tens of ions with site-resolved readout in 1D Paul trap, and that of hundreds of ions with global observables in 2D Penning trap. Howeve… ▽ More A large qubit capacity and an individual readout capability are two crucial requirements for large-scale quantum computing and simulation. As one of the leading physical platforms for quantum information processing, the ion trap has achieved quantum simulation of tens of ions with site-resolved readout in 1D Paul trap, and that of hundreds of ions with global observables in 2D Penning trap. However, integrating these two features into a single system is still very challenging. Here we report the stable trapping of 512 ions in a 2D Wigner crystal and the sideband cooling of their transverse motion. We demonstrate the quantum simulation of long-range quantum Ising models with tunable coupling strengths and patterns, with or without frustration, using 300 ions. Enabled by the site resolution in the single-shot measurement, we observe rich spatial correlation patterns in the quasi-adiabatically prepared ground states, which allows us to verify quantum simulation results by comparing with the calculated collective phonon modes and with classical simulated annealing. We further probe the quench dynamics of the Ising model in a transverse field to demonstrate quantum sampling tasks. Our work paves the way for simulating classically intractable quantum dynamics and for running NISQ algorithms using 2D ion trap quantum simulators. △ Less

Submitted 11 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

arXiv:2311.12659 [pdf, ps, other]

Uniqueness of critical points of the second Neumann eigenfunctions on triangles

Authors: Hongbin Chen, Changfeng Gui, Ruofei Yao

Abstract: This paper deals with the second Neumann eigenfunction ${u}$ of any planar triangle ${T}$. In a recent work by C. Judge and S. Mondal [Ann. Math., 2022], it was established that ${u}$ does not have any critical point within the interior of ${T}$. In this paper, we show the uniqueness of non-vertex critical point and the monotonicity property of the second eigenfunction. To be more precise, when… ▽ More This paper deals with the second Neumann eigenfunction ${u}$ of any planar triangle ${T}$. In a recent work by C. Judge and S. Mondal [Ann. Math., 2022], it was established that ${u}$ does not have any critical point within the interior of ${T}$. In this paper, we show the uniqueness of non-vertex critical point and the monotonicity property of the second eigenfunction. To be more precise, when ${T}$ is not an equilateral triangle, the non-vertex critical point exists if and only if ${T}$ is an acute triangle that is not a super-equilateral triangle, and the global extrema of ${u}$ are achieved at and only at the endpoints of the longest side. This establishes the origin theorem and conjecture 13.6 initially posed by C. Judge and S. Mondal [Ann. Math., 2020]. Our proof relies heavily on continuity methods, eigenvalue inequalities, and the maximum principle to establish these results. △ Less

Submitted 21 November, 2023; originally announced November 2023.

Comments: 41 pages, 6 figures

arXiv:2311.00894 [pdf, other]

Minimizing Convex Functionals over Space of Probability Measures via KL Divergence Gradient Flow

Authors: Rentian Yao, Linjun Huang, Yun Yang

Abstract: Motivated by the computation of the non-parametric maximum likelihood estimator (NPMLE) and the Bayesian posterior in statistics, this paper explores the problem of convex optimization over the space of all probability distributions. We introduce an implicit scheme, called the implicit KL proximal descent (IKLPD) algorithm, for discretizing a continuous-time gradient flow relative to the Kullback-… ▽ More Motivated by the computation of the non-parametric maximum likelihood estimator (NPMLE) and the Bayesian posterior in statistics, this paper explores the problem of convex optimization over the space of all probability distributions. We introduce an implicit scheme, called the implicit KL proximal descent (IKLPD) algorithm, for discretizing a continuous-time gradient flow relative to the Kullback-Leibler divergence for minimizing a convex target functional. We show that IKLPD converges to a global optimum at a polynomial rate from any initialization; moreover, if the objective functional is strongly convex relative to the KL divergence, for example, when the target functional itself is a KL divergence as in the context of Bayesian posterior computation, IKLPD exhibits globally exponential convergence. Computationally, we propose a numerical method based on normalizing flow to realize IKLPD. Conversely, our numerical method can also be viewed as a new approach that sequentially trains a normalizing flow for minimizing a convex functional with a strong theoretical guarantee. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2310.19113 [pdf, other]

Dynamic V2X Autonomous Perception from Road-to-Vehicle Vision

Authors: Jiayao Tan, Fan Lyu, Linyan Li, Fuyuan Hu, Tingliang Feng, Fenglei Xu, Rui Yao

Abstract: Vehicle-to-everything (V2X) perception is an innovative technology that enhances vehicle perception accuracy, thereby elevating the security and reliability of autonomous systems. However, existing V2X perception methods focus on static scenes from mainly vehicle-based vision, which is constrained by sensor capabilities and communication loads. To adapt V2X perception models to dynamic scenes, we… ▽ More Vehicle-to-everything (V2X) perception is an innovative technology that enhances vehicle perception accuracy, thereby elevating the security and reliability of autonomous systems. However, existing V2X perception methods focus on static scenes from mainly vehicle-based vision, which is constrained by sensor capabilities and communication loads. To adapt V2X perception models to dynamic scenes, we propose to build V2X perception from road-to-vehicle vision and present Adaptive Road-to-Vehicle Perception (AR2VP) method. In AR2VP,we leverage roadside units to offer stable, wide-range sensing capabilities and serve as communication hubs. AR2VP is devised to tackle both intra-scene and inter-scene changes. For the former, we construct a dynamic perception representing module, which efficiently integrates vehicle perceptions, enabling vehicles to capture a more comprehensive range of dynamic factors within the scene.Moreover, we introduce a road-to-vehicle perception compensating module, aimed at preserving the maximized roadside unit perception information in the presence of intra-scene changes.For inter-scene changes, we implement an experience replay mechanism leveraging the roadside unit's storage capacity to retain a subset of historical scene data, maintaining model robustness in response to inter-scene shifts. We conduct perception experiment on 3D object detection and segmentation, and the results show that AR2VP excels in both performance-bandwidth trade-offs and adaptability within dynamic environments. △ Less

Submitted 29 October, 2023; originally announced October 2023.

arXiv:2310.16499 [pdf, other]

Data Optimization in Deep Learning: A Survey

Authors: Ou Wu, Rujing Yao

Abstract: Large-scale, high-quality data are considered an essential factor for the successful application of many deep learning techniques. Meanwhile, numerous real-world deep learning tasks still have to contend with the lack of sufficient amounts of high-quality data. Additionally, issues such as model robustness, fairness, and trustworthiness are also closely related to training data. Consequently, a hu… ▽ More Large-scale, high-quality data are considered an essential factor for the successful application of many deep learning techniques. Meanwhile, numerous real-world deep learning tasks still have to contend with the lack of sufficient amounts of high-quality data. Additionally, issues such as model robustness, fairness, and trustworthiness are also closely related to training data. Consequently, a huge number of studies in the existing literature have focused on the data aspect in deep learning tasks. Some typical data optimization techniques include data augmentation, logit perturbation, sample weighting, and data condensation. These techniques usually come from different deep learning divisions and their theoretical inspirations or heuristic motivations may seem unrelated to each other. This study aims to organize a wide range of existing data optimization methodologies for deep learning from the previous literature, and makes the effort to construct a comprehensive taxonomy for them. The constructed taxonomy considers the diversity of split dimensions, and deep sub-taxonomies are constructed for each dimension. On the basis of the taxonomy, connections among the extensive data optimization methods for deep learning are built in terms of four aspects. We probe into rendering several promising and interesting future directions. The constructed taxonomy and the revealed connections will enlighten the better understanding of existing methods and the design of novel data optimization techniques. Furthermore, our aspiration for this survey is to promote data optimization as an independent subdivision of deep learning. A curated, up-to-date list of resources related to data optimization in deep learning is available at \url{https://github.com/YaoRujing/Data-Optimization}. △ Less

Submitted 25 October, 2023; originally announced October 2023.

arXiv:2310.11391 [pdf, other]

doi 10.1088/1475-7516/2024/01/026

Constraints on Axion-like Particles from the Observation of GRB 221009A by LHAASO

Authors: Lin-Qing Gao, Xiao-Jun Bi, Jun Li, Run-Min Yao, Peng-Fei Yin

Abstract: The LHAASO collaboration recently reported the measurement of the gamma-ray spectra of GRB 221009A, which is the brightest burst ever, covering an energy range from 0.3 $\mathrm{TeV}$ to about 10 $\mathrm{TeV}$. Based on the observation, we investigate the ALP-photon oscillation effect in the host galaxy of GRB 221009A and the Milky Way. The ${\rm CL_s}$ method is applied to set constraints on the… ▽ More The LHAASO collaboration recently reported the measurement of the gamma-ray spectra of GRB 221009A, which is the brightest burst ever, covering an energy range from 0.3 $\mathrm{TeV}$ to about 10 $\mathrm{TeV}$. Based on the observation, we investigate the ALP-photon oscillation effect in the host galaxy of GRB 221009A and the Milky Way. The ${\rm CL_s}$ method is applied to set constraints on the ALP parameters in this study. Given the uncertain magnetic field configuration in the host galaxy, we use three different models: a homogeneous magnetic field model, a magnetic field model identical to that of the Milky Way, and a model constructed from the HST observations of the host galaxy. We find that the constraints derived using these three host galaxy magnetic field models are comparable. Our results are complementary in the small ALP mass regions compared with other experiments. △ Less

Submitted 15 January, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: 8 pages, 13 figures

arXiv:2310.08285 [pdf, other]

How would mobility-as-a-service (MaaS) platform survive as an intermediary? From the viewpoint of stability in many-to-many matching

Authors: Rui Yao, Kenan Zhang

Abstract: Mobility-as-a-service (MaaS) provides seamless door-to-door trips by integrating different transport modes. Although many MaaS platforms have emerged in recent years, most of them remain at a limited integration level. This study investigates the assignment and pricing problem for a MaaS platform as an intermediary in a multi-modal transportation network, which purchases capacity from service oper… ▽ More Mobility-as-a-service (MaaS) provides seamless door-to-door trips by integrating different transport modes. Although many MaaS platforms have emerged in recent years, most of them remain at a limited integration level. This study investigates the assignment and pricing problem for a MaaS platform as an intermediary in a multi-modal transportation network, which purchases capacity from service operators and sells multi-modal trips to travelers. The analysis framework of many-to-many stable matching is adopted to decompose the joint design problem and to derive the stability condition such that both operators and travelers are willing to participate in the MaaS system. To maximize the flexibility in route choice and remove boundaries between modes, we design an origin-destination pricing scheme for MaaS trips. On the supply side, we propose a wholesale purchase price for service capacity. Accordingly, the assignment problem is reformulated and solved as a bi-level program, where MaaS travelers make multi-modal trips to minimize their travel costs meanwhile interacting with non-MaaS travelers in the multi-modal transport system. We prove that, under the proposed pricing scheme, there always exists a stable outcome to the overall many-to-many matching problem. Further, given an optimal assignment and under some mild conditions, a unique optimal pricing scheme is ensured. Numerical experiments conducted on the extended Sioux Falls network also demonstrate that the proposed MaaS system could create a win-win-win situation -- the MaaS platform is profitable and both traveler welfare and transit operator revenues increase from a baseline scenario without MaaS. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2309.02510 [pdf, other]

doi 10.5802/crphys.173

Geometric squeezing of rotating quantum gases into the lowest Landau level

Authors: Valentin Crépel, Ruixiao Yao, Biswaroop Mukherjee, Richard J. Fletcher, Martin Zwierlein

Abstract: The simulation of quantum Hall physics with rotating quantum gases is witnessing a revival due to recent experimental advances that enabled the observation of a Bose-Einstein condensate entirely contained in its lowest kinetic energy state, i.e. the lowest Landau level. We theoretically describe this experimental result, and show that it can be interpreted as a squeezing of the geometric degree of… ▽ More The simulation of quantum Hall physics with rotating quantum gases is witnessing a revival due to recent experimental advances that enabled the observation of a Bose-Einstein condensate entirely contained in its lowest kinetic energy state, i.e. the lowest Landau level. We theoretically describe this experimental result, and show that it can be interpreted as a squeezing of the geometric degree of freedom of the problem, the guiding center metric. This "geometric squeezing" offers an unprecedented experimental control over the quantum geometry in Landau-level analogues, and at the same time opens a realistic path towards achieving correlated quantum phases akin to quantum Hall states with neutral atoms. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Journal ref: Comptes Rendus. Physique Volume 24 (2023) no. S3, pp. 241-262

arXiv:2308.14378 [pdf, other]

GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition

Authors: Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian, Ping Luo, Ji Wu

Abstract: Multi-Label Image Recognition (MLIR) is a challenging task that aims to predict multiple object labels in a single image while modeling the complex relationships between labels and image regions. Although convolutional neural networks and vision transformers have succeeded in processing images as regular grids of pixels or patches, these representations are sub-optimal for capturing irregular and… ▽ More Multi-Label Image Recognition (MLIR) is a challenging task that aims to predict multiple object labels in a single image while modeling the complex relationships between labels and image regions. Although convolutional neural networks and vision transformers have succeeded in processing images as regular grids of pixels or patches, these representations are sub-optimal for capturing irregular and discontinuous regions of interest. In this work, we present the first fully graph convolutional model, Group K-nearest neighbor based Graph convolutional Network (GKGNet), which models the connections between semantic label embeddings and image patches in a flexible and unified graph structure. To address the scale variance of different objects and to capture information from multiple perspectives, we propose the Group KGCN module for dynamic graph construction and message passing. Our experiments demonstrate that GKGNet achieves state-of-the-art performance with significantly lower computational costs on the challenging multi-label datasets, \ie MS-COCO and VOC2007 datasets. We will release the code and models to facilitate future research in this area. △ Less

Submitted 15 July, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: Accepted by ECCV 2024

arXiv:2308.08273 [pdf, ps, other]

On location of maximum of gradient of torsion function

Authors: Qinfeng Li, Ruofei yao

Abstract: It has been a widely belief that for a planar convex domain with two coordinate axes of symmetry, the location of maximal norm of gradient of torsion function is either linked to contact points of largest inscribed circle or connected to points on boundary of minimal curvature. However, we show that this is not quite true in general. Actually, we derive the precise formula for the location of maxi… ▽ More It has been a widely belief that for a planar convex domain with two coordinate axes of symmetry, the location of maximal norm of gradient of torsion function is either linked to contact points of largest inscribed circle or connected to points on boundary of minimal curvature. However, we show that this is not quite true in general. Actually, we derive the precise formula for the location of maximal norm of gradient of torsion function on nearly ball domains in $\mathbb{R}^n$, which displays nonlocal nature and thus does not inherently establish a connection to the aforementioned two types of points. Consequently, explicit counterexamples can be straightforwardly constructed to illustrate this deviation from conventional understanding. We also prove that for a rectangular domain, the maximum of the norm of gradient of torsion function exactly occurs at the centers of the faces of largest $(n-1)$-volume. △ Less

Submitted 4 November, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

arXiv:2307.13310 [pdf, other]

doi 10.1109/TCSVT.2023.3299087

CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer

Authors: Zhiwen Shao, Yuchen Su, Yong Zhou, Fanrong Meng, Hancheng Zhu, Bing Liu, Rui Yao

Abstract: Contour based scene text detection methods have rapidly developed recently, but still suffer from inaccurate frontend contour initialization, multi-stage error accumulation, or deficient local information aggregation. To tackle these limitations, we propose a novel arbitrary-shaped scene text detection framework named CT-Net by progressive contour regression with contour transformers. Specifically… ▽ More Contour based scene text detection methods have rapidly developed recently, but still suffer from inaccurate frontend contour initialization, multi-stage error accumulation, or deficient local information aggregation. To tackle these limitations, we propose a novel arbitrary-shaped scene text detection framework named CT-Net by progressive contour regression with contour transformers. Specifically, we first employ a contour initialization module that generates coarse text contours without any post-processing. Then, we adopt contour refinement modules to adaptively refine text contours in an iterative manner, which are beneficial for context information capturing and progressive global contour deformation. Besides, we propose an adaptive training strategy to enable the contour transformers to learn more potential deformation paths, and introduce a re-score mechanism that can effectively suppress false positives. Extensive experiments are conducted on four challenging datasets, which demonstrate the accuracy and efficiency of our CT-Net over state-of-the-art methods. Particularly, CT-Net achieves F-measure of 86.1 at 11.2 frames per second (FPS) and F-measure of 87.8 at 10.1 FPS for CTW1500 and Total-Text datasets, respectively. △ Less

Submitted 25 July, 2023; originally announced July 2023.

Comments: This paper has been accepted by IEEE Transactions on Circuits and Systems for Video Technology

arXiv:2306.08854 [pdf, other]

A Gromov--Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening

Authors: Yifan Chen, Rentian Yao, Yun Yang, Jie Chen

Abstract: Graph coarsening is a technique for solving large-scale graph problems by working on a smaller version of the original graph, and possibly interpolating the results back to the original graph. It has a long history in scientific computing and has recently gained popularity in machine learning, particularly in methods that preserve the graph spectrum. This work studies graph coarsening from a diffe… ▽ More Graph coarsening is a technique for solving large-scale graph problems by working on a smaller version of the original graph, and possibly interpolating the results back to the original graph. It has a long history in scientific computing and has recently gained popularity in machine learning, particularly in methods that preserve the graph spectrum. This work studies graph coarsening from a different perspective, developing a theory for preserving graph distances and proposing a method to achieve this. The geometric approach is useful when working with a collection of graphs, such as in graph classification and regression. In this study, we consider a graph as an element on a metric space equipped with the Gromov--Wasserstein (GW) distance, and bound the difference between the distance of two graphs and their coarsened versions. Minimizing this difference can be done using the popular weighted kernel $K$-means method, which improves existing spectrum-preserving methods with the proper choice of the kernel. The study includes a set of experiments to support the theory and method, including approximating the GW distance, preserving the graph spectrum, classifying graphs using spectral information, and performing regression using graph convolutional networks. Code is available at https://github.com/ychen-stat-ml/GW-Graph-Coarsening . △ Less

Submitted 15 June, 2023; originally announced June 2023.

Comments: To appear at ICML 2023. Code is available at https://github.com/ychen-stat-ml/GW-Graph-Coarsening

arXiv:2306.06624 [pdf, other]

RestGPT: Connecting Large Language Models with Real-World RESTful APIs

Authors: Yifan Song, Weimin Xiong, Dawei Zhu, Wenhao Wu, Han Qian, Mingbo Song, Hailiang Huang, Cheng Li, Ke Wang, Rong Yao, Ye Tian, Sujian Li

Abstract: Tool-augmented large language models (LLMs) have achieved remarkable progress in tackling a broad range of tasks. However, existing methods are mainly restricted to specifically designed tools and fail to fulfill complex instructions, having great limitations when confronted with real-world scenarios. In this paper, we explore a more realistic scenario by connecting LLMs with RESTful APIs, which a… ▽ More Tool-augmented large language models (LLMs) have achieved remarkable progress in tackling a broad range of tasks. However, existing methods are mainly restricted to specifically designed tools and fail to fulfill complex instructions, having great limitations when confronted with real-world scenarios. In this paper, we explore a more realistic scenario by connecting LLMs with RESTful APIs, which adhere to the widely adopted REST software architectural style for web service development. To address the practical challenges of tackling complex instructions, we propose RestGPT, which exploits the power of LLMs and conducts a coarse-to-fine online planning mechanism to enhance the abilities of task decomposition and API selection. RestGPT also contains an API executor tailored for calling RESTful APIs, which can meticulously formulate parameters and parse API responses. To fully evaluate the performance of RestGPT, we propose RestBench, a high-quality benchmark which consists of two real-world scenarios and human-annotated instructions with gold solution paths. Experiments show that RestGPT is able to achieve impressive results in complex tasks and has strong robustness, which paves a new way towards AGI. RestGPT and RestBench is publicly available at https://restgpt.github.io/. △ Less

Submitted 26 August, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

Comments: Add RestBench to evaluate RestGPT

arXiv:2306.00127 [pdf, other]

Surrogate Model Extension (SME): A Fast and Accurate Weight Update Attack on Federated Learning

Authors: Junyi Zhu, Ruicong Yao, Matthew B. Blaschko

Abstract: In Federated Learning (FL) and many other distributed training frameworks, collaborators can hold their private data locally and only share the network weights trained with the local data after multiple iterations. Gradient inversion is a family of privacy attacks that recovers data from its generated gradients. Seemingly, FL can provide a degree of protection against gradient inversion attacks on… ▽ More In Federated Learning (FL) and many other distributed training frameworks, collaborators can hold their private data locally and only share the network weights trained with the local data after multiple iterations. Gradient inversion is a family of privacy attacks that recovers data from its generated gradients. Seemingly, FL can provide a degree of protection against gradient inversion attacks on weight updates, since the gradient of a single step is concealed by the accumulation of gradients over multiple local iterations. In this work, we propose a principled way to extend gradient inversion attacks to weight updates in FL, thereby better exposing weaknesses in the presumed privacy protection inherent in FL. In particular, we propose a surrogate model method based on the characteristic of two-dimensional gradient flow and low-rank property of local updates. Our method largely boosts the ability of gradient inversion attacks on weight updates containing many iterations and achieves state-of-the-art (SOTA) performance. Additionally, our method runs up to $100\times$ faster than the SOTA baseline in the common FL scenario. Our work re-evaluates and highlights the privacy risk of sharing network weights. Our code is available at https://github.com/JunyiZhu-AI/surrogate_model_extension. △ Less

Submitted 31 May, 2023; originally announced June 2023.

Comments: Accepted at ICML 2023

arXiv:2305.15583 [pdf, other]

Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps

Authors: Mingxiao Li, Tingyu Qu, Ruicong Yao, Wei Sun, Marie-Francine Moens

Abstract: Diffusion Probabilistic Models (DPM) have shown remarkable efficacy in the synthesis of high-quality images. However, their inference process characteristically requires numerous, potentially hundreds, of iterative steps, which could exaggerate the problem of exposure bias due to the training and inference discrepancy. Previous work has attempted to mitigate this issue by perturbing inputs during… ▽ More Diffusion Probabilistic Models (DPM) have shown remarkable efficacy in the synthesis of high-quality images. However, their inference process characteristically requires numerous, potentially hundreds, of iterative steps, which could exaggerate the problem of exposure bias due to the training and inference discrepancy. Previous work has attempted to mitigate this issue by perturbing inputs during training, which consequently mandates the retraining of the DPM. In this work, we conduct a systematic study of exposure bias in DPM and, intriguingly, we find that the exposure bias could be alleviated with a novel sampling method that we propose, without retraining the model. We empirically and theoretically show that, during inference, for each backward time step $t$ and corresponding state $\hat{x}_t$, there might exist another time step $t_s$ which exhibits superior coupling with $\hat{x}_t$. Based on this finding, we introduce a sampling method named Time-Shift Sampler. Our framework can be seamlessly integrated to existing sampling algorithms, such as DDPM, DDIM and other high-order solvers, inducing merely minimal additional computations. Experimental results show our method brings significant and consistent improvements in FID scores on different datasets and sampling methods. For example, integrating Time-Shift Sampler to F-PNDM yields a FID=3.88, achieving 44.49\% improvements as compared to F-PNDM, on CIFAR-10 with 10 sampling steps, which is more performant than the vanilla DDIM with 100 sampling steps. Our code is available at https://github.com/Mingxiao-Li/TS-DPM. △ Less

Submitted 16 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: Accepted at International Conference on Learning Representations (ICLR2024); typo correction

arXiv:2304.10468 [pdf, other]

Observation of chiral edge transport in a rapidly-rotating quantum gas

Authors: Ruixiao Yao, Sungjae Chi, Biswaroop Mukherjee, Airlia Shaffer, Martin Zwierlein, Richard J. Fletcher

Abstract: The frictionless, directional propagation of particles at the boundary of topological materials is one of the most striking phenomena in transport. These chiral edge modes lie at the heart of the integer and fractional quantum Hall effects, and their extraordinary robustness against noise and disorder reflects the quantization of Hall conductivity in these systems. Despite their central importance… ▽ More The frictionless, directional propagation of particles at the boundary of topological materials is one of the most striking phenomena in transport. These chiral edge modes lie at the heart of the integer and fractional quantum Hall effects, and their extraordinary robustness against noise and disorder reflects the quantization of Hall conductivity in these systems. Despite their central importance, controllable injection of edge modes, and direct imaging of their propagation, structure, and dynamics, is challenging. Here, we demonstrate the distillation of individual chiral edge states in a rapidly-rotating bosonic superfluid confined by an optical boundary. Tuning the wall sharpness, we reveal the smooth crossover between soft wall behaviour in which the propagation speed is proportional to wall steepness, and the hard wall regime exhibiting chiral free particles. From the skipping motion of atoms along the boundary, we spectroscopically infer the energy gap between the ground and first excited edge bands, and reveal its evolution from the bulk Landau level splitting for a soft boundary, to the hard wall limit. △ Less

Submitted 1 May, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

Comments: 9 pages, 5+2 figures, v3 added a new figure

arXiv:2304.07569 [pdf]

Optical dielectric huygens metagrating performing near unity anomalous refraction at TM mode with extremely simple design

Authors: Rui Yao

Abstract: We numerically demonstrate a highly efficient optica huygens metagrating with unprecedentedly simple structure (only one meta-atom per period) designed via aggressively discretized method which initially originated from discretized metasurface design, performing nearly lossless anomalous refraction under TM-polarized incident light. A 2D full-wave floquet simulation shows that the proposed metagra… ▽ More We numerically demonstrate a highly efficient optica huygens metagrating with unprecedentedly simple structure (only one meta-atom per period) designed via aggressively discretized method which initially originated from discretized metasurface design, performing nearly lossless anomalous refraction under TM-polarized incident light. A 2D full-wave floquet simulation shows that the proposed metagrating anomalously transmits and incident light of 800 nm wavelength with up to 94% power efficiency, where the specular 2transmission is substantially suppressed. our findings can also help demarcate the boundary between metagrating and metasurface. △ Less

Submitted 15 April, 2023; originally announced April 2023.

arXiv:2303.16595 [pdf, other]

doi 10.1016/j.trb.2023.05.012

A general equilibrium model for multi-passenger ridesharing systems with stable matching

Authors: Rui Yao, Shlomo Bekhor

Abstract: This paper proposes a general equilibrium model for multi-passenger ridesharing systems, in which interactions between ridesharing drivers, passengers, platforms, and transportation networks are endogenously captured. Stable matching is modeled as an equilibrium problem in which no ridesharing driver or passenger can reduce ridesharing disutility by unilaterally switching to another matching seque… ▽ More This paper proposes a general equilibrium model for multi-passenger ridesharing systems, in which interactions between ridesharing drivers, passengers, platforms, and transportation networks are endogenously captured. Stable matching is modeled as an equilibrium problem in which no ridesharing driver or passenger can reduce ridesharing disutility by unilaterally switching to another matching sequence. This paper is one of the first studies that explicitly integrates the ridesharing platform multi-passenger matching problem into the model. By integrating matching sequence with hyper-network, ridesharing-passenger transfers are avoided in a multi-passenger ridesharing system. Moreover, the matching stability between the ridesharing drivers and passengers is extended to address the multi-OD multi-passenger case in terms of matching sequence. The paper provides a proof for the existence of the proposed general equilibrium. A sequence-bush algorithm is developed for solving the multi-passenger ridesharing equilibrium problem. This algorithm is capable to handle complex ridesharing constraints implicitly. Results illustrate that the proposed sequence-bush algorithm outperforms general-purpose solver, and provides insights into the equilibrium of the joint stable matching and route choice problem. Numerical experiments indicate that ridesharing trips are typically longer than average trip lengths. Sensitivity analysis suggests that a properly designed ridesharing unit price is necessary to achieve network benefits, and travelers with relatively lower values of time are more likely to participate in ridesharing. △ Less

Submitted 5 December, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

Journal ref: Transportation Research Part B: Methodological, 175, 102775 (2023)

arXiv:2303.16526 [pdf, other]

doi 10.1109/ICME55011.2023.00346

HybridPoint: Point Cloud Registration Based on Hybrid Point Sampling and Matching

Authors: Yiheng Li, Canhui Tang, Runzhao Yao, Aixue Ye, Feng Wen, Shaoyi Du

Abstract: Patch-to-point matching has become a robust way of point cloud registration. However, previous patch-matching methods employ superpoints with poor localization precision as nodes, which may lead to ambiguous patch partitions. In this paper, we propose a HybridPoint-based network to find more robust and accurate correspondences. Firstly, we propose to use salient points with prominent local feature… ▽ More Patch-to-point matching has become a robust way of point cloud registration. However, previous patch-matching methods employ superpoints with poor localization precision as nodes, which may lead to ambiguous patch partitions. In this paper, we propose a HybridPoint-based network to find more robust and accurate correspondences. Firstly, we propose to use salient points with prominent local features as nodes to increase patch repeatability, and introduce some uniformly distributed points to complete the point cloud, thus constituting hybrid points. Hybrid points not only have better localization precision but also give a complete picture of the whole point cloud. Furthermore, based on the characteristic of hybrid points, we propose a dual-classes patch matching module, which leverages the matching results of salient points and filters the matching noise of non-salient points. Experiments show that our model achieves state-of-the-art performance on 3DMatch, 3DLoMatch, and KITTI odometry, especially with 93.0% Registration Recall on the 3DMatch dataset. Our code and models are available at https://github.com/liyih/HybridPoint. △ Less

Submitted 23 April, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

Comments: Accepted by IEEE International Conference on Multimedia and Expo (ICME), 2023

arXiv:2302.09342 [pdf]

Semi-Analytical Electromagnetic Transient Simulation Using Differential Transformation

Authors: Min Xiong, Rui Yao, Yang Liu, Kai Sun, Feng Qiu

Abstract: For electromagnetic transient (EMT) simulation of a power system, a state-space-based approach needs to solve state-space EMT equations by using numerical integration methods, e.g., the Euler method, Runge-Kutta methods, and trapezoidal-rule method, at small time steps. The simulation can be slow on a power system having multiple generators. To speed up state-space-based EMT simulations, this pape… ▽ More For electromagnetic transient (EMT) simulation of a power system, a state-space-based approach needs to solve state-space EMT equations by using numerical integration methods, e.g., the Euler method, Runge-Kutta methods, and trapezoidal-rule method, at small time steps. The simulation can be slow on a power system having multiple generators. To speed up state-space-based EMT simulations, this paper proposes a Differential Transformation based semi-analytical method that repeatedly utilizes a high-order semi-analytical solution of the EMT equations at longer time steps. The proposed semi-analytical method is tested on the detailed EMT model of a four-generator two-area system. Simulation results show the significant potential of the proposed method to accelerate EMT simulations of power systems compared with traditional numerical methods. △ Less

Submitted 18 February, 2023; originally announced February 2023.

Journal ref: Presented at the 4th International Conference on Smart Power & Internet Energy Systems, Beijing, China, December 2022

arXiv:2302.03931 [pdf, other]

doi 10.1007/s10994-024-06590-3

Fast Linear Model Trees by PILOT

Authors: Jakob Raymaekers, Peter J. Rousseeuw, Tim Verdonck, Ruicong Yao

Abstract: Linear model trees are regression trees that incorporate linear models in the leaf nodes. This preserves the intuitive interpretation of decision trees and at the same time enables them to better capture linear relationships, which is hard for standard decision trees. But most existing methods for fitting linear model trees are time consuming and therefore not scalable to large data sets. In addit… ▽ More Linear model trees are regression trees that incorporate linear models in the leaf nodes. This preserves the intuitive interpretation of decision trees and at the same time enables them to better capture linear relationships, which is hard for standard decision trees. But most existing methods for fitting linear model trees are time consuming and therefore not scalable to large data sets. In addition, they are more prone to overfitting and extrapolation issues than standard regression trees. In this paper we introduce PILOT, a new algorithm for linear model trees that is fast, regularized, stable and interpretable. PILOT trains in a greedy fashion like classic regression trees, but incorporates an $L^2$ boosting approach and a model selection rule for fitting linear models in the nodes. The abbreviation PILOT stands for $PI$ecewise $L$inear $O$rganic $T$ree, where `organic' refers to the fact that no pruning is carried out. PILOT has the same low time and space complexity as CART without its pruning. An empirical study indicates that PILOT tends to outperform standard decision trees and other linear model trees on a variety of data sets. Moreover, we prove its consistency in an additive model setting under weak assumptions. When the data is generated by a linear model, the convergence rate is polynomial. △ Less

Submitted 8 February, 2023; originally announced February 2023.

Journal ref: Machine Learning, 2024

arXiv:2301.01429 [pdf, other]

doi 10.1088/1674-1056/aca7ed

Atlas of dynamic spectra of fast radio burst FRB 20201124A

Authors: Bo-Jun Wang, Heng Xu, Jin-Chen Jiang, Jiang-Wei Xu, Jia-Rui Niu, Ping Chen, Ke-Jia Lee, Bing Zhang, Wei-Wei Zhu, Su-Bo Dong, Chun-Feng Zhang, Hai Fu, De-Jiang Zhou, Yong-Kun Zhang, Pei Wang, Yi Feng, Ye Li, Dong-Zi Li, Wen-Bin Lu, Yuan-Pei Yang, R. N. Caballero, Ce Cai, Mao-Zheng Chen, Zi-Gao Dai, A. Esamdin , et al. (42 additional authors not shown)

Abstract: Fast radio bursts (FRBs) are highly dispersed millisecond-duration radio bursts, of which the physical origin is still not fully understood. FRB 20201124A is one of the most actively repeating FRBs. In this paper, we present the collection of 1863 burst dynamic spectra of FRB 20201124A measured with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The current collection, taken fro… ▽ More Fast radio bursts (FRBs) are highly dispersed millisecond-duration radio bursts, of which the physical origin is still not fully understood. FRB 20201124A is one of the most actively repeating FRBs. In this paper, we present the collection of 1863 burst dynamic spectra of FRB 20201124A measured with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The current collection, taken from the observation during the FRB active phase from April to June 2021, is the largest burst sample detected in any FRB so far. The standard PSRFITs format is adopted, including dynamic spectra of the burst, and the time information of the dynamic spectra, in addition, mask files help readers to identify the pulse positions are also provided. △ Less

Submitted 3 January, 2023; originally announced January 2023.

arXiv:2211.15110 [pdf, ps, other]

doi 10.1007/s00208-024-02873-1

On Laplacian eigenvalue equation with constant Neumann boundary data

Authors: Yong Huang, Qinfeng Li, Qiuqi Li, Ruofei Yao

Abstract: Let $Ω$ be a bounded Lipshcitz domain in $\mathbb{R}^n$ and we study boundary behaviors of solutions to the Laplacian eigenvalue equation with constant Neumann data. \begin{align} \label{cequation0} \begin{cases} -Δu=cu\quad &\mbox{in $Ω$}\\ \frac{\partial u}{\partial ν}=-1\quad &\mbox{on $\partial Ω$}. \end{cases} \end{align}First, by using properties of Bessel functions and proving new inequal… ▽ More Let $Ω$ be a bounded Lipshcitz domain in $\mathbb{R}^n$ and we study boundary behaviors of solutions to the Laplacian eigenvalue equation with constant Neumann data. \begin{align} \label{cequation0} \begin{cases} -Δu=cu\quad &\mbox{in $Ω$}\\ \frac{\partial u}{\partial ν}=-1\quad &\mbox{on $\partial Ω$}. \end{cases} \end{align}First, by using properties of Bessel functions and proving new inequalities on elementary symmetric polynomials, we obtain the following inequality for rectangular boxes, balls and equilateral triangles: \begin{align} \label{bbb} \lim_{c\rightarrow μ_2^-}c\int_{\partial Ω}u_c\, dσ\ge \frac{n-1}{n}\frac{P^2(Ω)}{|Ω|}, \end{align}with equality achieved only at cubes and balls. In the above, $u_c$ is the solution to the eigenvalue equation and $μ_2$ is the second Neumann Laplacian eigenvalue. Second, let $κ_1$ be the best constant for the Poincaré inequality with mean zero on $\partial Ω$, and we prove that $κ_1\le μ_2$, with equality holds if and only if $\int_{\partial Ω}u_c\, dσ>0$ for any $c\in (0,μ_2)$. As a consequence, $κ_1=μ_2$ on balls, rectangular boxes and equilateral triangles, and balls maximize $κ_1$ over all Lipschitz domains with fixed volume. As an application, we extend the symmetry breaking results from ball domains obtained in Bucur-Buttazzo-Nitsch[J. Math. Pures Appl., 2017], to wider class of domains, and give quantitative estimates for the precise breaking threshold at balls and rectangular boxes. It is a direct consequence that for domains with $κ_1<μ_2$, the above boundary limit inequality is never true, while whether it is valid for domains on which $κ_1=μ_2$ remains open. △ Less

Submitted 28 April, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

Comments: A revised version compared to the previous one

arXiv:2211.12794 [pdf, ps, other]

Zero Forcing Uplink Detection through Large-Scale RIS: System Performance and Phase Shift Design

Authors: Nikolaos I. Miridakis, Theodoros A. Tsiftsis, Rugui Yao

Abstract: A multiple-input multiple-output wireless communication system is analytically studied, which operates with the aid of a large-scale reconfigurable intelligent surface (LRIS). LRIS is equipped with multiple passive elements with discrete phase adjustment capabilities, and independent Rician fading conditions are assumed for both the transmitter-to-LRIS and LRIS-to-receiver links. A direct transcei… ▽ More A multiple-input multiple-output wireless communication system is analytically studied, which operates with the aid of a large-scale reconfigurable intelligent surface (LRIS). LRIS is equipped with multiple passive elements with discrete phase adjustment capabilities, and independent Rician fading conditions are assumed for both the transmitter-to-LRIS and LRIS-to-receiver links. A direct transceiver link is also considered which is modeled by Rayleigh fading distribution. The system performance is analytically studied when the linear yet efficient zero-forcing detection is implemented at the receiver. In particular, the outage performance is derived in closed-form expression for different system configuration setups with regards to the available channel state information (CSI) at the receiver. In fact, the case of both perfect and imperfect CSI is analyzed. Also, an efficient phase shift design approach at LRIS is introduced, which is linear on the number of passive elements and receive antennas. The proposed phase shift design can be applied on two different modes of operation; namely, when the system strives to adapt either on the instantaneous or statistical CSI. Finally, some impactful engineering insights are provided, such as how the channel fading conditions, CSI, discrete phase shift resolution, and volume of antenna/LRIS element arrays impact on the overall system performance. △ Less

Submitted 23 November, 2022; originally announced November 2022.

Comments: Accepted for publication to IEEE Transactions on Communications

arXiv:2211.00168 [pdf, other]

Improving Fairness in Image Classification via Sketching

Authors: Ruichen Yao, Ziteng Cui, Xiaoxiao Li, Lin Gu

Abstract: Fairness is a fundamental requirement for trustworthy and human-centered Artificial Intelligence (AI) system. However, deep neural networks (DNNs) tend to make unfair predictions when the training data are collected from different sub-populations with different attributes (i.e. color, sex, age), leading to biased DNN predictions. We notice that such a troubling phenomenon is often caused by data i… ▽ More Fairness is a fundamental requirement for trustworthy and human-centered Artificial Intelligence (AI) system. However, deep neural networks (DNNs) tend to make unfair predictions when the training data are collected from different sub-populations with different attributes (i.e. color, sex, age), leading to biased DNN predictions. We notice that such a troubling phenomenon is often caused by data itself, which means that bias information is encoded to the DNN along with the useful information (i.e. class information, semantic information). Therefore, we propose to use sketching to handle this phenomenon. Without losing the utility of data, we explore the image-to-sketching methods that can maintain useful semantic information for the target classification while filtering out the useless bias information. In addition, we design a fair loss to further improve the model fairness. We evaluate our method through extensive experiments on both general scene dataset and medical scene dataset. Our results show that the desired image-to-sketching method improves model fairness and achieves satisfactory results among state-of-the-art. △ Less

Submitted 31 October, 2022; originally announced November 2022.

Comments: 8 pages, 2 figures. To appear in 2022 Trustworthy and Socially Responsible Machine Learning (TSRML 2022) co-located with NeurIPS 2022

arXiv:2210.03298 [pdf, other]

Simulation of Transients in Natural Gas Networks via A Semi-analytical Solution Approach

Authors: Xin Xu, Rui Yao, Kai Sun, Feng Qiu

Abstract: Simulation and control of the transient flow in natural gas networks involve solving partial differential equations (PDEs). This paper proposes a semi-analytical solutions (SAS) approach for fast and accurate simulation of the natural gas transients. The region of interest is divided into a grid, and an SAS is derived for each grid cell in the form of the multivariate polynomials, of which the coe… ▽ More Simulation and control of the transient flow in natural gas networks involve solving partial differential equations (PDEs). This paper proposes a semi-analytical solutions (SAS) approach for fast and accurate simulation of the natural gas transients. The region of interest is divided into a grid, and an SAS is derived for each grid cell in the form of the multivariate polynomials, of which the coefficients are identified according to the initial value and boundary value conditions. The solutions are solved in a ``time-stepping'' manner; that is, within one time step, the coefficients of the SAS are identified and the initial value of the next time step is evaluated. This approach achieves a much larger grid cell than the widely used finite difference method, and thus enhances the computational efficiency significantly. To further reduce the computation burden, the nonlinear terms in the model are simplified, which induces another SAS scheme that can greatly reduce the time consumption and have minor impact on accuracy. The simulation results on a single pipeline case and a 6-node network case validate the advantages of the proposed SAS approach in accuracy and computational efficiency. △ Less

Submitted 6 October, 2022; originally announced October 2022.

arXiv:2209.15459 [pdf, other]

doi 10.1103/PhysRevA.106.062617

Experimental realization of a 218-ion multi-qubit quantum memory

Authors: R. Yao, W. -Q. Lian, Y. -K. Wu, G. -X. Wang, B. -W. Li, Q. -X. Mei, B. -X. Qi, L. Yao, Z. -C. Zhou, L. He, L. -M. Duan

Abstract: Storage lifetime and capacity are two important factors to characterize the performance of a quantum memory. Here we report the stable trapping of above 200 ions in a cryogenic setup, and demonstrate the combination of the multi-qubit capacity and long storage lifetime by measuring the coherence time of randomly chosen ions to be on the order of hundreds of milliseconds. We apply composite microwa… ▽ More Storage lifetime and capacity are two important factors to characterize the performance of a quantum memory. Here we report the stable trapping of above 200 ions in a cryogenic setup, and demonstrate the combination of the multi-qubit capacity and long storage lifetime by measuring the coherence time of randomly chosen ions to be on the order of hundreds of milliseconds. We apply composite microwave pulses to manipulate qubit states globally for efficient characterization of different storage units simultaneously, and we compare the performance of the quantum memory with and without the sympathetic cooling laser, thus unambiguously show the necessity of sympathetic cooling for the long-time storage of multiple ionic qubits. △ Less

Submitted 30 September, 2022; originally announced September 2022.

arXiv:2209.14214 [pdf, other]

doi 10.1103/PhysRevD.107.043031

Optical circular polarization induced by axionlike particles in blazars

Authors: Run-Min Yao, Xiao-Jun Bi, Jin-Wei Wang, Peng-Fei Yin

Abstract: We propose that the interaction between the axionlike particles (ALPs) and photons can be a possible origin of optical circular polarization (CP) in blazars. Given that there is no definite detection of optical CP at $\sim0.1\%$ level, a rough limit on ALP-photon coupling can be obtained, specifically $g_{aγ}\cdot B_\mathrm{T0}\lesssim7.9\times10^{-12}~\mathrm{G\cdot GeV}^{-1}$ for… ▽ More We propose that the interaction between the axionlike particles (ALPs) and photons can be a possible origin of optical circular polarization (CP) in blazars. Given that there is no definite detection of optical CP at $\sim0.1\%$ level, a rough limit on ALP-photon coupling can be obtained, specifically $g_{aγ}\cdot B_\mathrm{T0}\lesssim7.9\times10^{-12}~\mathrm{G\cdot GeV}^{-1}$ for $m_{a}\lesssim 10^{-13}~\mathrm{eV}$, depending on the magnetic field configuration of the blazar jet. Obviously, for the blazar models with a larger magnetic field strength, such as hadronic radiation models, this constraint could be more stringent. We also perform a dedicated analysis of the tentative observations of optical CP in two blazars, namely 3C 66A and OJ 287, and we find that these observations could be explained by the ALP-photon mixing with $g_{aγ} \sim 10^{-11}~\mathrm{GeV}^{-1}$. As an outlook, our analysis can be improved by further research on the radiation models of blazars and high-precision joint measurements of optical CP and linear polarization. △ Less

Submitted 7 March, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

Comments: 15 pages, 9 figures, 4 tables

arXiv:2208.12042 [pdf, other]

Efficient Truncated Linear Regression with Unknown Noise Variance

Authors: Constantinos Daskalakis, Patroklos Stefanou, Rui Yao, Manolis Zampetakis

Abstract: Truncated linear regression is a classical challenge in Statistics, wherein a label, $y = w^T x + \varepsilon$, and its corresponding feature vector, $x \in \mathbb{R}^k$, are only observed if the label falls in some subset $S \subseteq \mathbb{R}$; otherwise the existence of the pair $(x, y)$ is hidden from observation. Linear regression with truncated observations has remained a challenge, in it… ▽ More Truncated linear regression is a classical challenge in Statistics, wherein a label, $y = w^T x + \varepsilon$, and its corresponding feature vector, $x \in \mathbb{R}^k$, are only observed if the label falls in some subset $S \subseteq \mathbb{R}$; otherwise the existence of the pair $(x, y)$ is hidden from observation. Linear regression with truncated observations has remained a challenge, in its general form, since the early works of~\citet{tobin1958estimation,amemiya1973regression}. When the distribution of the error is normal with known variance, recent work of~\citet{daskalakis2019truncatedregression} provides computationally and statistically efficient estimators of the linear model, $w$. In this paper, we provide the first computationally and statistically efficient estimators for truncated linear regression when the noise variance is unknown, estimating both the linear model and the variance of the noise. Our estimator is based on an efficient implementation of Projected Stochastic Gradient Descent on the negative log-likelihood of the truncated sample. Importantly, we show that the error of our estimates is asymptotically normal, and we use this to provide explicit confidence regions for our estimates. △ Less

Submitted 25 August, 2022; originally announced August 2022.

arXiv:2208.03589 [pdf, other]

D-optimal Data Fusion: Exact and Approximation Algorithms

Authors: Yongchun Li, Marcia Fampa, Jon Lee, Feng Qiu, Weijun Xie, Rui Yao

Abstract: We study the D-optimal Data Fusion (DDF) problem, which aims to select new data points, given an existing Fisher information matrix, so as to maximize the logarithm of the determinant of the overall Fisher information matrix. We show that the DDF problem is NP-hard and has no constant-factor polynomial-time approximation algorithm unless P $=$ NP. Therefore, to solve the DDF problem effectively, w… ▽ More We study the D-optimal Data Fusion (DDF) problem, which aims to select new data points, given an existing Fisher information matrix, so as to maximize the logarithm of the determinant of the overall Fisher information matrix. We show that the DDF problem is NP-hard and has no constant-factor polynomial-time approximation algorithm unless P $=$ NP. Therefore, to solve the DDF problem effectively, we propose two convex integer-programming formulations and investigate their corresponding complementary and Lagrangian-dual problems. We also develop scalable randomized-sampling and local-search algorithms with provable performance guarantees. Leveraging the concavity of the objective functions in the two proposed formulations, we design an exact algorithm, aimed at solving the DDF problem to optimality. We further derive a family of submodular valid inequalities and optimality cuts, which can significantly enhance the algorithm performance. Finally, we test our algorithms using real-world data on the new phasor-measurement-units placement problem for modern power grids, considering the existing conventional sensors. Our numerical study demonstrates the efficiency of our exact algorithm and the scalability and high-quality outputs of our approximation algorithms. △ Less

Submitted 6 August, 2022; originally announced August 2022.

arXiv:2208.03060 [pdf, other]

Probing critical behavior of long-range transverse-field Ising model through quantum Kibble-Zurek mechanism

Authors: B. -W. Li, Y. -K. Wu, Q. -X. Mei, R. Yao, W. -Q. Lian, M. -L. Cai, Y. Wang, B. -X. Qi, L. Yao, L. He, Z. -C. Zhou, L. -M. Duan

Abstract: The trapped ion quantum simulator has demonstrated qualitative properties of different physical models for up to tens of ions. In particular, a linear ion chain naturally hosts long-range Ising interactions under the laser driving, which has been used for various phenomena such as quantum phase transition, localization, thermalization and information propagation. For near-term practical usage, a c… ▽ More The trapped ion quantum simulator has demonstrated qualitative properties of different physical models for up to tens of ions. In particular, a linear ion chain naturally hosts long-range Ising interactions under the laser driving, which has been used for various phenomena such as quantum phase transition, localization, thermalization and information propagation. For near-term practical usage, a central task is to find more quantitative applications of the noisy quantum simulators that are robust to small errors in the parameters. Here we report the quantum simulation of a long-range transverse-field Ising model using up to 61 ions and probe the critical behavior of its quantum phase transition through the Kibble-Zurek mechanism. By calibrating and verifying the coupling coefficients, we realize the same model for increasing ion numbers, so as to extract a critical exponent free of the finite size effect. For ferromagnetic interaction, our experimental result agrees well with the previous numerical predictions. As for the anti-ferromagnetic case, signals are too weak to fit a critical exponent due to the frustration in the interaction, but still consistent with the theory. △ Less

Submitted 30 December, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

arXiv:2207.13250 [pdf, other]

Spatio-Temporal Wildfire Prediction using Multi-Modal Data

Authors: Chen Xu, Yao Xie, Daniel A. Zuniga Vazquez, Rui Yao, Feng Qiu

Abstract: Due to severe societal and environmental impacts, wildfire prediction using multi-modal sensing data has become a highly sought-after data-analytical tool by various stakeholders (such as state governments and power utility companies) to achieve a more informed understanding of wildfire activities and plan preventive measures. A desirable algorithm should precisely predict fire risk and magnitude… ▽ More Due to severe societal and environmental impacts, wildfire prediction using multi-modal sensing data has become a highly sought-after data-analytical tool by various stakeholders (such as state governments and power utility companies) to achieve a more informed understanding of wildfire activities and plan preventive measures. A desirable algorithm should precisely predict fire risk and magnitude for a location in real time. In this paper, we develop a flexible spatio-temporal wildfire prediction framework using multi-modal time series data. We first predict the wildfire risk (the chance of a wildfire event) in real-time, considering the historical events using discrete mutually exciting point process models. Then we further develop a wildfire magnitude prediction set method based on the flexible distribution-free time-series conformal prediction (CP) approach. Theoretically, we prove a risk model parameter recovery guarantee, as well as coverage and set size guarantees for the CP sets. Through extensive real-data experiments with wildfire data in California, we demonstrate the effectiveness of our methods, as well as their flexibility and scalability in large regions. △ Less

Submitted 10 October, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

arXiv:2207.08074 [pdf, other]

Mean-field Variational Inference via Wasserstein Gradient Flow

Authors: Rentian Yao, Yun Yang

Abstract: Variational inference, such as the mean-field (MF) approximation, requires certain conjugacy structures for efficient computation. These can impose unnecessary restrictions on the viable prior distribution family and further constraints on the variational approximation family. In this work, we introduce a general computational framework to implement MF variational inference for Bayesian models, wi… ▽ More Variational inference, such as the mean-field (MF) approximation, requires certain conjugacy structures for efficient computation. These can impose unnecessary restrictions on the viable prior distribution family and further constraints on the variational approximation family. In this work, we introduce a general computational framework to implement MF variational inference for Bayesian models, with or without latent variables, using the Wasserstein gradient flow (WGF), a modern mathematical technique for realizing a gradient flow over the space of probability measures. Theoretically, we analyze the algorithmic convergence of the proposed approaches, providing an explicit expression for the contraction factor. We also strengthen existing results on MF variational posterior concentration from a polynomial to an exponential contraction, by utilizing the fixed point equation of the time-discretized WGF. Computationally, we propose a new constraint-free function approximation method using neural networks to numerically realize our algorithm. This method is shown to be more precise and efficient than traditional particle approximation methods based on Langevin dynamics. △ Less

Submitted 8 September, 2023; v1 submitted 17 July, 2022; originally announced July 2022.

arXiv:2206.13381 [pdf, other]

doi 10.1109/TMM.2022.3186431

TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask

Authors: Yuchen Su, Zhiwen Shao, Yong Zhou, Fanrong Meng, Hancheng Zhu, Bing Liu, Rui Yao

Abstract: Arbitrary-shaped scene text detection is a challenging task due to the variety of text changes in font, size, color, and orientation. Most existing regression based methods resort to regress the masks or contour points of text regions to model the text instances. However, regressing the complete masks requires high training complexity, and contour points are not sufficient to capture the details o… ▽ More Arbitrary-shaped scene text detection is a challenging task due to the variety of text changes in font, size, color, and orientation. Most existing regression based methods resort to regress the masks or contour points of text regions to model the text instances. However, regressing the complete masks requires high training complexity, and contour points are not sufficient to capture the details of highly curved texts. To tackle the above limitations, we propose a novel light-weight anchor-free text detection framework called TextDCT, which adopts the discrete cosine transform (DCT) to encode the text masks as compact vectors. Further, considering the imbalanced number of training samples among pyramid layers, we only employ a single-level head for top-down prediction. To model the multi-scale texts in a single-level head, we introduce a novel positive sampling strategy by treating the shrunk text region as positive samples, and design a feature awareness module (FAM) for spatial-awareness and scale-awareness by fusing rich contextual information and focusing on more significant features. Moreover, we propose a segmented non-maximum suppression (S-NMS) method that can filter low-quality mask regressions. Extensive experiments are conducted on four challenging datasets, which demonstrate our TextDCT obtains competitive performance on both accuracy and efficiency. Specifically, TextDCT achieves F-measure of 85.1 at 17.2 frames per second (FPS) and F-measure of 84.9 at 15.1 FPS for CTW1500 and Total-Text datasets, respectively. △ Less

Submitted 27 June, 2022; originally announced June 2022.

Comments: This paper has been accepted by IEEE Transactions on Multimedia

arXiv:2205.07937 [pdf, ps, other]

Mean-Field Nonparametric Estimation of Interacting Particle Systems

Authors: Rentian Yao, Xiaohui Chen, Yun Yang

Abstract: This paper concerns the nonparametric estimation problem of the distribution-state dependent drift vector field in an interacting $N$-particle system. Observing single-trajectory data for each particle, we derive the mean-field rate of convergence for the maximum likelihood estimator (MLE), which depends on both Gaussian complexity and Rademacher complexity of the function class. In particular, wh… ▽ More This paper concerns the nonparametric estimation problem of the distribution-state dependent drift vector field in an interacting $N$-particle system. Observing single-trajectory data for each particle, we derive the mean-field rate of convergence for the maximum likelihood estimator (MLE), which depends on both Gaussian complexity and Rademacher complexity of the function class. In particular, when the function class contains $α$-smooth H{ö}lder functions, our rate of convergence is minimax optimal on the order of $N^{-\fracα{d+2α}}$. Combining with a Fourier analytical deconvolution argument, we derive the consistency of MLE for the external force and interaction kernel in the McKean-Vlasov equation. △ Less

Submitted 26 June, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

Showing 1–50 of 125 results for author: Yao, R