subscribe to arXiv mailings

arXiv:2407.08731 [pdf, other]

Massive-ish Particles from Small-ish Scales: Non-Perturbative Techniques for Cosmological Collider Physics from Large-Scale Structure Surveys

Authors: Samuel Goldstein, Oliver H. E. Philcox, J. Colin Hill, Lam Hui

Abstract: Massive particles produced during inflation impact soft limits of primordial correlators. Searching for these signatures presents an exciting opportunity to uncover the particle spectrum in the inflationary epoch. We present non-perturbative methods to constrain intermediate-mass scalars ($0\leq m/H<3/2$, where $H$ is the inflationary Hubble scale) produced during inflation, which give rise to a p… ▽ More Massive particles produced during inflation impact soft limits of primordial correlators. Searching for these signatures presents an exciting opportunity to uncover the particle spectrum in the inflationary epoch. We present non-perturbative methods to constrain intermediate-mass scalars ($0\leq m/H<3/2$, where $H$ is the inflationary Hubble scale) produced during inflation, which give rise to a power-law scaling in the squeezed primordial bispectrum. Exploiting the large-scale structure consistency relations and the separate universe approach, we derive models for the late-time squeezed matter bispectrum and collapsed matter trispectrum sourced by these fields. To validate our models, we run $N$-body simulations with the "Cosmological Collider" squeezed bispectrum for two different particle masses. Our models yield unbiased constraints on the amplitude of non-Gaussianity, $f_{\rm NL}^Δ$, from the squeezed bispectrum and collapsed trispectrum deep into the non-linear regime ($k_{\rm max}\approx 2~h/{\rm Mpc}$ at $z=0$). We assess the information content of these summary statistics, emphasizing the importance of sample variance cancellation in the matter sector. We also study the scale-dependent halo bias in our simulations. For mass-selected halos, the non-Gaussian bias estimated from our simulations agrees with predictions based on (i) separate universe simulations and (ii) universal mass functions. With further work, these results can be used to search for inflationary massive particle production with upcoming galaxy surveys. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 25 pages, 11 figures; comments welcome

arXiv:2407.07955 [pdf, other]

Fragmention in Gravitationally-Unstable Collapsar Disks and Sub-Solar Neutron Star Mergers

Authors: Brian D. Metzger, Lam Hui, Matteo Cantiello

Abstract: Although stable neutron stars (NS) can in principle exist down to masses Mns ~ 0.1Msun, standard models of stellar core-collapse predict a robust lower limit Mns >~ 1.2Msun, roughly commensurate with the Chandrasekhar mass Mch of the progenitor's iron core (electron fraction Ye ~ 0.5). However, this limit may be circumvented in sufficiently dense neutron-rich environments (Ye << 0.5) for which Mch… ▽ More Although stable neutron stars (NS) can in principle exist down to masses Mns ~ 0.1Msun, standard models of stellar core-collapse predict a robust lower limit Mns >~ 1.2Msun, roughly commensurate with the Chandrasekhar mass Mch of the progenitor's iron core (electron fraction Ye ~ 0.5). However, this limit may be circumvented in sufficiently dense neutron-rich environments (Ye << 0.5) for which Mch ~ Ye^2 is reduced to < Msun. Such physical conditions could arise in the black hole accretion disks formed from the collapse of rapidly-rotating stars (``collapsars''), as a result of gravitational instabilities and cooling-induced fragmentation, similar to models for planet formation in protostellar disks. We confirm that the conditions to form sub-solar mass NS (ssNS) may be marginally satisfied in the outer regions of massive neutrino-cooled collapsar disks. If the disk fragments into multiple ssNS, their subsequent coalescence offers a channel for precipitating sub-solar mass LIGO/Virgo gravitational-wave mergers that does not implicate primordial black holes. The model makes several additional predictions: (1) ~Hz frequency Doppler modulation of the ssNS-merger gravitational wave signals due to the binary's orbital motion in the disk; (2) at least one additional gravitational wave event (coincident within <~ hours), from the coalescence of the ssNS-merger remnant(s) with the central black hole; (3) an associated gamma-ray burst and supernova counterpart, the latter boosted in energy and enriched with r-process elements from the NS merger(s) embedded within the exploding stellar envelope (``kilonovae inside a supernova''). △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: 9 pages, 2 figures

arXiv:2405.04692 [pdf]

doi 10.18374/JABE-24-1.11

Enhancing Organizational Performance: Harnessing AI and NLP for User Feedback Analysis in Product Development

Authors: Tian Tian, Liu Ze hui, Huang Zichen, Yubing Tang

Abstract: This paper explores the application of AI and NLP techniques for user feedback analysis in the context of heavy machine crane products. By leveraging AI and NLP, organizations can gain insights into customer perceptions, improve product development, enhance satisfaction and loyalty, inform decision-making, and gain a competitive advantage. The paper highlights the impact of user feedback analysis… ▽ More This paper explores the application of AI and NLP techniques for user feedback analysis in the context of heavy machine crane products. By leveraging AI and NLP, organizations can gain insights into customer perceptions, improve product development, enhance satisfaction and loyalty, inform decision-making, and gain a competitive advantage. The paper highlights the impact of user feedback analysis on organizational performance and emphasizes the reasons for using AI and NLP, including scalability, objectivity, improved accuracy, increased insights, and time savings. The methodology involves data collection, cleaning, text and rating analysis, interpretation, and feedback implementation. Results include sentiment analysis, word cloud visualizations, and radar charts comparing product attributes. These findings provide valuable information for understanding customer sentiment, identifying improvement areas, and making data-driven decisions to enhance the customer experience. In conclusion, promising AI and NLP techniques in user feedback analysis offer organizations a powerful tool to understand customers, improve product development, increase satisfaction, and drive business success △ Less

Submitted 7 May, 2024; originally announced May 2024.

Journal ref: Journal of Academy of Business and Economics 2024/3

arXiv:2404.06270 [pdf, other]

3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis

Authors: Zhicheng Lu, Xiang Guo, Le Hui, Tianrui Chen, Min Yang, Xiao Tang, Feng Zhu, Yuchao Dai

Abstract: In this paper, we propose a 3D geometry-aware deformable Gaussian Splatting method for dynamic view synthesis. Existing neural radiance fields (NeRF) based solutions learn the deformation in an implicit manner, which cannot incorporate 3D scene geometry. Therefore, the learned deformation is not necessarily geometrically coherent, which results in unsatisfactory dynamic view synthesis and 3D dynam… ▽ More In this paper, we propose a 3D geometry-aware deformable Gaussian Splatting method for dynamic view synthesis. Existing neural radiance fields (NeRF) based solutions learn the deformation in an implicit manner, which cannot incorporate 3D scene geometry. Therefore, the learned deformation is not necessarily geometrically coherent, which results in unsatisfactory dynamic view synthesis and 3D dynamic reconstruction. Recently, 3D Gaussian Splatting provides a new representation of the 3D scene, building upon which the 3D geometry could be exploited in learning the complex 3D deformation. Specifically, the scenes are represented as a collection of 3D Gaussian, where each 3D Gaussian is optimized to move and rotate over time to model the deformation. To enforce the 3D scene geometry constraint during deformation, we explicitly extract 3D geometry features and integrate them in learning the 3D deformation. In this way, our solution achieves 3D geometry-aware deformation modeling, which enables improved dynamic view synthesis and 3D dynamic reconstruction. Extensive experimental results on both synthetic and real datasets prove the superiority of our solution, which achieves new state-of-the-art performance. The project is available at https://npucvr.github.io/GaGS/ △ Less

Submitted 14 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

Comments: Accepted by CVPR 2024. Project page: https://npucvr.github.io/GaGS/

arXiv:2312.13641 [pdf, other]

SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection

Authors: Yun Zhu, Le Hui, Yaqi Shen, Jin Xie

Abstract: Current 3D object detection methods for indoor scenes mainly follow the voting-and-grouping strategy to generate proposals. However, most methods utilize instance-agnostic groupings, such as ball query, leading to inconsistent semantic information and inaccurate regression of the proposals. To this end, we propose a novel superpoint grouping network for indoor anchor-free one-stage 3D object detec… ▽ More Current 3D object detection methods for indoor scenes mainly follow the voting-and-grouping strategy to generate proposals. However, most methods utilize instance-agnostic groupings, such as ball query, leading to inconsistent semantic information and inaccurate regression of the proposals. To this end, we propose a novel superpoint grouping network for indoor anchor-free one-stage 3D object detection. Specifically, we first adopt an unsupervised manner to partition raw point clouds into superpoints, areas with semantic consistency and spatial similarity. Then, we design a geometry-aware voting module that adapts to the centerness in anchor-free detection by constraining the spatial relationship between superpoints and object centers. Next, we present a superpoint-based grouping module to explore the consistent representation within proposals. This module includes a superpoint attention layer to learn feature interaction between neighboring superpoints, and a superpoint-voxel fusion layer to propagate the superpoint-level information to the voxel level. Finally, we employ effective multiple matching to capitalize on the dynamic receptive fields of proposals based on superpoints during the training. Experimental results demonstrate our method achieves state-of-the-art performance on ScanNet V2, SUN RGB-D, and S3DIS datasets in the indoor one-stage 3D object detection. Source code is available at https://github.com/zyrant/SPGroup3D. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI 2024

arXiv:2312.08440 [pdf, other]

$S$-matrix positivity without Lorentz invariance: a case study

Authors: Lam Hui, Ioanna Kourkoulou, Alberto Nicolis, Alessandro Podo, Shengjia Zhou

Abstract: We investigate the analytic structure of scattering amplitudes in theories in which Lorentz invariance is spontaneously broken. We do so by computing and studying the S-matrix for a simple example: a superfluid described by a complex scalar with quartic interactions. The computation is confined to tree-level, for there are no absolutely stable single-particle states, though the lifetime can be mad… ▽ More We investigate the analytic structure of scattering amplitudes in theories in which Lorentz invariance is spontaneously broken. We do so by computing and studying the S-matrix for a simple example: a superfluid described by a complex scalar with quartic interactions. The computation is confined to tree-level, for there are no absolutely stable single-particle states, though the lifetime can be made long by lowering the chemical potential. For the $2 \to 2$ amplitude in center-of-mass configurations, not only is crossing symmetry violated, there appears a {\it tree level} branch cut for unphysical kinematics. Its appearance is a consequence of non-analyticity in the dispersion relation. The branch point defines a new scale in the problem, which scales inversely with the chemical potential. In this example, even derivatives of the forward amplitude are positive while odd derivatives are negative. This pattern can be understood in a general way in the limit of a small chemical potential, or weak Lorentz breaking. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: 36 pages, 5 figures

arXiv:2310.12959 [pdf, other]

Consistently constraining $f_{\rm NL}$ with the squeezed lensing bispectrum using consistency relations

Authors: Samuel Goldstein, Oliver H. E. Philcox, J. Colin Hill, Angelo Esposito, Lam Hui

Abstract: We introduce a non-perturbative method to constrain the amplitude of local-type primordial non-Gaussianity ($f_{\rm NL}$) using squeezed configurations of the CMB lensing convergence and cosmic shear bispectra. First, we use cosmological consistency relations to derive a model for the squeezed limit of angular auto- and cross-bispectra of lensing convergence fields in the presence of $f_{\rm NL}$.… ▽ More We introduce a non-perturbative method to constrain the amplitude of local-type primordial non-Gaussianity ($f_{\rm NL}$) using squeezed configurations of the CMB lensing convergence and cosmic shear bispectra. First, we use cosmological consistency relations to derive a model for the squeezed limit of angular auto- and cross-bispectra of lensing convergence fields in the presence of $f_{\rm NL}$. Using this model, we perform a Fisher forecast with specifications expected for upcoming CMB lensing measurements from the Simons Observatory and CMB-S4, as well as cosmic shear measurements from a Rubin LSST/Euclid-like experiment. Assuming a minimum multipole $\ell_{\rm min}=10$ and maximum multipole $\ell_{\rm max}=1400$, we forecast $σ_{f_{\rm NL}}=175$ ($95$) for Simons Observatory (CMB-S4). Our forecasts improve considerably for an LSST/Euclid-like cosmic shear experiment with three tomographic bins and $\ell_{\rm min}=10$ and $\ell_{\rm max}=1400$ ($5000$) with $σ_{f_{\rm NL}}=31$ ($16$). A joint analysis of CMB-S4 lensing and LSST/Euclid-like shear yields little gain over the shear-only forecasts; however, we show that a joint analysis could be useful if the CMB lensing convergence can be reliably reconstructed at larger angular scales than the shear field. The method presented in this work is a novel and robust technique to constrain local primordial non-Gaussianity from upcoming large-scale structure surveys that is completely independent of the galaxy field (and therefore any nuisance parameters such as $b_φ$), thus complementing existing techniques to constrain $f_{\rm NL}$ using the scale-dependent halo bias. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 14 pages, 5 figures, comments welcome!

arXiv:2309.00655 [pdf, other]

RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion

Authors: Zhiqiang Yan, Xiang Li, Le Hui, Zhenyu Zhang, Jun Li, Jian Yang

Abstract: Depth completion aims to recover dense depth maps from sparse ones, where color images are often used to facilitate this task. Recent depth methods primarily focus on image guided learning frameworks. However, blurry guidance in the image and unclear structure in the depth still impede their performance. To tackle these challenges, we explore a repetitive design in our image guided network to grad… ▽ More Depth completion aims to recover dense depth maps from sparse ones, where color images are often used to facilitate this task. Recent depth methods primarily focus on image guided learning frameworks. However, blurry guidance in the image and unclear structure in the depth still impede their performance. To tackle these challenges, we explore a repetitive design in our image guided network to gradually and sufficiently recover depth values. Specifically, the repetition is embodied in both the image guidance branch and depth generation branch. In the former branch, we design a dense repetitive hourglass network (DRHN) to extract discriminative image features of complex environments, which can provide powerful contextual instruction for depth prediction. In the latter branch, we present a repetitive guidance (RG) module based on dynamic convolution, in which an efficient convolution factorization is proposed to reduce the complexity while modeling high-frequency structures progressively. Furthermore, in the semantic guidance branch, we utilize the well-known large vision model, i.e., segment anything (SAM), to supply RG with semantic prior. In addition, we propose a region-aware spatial propagation network (RASPN) for further depth refinement based on the semantic prior constraint. Finally, we collect a new dataset termed TOFDC for the depth completion task, which is acquired by the time-of-flight (TOF) sensor and the color camera on smartphones. Extensive experiments demonstrate that our method achieves state-of-the-art performance on KITTI, NYUv2, Matterport3D, 3D60, VKITTI, and our TOFDC. △ Less

Submitted 28 February, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

Comments: 20 pages

arXiv:2308.11017 [pdf, other]

doi 10.1093/mnras/stad2244

Statistical analysis of the onset temperature of solar flares in 2010-2011

Authors: Douglas Félix da Silva, Li Hui, Paulo J. A. Simões, Adriana Valio, Joaquim C. E. R., Hugh S. Hudson, Paulo J. A. Simoes, Lyndsay Fletcher, Laura A. Hayes, Iain G. Hannah

Abstract: Understanding the physical processes that trigger solar flares is paramount to help with forecasting space weather and mitigating the effects on our technological infrastructure. A previously unknown phenomenon was recently identified in solar flares: the plasma temperature, derived from soft X-ray (SXR) data, at the onset of four flares, was revealed to be in the range 10-15 MK, without evidence… ▽ More Understanding the physical processes that trigger solar flares is paramount to help with forecasting space weather and mitigating the effects on our technological infrastructure. A previously unknown phenomenon was recently identified in solar flares: the plasma temperature, derived from soft X-ray (SXR) data, at the onset of four flares, was revealed to be in the range 10-15 MK, without evidence of gradual heating. To investigate how common the hot-onset phenomenon may be, we extend this investigation to solar flares of B1.2- X6.9 classes recorded by the X-ray Sensor (XRS) on-board the GOES-14 and GOES-15 satellites between 2010 and 2011. For this statistical study, we employed the same methodology as in recent work, where the pre-flare SXR flux of each flare is obtained manually, and the temperature and emission measure values are obtained by the flux ratio of the two GOES/XRS channels using the standard software. From 3224 events listed in the GOES flare catalog for 2010-2011, we have selected and analyzed 745 events for which the flare heliographic location was provided in the list, to investigate center-to-limb effects of the hot-onset phenomenon. Our results show that 559 out of 745 flares (75%) exhibit an onset temperature above 8.6 MK (the first quartile), with respective log10 of the emission measure values between 46.0 - 47.25 cm-3, indicating that small amounts of plasma are quickly heated to high temperatures. These results suggest that the hot-onset phenomenon is very common in solar flares. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: 6 pages,7 figures

arXiv:2305.10492 [pdf, other]

doi 10.1103/PhysRevD.108.L121502

Relativistic drag forces on black holes from scalar dark matter clouds of all sizes

Authors: Dina Traykova, Rodrigo Vicente, Katy Clough, Thomas Helfer, Emanuele Berti, Pedro G. Ferreira, Lam Hui

Abstract: We use numerical simulations of scalar field dark matter evolving on a moving black hole background to confirm the regime of validity of (semi-)analytic expressions derived from first principles for both dynamical friction and momentum accretion in the relativistic regime. We cover both small and large clouds (relative to the de Broglie wavelength of the scalars), and light and heavy particle mass… ▽ More We use numerical simulations of scalar field dark matter evolving on a moving black hole background to confirm the regime of validity of (semi-)analytic expressions derived from first principles for both dynamical friction and momentum accretion in the relativistic regime. We cover both small and large clouds (relative to the de Broglie wavelength of the scalars), and light and heavy particle masses (relative to the BH size). In the case of a small dark matter cloud, the effect of accretion is a non-negligible contribution to the total force on the black hole, even for small scalar masses. We confirm that this momentum accretion transitions between two regimes (wave- and particle-like) and we identify the mass of the scalar at which the transition between regimes occurs. △ Less

Submitted 19 February, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

Comments: 11 pages, 5 figures. Minor corrections and references added to match published version

Journal ref: Phys.Rev.D 108 (2023) 12, L121502

arXiv:2305.08813 [pdf, other]

ReLU soothes the NTK condition number and accelerates optimization for wide neural networks

Authors: Chaoyue Liu, Like Hui

Abstract: Rectified linear unit (ReLU), as a non-linear activation function, is well known to improve the expressivity of neural networks such that any continuous function can be approximated to arbitrary precision by a sufficiently wide neural network. In this work, we present another interesting and important feature of ReLU activation function. We show that ReLU leads to: {\it better separation} for simi… ▽ More Rectified linear unit (ReLU), as a non-linear activation function, is well known to improve the expressivity of neural networks such that any continuous function can be approximated to arbitrary precision by a sufficiently wide neural network. In this work, we present another interesting and important feature of ReLU activation function. We show that ReLU leads to: {\it better separation} for similar data, and {\it better conditioning} of neural tangent kernel (NTK), which are closely related. Comparing with linear neural networks, we show that a ReLU activated wide neural network at random initialization has a larger angle separation for similar data in the feature space of model gradient, and has a smaller condition number for NTK. Note that, for a linear neural network, the data separation and NTK condition number always remain the same as in the case of a linear model. Furthermore, we show that a deeper ReLU network (i.e., with more ReLU activation operations), has a smaller NTK condition number than a shallower one. Our results imply that ReLU activation, as well as the depth of ReLU network, helps improve the gradient descent convergence rate, which is closely related to the NTK condition number. △ Less

Submitted 15 May, 2023; originally announced May 2023.

arXiv:2305.02528 [pdf, other]

Self-Supervised 3D Scene Flow Estimation Guided by Superpoints

Authors: Yaqi Shen, Le Hui, Jin Xie, Jian Yang

Abstract: 3D scene flow estimation aims to estimate point-wise motions between two consecutive frames of point clouds. Superpoints, i.e., points with similar geometric features, are usually employed to capture similar motions of local regions in 3D scenes for scene flow estimation. However, in existing methods, superpoints are generated with the offline clustering methods, which cannot characterize local re… ▽ More 3D scene flow estimation aims to estimate point-wise motions between two consecutive frames of point clouds. Superpoints, i.e., points with similar geometric features, are usually employed to capture similar motions of local regions in 3D scenes for scene flow estimation. However, in existing methods, superpoints are generated with the offline clustering methods, which cannot characterize local regions with similar motions for complex 3D scenes well, leading to inaccurate scene flow estimation. To this end, we propose an iterative end-to-end superpoint based scene flow estimation framework, where the superpoints can be dynamically updated to guide the point-level flow prediction. Specifically, our framework consists of a flow guided superpoint generation module and a superpoint guided flow refinement module. In our superpoint generation module, we utilize the bidirectional flow information at the previous iteration to obtain the matching points of points and superpoint centers for soft point-to-superpoint association construction, in which the superpoints are generated for pairwise point clouds. With the generated superpoints, we first reconstruct the flow for each point by adaptively aggregating the superpoint-level flow, and then encode the consistency between the reconstructed flow of pairwise point clouds. Finally, we feed the consistency encoding along with the reconstructed flow into GRU to refine point-level flow. Extensive experiments on several different datasets show that our method can achieve promising performance. △ Less

Submitted 3 May, 2023; originally announced May 2023.

Comments: CVPR 2023

arXiv:2302.03952 [pdf, other]

Cut your Losses with Squentropy

Authors: Like Hui, Mikhail Belkin, Stephen Wright

Abstract: Nearly all practical neural models for classification are trained using cross-entropy loss. Yet this ubiquitous choice is supported by little theoretical or empirical evidence. Recent work (Hui & Belkin, 2020) suggests that training using the (rescaled) square loss is often superior in terms of the classification accuracy. In this paper we propose the "squentropy" loss, which is the sum of two ter… ▽ More Nearly all practical neural models for classification are trained using cross-entropy loss. Yet this ubiquitous choice is supported by little theoretical or empirical evidence. Recent work (Hui & Belkin, 2020) suggests that training using the (rescaled) square loss is often superior in terms of the classification accuracy. In this paper we propose the "squentropy" loss, which is the sum of two terms: the cross-entropy loss and the average square loss over the incorrect classes. We provide an extensive set of experiments on multi-class classification problems showing that the squentropy loss outperforms both the pure cross entropy and rescaled square losses in terms of the classification accuracy. We also demonstrate that it provides significantly better model calibration than either of these alternative losses and, furthermore, has less variance with respect to the random initialization. Additionally, in contrast to the square loss, squentropy loss can typically be trained using exactly the same optimization parameters, including the learning rate, as the standard cross-entropy loss, making it a true "plug-and-play" replacement. Finally, unlike the rescaled square loss, multiclass squentropy contains no parameters that need to be adjusted. △ Less

Submitted 8 February, 2023; originally announced February 2023.

Comments: 18 pages, 16 figures, 6 tables

arXiv:2212.09367 [pdf, ps, other]

doi 10.1088/1475-7516/2023/06/056

Ladder Symmetries of Black Holes and de Sitter Space: Love Numbers and Quasinormal Modes

Authors: Roman Berens, Lam Hui, Zimo Sun

Abstract: In this note, we present a synopsis of geometric symmetries for (spin 0) perturbations around (4D) black holes and de Sitter space. For black holes, we focus on static perturbations, for which the (exact) geometric symmetries have the group structure of SO(1,3). The generators consist of three spatial rotations, and three conformal Killing vectors obeying a special melodic condition. The static pe… ▽ More In this note, we present a synopsis of geometric symmetries for (spin 0) perturbations around (4D) black holes and de Sitter space. For black holes, we focus on static perturbations, for which the (exact) geometric symmetries have the group structure of SO(1,3). The generators consist of three spatial rotations, and three conformal Killing vectors obeying a special melodic condition. The static perturbation solutions form a unitary (principal series) representation of the group. The recently uncovered ladder symmetries follow from this representation structure; they explain the well-known vanishing of the black hole Love numbers. For dynamical perturbations around de Sitter space, the geometric symmetries are less surprising, following from the SO(1,4) isometry. As is well known, the quasinormal solutions form a non-unitary representation of the isometry group. We provide explicit expressions for the ladder operators associated with this representation. In both cases, the ladder structures help connect the boundary condition at the horizon with that at infinity (black hole) or origin (de Sitter space), and they manifest as contiguous relations of the hypergeometric solutions. △ Less

Submitted 19 April, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

Comments: 37 pages, no figures v2: Author ZS added, section 2.1 extended

arXiv:2210.16276 [pdf, other]

doi 10.1007/JHEP02(2023)123

Soft theorems for boosts and other time symmetries

Authors: Lam Hui, Austin Joyce, Ilia Komissarov, Klaas Parmentier, Luca Santoni, Sam S. C. Wong

Abstract: We derive soft theorems for theories in which time symmetries -- symmetries that involve the transformation of time, an example of which are Lorentz boosts -- are spontaneously broken. The soft theorems involve unequal-time correlation functions with the insertion of a soft Goldstone in the far past. Explicit checks are provided for several examples, including the effective theory of a relativisti… ▽ More We derive soft theorems for theories in which time symmetries -- symmetries that involve the transformation of time, an example of which are Lorentz boosts -- are spontaneously broken. The soft theorems involve unequal-time correlation functions with the insertion of a soft Goldstone in the far past. Explicit checks are provided for several examples, including the effective theory of a relativistic superfluid and the effective field theory of inflation. We discuss how in certain cases these unequal-time identities capture information at the level of observables that cannot be seen purely in terms of equal-time correlators of the field alone. We also discuss when it is possible to phrase these soft theorems as identities involving equal-time correlators. △ Less

Submitted 28 October, 2022; originally announced October 2022.

Comments: 50 pages

arXiv:2210.10788 [pdf, other]

doi 10.1007/JHEP03(2023)060

An analytic approach to quasinormal modes for coupled linear systems

Authors: Lam Hui, Alessandro Podo, Luca Santoni, Enrico Trincherini

Abstract: Quasinormal modes describe the ringdown of compact objects deformed by small perturbations. In generic theories of gravity that extend General Relativity, the linearized dynamics of these perturbations is described by a system of coupled linear differential equations of second order. We first show, under general assumptions, that such a system can be brought to a Schrödinger-like form. We then dev… ▽ More Quasinormal modes describe the ringdown of compact objects deformed by small perturbations. In generic theories of gravity that extend General Relativity, the linearized dynamics of these perturbations is described by a system of coupled linear differential equations of second order. We first show, under general assumptions, that such a system can be brought to a Schrödinger-like form. We then devise an analytic approximation scheme to compute the spectrum of quasinormal modes. We validate our approach using a toy model with a controllable mixing parameter $\varepsilon$ and showing that the analytic approximation for the fundamental mode agrees with the numerical computation when the approximation is justified. The accuracy of the analytic approximation is at the (sub-) percent level for the real part and at the level of a few percent for the imaginary part, even when $\varepsilon$ is of order one. Our approximation scheme can be seen as an extension of the approach of Schutz and Will to the case of coupled systems of equations, although our approach is not phrased in terms of a WKB analysis, and offers a new viewpoint even in the case of a single equation. △ Less

Submitted 2 October, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

Comments: 30 pages. v2: matches published version

arXiv:2210.05534 [pdf, other]

Learning Inter-Superpoint Affinity for Weakly Supervised 3D Instance Segmentation

Authors: Linghua Tang, Le Hui, Jin Xie

Abstract: Due to the few annotated labels of 3D point clouds, how to learn discriminative features of point clouds to segment object instances is a challenging problem. In this paper, we propose a simple yet effective 3D instance segmentation framework that can achieve good performance by annotating only one point for each instance. Specifically, to tackle extremely few labels for instance segmentation, we… ▽ More Due to the few annotated labels of 3D point clouds, how to learn discriminative features of point clouds to segment object instances is a challenging problem. In this paper, we propose a simple yet effective 3D instance segmentation framework that can achieve good performance by annotating only one point for each instance. Specifically, to tackle extremely few labels for instance segmentation, we first oversegment the point cloud into superpoints in an unsupervised manner and extend the point-level annotations to the superpoint level. Then, based on the superpoint graph, we propose an inter-superpoint affinity mining module that considers the semantic and spatial relations to adaptively learn inter-superpoint affinity to generate high-quality pseudo labels via semantic-aware random walk. Finally, we propose a volume-aware instance refinement module to segment high-quality instances by applying volume constraints of objects in clustering on the superpoint graph. Extensive experiments on the ScanNet-v2 and S3DIS datasets demonstrate that our method achieves state-of-the-art performance in the weakly supervised point cloud instance segmentation task, and even outperforms some fully supervised methods. △ Less

Submitted 11 October, 2022; originally announced October 2022.

Comments: accepted by ACCV 2022

arXiv:2209.06395 [pdf, other]

Point Cloud Registration-Driven Robust Feature Matching for 3D Siamese Object Tracking

Authors: Haobo Jiang, Kaihao Lan, Le Hui, Guangyu Li, Jin Xie, Jian Yang

Abstract: Learning robust feature matching between the template and search area is crucial for 3D Siamese tracking. The core of Siamese feature matching is how to assign high feature similarity on the corresponding points between the template and search area for precise object localization. In this paper, we propose a novel point cloud registration-driven Siamese tracking framework, with the intuition that… ▽ More Learning robust feature matching between the template and search area is crucial for 3D Siamese tracking. The core of Siamese feature matching is how to assign high feature similarity on the corresponding points between the template and search area for precise object localization. In this paper, we propose a novel point cloud registration-driven Siamese tracking framework, with the intuition that spatially aligned corresponding points (via 3D registration) tend to achieve consistent feature representations. Specifically, our method consists of two modules, including a tracking-specific nonlocal registration module and a registration-aided Sinkhorn template-feature aggregation module. The registration module targets at the precise spatial alignment between the template and search area. The tracking-specific spatial distance constraint is proposed to refine the cross-attention weights in the nonlocal module for discriminative feature learning. Then, we use the weighted SVD to compute the rigid transformation between the template and search area, and align them to achieve the desired spatially aligned corresponding points. For the feature aggregation model, we formulate the feature matching between the transformed template and search area as an optimal transport problem and utilize the Sinkhorn optimization to search for the outlier-robust matching solution. Also, a registration-aided spatial distance map is built to improve the matching robustness in indistinguishable regions (e.g., smooth surface). Finally, guided by the obtained feature matching map, we aggregate the target information from the template into the search area to construct the target-specific feature, which is then fed into a CenterPoint-like detection head for object localization. Extensive experiments on KITTI, NuScenes and Waymo datasets verify the effectiveness of our proposed method. △ Less

Submitted 3 December, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

arXiv:2209.06228 [pdf, other]

doi 10.1103/PhysRevD.106.123525

Squeezing $f_{\rm NL}$ out of the matter bispectrum with consistency relations

Authors: Samuel Goldstein, Angelo Esposito, Oliver H. E. Philcox, Lam Hui, J. Colin Hill, Roman Scoccimarro, Maximilian H. Abitbol

Abstract: We show how consistency relations can be used to robustly extract the amplitude of local primordial non-Gaussianity ($f_{\rm NL}$) from the squeezed limit of the matter bispectrum, well into the non-linear regime. First, we derive a non-perturbative relation between primordial non-Gaussianity and the leading term in the squeezed bispectrum, revising some results present in the literature. This rel… ▽ More We show how consistency relations can be used to robustly extract the amplitude of local primordial non-Gaussianity ($f_{\rm NL}$) from the squeezed limit of the matter bispectrum, well into the non-linear regime. First, we derive a non-perturbative relation between primordial non-Gaussianity and the leading term in the squeezed bispectrum, revising some results present in the literature. This relation is then used to successfully measure $f_{\rm NL}$ from $N$-body simulations. We discuss the dependence of our results on different scale cuts and redshifts. Specifically, the analysis is strongly dependent on the choice of the smallest soft momentum, $q_{\rm min}$, which is the most sensitive to primordial bispectrum contributions, but is largely independent of the choice of the largest hard momentum, $k_{\rm max}$, due to the non-Gaussian nature of the covariance. We also show how the constraints on $f_{\rm NL}$ improve at higher redshift, due to a reduced off-diagonal covariance. In particular, for a simulation with $f_{\rm NL} = 100$ and a volume of $(2.4 \text{ Gpc}/h)^3$, we measure $f_{\rm NL} = 98 \pm 12$ at redshift $z=0$ and $f_{\rm NL} = 97 \pm 8$ at $z=0.97$. Finally, we compare our results with a Fisher forecast, showing that the current version of the analysis is satisfactorily close to the Fisher error. We regard this as a first step towards the realistic application of consistency relations to constrain primordial non-Gaussianity using observations. △ Less

Submitted 6 January, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

Comments: 17 pages, 8 figures. Minor changes. Matches version published in PRD

Journal ref: Phys. Rev. D 106, 123525 (2022)

arXiv:2208.07380 [pdf, other]

doi 10.1103/PhysRevLett.130.081402

Nonlinearities in Black Hole Ringdowns

Authors: Keefe Mitman, Macarena Lagos, Leo C. Stein, Sizheng Ma, Lam Hui, Yanbei Chen, Nils Deppe, François Hébert, Lawrence E. Kidder, Jordan Moxon, Mark A. Scheel, Saul A. Teukolsky, William Throwe, Nils L. Vu

Abstract: The gravitational wave strain emitted by a perturbed black hole (BH) ringing down is typically modeled analytically using first-order BH perturbation theory. In this Letter we show that second-order effects are necessary for modeling ringdowns from BH merger simulations. Focusing on the strain's $(\ell,m)=(4,4)$ angular harmonic, we show the presence of a quadratic effect across a range of binary… ▽ More The gravitational wave strain emitted by a perturbed black hole (BH) ringing down is typically modeled analytically using first-order BH perturbation theory. In this Letter we show that second-order effects are necessary for modeling ringdowns from BH merger simulations. Focusing on the strain's $(\ell,m)=(4,4)$ angular harmonic, we show the presence of a quadratic effect across a range of binary BH mass ratios that agrees with theoretical expectations. We find that the quadratic $(4,4)$ mode's amplitude exhibits quadratic scaling with the fundamental $(2,2)$ mode -- its parent mode. The nonlinear mode's amplitude is comparable to or even larger than that of the linear $(4,4)$ mode. Therefore, correctly modeling the ringdown of higher harmonics -- improving mode mismatches by up to 2 orders of magnitude -- requires the inclusion of nonlinear effects. △ Less

Submitted 22 February, 2023; v1 submitted 15 August, 2022; originally announced August 2022.

Comments: 6+2 pages, 4 figures, 1 table. Matches PRL version

Journal ref: Phys. Rev. Lett. 130, 081402 (2023)

arXiv:2208.07379 [pdf, other]

doi 10.1103/PhysRevD.107.044040

Generation and propagation of nonlinear quasi-normal modes of a Schwarzschild black hole

Authors: Macarena Lagos, Lam Hui

Abstract: In the analysis of a binary black hole coalescence, it is necessary to include gravitational self-interactions in order to describe the transition of the gravitational wave signal from the merger to the ringdown stage. In this paper we study the phenomenology of the generation and propagation of nonlinearities in the ringdown of a Schwarzschild black hole, using second-order perturbation theory. F… ▽ More In the analysis of a binary black hole coalescence, it is necessary to include gravitational self-interactions in order to describe the transition of the gravitational wave signal from the merger to the ringdown stage. In this paper we study the phenomenology of the generation and propagation of nonlinearities in the ringdown of a Schwarzschild black hole, using second-order perturbation theory. Following earlier work, we show that the Green's function and its causal structure determines how both first-order and second-order perturbations are generated, and hence highlight that both of these solutions share some physical properties. In particular, we discuss the sense in which both linear and quadratic quasi-normal modes (QNMs) are generated in the vicinity of the peak of the gravitational potential barrier (loosely referred to as the light ring). Among the second-order perturbations, there are solutions with linear QNM frequencies (whose amplitudes are thus renormalized from their linear values), as well as quadratic QNM frequencies with a distinct spectrum. Moreover, we show using a WKB analysis that, in the eikonal limit, waves generated inside the light ring propagate towards the black hole horizon, and only waves generated outside propagate towards an asymptotic observer. These results might be relevant for recent discussions on the validity of perturbation theory close to the merger. Finally, we argue that even if nonlinearities are small, quadratic QNMs may be detectable and would likely be useful for improving ringdown models of higher angular harmonics and future tests of gravity. △ Less

Submitted 9 January, 2023; v1 submitted 15 August, 2022; originally announced August 2022.

Comments: Version accepted in PRD

arXiv:2208.06408 [pdf, other]

doi 10.1103/PhysRevD.107.104018

Black hole superradiance with (dark) matter accretion

Authors: Lam Hui, Y. T. Albert Law, Luca Santoni, Guanhao Sun, Giovanni Maria Tomaselli, Enrico Trincherini

Abstract: Studies of black hole superradiance often focus on the growth of a cloud in isolation, accompanied by the spin-down of the black hole. In this paper, we consider the additional effect of the accretion of matter and angular momentum from the environment. We show that, in many cases, the black hole evolves by drifting along the superradiance threshold, in which case the evolution of its parameters c… ▽ More Studies of black hole superradiance often focus on the growth of a cloud in isolation, accompanied by the spin-down of the black hole. In this paper, we consider the additional effect of the accretion of matter and angular momentum from the environment. We show that, in many cases, the black hole evolves by drifting along the superradiance threshold, in which case the evolution of its parameters can be described analytically or semi-analytically. We quantify the conditions under which accretion can serve as a mechanism to increase the cloud-to-black hole mass ratio, beyond the standard maximum of about 10%. This occurs by a process we call over-superradiance, whereby accretion effectively feeds the superradiance cloud, by way of the black hole. We give two explicit examples: accretion from a vortex expected in wave dark matter and accretion from a baryonic disk. In the former case, we estimate the accretion rate by using an analytical fit to the asymptotic behavior of the confluent Heun function. Level transition, whereby one cloud level grows while the other shrinks, can be understood in a similar way. △ Less

Submitted 25 May, 2023; v1 submitted 12 August, 2022; originally announced August 2022.

Comments: 30+21 pages, 14 figures

arXiv:2208.04510 [pdf, other]

Unsupervised Domain Adaptation for Point Cloud Semantic Segmentation via Graph Matching

Authors: Yikai Bian, Le Hui, Jianjun Qian, Jin Xie

Abstract: Unsupervised domain adaptation for point cloud semantic segmentation has attracted great attention due to its effectiveness in learning with unlabeled data. Most of existing methods use global-level feature alignment to transfer the knowledge from the source domain to the target domain, which may cause the semantic ambiguity of the feature space. In this paper, we propose a graph-based framework t… ▽ More Unsupervised domain adaptation for point cloud semantic segmentation has attracted great attention due to its effectiveness in learning with unlabeled data. Most of existing methods use global-level feature alignment to transfer the knowledge from the source domain to the target domain, which may cause the semantic ambiguity of the feature space. In this paper, we propose a graph-based framework to explore the local-level feature alignment between the two domains, which can reserve semantic discrimination during adaptation. Specifically, in order to extract local-level features, we first dynamically construct local feature graphs on both domains and build a memory bank with the graphs from the source domain. In particular, we use optimal transport to generate the graph matching pairs. Then, based on the assignment matrix, we can align the feature distributions between the two domains with the graph-based local feature loss. Furthermore, we consider the correlation between the features of different categories and formulate a category-guided contrastive loss to guide the segmentation model to learn discriminative features on the target domain. Extensive experiments on different synthetic-to-real and real-to-real domain adaptation scenarios demonstrate that our method can achieve state-of-the-art performance. △ Less

Submitted 8 August, 2022; originally announced August 2022.

arXiv:2207.11996 [pdf, other]

Generative Subgraph Contrast for Self-Supervised Graph Representation Learning

Authors: Yuehui Han, Le Hui, Haobo Jiang, Jianjun Qian, Jin Xie

Abstract: Contrastive learning has shown great promise in the field of graph representation learning. By manually constructing positive/negative samples, most graph contrastive learning methods rely on the vector inner product based similarity metric to distinguish the samples for graph representation. However, the handcrafted sample construction (e.g., the perturbation on the nodes or edges of the graph) m… ▽ More Contrastive learning has shown great promise in the field of graph representation learning. By manually constructing positive/negative samples, most graph contrastive learning methods rely on the vector inner product based similarity metric to distinguish the samples for graph representation. However, the handcrafted sample construction (e.g., the perturbation on the nodes or edges of the graph) may not effectively capture the intrinsic local structures of the graph. Also, the vector inner product based similarity metric cannot fully exploit the local structures of the graph to characterize the graph difference well. To this end, in this paper, we propose a novel adaptive subgraph generation based contrastive learning framework for efficient and robust self-supervised graph representation learning, and the optimal transport distance is utilized as the similarity metric between the subgraphs. It aims to generate contrastive samples by capturing the intrinsic structures of the graph and distinguish the samples based on the features and structures of subgraphs simultaneously. Specifically, for each center node, by adaptively learning relation weights to the nodes of the corresponding neighborhood, we first develop a network to generate the interpolated subgraph. We then construct the positive and negative pairs of subgraphs from the same and different nodes, respectively. Finally, we employ two types of optimal transport distances (i.e., Wasserstein distance and Gromov-Wasserstein distance) to construct the structured contrastive loss. Extensive node classification experiments on benchmark datasets verify the effectiveness of our graph contrastive learning method. △ Less

Submitted 26 July, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

Comments: ECCV 2022

arXiv:2207.11995 [pdf, other]

3D Siamese Transformer Network for Single Object Tracking on Point Clouds

Authors: Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang

Abstract: Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area. Due to the large appearance variation between the template and search area during tracking, how to learn the robust cross correlation between them for identifying the potential target in the search area is still a challenging problem. In this pape… ▽ More Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area. Due to the large appearance variation between the template and search area during tracking, how to learn the robust cross correlation between them for identifying the potential target in the search area is still a challenging problem. In this paper, we explicitly use Transformer to form a 3D Siamese Transformer network for learning robust cross correlation between the template and the search area of point clouds. Specifically, we develop a Siamese point Transformer network to learn shape context information of the target. Its encoder uses self-attention to capture non-local information of point clouds to characterize the shape information of the object, and the decoder utilizes cross-attention to upsample discriminative point features. After that, we develop an iterative coarse-to-fine correlation network to learn the robust cross correlation between the template and the search area. It formulates the cross-feature augmentation to associate the template with the potential target in the search area via cross attention. To further enhance the potential target, it employs the ego-feature augmentation that applies self-attention to the local k-NN graph of the feature space to aggregate target features. Experiments on the KITTI, nuScenes, and Waymo datasets show that our method achieves state-of-the-art performance on the 3D single object tracking task. △ Less

Submitted 26 July, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

Comments: Accepted to ECCV'22

arXiv:2207.11984 [pdf, other]

RA-Depth: Resolution Adaptive Self-Supervised Monocular Depth Estimation

Authors: Mu He, Le Hui, Yikai Bian, Jian Ren, Jin Xie, Jian Yang

Abstract: Existing self-supervised monocular depth estimation methods can get rid of expensive annotations and achieve promising results. However, these methods suffer from severe performance degradation when directly adopting a model trained on a fixed resolution to evaluate at other different resolutions. In this paper, we propose a resolution adaptive self-supervised monocular depth estimation method (RA… ▽ More Existing self-supervised monocular depth estimation methods can get rid of expensive annotations and achieve promising results. However, these methods suffer from severe performance degradation when directly adopting a model trained on a fixed resolution to evaluate at other different resolutions. In this paper, we propose a resolution adaptive self-supervised monocular depth estimation method (RA-Depth) by learning the scale invariance of the scene depth. Specifically, we propose a simple yet efficient data augmentation method to generate images with arbitrary scales for the same scene. Then, we develop a dual high-resolution network that uses the multi-path encoder and decoder with dense interactions to aggregate multi-scale features for accurate depth inference. Finally, to explicitly learn the scale invariance of the scene depth, we formulate a cross-scale depth consistency loss on depth predictions with different scales. Extensive experiments on the KITTI, Make3D and NYU-V2 datasets demonstrate that RA-Depth not only achieves state-of-the-art performance, but also exhibits a good ability of resolution adaptation. △ Less

Submitted 26 July, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

Comments: Accepted to ECCV'22

arXiv:2206.00310 [pdf, other]

Digital Twin for Networking: A Data-driven Performance Modeling Perspective

Authors: Linbo Hui, Mowei Wang, Liang Zhang, Lu Lu, Yong Cui

Abstract: Emerging technologies and applications make the network unprecedentedly complex and heterogeneous, leading physical network practices to be costly and risky. The digital twin network (DTN) can ease these burdens by virtually enabling users to understand how performance changes accordingly with modifications. For this "What-if" performance evaluation, conventional simulation and analytical approach… ▽ More Emerging technologies and applications make the network unprecedentedly complex and heterogeneous, leading physical network practices to be costly and risky. The digital twin network (DTN) can ease these burdens by virtually enabling users to understand how performance changes accordingly with modifications. For this "What-if" performance evaluation, conventional simulation and analytical approaches are inefficient, inaccurate, and inflexible, and we argue that data-driven methods are most promising. In this article, we identify three requirements (fidelity, efficiency, and flexibility) for performance evaluation. Then we present a comparison of selected data-driven methods and investigate their potential trends in data, models, and applications. Although extensive applications have been enabled, there are still significant conflicts between models' capacities to handle diversified inputs and limited data collected from the production network. We further illustrate the opportunities for data collection, model construction, and application prospects. This survey aims to provide a reference for performance evaluation while also facilitating future DTN research. △ Less

Submitted 1 June, 2022; originally announced June 2022.

arXiv:2203.08832 [pdf, other]

doi 10.1007/JHEP09(2022)049

Near-Zone Symmetries of Kerr Black Holes

Authors: Lam Hui, Austin Joyce, Riccardo Penco, Luca Santoni, Adam R. Solomon

Abstract: We study the near-zone symmetries of a massless scalar field on four-dimensional black hole backgrounds. We provide a geometric understanding that unifies various recently discovered symmetries as part of an SO(4,2) group. Of these, a subset are exact symmetries of the static sector and give rise to the ladder symmetries responsible for the vanishing of Love numbers. In the Kerr case, we compare d… ▽ More We study the near-zone symmetries of a massless scalar field on four-dimensional black hole backgrounds. We provide a geometric understanding that unifies various recently discovered symmetries as part of an SO(4,2) group. Of these, a subset are exact symmetries of the static sector and give rise to the ladder symmetries responsible for the vanishing of Love numbers. In the Kerr case, we compare different near-zone approximations in the literature, and focus on the implementation that retains the symmetries of the static limit. We also describe the relation to spin-1 and 2 perturbations. △ Less

Submitted 16 March, 2022; originally announced March 2022.

Comments: 4+3 pages, 1 figure

Journal ref: JHEP09(2022)049

arXiv:2202.11948 [pdf, other]

Domain Disentangled Generative Adversarial Network for Zero-Shot Sketch-Based 3D Shape Retrieval

Authors: Rui Xu, Zongyan Han, Le Hui, Jianjun Qian, Jin Xie

Abstract: Sketch-based 3D shape retrieval is a challenging task due to the large domain discrepancy between sketches and 3D shapes. Since existing methods are trained and evaluated on the same categories, they cannot effectively recognize the categories that have not been used during training. In this paper, we propose a novel domain disentangled generative adversarial network (DD-GAN) for zero-shot sketch-… ▽ More Sketch-based 3D shape retrieval is a challenging task due to the large domain discrepancy between sketches and 3D shapes. Since existing methods are trained and evaluated on the same categories, they cannot effectively recognize the categories that have not been used during training. In this paper, we propose a novel domain disentangled generative adversarial network (DD-GAN) for zero-shot sketch-based 3D retrieval, which can retrieve the unseen categories that are not accessed during training. Specifically, we first generate domain-invariant features and domain-specific features by disentangling the learned features of sketches and 3D shapes, where the domain-invariant features are used to align with the corresponding word embeddings. Then, we develop a generative adversarial network that combines the domain-specific features of the seen categories with the aligned domain-invariant features to synthesize samples, where the synthesized samples of the unseen categories are generated by using the corresponding word embeddings. Finally, we use the synthesized samples of the unseen categories combined with the real samples of the seen categories to train the network for retrieval, so that the unseen categories can be recognized. In order to reduce the domain shift problem, we utilized unlabeled unseen samples to enhance the discrimination ability of the discriminator. With the discriminator distinguishing the generated samples from the unlabeled unseen samples, the generator can generate more realistic unseen samples. Extensive experiments on the SHREC'13 and SHREC'14 datasets show that our method significantly improves the retrieval performance of the unseen categories. △ Less

Submitted 29 June, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

Comments: Accepted by AAAI 2022

arXiv:2202.11292 [pdf, other]

Reliable Inlier Evaluation for Unsupervised Point Cloud Registration

Authors: Yaqi Shen, Le Hui, Haobo Jiang, Jin Xie, Jian Yang

Abstract: Unsupervised point cloud registration algorithm usually suffers from the unsatisfied registration precision in the partially overlapping problem due to the lack of effective inlier evaluation. In this paper, we propose a neighborhood consensus based reliable inlier evaluation method for robust unsupervised point cloud registration. It is expected to capture the discriminative geometric difference… ▽ More Unsupervised point cloud registration algorithm usually suffers from the unsatisfied registration precision in the partially overlapping problem due to the lack of effective inlier evaluation. In this paper, we propose a neighborhood consensus based reliable inlier evaluation method for robust unsupervised point cloud registration. It is expected to capture the discriminative geometric difference between the source neighborhood and the corresponding pseudo target neighborhood for effective inlier distinction. Specifically, our model consists of a matching map refinement module and an inlier evaluation module. In our matching map refinement module, we improve the point-wise matching map estimation by integrating the matching scores of neighbors into it. The aggregated neighborhood information potentially facilitates the discriminative map construction so that high-quality correspondences can be provided for generating the pseudo target point cloud. Based on the observation that the outlier has the significant structure-wise difference between its source neighborhood and corresponding pseudo target neighborhood while this difference for inlier is small, the inlier evaluation module exploits this difference to score the inlier confidence for each estimated correspondence. In particular, we construct an effective graph representation for capturing this geometric difference between the neighborhoods. Finally, with the learned correspondences and the corresponding inlier confidence, we use the weighted SVD algorithm for transformation estimation. Under the unsupervised setting, we exploit the Huber function based global alignment loss, the local neighborhood consensus loss, and spatial consistency loss for model optimization. The experimental results on extensive datasets demonstrate that our unsupervised point cloud registration method can yield comparable performance. △ Less

Submitted 22 February, 2022; originally announced February 2022.

Comments: Accepted by AAAI 2022

arXiv:2202.08384 [pdf, other]

Limitations of Neural Collapse for Understanding Generalization in Deep Learning

Authors: Like Hui, Mikhail Belkin, Preetum Nakkiran

Abstract: The recent work of Papyan, Han, & Donoho (2020) presented an intriguing "Neural Collapse" phenomenon, showing a structural property of interpolating classifiers in the late stage of training. This opened a rich area of exploration studying this phenomenon. Our motivation is to study the upper limits of this research program: How far will understanding Neural Collapse take us in understanding deep… ▽ More The recent work of Papyan, Han, & Donoho (2020) presented an intriguing "Neural Collapse" phenomenon, showing a structural property of interpolating classifiers in the late stage of training. This opened a rich area of exploration studying this phenomenon. Our motivation is to study the upper limits of this research program: How far will understanding Neural Collapse take us in understanding deep learning? First, we investigate its role in generalization. We refine the Neural Collapse conjecture into two separate conjectures: collapse on the train set (an optimization property) and collapse on the test distribution (a generalization property). We find that while Neural Collapse often occurs on the train set, it does not occur on the test set. We thus conclude that Neural Collapse is primarily an optimization phenomenon, with as-yet-unclear connections to generalization. Second, we investigate the role of Neural Collapse in feature learning. We show simple, realistic experiments where training longer leads to worse last-layer features, as measured by transfer-performance on a downstream task. This suggests that neural collapse is not always desirable for representation learning, as previously claimed. Finally, we give preliminary evidence of a "cascading collapse" phenomenon, wherein some form of Neural Collapse occurs not only for the last layer, but in earlier layers as well. We hope our work encourages the community to continue the rich line of Neural Collapse research, while also considering its inherent limitations. △ Less

Submitted 16 February, 2022; originally announced February 2022.

arXiv:2111.04426 [pdf, other]

3D Siamese Voxel-to-BEV Tracker for Sparse Point Clouds

Authors: Le Hui, Lingpeng Wang, Mingmei Cheng, Jin Xie, Jian Yang

Abstract: 3D object tracking in point clouds is still a challenging problem due to the sparsity of LiDAR points in dynamic environments. In this work, we propose a Siamese voxel-to-BEV tracker, which can significantly improve the tracking performance in sparse 3D point clouds. Specifically, it consists of a Siamese shape-aware feature learning network and a voxel-to-BEV target localization network. The Siam… ▽ More 3D object tracking in point clouds is still a challenging problem due to the sparsity of LiDAR points in dynamic environments. In this work, we propose a Siamese voxel-to-BEV tracker, which can significantly improve the tracking performance in sparse 3D point clouds. Specifically, it consists of a Siamese shape-aware feature learning network and a voxel-to-BEV target localization network. The Siamese shape-aware feature learning network can capture 3D shape information of the object to learn the discriminative features of the object so that the potential target from the background in sparse point clouds can be identified. To this end, we first perform template feature embedding to embed the template's feature into the potential target and then generate a dense 3D shape to characterize the shape information of the potential target. For localizing the tracked target, the voxel-to-BEV target localization network regresses the target's 2D center and the $z$-axis center from the dense bird's eye view (BEV) feature map in an anchor-free manner. Concretely, we compress the voxelized point cloud along $z$-axis through max pooling to obtain a dense BEV feature map, where the regression of the 2D center and the $z$-axis center can be performed more effectively. Extensive evaluation on the KITTI and nuScenes datasets shows that our method significantly outperforms the current state-of-the-art methods by a large margin. △ Less

Submitted 17 November, 2021; v1 submitted 8 November, 2021; originally announced November 2021.

Comments: Accepted by NeurIPS 2021

arXiv:2111.02072 [pdf, other]

doi 10.1007/JHEP12(2021)183

Effective Field Theory for the Perturbations of a Slowly Rotating Black Hole

Authors: Lam Hui, Alessandro Podo, Luca Santoni, Enrico Trincherini

Abstract: We develop the effective theory for perturbations around black holes with scalar hair, in two directions. First, we show that the scalar-Gauss--Bonnet theory, often used as an example exhibiting scalar black hole hair, can be deformed by galileon operators leading to order unity changes to its predictions. The effective theory for perturbations thus provides an efficient framework for describing a… ▽ More We develop the effective theory for perturbations around black holes with scalar hair, in two directions. First, we show that the scalar-Gauss--Bonnet theory, often used as an example exhibiting scalar black hole hair, can be deformed by galileon operators leading to order unity changes to its predictions. The effective theory for perturbations thus provides an efficient framework for describing and constraining broad classes of scalar-tensor theories, of which the addition of galileon operators is an example. Second, we extend the effective theory to perturbations around an axisymmetric, slowly rotating black hole, at linear order in the black hole spin. We also discuss the inclusion of parity-breaking operators in the effective theory. △ Less

Submitted 14 January, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

arXiv:2109.06125 [pdf, other]

doi 10.1103/PhysRevD.105.023512

Construction of Wave Dark Matter Halos: Numerical Algorithm and Analytical Constraints

Authors: Tomer D. Yavetz, Xinyu Li, Lam Hui

Abstract: We present a wave generalization of the classic Schwarzschild method for constructing self-consistent halos -- such a halo consists of a suitable superposition of waves instead of particle orbits, chosen to yield a desired mean density profile. As an illustration, the method is applied to spherically symmetric halos. We derive an analytic relation between the particle distribution function and the… ▽ More We present a wave generalization of the classic Schwarzschild method for constructing self-consistent halos -- such a halo consists of a suitable superposition of waves instead of particle orbits, chosen to yield a desired mean density profile. As an illustration, the method is applied to spherically symmetric halos. We derive an analytic relation between the particle distribution function and the wave superposition amplitudes, and show how it simplifies in the high energy (WKB) limit. We verify the stability of such constructed halos by numerically evolving the Schrödinger-Poisson system. The algorithm provides an efficient and accurate way to simulate the time-dependent halo substructures from wave interference. We use this method to construct halos with a variety of density profiles, all of which have a core from the ground-state wave function, though the core-halo relation need not be the standard one. △ Less

Submitted 5 January, 2023; v1 submitted 13 September, 2021; originally announced September 2021.

Comments: 22 pages, 15 figures; published in Phys. Rev. D

Journal ref: Phys. Rev. D 105, 023512, Published 10 January 2022

arXiv:2108.00454 [pdf, other]

SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering

Authors: Yifan Zhao, Le Hui, Jin Xie

Abstract: Point clouds obtained from 3D sensors are usually sparse. Existing methods mainly focus on upsampling sparse point clouds in a supervised manner by using dense ground truth point clouds. In this paper, we propose a self-supervised point cloud upsampling network (SSPU-Net) to generate dense point clouds without using ground truth. To achieve this, we exploit the consistency between the input sparse… ▽ More Point clouds obtained from 3D sensors are usually sparse. Existing methods mainly focus on upsampling sparse point clouds in a supervised manner by using dense ground truth point clouds. In this paper, we propose a self-supervised point cloud upsampling network (SSPU-Net) to generate dense point clouds without using ground truth. To achieve this, we exploit the consistency between the input sparse point cloud and generated dense point cloud for the shapes and rendered images. Specifically, we first propose a neighbor expansion unit (NEU) to upsample the sparse point clouds, where the local geometric structures of the sparse point clouds are exploited to learn weights for point interpolation. Then, we develop a differentiable point cloud rendering unit (DRU) as an end-to-end module in our network to render the point cloud into multi-view images. Finally, we formulate a shape-consistent loss and an image-consistent loss to train the network so that the shapes of the sparse and dense point clouds are as consistent as possible. Extensive results on the CAD and scanned datasets demonstrate that our method can achieve impressive results in a self-supervised manner. Code is available at https://github.com/fpthink/SSPU-Net. △ Less

Submitted 3 August, 2021; v1 submitted 1 August, 2021; originally announced August 2021.

Comments: Accepted by ACM Multimedia 2021

arXiv:2106.08280 [pdf, other]

doi 10.1103/PhysRevD.104.103014

Dynamical friction from scalar dark matter in the relativistic regime

Authors: Dina Traykova, Katy Clough, Thomas Helfer, Emanuele Berti, Pedro G. Ferreira, Lam Hui

Abstract: Light bosonic scalars (e.g. axions) may form clouds around black holes via superradiant instabilities, or via accretion if they form some component of the dark matter. It has been suggested that their presence may lead to a distinctive dephasing of the gravitational wave signal when a small compact object spirals into a larger black hole. Motivated by this, we study numerically the dynamical frict… ▽ More Light bosonic scalars (e.g. axions) may form clouds around black holes via superradiant instabilities, or via accretion if they form some component of the dark matter. It has been suggested that their presence may lead to a distinctive dephasing of the gravitational wave signal when a small compact object spirals into a larger black hole. Motivated by this, we study numerically the dynamical friction force on a black hole moving at relativistic velocities in a background scalar field with an asymptotically homogeneous energy density. We show that the relativistic scaling is analogous to that found for supersonic collisional fluids, assuming an approximate expression for the pressure correction which depends on the velocity and scalar mass. While we focus on a complex scalar field, our results confirm the expectation that real scalars would exert a force which oscillates between positive and negative values in time with a frequency set by the scalar mass. The complex field describes the time averaged value of this force, but in a real scalar the rapid force oscillations could in principle leave an imprint on the trajectory. The approximation we obtain can be used to inform estimates of dephasing in the final stages of an extreme mass ratio inspiral. △ Less

Submitted 27 October, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

Comments: 17 pages, 13 figures, 1 table; Version as accepted in PRD - minor changes

arXiv:2105.01069 [pdf, other]

doi 10.1088/1475-7516/2022/01/032

Ladder Symmetries of Black Holes: Implications for Love Numbers and No-Hair Theorems

Authors: Lam Hui, Austin Joyce, Riccardo Penco, Luca Santoni, Adam R. Solomon

Abstract: It is well known that asymptotically flat black holes in general relativity have a vanishing static, conservative tidal response. We show that this is a result of linearly realized symmetries governing static (spin 0,1,2) perturbations around black holes. The symmetries have a geometric origin: in the scalar case, they arise from the (E)AdS isometries of a dimensionally reduced black hole spacetim… ▽ More It is well known that asymptotically flat black holes in general relativity have a vanishing static, conservative tidal response. We show that this is a result of linearly realized symmetries governing static (spin 0,1,2) perturbations around black holes. The symmetries have a geometric origin: in the scalar case, they arise from the (E)AdS isometries of a dimensionally reduced black hole spacetime. Underlying the symmetries is a ladder structure which can be used to construct the full tower of solutions, and derive their general properties: (1) solutions that decay with radius spontaneously break the symmetries, and must diverge at the horizon; (2) solutions regular at the horizon respect the symmetries, and take the form of a finite polynomial that grows with radius. Taken together, these two properties imply that static response coefficients -- and in particular Love numbers -- vanish. Moreover, property (1) is consistent with the absence of black holes with linear (perturbative) hair. We also discuss the manifestation of these symmetries in the effective point particle description of a black hole, showing explicitly that for scalar probes the worldline couplings associated with a non-trivial tidal response and scalar hair must vanish in order for the symmetries to be preserved. △ Less

Submitted 14 January, 2022; v1 submitted 3 May, 2021; originally announced May 2021.

Comments: Some sentences rephrased for clarity. Equations and conclusions unchanged

Journal ref: JCAP 01 (2022) 01, 032

arXiv:2104.07861 [pdf, other]

SSPC-Net: Semi-supervised Semantic 3D Point Cloud Segmentation Network

Authors: Mingmei Cheng, Le Hui, Jin Xie, Jian Yang

Abstract: Point cloud semantic segmentation is a crucial task in 3D scene understanding. Existing methods mainly focus on employing a large number of annotated labels for supervised semantic segmentation. Nonetheless, manually labeling such large point clouds for the supervised segmentation task is time-consuming. In order to reduce the number of annotated labels, we propose a semi-supervised semantic point… ▽ More Point cloud semantic segmentation is a crucial task in 3D scene understanding. Existing methods mainly focus on employing a large number of annotated labels for supervised semantic segmentation. Nonetheless, manually labeling such large point clouds for the supervised segmentation task is time-consuming. In order to reduce the number of annotated labels, we propose a semi-supervised semantic point cloud segmentation network, named SSPC-Net, where we train the semantic segmentation network by inferring the labels of unlabeled points from the few annotated 3D points. In our method, we first partition the whole point cloud into superpoints and build superpoint graphs to mine the long-range dependencies in point clouds. Based on the constructed superpoint graph, we then develop a dynamic label propagation method to generate the pseudo labels for the unsupervised superpoints. Particularly, we adopt a superpoint dropout strategy to dynamically select the generated pseudo labels. In order to fully exploit the generated pseudo labels of the unsupervised superpoints, we furthermore propose a coupled attention mechanism for superpoint feature embedding. Finally, we employ the cross-entropy loss to train the semantic segmentation network with the labels of the supervised superpoints and the pseudo labels of the unsupervised superpoints. Experiments on various datasets demonstrate that our semi-supervised segmentation method can achieve better performance than the current semi-supervised segmentation method with fewer annotated 3D points. Our code is available at https://github.com/MMCheng/SSPC-Net. △ Less

Submitted 24 May, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

Comments: Accepted by AAAI 2021; Project page: \<https://github.com/MMCheng/SSPC-Net>

arXiv:2101.11735 [pdf, other]

doi 10.1146/annurev-astro-120920-010024

Wave Dark Matter

Authors: Lam Hui

Abstract: We review the physics and phenomenology of wave dark matter: a bosonic dark matter candidate lighter than about 30 eV. Such particles have a de Broglie wavelength exceeding the average inter-particle separation in a galaxy like the Milky Way, and are well described as classical waves. We outline the particle physics motivations for them, including the QCD axion and ultra-light axion-like-particles… ▽ More We review the physics and phenomenology of wave dark matter: a bosonic dark matter candidate lighter than about 30 eV. Such particles have a de Broglie wavelength exceeding the average inter-particle separation in a galaxy like the Milky Way, and are well described as classical waves. We outline the particle physics motivations for them, including the QCD axion and ultra-light axion-like-particles such as fuzzy dark matter. The wave nature of the dark matter implies a rich phenomenology: (1) Wave interference leads to order unity density fluctuations on de Broglie scale. A manifestation is vortices where the density vanishes and around which the velocity circulates. There is one vortex ring per de Broglie volume on average. (2) For sufficiently low masses, soliton condensation occurs at centers of halos. The soliton oscillates and random walks, another manifestation of wave interference. The halo/subhalo abundance is suppressed at small masses, but the precise prediction from numerical wave simulations remains to be determined. (3) For ultra-light ~$10^{-22}$ eV dark matter, the wave interference substructures can be probed by tidal streams/gravitational lensing. The signal can be distinguished from that due to subhalos by the dependence on stream orbital radius/image separation. (4) Axion detection experiments are sensitive to interference substructures for moderately light masses. The stochastic nature of the waves affects the interpretation of experiments and motivates the measurement of correlation functions. Current constraints and open questions, covering detection experiments and cosmological/galactic/black-hole observations, are discussed. △ Less

Submitted 27 January, 2021; originally announced January 2021.

Comments: 44 pages, to appear in Annual Review of Astronomy and Astrophysics

arXiv:2101.02374 [pdf, other]

doi 10.1109/TIP.2021.3136714

Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition

Authors: Le Hui, Mingmei Cheng, Jin Xie, Jian Yang

Abstract: Point cloud based retrieval for place recognition is still a challenging problem due to drastic appearance and illumination changes of scenes in changing environments. Existing deep learning based global descriptors for the retrieval task usually consume a large amount of computation resources (e.g., memory), which may not be suitable for the cases of limited hardware resources. In this paper, we… ▽ More Point cloud based retrieval for place recognition is still a challenging problem due to drastic appearance and illumination changes of scenes in changing environments. Existing deep learning based global descriptors for the retrieval task usually consume a large amount of computation resources (e.g., memory), which may not be suitable for the cases of limited hardware resources. In this paper, we develop an efficient point cloud learning network (EPC-Net) to form a global descriptor for visual place recognition, which can obtain good performance and reduce computation memory and inference time. First, we propose a lightweight but effective neural network module, called ProxyConv, to aggregate the local geometric features of point clouds. We leverage the spatial adjacent matrix and proxy points to simplify the original edge convolution for lower memory consumption. Then, we design a lightweight grouped VLAD network (G-VLAD) to form global descriptors for retrieval. Compared with the original VLAD network, we propose a grouped fully connected (GFC) layer to decompose the high-dimensional vectors into a group of low-dimensional vectors, which can reduce the number of parameters of the network and maintain the discrimination of the feature vector. Finally, to further reduce the inference time, we develop a simple version of EPC-Net, called EPC-Net-L, which consists of two ProxyConv modules and one max pooling layer to aggregate global descriptors. By distilling the knowledge from EPC-Net, EPC-Net-L can obtain discriminative global descriptors for retrieval. Extensive experiments on the Oxford dataset and three in-house datasets demonstrate that our proposed method can achieve state-of-the-art performance with lower parameters, FLOPs, and runtime per frame. △ Less

Submitted 7 January, 2021; originally announced January 2021.

Comments: Project page: https://github.com/fpthink/EPC-Net

arXiv:2011.13141 [pdf, other]

doi 10.1088/1475-7516/2021/03/076

Don't cross the streams: caustics from Fuzzy Dark Matter

Authors: Neal Dalal, Jo Bovy, Lam Hui, Xinyu Li

Abstract: We study how tidal streams from globular clusters may be used to constrain the mass of ultra-light dark matter particles, called `fuzzy' dark matter (FDM). A general feature of FDM models is the presence of ubiquitous density fluctuations in bound, virialized dark matter structures, on the scale of the de Broglie wavelength, arising from wave interference in the evolving dark matter distribution.… ▽ More We study how tidal streams from globular clusters may be used to constrain the mass of ultra-light dark matter particles, called `fuzzy' dark matter (FDM). A general feature of FDM models is the presence of ubiquitous density fluctuations in bound, virialized dark matter structures, on the scale of the de Broglie wavelength, arising from wave interference in the evolving dark matter distribution. These time-varying fluctuations can disturb the motions of stars, leading to potentially observable signatures in cold thin tidal streams in our own Galaxy. The study of this effect has been hindered by the difficulty in simulating the FDM wavefunction in Milky Way-sized systems. We present a simple method to evolve realistic wavefunctions in nearly static potentials, that should provide an accurate estimate of this granulation effect. We quantify the impact of FDM perturbations on tidal streams, and show that initially, while stream perturbations are small in amplitude, their power spectra exhibit a sharp cutoff corresponding to the de Broglie wavelength of the FDM potential fluctuations. Eventually, when stream perturbations become nonlinear, fold caustics generically arise that lead to density fluctuations with universal behavior. This erases the signature of the de Broglie wavelength in the stream density power spectrum, but we show that it will still be possible to determine the FDM mass in this regime, by considering the fluctuations in quantities like angular momenta or actions. △ Less

Submitted 26 November, 2020; originally announced November 2020.

Comments: Comments welcome. To be submitted to JCAP

arXiv:2011.11416 [pdf, other]

doi 10.1103/PhysRevD.103.023508

Oscillations and Random Walk of the Soliton Core in a Fuzzy Dark Matter Halo

Authors: Xinyu Li, Lam Hui, Tomer D. Yavetz

Abstract: A Fuzzy Dark Matter (FDM) halo consists of a soliton core close to the center and an NFW-like density profile in the outer region. Previous investigations found that the soliton core exhibits temporal oscillations and random walk excursions around the halo center. Analyzing a set of numerical simulations, we show that both phenomena can be understood as the results of wave interference -- a suitab… ▽ More A Fuzzy Dark Matter (FDM) halo consists of a soliton core close to the center and an NFW-like density profile in the outer region. Previous investigations found that the soliton core exhibits temporal oscillations and random walk excursions around the halo center. Analyzing a set of numerical simulations, we show that both phenomena can be understood as the results of wave interference -- a suitable superposition of the ground (solitonic) state and excited states in a fixed potential suffices to account for the main features of these phenomena. Such an eigenmode analysis can shed light on the evolution of a satellite halo undergoing tidal disruption. As the outer halo is stripped away, reducing the amplitudes of the excited states, the ground state evolves adiabatically. This suggests diminished soliton oscillations and random walk excursions, an effect to consider in deducing constraints from stellar heating. △ Less

Submitted 23 November, 2020; originally announced November 2020.

Comments: 9 pages, 12 figures

Journal ref: Phys. Rev. D 103, 023508 (2021)

arXiv:2011.07870 [pdf, other]

doi 10.1103/PhysRevD.103.044059

Growth of accretion driven scalar hair around Kerr black holes

Authors: Jamie Bamber, Katy Clough, Pedro G. Ferreira, Lam Hui, Macarena Lagos

Abstract: Scalar fields around compact objects are of interest for scalar-tensor theories of gravity and dark matter models consisting of a massive scalar, e.g. axions. We study the behaviour of a scalar field around a Kerr black hole with non trivial asymptotic boundary conditions - both non zero density and non zero angular momentum. Starting from an initial radially homogeneous configuration, a scalar cl… ▽ More Scalar fields around compact objects are of interest for scalar-tensor theories of gravity and dark matter models consisting of a massive scalar, e.g. axions. We study the behaviour of a scalar field around a Kerr black hole with non trivial asymptotic boundary conditions - both non zero density and non zero angular momentum. Starting from an initial radially homogeneous configuration, a scalar cloud is accreted, which asymptotes to known stationary configurations over time. We study the cloud growth for different parameters including black hole spin, scalar field mass, and the scalar field density and angular momentum far from the black hole. We characterise the transient growth of the mass and angular momentum in the cloud, and the spatial profile of the scalar around the black hole, and relate the results of fully non-linear simulations to an analytic perturbative expansion. We also highlight the potential for these accreted clouds to create monochromatic gravitational wave signals - similar to the signals from superradiant clouds, although significantly weaker in amplitude. △ Less

Submitted 11 March, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

Comments: 21 pages, 24 figures

Journal ref: Phys. Rev. D 103, 044059 (2021)

arXiv:2010.00593 [pdf, other]

doi 10.1088/1475-7516/2021/04/052

Static response and Love numbers of Schwarzschild black holes

Authors: Lam Hui, Austin Joyce, Riccardo Penco, Luca Santoni, Adam R. Solomon

Abstract: We derive the quadratic action for the physical degrees of freedom of massless spin-0, spin-1, and spin-2 perturbations on a Schwarzschild--(A)dS background in arbitrary dimensions. We then use these results to compute the static response of asymptotically flat Schwarzschild black holes to external fields. Our analysis reproduces known facts about black hole Love numbers, in particular that they v… ▽ More We derive the quadratic action for the physical degrees of freedom of massless spin-0, spin-1, and spin-2 perturbations on a Schwarzschild--(A)dS background in arbitrary dimensions. We then use these results to compute the static response of asymptotically flat Schwarzschild black holes to external fields. Our analysis reproduces known facts about black hole Love numbers, in particular that they vanish for all types of perturbation in four spacetime dimensions, but also leads to new results. For instance, we find that neutral Schwarzschild black holes polarize in the presence of an electromagnetic background in any number of spacetime dimensions except four. Moreover, we calculate for the first time black hole Love numbers for vector-type gravitational perturbations in higher dimensions and find that they generically do not vanish. Along the way, we shed some light on an apparent discrepancy between previous results in the literature, and clarify some aspects of the matching between perturbative calculations of static response on a Schwarzschild background and the point-particle effective theory △ Less

Submitted 11 February, 2024; v1 submitted 1 October, 2020; originally announced October 2020.

Comments: 78 pages, 1 figure v2: minor corrections, v3: minor corrections, v4: fixed minor error in spin-2 matching

Journal ref: JCAP 04 (2021) 052

arXiv:2009.06253 [pdf, ps, other]

doi 10.3847/2041-8213/abb76b

Fast Magnetic Reconnection with Turbulence in High Lundquist Number Limit

Authors: Yang Liping, Li Hui, Guo Fan, Li Xiaocan, Li Shengtai, He jiansen, Zhang Lei, Feng Xueshang

Abstract: We use extensive 3D resistive MHD simulations to study how large-scale current sheets will undergo fast reconnection in the high Lundquist number $S$ limit (above $\sim 10^4$), when the system is subject to different externally driven turbulence levels and the self-generated turbulence produced by 3D reconnection dynamics. We find that the normalized global reconnection rate $\sim 0.01 - 0.13$, we… ▽ More We use extensive 3D resistive MHD simulations to study how large-scale current sheets will undergo fast reconnection in the high Lundquist number $S$ limit (above $\sim 10^4$), when the system is subject to different externally driven turbulence levels and the self-generated turbulence produced by 3D reconnection dynamics. We find that the normalized global reconnection rate $\sim 0.01 - 0.13$, weakly dependent on $S$. Global reconnection with the classic inflow/outflow configurations is observed, and 3D flux ropes are hierarchically formed and ejected from reconnection regions. A statistical separation of the reconnected magnetic field lines follows a super-diffusive behavior, from which the rate is measured to be very similar to that obtained from the mixing of tracer populations. We find that the reconnection rate scales roughly linearly with the turbulence level during the peak of reconnection. This scaling is consistent with the turbulence properties produced by both the externally driven and self-generation processes. These results imply that large-scale thin current sheets tend to undergo rigorous reconnection. △ Less

Submitted 14 September, 2020; originally announced September 2020.

Comments: Accepted By ApJL

arXiv:2007.15488 [pdf, other]

Cascaded Non-local Neural Network for Point Cloud Semantic Segmentation

Authors: Mingmei Cheng, Le Hui, Jin Xie, Jian Yang, Hui Kong

Abstract: In this paper, we propose a cascaded non-local neural network for point cloud segmentation. The proposed network aims to build the long-range dependencies of point clouds for the accurate segmentation. Specifically, we develop a novel cascaded non-local module, which consists of the neighborhood-level, superpoint-level and global-level non-local blocks. First, in the neighborhood-level block, we e… ▽ More In this paper, we propose a cascaded non-local neural network for point cloud segmentation. The proposed network aims to build the long-range dependencies of point clouds for the accurate segmentation. Specifically, we develop a novel cascaded non-local module, which consists of the neighborhood-level, superpoint-level and global-level non-local blocks. First, in the neighborhood-level block, we extract the local features of the centroid points of point clouds by assigning different weights to the neighboring points. The extracted local features of the centroid points are then used to encode the superpoint-level block with the non-local operation. Finally, the global-level block aggregates the non-local features of the superpoints for semantic segmentation in an encoder-decoder framework. Benefiting from the cascaded structure, geometric structure information of different neighborhoods with the same label can be propagated. In addition, the cascaded structure can largely reduce the computational cost of the original non-local operation on point clouds. Experiments on different indoor and outdoor datasets show that our method achieves state-of-the-art performance and effectively reduces the time consumption and memory occupation. △ Less

Submitted 30 July, 2020; originally announced July 2020.

Comments: Accepted by IEEE/RSJ International Conference on Intelligent Robots and Systems 2020 (IROS)

arXiv:2007.12887 [pdf, other]

Approximated Bilinear Modules for Temporal Modeling

Authors: Xinqi Zhu, Chang Xu, Langwen Hui, Cewu Lu, Dacheng Tao

Abstract: We consider two less-emphasized temporal properties of video: 1. Temporal cues are fine-grained; 2. Temporal modeling needs reasoning. To tackle both problems at once, we exploit approximated bilinear modules (ABMs) for temporal modeling. There are two main points making the modules effective: two-layer MLPs can be seen as a constraint approximation of bilinear operations, thus can be used to cons… ▽ More We consider two less-emphasized temporal properties of video: 1. Temporal cues are fine-grained; 2. Temporal modeling needs reasoning. To tackle both problems at once, we exploit approximated bilinear modules (ABMs) for temporal modeling. There are two main points making the modules effective: two-layer MLPs can be seen as a constraint approximation of bilinear operations, thus can be used to construct deep ABMs in existing CNNs while reusing pretrained parameters; frame features can be divided into static and dynamic parts because of visual repetition in adjacent frames, which enables temporal modeling to be more efficient. Multiple ABM variants and implementations are investigated, from high performance to high efficiency. Specifically, we show how two-layer subnets in CNNs can be converted to temporal bilinear modules by adding an auxiliary-branch. Besides, we introduce snippet sampling and shifting inference to boost sparse-frame video classification performance. Extensive ablation studies are conducted to show the effectiveness of proposed techniques. Our models can outperform most state-of-the-art methods on Something-Something v1 and v2 datasets without Kinetics pretraining, and are also competitive on other YouTube-like action recognition datasets. Our code is available on https://github.com/zhuxinqimac/abm-pytorch. △ Less

Submitted 25 July, 2020; originally announced July 2020.

Comments: 8 pages, ICCV19

arXiv:2007.05361 [pdf, other]

Progressive Point Cloud Deconvolution Generation Network

Authors: Le Hui, Rui Xu, Jin Xie, Jianjun Qian, Jian Yang

Abstract: In this paper, we propose an effective point cloud generation method, which can generate multi-resolution point clouds of the same shape from a latent vector. Specifically, we develop a novel progressive deconvolution network with the learning-based bilateral interpolation. The learning-based bilateral interpolation is performed in the spatial and feature spaces of point clouds so that local geome… ▽ More In this paper, we propose an effective point cloud generation method, which can generate multi-resolution point clouds of the same shape from a latent vector. Specifically, we develop a novel progressive deconvolution network with the learning-based bilateral interpolation. The learning-based bilateral interpolation is performed in the spatial and feature spaces of point clouds so that local geometric structure information of point clouds can be exploited. Starting from the low-resolution point clouds, with the bilateral interpolation and max-pooling operations, the deconvolution network can progressively output high-resolution local and global feature maps. By concatenating different resolutions of local and global feature maps, we employ the multi-layer perceptron as the generation network to generate multi-resolution point clouds. In order to keep the shapes of different resolutions of point clouds consistent, we propose a shape-preserving adversarial loss to train the point cloud deconvolution generation network. Experimental results demonstrate the effectiveness of our proposed method. △ Less

Submitted 10 July, 2020; originally announced July 2020.

Comments: Accepted to ECCV 2020; Project page: https://github.com/fpthink/PDGN

arXiv:2006.07322 [pdf, other]

Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks

Authors: Like Hui, Mikhail Belkin

Abstract: Modern neural architectures for classification tasks are trained using the cross-entropy loss, which is widely believed to be empirically superior to the square loss. In this work we provide evidence indicating that this belief may not be well-founded. We explore several major neural architectures and a range of standard benchmark datasets for NLP, automatic speech recognition (ASR) and computer v… ▽ More Modern neural architectures for classification tasks are trained using the cross-entropy loss, which is widely believed to be empirically superior to the square loss. In this work we provide evidence indicating that this belief may not be well-founded. We explore several major neural architectures and a range of standard benchmark datasets for NLP, automatic speech recognition (ASR) and computer vision tasks to show that these architectures, with the same hyper-parameter settings as reported in the literature, perform comparably or better when trained with the square loss, even after equalizing computational resources. Indeed, we observe that the square loss produces better results in the dominant majority of NLP and ASR experiments. Cross-entropy appears to have a slight edge on computer vision tasks. We argue that there is little compelling empirical or theoretical evidence indicating a clear-cut advantage to the cross-entropy loss. Indeed, in our experiments, performance on nearly all non-vision tasks can be improved, sometimes significantly, by switching to the square loss. Furthermore, training with square loss appears to be less sensitive to the randomness in initialization. We posit that training using the square loss for classification needs to be a part of best practices of modern deep learning on equal footing with cross-entropy. △ Less

Submitted 22 October, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

Comments: An extended version of the paper published at ICLR2021. Added material includes evaluations of Transformer architectures

arXiv:2004.06718 [pdf, other]

Line Art Correlation Matching Feature Transfer Network for Automatic Animation Colorization

Authors: Zhang Qian, Wang Bo, Wen Wei, Li Hai, Liu Jun Hui

Abstract: Automatic animation line art colorization is a challenging computer vision problem, since the information of the line art is highly sparse and abstracted and there exists a strict requirement for the color and style consistency between frames. Recently, a lot of Generative Adversarial Network (GAN) based image-to-image translation methods for single line art colorization have emerged. They can gen… ▽ More Automatic animation line art colorization is a challenging computer vision problem, since the information of the line art is highly sparse and abstracted and there exists a strict requirement for the color and style consistency between frames. Recently, a lot of Generative Adversarial Network (GAN) based image-to-image translation methods for single line art colorization have emerged. They can generate perceptually appealing results conditioned on line art images. However, these methods can not be adopted for the purpose of animation colorization because there is a lack of consideration of the in-between frame consistency. Existing methods simply input the previous colored frame as a reference to color the next line art, which will mislead the colorization due to the spatial misalignment of the previous colored frame and the next line art especially at positions where apparent changes happen. To address these challenges, we design a kind of correlation matching feature transfer model (called CMFT) to align the colored reference feature in a learnable way and integrate the model into an U-Net based generator in a coarse-to-fine manner. This enables the generator to transfer the layer-wise synchronized features from the deep semantic code to the content progressively. Extension evaluation shows that CMFT model can effectively improve the in-between consistency and the quality of colored frames especially when the motion is intense and diverse. △ Less

Submitted 10 November, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

Comments: 8pages,6 figures

Showing 1–50 of 177 results for author: Hui, L