-
Massive-ish Particles from Small-ish Scales: Non-Perturbative Techniques for Cosmological Collider Physics from Large-Scale Structure Surveys
Authors:
Samuel Goldstein,
Oliver H. E. Philcox,
J. Colin Hill,
Lam Hui
Abstract:
Massive particles produced during inflation impact soft limits of primordial correlators. Searching for these signatures presents an exciting opportunity to uncover the particle spectrum in the inflationary epoch. We present non-perturbative methods to constrain intermediate-mass scalars ($0\leq m/H<3/2$, where $H$ is the inflationary Hubble scale) produced during inflation, which give rise to a p…
▽ More
Massive particles produced during inflation impact soft limits of primordial correlators. Searching for these signatures presents an exciting opportunity to uncover the particle spectrum in the inflationary epoch. We present non-perturbative methods to constrain intermediate-mass scalars ($0\leq m/H<3/2$, where $H$ is the inflationary Hubble scale) produced during inflation, which give rise to a power-law scaling in the squeezed primordial bispectrum. Exploiting the large-scale structure consistency relations and the separate universe approach, we derive models for the late-time squeezed matter bispectrum and collapsed matter trispectrum sourced by these fields. To validate our models, we run $N$-body simulations with the "Cosmological Collider" squeezed bispectrum for two different particle masses. Our models yield unbiased constraints on the amplitude of non-Gaussianity, $f_{\rm NL}^Δ$, from the squeezed bispectrum and collapsed trispectrum deep into the non-linear regime ($k_{\rm max}\approx 2~h/{\rm Mpc}$ at $z=0$). We assess the information content of these summary statistics, emphasizing the importance of sample variance cancellation in the matter sector. We also study the scale-dependent halo bias in our simulations. For mass-selected halos, the non-Gaussian bias estimated from our simulations agrees with predictions based on (i) separate universe simulations and (ii) universal mass functions. With further work, these results can be used to search for inflationary massive particle production with upcoming galaxy surveys.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Fragmention in Gravitationally-Unstable Collapsar Disks and Sub-Solar Neutron Star Mergers
Authors:
Brian D. Metzger,
Lam Hui,
Matteo Cantiello
Abstract:
Although stable neutron stars (NS) can in principle exist down to masses Mns ~ 0.1Msun, standard models of stellar core-collapse predict a robust lower limit Mns >~ 1.2Msun, roughly commensurate with the Chandrasekhar mass Mch of the progenitor's iron core (electron fraction Ye ~ 0.5). However, this limit may be circumvented in sufficiently dense neutron-rich environments (Ye << 0.5) for which Mch…
▽ More
Although stable neutron stars (NS) can in principle exist down to masses Mns ~ 0.1Msun, standard models of stellar core-collapse predict a robust lower limit Mns >~ 1.2Msun, roughly commensurate with the Chandrasekhar mass Mch of the progenitor's iron core (electron fraction Ye ~ 0.5). However, this limit may be circumvented in sufficiently dense neutron-rich environments (Ye << 0.5) for which Mch ~ Ye^2 is reduced to < Msun. Such physical conditions could arise in the black hole accretion disks formed from the collapse of rapidly-rotating stars (``collapsars''), as a result of gravitational instabilities and cooling-induced fragmentation, similar to models for planet formation in protostellar disks. We confirm that the conditions to form sub-solar mass NS (ssNS) may be marginally satisfied in the outer regions of massive neutrino-cooled collapsar disks. If the disk fragments into multiple ssNS, their subsequent coalescence offers a channel for precipitating sub-solar mass LIGO/Virgo gravitational-wave mergers that does not implicate primordial black holes. The model makes several additional predictions: (1) ~Hz frequency Doppler modulation of the ssNS-merger gravitational wave signals due to the binary's orbital motion in the disk; (2) at least one additional gravitational wave event (coincident within <~ hours), from the coalescence of the ssNS-merger remnant(s) with the central black hole; (3) an associated gamma-ray burst and supernova counterpart, the latter boosted in energy and enriched with r-process elements from the NS merger(s) embedded within the exploding stellar envelope (``kilonovae inside a supernova'').
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Enhancing Organizational Performance: Harnessing AI and NLP for User Feedback Analysis in Product Development
Authors:
Tian Tian,
Liu Ze hui,
Huang Zichen,
Yubing Tang
Abstract:
This paper explores the application of AI and NLP techniques for user feedback analysis in the context of heavy machine crane products. By leveraging AI and NLP, organizations can gain insights into customer perceptions, improve product development, enhance satisfaction and loyalty, inform decision-making, and gain a competitive advantage. The paper highlights the impact of user feedback analysis…
▽ More
This paper explores the application of AI and NLP techniques for user feedback analysis in the context of heavy machine crane products. By leveraging AI and NLP, organizations can gain insights into customer perceptions, improve product development, enhance satisfaction and loyalty, inform decision-making, and gain a competitive advantage. The paper highlights the impact of user feedback analysis on organizational performance and emphasizes the reasons for using AI and NLP, including scalability, objectivity, improved accuracy, increased insights, and time savings. The methodology involves data collection, cleaning, text and rating analysis, interpretation, and feedback implementation. Results include sentiment analysis, word cloud visualizations, and radar charts comparing product attributes. These findings provide valuable information for understanding customer sentiment, identifying improvement areas, and making data-driven decisions to enhance the customer experience. In conclusion, promising AI and NLP techniques in user feedback analysis offer organizations a powerful tool to understand customers, improve product development, increase satisfaction, and drive business success
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis
Authors:
Zhicheng Lu,
Xiang Guo,
Le Hui,
Tianrui Chen,
Min Yang,
Xiao Tang,
Feng Zhu,
Yuchao Dai
Abstract:
In this paper, we propose a 3D geometry-aware deformable Gaussian Splatting method for dynamic view synthesis. Existing neural radiance fields (NeRF) based solutions learn the deformation in an implicit manner, which cannot incorporate 3D scene geometry. Therefore, the learned deformation is not necessarily geometrically coherent, which results in unsatisfactory dynamic view synthesis and 3D dynam…
▽ More
In this paper, we propose a 3D geometry-aware deformable Gaussian Splatting method for dynamic view synthesis. Existing neural radiance fields (NeRF) based solutions learn the deformation in an implicit manner, which cannot incorporate 3D scene geometry. Therefore, the learned deformation is not necessarily geometrically coherent, which results in unsatisfactory dynamic view synthesis and 3D dynamic reconstruction. Recently, 3D Gaussian Splatting provides a new representation of the 3D scene, building upon which the 3D geometry could be exploited in learning the complex 3D deformation. Specifically, the scenes are represented as a collection of 3D Gaussian, where each 3D Gaussian is optimized to move and rotate over time to model the deformation. To enforce the 3D scene geometry constraint during deformation, we explicitly extract 3D geometry features and integrate them in learning the 3D deformation. In this way, our solution achieves 3D geometry-aware deformation modeling, which enables improved dynamic view synthesis and 3D dynamic reconstruction. Extensive experimental results on both synthetic and real datasets prove the superiority of our solution, which achieves new state-of-the-art performance.
The project is available at https://npucvr.github.io/GaGS/
△ Less
Submitted 14 April, 2024; v1 submitted 9 April, 2024;
originally announced April 2024.
-
SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection
Authors:
Yun Zhu,
Le Hui,
Yaqi Shen,
Jin Xie
Abstract:
Current 3D object detection methods for indoor scenes mainly follow the voting-and-grouping strategy to generate proposals. However, most methods utilize instance-agnostic groupings, such as ball query, leading to inconsistent semantic information and inaccurate regression of the proposals. To this end, we propose a novel superpoint grouping network for indoor anchor-free one-stage 3D object detec…
▽ More
Current 3D object detection methods for indoor scenes mainly follow the voting-and-grouping strategy to generate proposals. However, most methods utilize instance-agnostic groupings, such as ball query, leading to inconsistent semantic information and inaccurate regression of the proposals. To this end, we propose a novel superpoint grouping network for indoor anchor-free one-stage 3D object detection. Specifically, we first adopt an unsupervised manner to partition raw point clouds into superpoints, areas with semantic consistency and spatial similarity. Then, we design a geometry-aware voting module that adapts to the centerness in anchor-free detection by constraining the spatial relationship between superpoints and object centers. Next, we present a superpoint-based grouping module to explore the consistent representation within proposals. This module includes a superpoint attention layer to learn feature interaction between neighboring superpoints, and a superpoint-voxel fusion layer to propagate the superpoint-level information to the voxel level. Finally, we employ effective multiple matching to capitalize on the dynamic receptive fields of proposals based on superpoints during the training. Experimental results demonstrate our method achieves state-of-the-art performance on ScanNet V2, SUN RGB-D, and S3DIS datasets in the indoor one-stage 3D object detection. Source code is available at https://github.com/zyrant/SPGroup3D.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
$S$-matrix positivity without Lorentz invariance: a case study
Authors:
Lam Hui,
Ioanna Kourkoulou,
Alberto Nicolis,
Alessandro Podo,
Shengjia Zhou
Abstract:
We investigate the analytic structure of scattering amplitudes in theories in which Lorentz invariance is spontaneously broken. We do so by computing and studying the S-matrix for a simple example: a superfluid described by a complex scalar with quartic interactions. The computation is confined to tree-level, for there are no absolutely stable single-particle states, though the lifetime can be mad…
▽ More
We investigate the analytic structure of scattering amplitudes in theories in which Lorentz invariance is spontaneously broken. We do so by computing and studying the S-matrix for a simple example: a superfluid described by a complex scalar with quartic interactions. The computation is confined to tree-level, for there are no absolutely stable single-particle states, though the lifetime can be made long by lowering the chemical potential. For the $2 \to 2$ amplitude in center-of-mass configurations, not only is crossing symmetry violated, there appears a {\it tree level} branch cut for unphysical kinematics. Its appearance is a consequence of non-analyticity in the dispersion relation. The branch point defines a new scale in the problem, which scales inversely with the chemical potential. In this example, even derivatives of the forward amplitude are positive while odd derivatives are negative. This pattern can be understood in a general way in the limit of a small chemical potential, or weak Lorentz breaking.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Consistently constraining $f_{\rm NL}$ with the squeezed lensing bispectrum using consistency relations
Authors:
Samuel Goldstein,
Oliver H. E. Philcox,
J. Colin Hill,
Angelo Esposito,
Lam Hui
Abstract:
We introduce a non-perturbative method to constrain the amplitude of local-type primordial non-Gaussianity ($f_{\rm NL}$) using squeezed configurations of the CMB lensing convergence and cosmic shear bispectra. First, we use cosmological consistency relations to derive a model for the squeezed limit of angular auto- and cross-bispectra of lensing convergence fields in the presence of $f_{\rm NL}$.…
▽ More
We introduce a non-perturbative method to constrain the amplitude of local-type primordial non-Gaussianity ($f_{\rm NL}$) using squeezed configurations of the CMB lensing convergence and cosmic shear bispectra. First, we use cosmological consistency relations to derive a model for the squeezed limit of angular auto- and cross-bispectra of lensing convergence fields in the presence of $f_{\rm NL}$. Using this model, we perform a Fisher forecast with specifications expected for upcoming CMB lensing measurements from the Simons Observatory and CMB-S4, as well as cosmic shear measurements from a Rubin LSST/Euclid-like experiment. Assuming a minimum multipole $\ell_{\rm min}=10$ and maximum multipole $\ell_{\rm max}=1400$, we forecast $σ_{f_{\rm NL}}=175$ ($95$) for Simons Observatory (CMB-S4). Our forecasts improve considerably for an LSST/Euclid-like cosmic shear experiment with three tomographic bins and $\ell_{\rm min}=10$ and $\ell_{\rm max}=1400$ ($5000$) with $σ_{f_{\rm NL}}=31$ ($16$). A joint analysis of CMB-S4 lensing and LSST/Euclid-like shear yields little gain over the shear-only forecasts; however, we show that a joint analysis could be useful if the CMB lensing convergence can be reliably reconstructed at larger angular scales than the shear field. The method presented in this work is a novel and robust technique to constrain local primordial non-Gaussianity from upcoming large-scale structure surveys that is completely independent of the galaxy field (and therefore any nuisance parameters such as $b_φ$), thus complementing existing techniques to constrain $f_{\rm NL}$ using the scale-dependent halo bias.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion
Authors:
Zhiqiang Yan,
Xiang Li,
Le Hui,
Zhenyu Zhang,
Jun Li,
Jian Yang
Abstract:
Depth completion aims to recover dense depth maps from sparse ones, where color images are often used to facilitate this task. Recent depth methods primarily focus on image guided learning frameworks. However, blurry guidance in the image and unclear structure in the depth still impede their performance. To tackle these challenges, we explore a repetitive design in our image guided network to grad…
▽ More
Depth completion aims to recover dense depth maps from sparse ones, where color images are often used to facilitate this task. Recent depth methods primarily focus on image guided learning frameworks. However, blurry guidance in the image and unclear structure in the depth still impede their performance. To tackle these challenges, we explore a repetitive design in our image guided network to gradually and sufficiently recover depth values. Specifically, the repetition is embodied in both the image guidance branch and depth generation branch. In the former branch, we design a dense repetitive hourglass network (DRHN) to extract discriminative image features of complex environments, which can provide powerful contextual instruction for depth prediction. In the latter branch, we present a repetitive guidance (RG) module based on dynamic convolution, in which an efficient convolution factorization is proposed to reduce the complexity while modeling high-frequency structures progressively. Furthermore, in the semantic guidance branch, we utilize the well-known large vision model, i.e., segment anything (SAM), to supply RG with semantic prior. In addition, we propose a region-aware spatial propagation network (RASPN) for further depth refinement based on the semantic prior constraint. Finally, we collect a new dataset termed TOFDC for the depth completion task, which is acquired by the time-of-flight (TOF) sensor and the color camera on smartphones. Extensive experiments demonstrate that our method achieves state-of-the-art performance on KITTI, NYUv2, Matterport3D, 3D60, VKITTI, and our TOFDC.
△ Less
Submitted 28 February, 2024; v1 submitted 1 September, 2023;
originally announced September 2023.
-
Statistical analysis of the onset temperature of solar flares in 2010-2011
Authors:
Douglas Félix da Silva,
Li Hui,
Paulo J. A. Simões,
Adriana Valio,
Joaquim C. E. R.,
Hugh S. Hudson,
Paulo J. A. Simoes,
Lyndsay Fletcher,
Laura A. Hayes,
Iain G. Hannah
Abstract:
Understanding the physical processes that trigger solar flares is paramount to help with forecasting space weather and mitigating the effects on our technological infrastructure. A previously unknown phenomenon was recently identified in solar flares: the plasma temperature, derived from soft X-ray (SXR) data, at the onset of four flares, was revealed to be in the range 10-15 MK, without evidence…
▽ More
Understanding the physical processes that trigger solar flares is paramount to help with forecasting space weather and mitigating the effects on our technological infrastructure. A previously unknown phenomenon was recently identified in solar flares: the plasma temperature, derived from soft X-ray (SXR) data, at the onset of four flares, was revealed to be in the range 10-15 MK, without evidence of gradual heating. To investigate how common the hot-onset phenomenon may be, we extend this investigation to solar flares of B1.2- X6.9 classes recorded by the X-ray Sensor (XRS) on-board the GOES-14 and GOES-15 satellites between 2010 and 2011. For this statistical study, we employed the same methodology as in recent work, where the pre-flare SXR flux of each flare is obtained manually, and the temperature and emission measure values are obtained by the flux ratio of the two GOES/XRS channels using the standard software. From 3224 events listed in the GOES flare catalog for 2010-2011, we have selected and analyzed 745 events for which the flare heliographic location was provided in the list, to investigate center-to-limb effects of the hot-onset phenomenon. Our results show that 559 out of 745 flares (75%) exhibit an onset temperature above 8.6 MK (the first quartile), with respective log10 of the emission measure values between 46.0 - 47.25 cm-3, indicating that small amounts of plasma are quickly heated to high temperatures. These results suggest that the hot-onset phenomenon is very common in solar flares.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Relativistic drag forces on black holes from scalar dark matter clouds of all sizes
Authors:
Dina Traykova,
Rodrigo Vicente,
Katy Clough,
Thomas Helfer,
Emanuele Berti,
Pedro G. Ferreira,
Lam Hui
Abstract:
We use numerical simulations of scalar field dark matter evolving on a moving black hole background to confirm the regime of validity of (semi-)analytic expressions derived from first principles for both dynamical friction and momentum accretion in the relativistic regime. We cover both small and large clouds (relative to the de Broglie wavelength of the scalars), and light and heavy particle mass…
▽ More
We use numerical simulations of scalar field dark matter evolving on a moving black hole background to confirm the regime of validity of (semi-)analytic expressions derived from first principles for both dynamical friction and momentum accretion in the relativistic regime. We cover both small and large clouds (relative to the de Broglie wavelength of the scalars), and light and heavy particle masses (relative to the BH size). In the case of a small dark matter cloud, the effect of accretion is a non-negligible contribution to the total force on the black hole, even for small scalar masses. We confirm that this momentum accretion transitions between two regimes (wave- and particle-like) and we identify the mass of the scalar at which the transition between regimes occurs.
△ Less
Submitted 19 February, 2024; v1 submitted 17 May, 2023;
originally announced May 2023.
-
ReLU soothes the NTK condition number and accelerates optimization for wide neural networks
Authors:
Chaoyue Liu,
Like Hui
Abstract:
Rectified linear unit (ReLU), as a non-linear activation function, is well known to improve the expressivity of neural networks such that any continuous function can be approximated to arbitrary precision by a sufficiently wide neural network. In this work, we present another interesting and important feature of ReLU activation function. We show that ReLU leads to: {\it better separation} for simi…
▽ More
Rectified linear unit (ReLU), as a non-linear activation function, is well known to improve the expressivity of neural networks such that any continuous function can be approximated to arbitrary precision by a sufficiently wide neural network. In this work, we present another interesting and important feature of ReLU activation function. We show that ReLU leads to: {\it better separation} for similar data, and {\it better conditioning} of neural tangent kernel (NTK), which are closely related. Comparing with linear neural networks, we show that a ReLU activated wide neural network at random initialization has a larger angle separation for similar data in the feature space of model gradient, and has a smaller condition number for NTK. Note that, for a linear neural network, the data separation and NTK condition number always remain the same as in the case of a linear model. Furthermore, we show that a deeper ReLU network (i.e., with more ReLU activation operations), has a smaller NTK condition number than a shallower one. Our results imply that ReLU activation, as well as the depth of ReLU network, helps improve the gradient descent convergence rate, which is closely related to the NTK condition number.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Self-Supervised 3D Scene Flow Estimation Guided by Superpoints
Authors:
Yaqi Shen,
Le Hui,
Jin Xie,
Jian Yang
Abstract:
3D scene flow estimation aims to estimate point-wise motions between two consecutive frames of point clouds. Superpoints, i.e., points with similar geometric features, are usually employed to capture similar motions of local regions in 3D scenes for scene flow estimation. However, in existing methods, superpoints are generated with the offline clustering methods, which cannot characterize local re…
▽ More
3D scene flow estimation aims to estimate point-wise motions between two consecutive frames of point clouds. Superpoints, i.e., points with similar geometric features, are usually employed to capture similar motions of local regions in 3D scenes for scene flow estimation. However, in existing methods, superpoints are generated with the offline clustering methods, which cannot characterize local regions with similar motions for complex 3D scenes well, leading to inaccurate scene flow estimation. To this end, we propose an iterative end-to-end superpoint based scene flow estimation framework, where the superpoints can be dynamically updated to guide the point-level flow prediction. Specifically, our framework consists of a flow guided superpoint generation module and a superpoint guided flow refinement module. In our superpoint generation module, we utilize the bidirectional flow information at the previous iteration to obtain the matching points of points and superpoint centers for soft point-to-superpoint association construction, in which the superpoints are generated for pairwise point clouds. With the generated superpoints, we first reconstruct the flow for each point by adaptively aggregating the superpoint-level flow, and then encode the consistency between the reconstructed flow of pairwise point clouds. Finally, we feed the consistency encoding along with the reconstructed flow into GRU to refine point-level flow. Extensive experiments on several different datasets show that our method can achieve promising performance.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Cut your Losses with Squentropy
Authors:
Like Hui,
Mikhail Belkin,
Stephen Wright
Abstract:
Nearly all practical neural models for classification are trained using cross-entropy loss. Yet this ubiquitous choice is supported by little theoretical or empirical evidence. Recent work (Hui & Belkin, 2020) suggests that training using the (rescaled) square loss is often superior in terms of the classification accuracy. In this paper we propose the "squentropy" loss, which is the sum of two ter…
▽ More
Nearly all practical neural models for classification are trained using cross-entropy loss. Yet this ubiquitous choice is supported by little theoretical or empirical evidence. Recent work (Hui & Belkin, 2020) suggests that training using the (rescaled) square loss is often superior in terms of the classification accuracy. In this paper we propose the "squentropy" loss, which is the sum of two terms: the cross-entropy loss and the average square loss over the incorrect classes. We provide an extensive set of experiments on multi-class classification problems showing that the squentropy loss outperforms both the pure cross entropy and rescaled square losses in terms of the classification accuracy. We also demonstrate that it provides significantly better model calibration than either of these alternative losses and, furthermore, has less variance with respect to the random initialization. Additionally, in contrast to the square loss, squentropy loss can typically be trained using exactly the same optimization parameters, including the learning rate, as the standard cross-entropy loss, making it a true "plug-and-play" replacement. Finally, unlike the rescaled square loss, multiclass squentropy contains no parameters that need to be adjusted.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Ladder Symmetries of Black Holes and de Sitter Space: Love Numbers and Quasinormal Modes
Authors:
Roman Berens,
Lam Hui,
Zimo Sun
Abstract:
In this note, we present a synopsis of geometric symmetries for (spin 0) perturbations around (4D) black holes and de Sitter space. For black holes, we focus on static perturbations, for which the (exact) geometric symmetries have the group structure of SO(1,3). The generators consist of three spatial rotations, and three conformal Killing vectors obeying a special melodic condition. The static pe…
▽ More
In this note, we present a synopsis of geometric symmetries for (spin 0) perturbations around (4D) black holes and de Sitter space. For black holes, we focus on static perturbations, for which the (exact) geometric symmetries have the group structure of SO(1,3). The generators consist of three spatial rotations, and three conformal Killing vectors obeying a special melodic condition. The static perturbation solutions form a unitary (principal series) representation of the group. The recently uncovered ladder symmetries follow from this representation structure; they explain the well-known vanishing of the black hole Love numbers. For dynamical perturbations around de Sitter space, the geometric symmetries are less surprising, following from the SO(1,4) isometry. As is well known, the quasinormal solutions form a non-unitary representation of the isometry group. We provide explicit expressions for the ladder operators associated with this representation. In both cases, the ladder structures help connect the boundary condition at the horizon with that at infinity (black hole) or origin (de Sitter space), and they manifest as contiguous relations of the hypergeometric solutions.
△ Less
Submitted 19 April, 2023; v1 submitted 19 December, 2022;
originally announced December 2022.
-
Soft theorems for boosts and other time symmetries
Authors:
Lam Hui,
Austin Joyce,
Ilia Komissarov,
Klaas Parmentier,
Luca Santoni,
Sam S. C. Wong
Abstract:
We derive soft theorems for theories in which time symmetries -- symmetries that involve the transformation of time, an example of which are Lorentz boosts -- are spontaneously broken. The soft theorems involve unequal-time correlation functions with the insertion of a soft Goldstone in the far past. Explicit checks are provided for several examples, including the effective theory of a relativisti…
▽ More
We derive soft theorems for theories in which time symmetries -- symmetries that involve the transformation of time, an example of which are Lorentz boosts -- are spontaneously broken. The soft theorems involve unequal-time correlation functions with the insertion of a soft Goldstone in the far past. Explicit checks are provided for several examples, including the effective theory of a relativistic superfluid and the effective field theory of inflation. We discuss how in certain cases these unequal-time identities capture information at the level of observables that cannot be seen purely in terms of equal-time correlators of the field alone. We also discuss when it is possible to phrase these soft theorems as identities involving equal-time correlators.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
An analytic approach to quasinormal modes for coupled linear systems
Authors:
Lam Hui,
Alessandro Podo,
Luca Santoni,
Enrico Trincherini
Abstract:
Quasinormal modes describe the ringdown of compact objects deformed by small perturbations. In generic theories of gravity that extend General Relativity, the linearized dynamics of these perturbations is described by a system of coupled linear differential equations of second order. We first show, under general assumptions, that such a system can be brought to a Schrödinger-like form. We then dev…
▽ More
Quasinormal modes describe the ringdown of compact objects deformed by small perturbations. In generic theories of gravity that extend General Relativity, the linearized dynamics of these perturbations is described by a system of coupled linear differential equations of second order. We first show, under general assumptions, that such a system can be brought to a Schrödinger-like form. We then devise an analytic approximation scheme to compute the spectrum of quasinormal modes. We validate our approach using a toy model with a controllable mixing parameter $\varepsilon$ and showing that the analytic approximation for the fundamental mode agrees with the numerical computation when the approximation is justified. The accuracy of the analytic approximation is at the (sub-) percent level for the real part and at the level of a few percent for the imaginary part, even when $\varepsilon$ is of order one. Our approximation scheme can be seen as an extension of the approach of Schutz and Will to the case of coupled systems of equations, although our approach is not phrased in terms of a WKB analysis, and offers a new viewpoint even in the case of a single equation.
△ Less
Submitted 2 October, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Learning Inter-Superpoint Affinity for Weakly Supervised 3D Instance Segmentation
Authors:
Linghua Tang,
Le Hui,
Jin Xie
Abstract:
Due to the few annotated labels of 3D point clouds, how to learn discriminative features of point clouds to segment object instances is a challenging problem. In this paper, we propose a simple yet effective 3D instance segmentation framework that can achieve good performance by annotating only one point for each instance. Specifically, to tackle extremely few labels for instance segmentation, we…
▽ More
Due to the few annotated labels of 3D point clouds, how to learn discriminative features of point clouds to segment object instances is a challenging problem. In this paper, we propose a simple yet effective 3D instance segmentation framework that can achieve good performance by annotating only one point for each instance. Specifically, to tackle extremely few labels for instance segmentation, we first oversegment the point cloud into superpoints in an unsupervised manner and extend the point-level annotations to the superpoint level. Then, based on the superpoint graph, we propose an inter-superpoint affinity mining module that considers the semantic and spatial relations to adaptively learn inter-superpoint affinity to generate high-quality pseudo labels via semantic-aware random walk. Finally, we propose a volume-aware instance refinement module to segment high-quality instances by applying volume constraints of objects in clustering on the superpoint graph. Extensive experiments on the ScanNet-v2 and S3DIS datasets demonstrate that our method achieves state-of-the-art performance in the weakly supervised point cloud instance segmentation task, and even outperforms some fully supervised methods.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Point Cloud Registration-Driven Robust Feature Matching for 3D Siamese Object Tracking
Authors:
Haobo Jiang,
Kaihao Lan,
Le Hui,
Guangyu Li,
Jin Xie,
Jian Yang
Abstract:
Learning robust feature matching between the template and search area is crucial for 3D Siamese tracking. The core of Siamese feature matching is how to assign high feature similarity on the corresponding points between the template and search area for precise object localization. In this paper, we propose a novel point cloud registration-driven Siamese tracking framework, with the intuition that…
▽ More
Learning robust feature matching between the template and search area is crucial for 3D Siamese tracking. The core of Siamese feature matching is how to assign high feature similarity on the corresponding points between the template and search area for precise object localization. In this paper, we propose a novel point cloud registration-driven Siamese tracking framework, with the intuition that spatially aligned corresponding points (via 3D registration) tend to achieve consistent feature representations. Specifically, our method consists of two modules, including a tracking-specific nonlocal registration module and a registration-aided Sinkhorn template-feature aggregation module. The registration module targets at the precise spatial alignment between the template and search area. The tracking-specific spatial distance constraint is proposed to refine the cross-attention weights in the nonlocal module for discriminative feature learning. Then, we use the weighted SVD to compute the rigid transformation between the template and search area, and align them to achieve the desired spatially aligned corresponding points. For the feature aggregation model, we formulate the feature matching between the transformed template and search area as an optimal transport problem and utilize the Sinkhorn optimization to search for the outlier-robust matching solution. Also, a registration-aided spatial distance map is built to improve the matching robustness in indistinguishable regions (e.g., smooth surface). Finally, guided by the obtained feature matching map, we aggregate the target information from the template into the search area to construct the target-specific feature, which is then fed into a CenterPoint-like detection head for object localization. Extensive experiments on KITTI, NuScenes and Waymo datasets verify the effectiveness of our proposed method.
△ Less
Submitted 3 December, 2022; v1 submitted 13 September, 2022;
originally announced September 2022.
-
Squeezing $f_{\rm NL}$ out of the matter bispectrum with consistency relations
Authors:
Samuel Goldstein,
Angelo Esposito,
Oliver H. E. Philcox,
Lam Hui,
J. Colin Hill,
Roman Scoccimarro,
Maximilian H. Abitbol
Abstract:
We show how consistency relations can be used to robustly extract the amplitude of local primordial non-Gaussianity ($f_{\rm NL}$) from the squeezed limit of the matter bispectrum, well into the non-linear regime. First, we derive a non-perturbative relation between primordial non-Gaussianity and the leading term in the squeezed bispectrum, revising some results present in the literature. This rel…
▽ More
We show how consistency relations can be used to robustly extract the amplitude of local primordial non-Gaussianity ($f_{\rm NL}$) from the squeezed limit of the matter bispectrum, well into the non-linear regime. First, we derive a non-perturbative relation between primordial non-Gaussianity and the leading term in the squeezed bispectrum, revising some results present in the literature. This relation is then used to successfully measure $f_{\rm NL}$ from $N$-body simulations. We discuss the dependence of our results on different scale cuts and redshifts. Specifically, the analysis is strongly dependent on the choice of the smallest soft momentum, $q_{\rm min}$, which is the most sensitive to primordial bispectrum contributions, but is largely independent of the choice of the largest hard momentum, $k_{\rm max}$, due to the non-Gaussian nature of the covariance. We also show how the constraints on $f_{\rm NL}$ improve at higher redshift, due to a reduced off-diagonal covariance. In particular, for a simulation with $f_{\rm NL} = 100$ and a volume of $(2.4 \text{ Gpc}/h)^3$, we measure $f_{\rm NL} = 98 \pm 12$ at redshift $z=0$ and $f_{\rm NL} = 97 \pm 8$ at $z=0.97$. Finally, we compare our results with a Fisher forecast, showing that the current version of the analysis is satisfactorily close to the Fisher error. We regard this as a first step towards the realistic application of consistency relations to constrain primordial non-Gaussianity using observations.
△ Less
Submitted 6 January, 2023; v1 submitted 13 September, 2022;
originally announced September 2022.
-
Nonlinearities in Black Hole Ringdowns
Authors:
Keefe Mitman,
Macarena Lagos,
Leo C. Stein,
Sizheng Ma,
Lam Hui,
Yanbei Chen,
Nils Deppe,
François Hébert,
Lawrence E. Kidder,
Jordan Moxon,
Mark A. Scheel,
Saul A. Teukolsky,
William Throwe,
Nils L. Vu
Abstract:
The gravitational wave strain emitted by a perturbed black hole (BH) ringing down is typically modeled analytically using first-order BH perturbation theory. In this Letter we show that second-order effects are necessary for modeling ringdowns from BH merger simulations. Focusing on the strain's $(\ell,m)=(4,4)$ angular harmonic, we show the presence of a quadratic effect across a range of binary…
▽ More
The gravitational wave strain emitted by a perturbed black hole (BH) ringing down is typically modeled analytically using first-order BH perturbation theory. In this Letter we show that second-order effects are necessary for modeling ringdowns from BH merger simulations. Focusing on the strain's $(\ell,m)=(4,4)$ angular harmonic, we show the presence of a quadratic effect across a range of binary BH mass ratios that agrees with theoretical expectations. We find that the quadratic $(4,4)$ mode's amplitude exhibits quadratic scaling with the fundamental $(2,2)$ mode -- its parent mode. The nonlinear mode's amplitude is comparable to or even larger than that of the linear $(4,4)$ mode. Therefore, correctly modeling the ringdown of higher harmonics -- improving mode mismatches by up to 2 orders of magnitude -- requires the inclusion of nonlinear effects.
△ Less
Submitted 22 February, 2023; v1 submitted 15 August, 2022;
originally announced August 2022.
-
Generation and propagation of nonlinear quasi-normal modes of a Schwarzschild black hole
Authors:
Macarena Lagos,
Lam Hui
Abstract:
In the analysis of a binary black hole coalescence, it is necessary to include gravitational self-interactions in order to describe the transition of the gravitational wave signal from the merger to the ringdown stage. In this paper we study the phenomenology of the generation and propagation of nonlinearities in the ringdown of a Schwarzschild black hole, using second-order perturbation theory. F…
▽ More
In the analysis of a binary black hole coalescence, it is necessary to include gravitational self-interactions in order to describe the transition of the gravitational wave signal from the merger to the ringdown stage. In this paper we study the phenomenology of the generation and propagation of nonlinearities in the ringdown of a Schwarzschild black hole, using second-order perturbation theory. Following earlier work, we show that the Green's function and its causal structure determines how both first-order and second-order perturbations are generated, and hence highlight that both of these solutions share some physical properties. In particular, we discuss the sense in which both linear and quadratic quasi-normal modes (QNMs) are generated in the vicinity of the peak of the gravitational potential barrier (loosely referred to as the light ring). Among the second-order perturbations, there are solutions with linear QNM frequencies (whose amplitudes are thus renormalized from their linear values), as well as quadratic QNM frequencies with a distinct spectrum. Moreover, we show using a WKB analysis that, in the eikonal limit, waves generated inside the light ring propagate towards the black hole horizon, and only waves generated outside propagate towards an asymptotic observer. These results might be relevant for recent discussions on the validity of perturbation theory close to the merger. Finally, we argue that even if nonlinearities are small, quadratic QNMs may be detectable and would likely be useful for improving ringdown models of higher angular harmonics and future tests of gravity.
△ Less
Submitted 9 January, 2023; v1 submitted 15 August, 2022;
originally announced August 2022.
-
Black hole superradiance with (dark) matter accretion
Authors:
Lam Hui,
Y. T. Albert Law,
Luca Santoni,
Guanhao Sun,
Giovanni Maria Tomaselli,
Enrico Trincherini
Abstract:
Studies of black hole superradiance often focus on the growth of a cloud in isolation, accompanied by the spin-down of the black hole. In this paper, we consider the additional effect of the accretion of matter and angular momentum from the environment. We show that, in many cases, the black hole evolves by drifting along the superradiance threshold, in which case the evolution of its parameters c…
▽ More
Studies of black hole superradiance often focus on the growth of a cloud in isolation, accompanied by the spin-down of the black hole. In this paper, we consider the additional effect of the accretion of matter and angular momentum from the environment. We show that, in many cases, the black hole evolves by drifting along the superradiance threshold, in which case the evolution of its parameters can be described analytically or semi-analytically. We quantify the conditions under which accretion can serve as a mechanism to increase the cloud-to-black hole mass ratio, beyond the standard maximum of about 10%. This occurs by a process we call over-superradiance, whereby accretion effectively feeds the superradiance cloud, by way of the black hole. We give two explicit examples: accretion from a vortex expected in wave dark matter and accretion from a baryonic disk. In the former case, we estimate the accretion rate by using an analytical fit to the asymptotic behavior of the confluent Heun function. Level transition, whereby one cloud level grows while the other shrinks, can be understood in a similar way.
△ Less
Submitted 25 May, 2023; v1 submitted 12 August, 2022;
originally announced August 2022.
-
Unsupervised Domain Adaptation for Point Cloud Semantic Segmentation via Graph Matching
Authors:
Yikai Bian,
Le Hui,
Jianjun Qian,
Jin Xie
Abstract:
Unsupervised domain adaptation for point cloud semantic segmentation has attracted great attention due to its effectiveness in learning with unlabeled data. Most of existing methods use global-level feature alignment to transfer the knowledge from the source domain to the target domain, which may cause the semantic ambiguity of the feature space. In this paper, we propose a graph-based framework t…
▽ More
Unsupervised domain adaptation for point cloud semantic segmentation has attracted great attention due to its effectiveness in learning with unlabeled data. Most of existing methods use global-level feature alignment to transfer the knowledge from the source domain to the target domain, which may cause the semantic ambiguity of the feature space. In this paper, we propose a graph-based framework to explore the local-level feature alignment between the two domains, which can reserve semantic discrimination during adaptation. Specifically, in order to extract local-level features, we first dynamically construct local feature graphs on both domains and build a memory bank with the graphs from the source domain. In particular, we use optimal transport to generate the graph matching pairs. Then, based on the assignment matrix, we can align the feature distributions between the two domains with the graph-based local feature loss. Furthermore, we consider the correlation between the features of different categories and formulate a category-guided contrastive loss to guide the segmentation model to learn discriminative features on the target domain. Extensive experiments on different synthetic-to-real and real-to-real domain adaptation scenarios demonstrate that our method can achieve state-of-the-art performance.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
Generative Subgraph Contrast for Self-Supervised Graph Representation Learning
Authors:
Yuehui Han,
Le Hui,
Haobo Jiang,
Jianjun Qian,
Jin Xie
Abstract:
Contrastive learning has shown great promise in the field of graph representation learning. By manually constructing positive/negative samples, most graph contrastive learning methods rely on the vector inner product based similarity metric to distinguish the samples for graph representation. However, the handcrafted sample construction (e.g., the perturbation on the nodes or edges of the graph) m…
▽ More
Contrastive learning has shown great promise in the field of graph representation learning. By manually constructing positive/negative samples, most graph contrastive learning methods rely on the vector inner product based similarity metric to distinguish the samples for graph representation. However, the handcrafted sample construction (e.g., the perturbation on the nodes or edges of the graph) may not effectively capture the intrinsic local structures of the graph. Also, the vector inner product based similarity metric cannot fully exploit the local structures of the graph to characterize the graph difference well. To this end, in this paper, we propose a novel adaptive subgraph generation based contrastive learning framework for efficient and robust self-supervised graph representation learning, and the optimal transport distance is utilized as the similarity metric between the subgraphs. It aims to generate contrastive samples by capturing the intrinsic structures of the graph and distinguish the samples based on the features and structures of subgraphs simultaneously. Specifically, for each center node, by adaptively learning relation weights to the nodes of the corresponding neighborhood, we first develop a network to generate the interpolated subgraph. We then construct the positive and negative pairs of subgraphs from the same and different nodes, respectively. Finally, we employ two types of optimal transport distances (i.e., Wasserstein distance and Gromov-Wasserstein distance) to construct the structured contrastive loss. Extensive node classification experiments on benchmark datasets verify the effectiveness of our graph contrastive learning method.
△ Less
Submitted 26 July, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
3D Siamese Transformer Network for Single Object Tracking on Point Clouds
Authors:
Le Hui,
Lingpeng Wang,
Linghua Tang,
Kaihao Lan,
Jin Xie,
Jian Yang
Abstract:
Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area. Due to the large appearance variation between the template and search area during tracking, how to learn the robust cross correlation between them for identifying the potential target in the search area is still a challenging problem. In this pape…
▽ More
Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area. Due to the large appearance variation between the template and search area during tracking, how to learn the robust cross correlation between them for identifying the potential target in the search area is still a challenging problem. In this paper, we explicitly use Transformer to form a 3D Siamese Transformer network for learning robust cross correlation between the template and the search area of point clouds. Specifically, we develop a Siamese point Transformer network to learn shape context information of the target. Its encoder uses self-attention to capture non-local information of point clouds to characterize the shape information of the object, and the decoder utilizes cross-attention to upsample discriminative point features. After that, we develop an iterative coarse-to-fine correlation network to learn the robust cross correlation between the template and the search area. It formulates the cross-feature augmentation to associate the template with the potential target in the search area via cross attention. To further enhance the potential target, it employs the ego-feature augmentation that applies self-attention to the local k-NN graph of the feature space to aggregate target features. Experiments on the KITTI, nuScenes, and Waymo datasets show that our method achieves state-of-the-art performance on the 3D single object tracking task.
△ Less
Submitted 26 July, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
RA-Depth: Resolution Adaptive Self-Supervised Monocular Depth Estimation
Authors:
Mu He,
Le Hui,
Yikai Bian,
Jian Ren,
Jin Xie,
Jian Yang
Abstract:
Existing self-supervised monocular depth estimation methods can get rid of expensive annotations and achieve promising results. However, these methods suffer from severe performance degradation when directly adopting a model trained on a fixed resolution to evaluate at other different resolutions. In this paper, we propose a resolution adaptive self-supervised monocular depth estimation method (RA…
▽ More
Existing self-supervised monocular depth estimation methods can get rid of expensive annotations and achieve promising results. However, these methods suffer from severe performance degradation when directly adopting a model trained on a fixed resolution to evaluate at other different resolutions. In this paper, we propose a resolution adaptive self-supervised monocular depth estimation method (RA-Depth) by learning the scale invariance of the scene depth. Specifically, we propose a simple yet efficient data augmentation method to generate images with arbitrary scales for the same scene. Then, we develop a dual high-resolution network that uses the multi-path encoder and decoder with dense interactions to aggregate multi-scale features for accurate depth inference. Finally, to explicitly learn the scale invariance of the scene depth, we formulate a cross-scale depth consistency loss on depth predictions with different scales. Extensive experiments on the KITTI, Make3D and NYU-V2 datasets demonstrate that RA-Depth not only achieves state-of-the-art performance, but also exhibits a good ability of resolution adaptation.
△ Less
Submitted 26 July, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
Digital Twin for Networking: A Data-driven Performance Modeling Perspective
Authors:
Linbo Hui,
Mowei Wang,
Liang Zhang,
Lu Lu,
Yong Cui
Abstract:
Emerging technologies and applications make the network unprecedentedly complex and heterogeneous, leading physical network practices to be costly and risky. The digital twin network (DTN) can ease these burdens by virtually enabling users to understand how performance changes accordingly with modifications. For this "What-if" performance evaluation, conventional simulation and analytical approach…
▽ More
Emerging technologies and applications make the network unprecedentedly complex and heterogeneous, leading physical network practices to be costly and risky. The digital twin network (DTN) can ease these burdens by virtually enabling users to understand how performance changes accordingly with modifications. For this "What-if" performance evaluation, conventional simulation and analytical approaches are inefficient, inaccurate, and inflexible, and we argue that data-driven methods are most promising. In this article, we identify three requirements (fidelity, efficiency, and flexibility) for performance evaluation. Then we present a comparison of selected data-driven methods and investigate their potential trends in data, models, and applications. Although extensive applications have been enabled, there are still significant conflicts between models' capacities to handle diversified inputs and limited data collected from the production network. We further illustrate the opportunities for data collection, model construction, and application prospects. This survey aims to provide a reference for performance evaluation while also facilitating future DTN research.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
Near-Zone Symmetries of Kerr Black Holes
Authors:
Lam Hui,
Austin Joyce,
Riccardo Penco,
Luca Santoni,
Adam R. Solomon
Abstract:
We study the near-zone symmetries of a massless scalar field on four-dimensional black hole backgrounds. We provide a geometric understanding that unifies various recently discovered symmetries as part of an SO(4,2) group. Of these, a subset are exact symmetries of the static sector and give rise to the ladder symmetries responsible for the vanishing of Love numbers. In the Kerr case, we compare d…
▽ More
We study the near-zone symmetries of a massless scalar field on four-dimensional black hole backgrounds. We provide a geometric understanding that unifies various recently discovered symmetries as part of an SO(4,2) group. Of these, a subset are exact symmetries of the static sector and give rise to the ladder symmetries responsible for the vanishing of Love numbers. In the Kerr case, we compare different near-zone approximations in the literature, and focus on the implementation that retains the symmetries of the static limit. We also describe the relation to spin-1 and 2 perturbations.
△ Less
Submitted 16 March, 2022;
originally announced March 2022.
-
Domain Disentangled Generative Adversarial Network for Zero-Shot Sketch-Based 3D Shape Retrieval
Authors:
Rui Xu,
Zongyan Han,
Le Hui,
Jianjun Qian,
Jin Xie
Abstract:
Sketch-based 3D shape retrieval is a challenging task due to the large domain discrepancy between sketches and 3D shapes. Since existing methods are trained and evaluated on the same categories, they cannot effectively recognize the categories that have not been used during training. In this paper, we propose a novel domain disentangled generative adversarial network (DD-GAN) for zero-shot sketch-…
▽ More
Sketch-based 3D shape retrieval is a challenging task due to the large domain discrepancy between sketches and 3D shapes. Since existing methods are trained and evaluated on the same categories, they cannot effectively recognize the categories that have not been used during training. In this paper, we propose a novel domain disentangled generative adversarial network (DD-GAN) for zero-shot sketch-based 3D retrieval, which can retrieve the unseen categories that are not accessed during training. Specifically, we first generate domain-invariant features and domain-specific features by disentangling the learned features of sketches and 3D shapes, where the domain-invariant features are used to align with the corresponding word embeddings. Then, we develop a generative adversarial network that combines the domain-specific features of the seen categories with the aligned domain-invariant features to synthesize samples, where the synthesized samples of the unseen categories are generated by using the corresponding word embeddings. Finally, we use the synthesized samples of the unseen categories combined with the real samples of the seen categories to train the network for retrieval, so that the unseen categories can be recognized. In order to reduce the domain shift problem, we utilized unlabeled unseen samples to enhance the discrimination ability of the discriminator. With the discriminator distinguishing the generated samples from the unlabeled unseen samples, the generator can generate more realistic unseen samples. Extensive experiments on the SHREC'13 and SHREC'14 datasets show that our method significantly improves the retrieval performance of the unseen categories.
△ Less
Submitted 29 June, 2022; v1 submitted 24 February, 2022;
originally announced February 2022.
-
Reliable Inlier Evaluation for Unsupervised Point Cloud Registration
Authors:
Yaqi Shen,
Le Hui,
Haobo Jiang,
Jin Xie,
Jian Yang
Abstract:
Unsupervised point cloud registration algorithm usually suffers from the unsatisfied registration precision in the partially overlapping problem due to the lack of effective inlier evaluation. In this paper, we propose a neighborhood consensus based reliable inlier evaluation method for robust unsupervised point cloud registration. It is expected to capture the discriminative geometric difference…
▽ More
Unsupervised point cloud registration algorithm usually suffers from the unsatisfied registration precision in the partially overlapping problem due to the lack of effective inlier evaluation. In this paper, we propose a neighborhood consensus based reliable inlier evaluation method for robust unsupervised point cloud registration. It is expected to capture the discriminative geometric difference between the source neighborhood and the corresponding pseudo target neighborhood for effective inlier distinction. Specifically, our model consists of a matching map refinement module and an inlier evaluation module. In our matching map refinement module, we improve the point-wise matching map estimation by integrating the matching scores of neighbors into it. The aggregated neighborhood information potentially facilitates the discriminative map construction so that high-quality correspondences can be provided for generating the pseudo target point cloud. Based on the observation that the outlier has the significant structure-wise difference between its source neighborhood and corresponding pseudo target neighborhood while this difference for inlier is small, the inlier evaluation module exploits this difference to score the inlier confidence for each estimated correspondence. In particular, we construct an effective graph representation for capturing this geometric difference between the neighborhoods. Finally, with the learned correspondences and the corresponding inlier confidence, we use the weighted SVD algorithm for transformation estimation. Under the unsupervised setting, we exploit the Huber function based global alignment loss, the local neighborhood consensus loss, and spatial consistency loss for model optimization. The experimental results on extensive datasets demonstrate that our unsupervised point cloud registration method can yield comparable performance.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
Limitations of Neural Collapse for Understanding Generalization in Deep Learning
Authors:
Like Hui,
Mikhail Belkin,
Preetum Nakkiran
Abstract:
The recent work of Papyan, Han, & Donoho (2020) presented an intriguing "Neural Collapse" phenomenon, showing a structural property of interpolating classifiers in the late stage of training. This opened a rich area of exploration studying this phenomenon. Our motivation is to study the upper limits of this research program: How far will understanding Neural Collapse take us in understanding deep…
▽ More
The recent work of Papyan, Han, & Donoho (2020) presented an intriguing "Neural Collapse" phenomenon, showing a structural property of interpolating classifiers in the late stage of training. This opened a rich area of exploration studying this phenomenon. Our motivation is to study the upper limits of this research program: How far will understanding Neural Collapse take us in understanding deep learning? First, we investigate its role in generalization. We refine the Neural Collapse conjecture into two separate conjectures: collapse on the train set (an optimization property) and collapse on the test distribution (a generalization property). We find that while Neural Collapse often occurs on the train set, it does not occur on the test set. We thus conclude that Neural Collapse is primarily an optimization phenomenon, with as-yet-unclear connections to generalization. Second, we investigate the role of Neural Collapse in feature learning. We show simple, realistic experiments where training longer leads to worse last-layer features, as measured by transfer-performance on a downstream task. This suggests that neural collapse is not always desirable for representation learning, as previously claimed. Finally, we give preliminary evidence of a "cascading collapse" phenomenon, wherein some form of Neural Collapse occurs not only for the last layer, but in earlier layers as well. We hope our work encourages the community to continue the rich line of Neural Collapse research, while also considering its inherent limitations.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.
-
3D Siamese Voxel-to-BEV Tracker for Sparse Point Clouds
Authors:
Le Hui,
Lingpeng Wang,
Mingmei Cheng,
Jin Xie,
Jian Yang
Abstract:
3D object tracking in point clouds is still a challenging problem due to the sparsity of LiDAR points in dynamic environments. In this work, we propose a Siamese voxel-to-BEV tracker, which can significantly improve the tracking performance in sparse 3D point clouds. Specifically, it consists of a Siamese shape-aware feature learning network and a voxel-to-BEV target localization network. The Siam…
▽ More
3D object tracking in point clouds is still a challenging problem due to the sparsity of LiDAR points in dynamic environments. In this work, we propose a Siamese voxel-to-BEV tracker, which can significantly improve the tracking performance in sparse 3D point clouds. Specifically, it consists of a Siamese shape-aware feature learning network and a voxel-to-BEV target localization network. The Siamese shape-aware feature learning network can capture 3D shape information of the object to learn the discriminative features of the object so that the potential target from the background in sparse point clouds can be identified. To this end, we first perform template feature embedding to embed the template's feature into the potential target and then generate a dense 3D shape to characterize the shape information of the potential target. For localizing the tracked target, the voxel-to-BEV target localization network regresses the target's 2D center and the $z$-axis center from the dense bird's eye view (BEV) feature map in an anchor-free manner. Concretely, we compress the voxelized point cloud along $z$-axis through max pooling to obtain a dense BEV feature map, where the regression of the 2D center and the $z$-axis center can be performed more effectively. Extensive evaluation on the KITTI and nuScenes datasets shows that our method significantly outperforms the current state-of-the-art methods by a large margin.
△ Less
Submitted 17 November, 2021; v1 submitted 8 November, 2021;
originally announced November 2021.
-
Effective Field Theory for the Perturbations of a Slowly Rotating Black Hole
Authors:
Lam Hui,
Alessandro Podo,
Luca Santoni,
Enrico Trincherini
Abstract:
We develop the effective theory for perturbations around black holes with scalar hair, in two directions. First, we show that the scalar-Gauss--Bonnet theory, often used as an example exhibiting scalar black hole hair, can be deformed by galileon operators leading to order unity changes to its predictions. The effective theory for perturbations thus provides an efficient framework for describing a…
▽ More
We develop the effective theory for perturbations around black holes with scalar hair, in two directions. First, we show that the scalar-Gauss--Bonnet theory, often used as an example exhibiting scalar black hole hair, can be deformed by galileon operators leading to order unity changes to its predictions. The effective theory for perturbations thus provides an efficient framework for describing and constraining broad classes of scalar-tensor theories, of which the addition of galileon operators is an example. Second, we extend the effective theory to perturbations around an axisymmetric, slowly rotating black hole, at linear order in the black hole spin. We also discuss the inclusion of parity-breaking operators in the effective theory.
△ Less
Submitted 14 January, 2022; v1 submitted 3 November, 2021;
originally announced November 2021.
-
Construction of Wave Dark Matter Halos: Numerical Algorithm and Analytical Constraints
Authors:
Tomer D. Yavetz,
Xinyu Li,
Lam Hui
Abstract:
We present a wave generalization of the classic Schwarzschild method for constructing self-consistent halos -- such a halo consists of a suitable superposition of waves instead of particle orbits, chosen to yield a desired mean density profile. As an illustration, the method is applied to spherically symmetric halos. We derive an analytic relation between the particle distribution function and the…
▽ More
We present a wave generalization of the classic Schwarzschild method for constructing self-consistent halos -- such a halo consists of a suitable superposition of waves instead of particle orbits, chosen to yield a desired mean density profile. As an illustration, the method is applied to spherically symmetric halos. We derive an analytic relation between the particle distribution function and the wave superposition amplitudes, and show how it simplifies in the high energy (WKB) limit. We verify the stability of such constructed halos by numerically evolving the Schrödinger-Poisson system. The algorithm provides an efficient and accurate way to simulate the time-dependent halo substructures from wave interference. We use this method to construct halos with a variety of density profiles, all of which have a core from the ground-state wave function, though the core-halo relation need not be the standard one.
△ Less
Submitted 5 January, 2023; v1 submitted 13 September, 2021;
originally announced September 2021.
-
SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering
Authors:
Yifan Zhao,
Le Hui,
Jin Xie
Abstract:
Point clouds obtained from 3D sensors are usually sparse. Existing methods mainly focus on upsampling sparse point clouds in a supervised manner by using dense ground truth point clouds. In this paper, we propose a self-supervised point cloud upsampling network (SSPU-Net) to generate dense point clouds without using ground truth. To achieve this, we exploit the consistency between the input sparse…
▽ More
Point clouds obtained from 3D sensors are usually sparse. Existing methods mainly focus on upsampling sparse point clouds in a supervised manner by using dense ground truth point clouds. In this paper, we propose a self-supervised point cloud upsampling network (SSPU-Net) to generate dense point clouds without using ground truth. To achieve this, we exploit the consistency between the input sparse point cloud and generated dense point cloud for the shapes and rendered images. Specifically, we first propose a neighbor expansion unit (NEU) to upsample the sparse point clouds, where the local geometric structures of the sparse point clouds are exploited to learn weights for point interpolation. Then, we develop a differentiable point cloud rendering unit (DRU) as an end-to-end module in our network to render the point cloud into multi-view images. Finally, we formulate a shape-consistent loss and an image-consistent loss to train the network so that the shapes of the sparse and dense point clouds are as consistent as possible. Extensive results on the CAD and scanned datasets demonstrate that our method can achieve impressive results in a self-supervised manner. Code is available at https://github.com/fpthink/SSPU-Net.
△ Less
Submitted 3 August, 2021; v1 submitted 1 August, 2021;
originally announced August 2021.
-
Dynamical friction from scalar dark matter in the relativistic regime
Authors:
Dina Traykova,
Katy Clough,
Thomas Helfer,
Emanuele Berti,
Pedro G. Ferreira,
Lam Hui
Abstract:
Light bosonic scalars (e.g. axions) may form clouds around black holes via superradiant instabilities, or via accretion if they form some component of the dark matter. It has been suggested that their presence may lead to a distinctive dephasing of the gravitational wave signal when a small compact object spirals into a larger black hole. Motivated by this, we study numerically the dynamical frict…
▽ More
Light bosonic scalars (e.g. axions) may form clouds around black holes via superradiant instabilities, or via accretion if they form some component of the dark matter. It has been suggested that their presence may lead to a distinctive dephasing of the gravitational wave signal when a small compact object spirals into a larger black hole. Motivated by this, we study numerically the dynamical friction force on a black hole moving at relativistic velocities in a background scalar field with an asymptotically homogeneous energy density. We show that the relativistic scaling is analogous to that found for supersonic collisional fluids, assuming an approximate expression for the pressure correction which depends on the velocity and scalar mass. While we focus on a complex scalar field, our results confirm the expectation that real scalars would exert a force which oscillates between positive and negative values in time with a frequency set by the scalar mass. The complex field describes the time averaged value of this force, but in a real scalar the rapid force oscillations could in principle leave an imprint on the trajectory. The approximation we obtain can be used to inform estimates of dephasing in the final stages of an extreme mass ratio inspiral.
△ Less
Submitted 27 October, 2021; v1 submitted 15 June, 2021;
originally announced June 2021.
-
Ladder Symmetries of Black Holes: Implications for Love Numbers and No-Hair Theorems
Authors:
Lam Hui,
Austin Joyce,
Riccardo Penco,
Luca Santoni,
Adam R. Solomon
Abstract:
It is well known that asymptotically flat black holes in general relativity have a vanishing static, conservative tidal response. We show that this is a result of linearly realized symmetries governing static (spin 0,1,2) perturbations around black holes. The symmetries have a geometric origin: in the scalar case, they arise from the (E)AdS isometries of a dimensionally reduced black hole spacetim…
▽ More
It is well known that asymptotically flat black holes in general relativity have a vanishing static, conservative tidal response. We show that this is a result of linearly realized symmetries governing static (spin 0,1,2) perturbations around black holes. The symmetries have a geometric origin: in the scalar case, they arise from the (E)AdS isometries of a dimensionally reduced black hole spacetime. Underlying the symmetries is a ladder structure which can be used to construct the full tower of solutions, and derive their general properties: (1) solutions that decay with radius spontaneously break the symmetries, and must diverge at the horizon; (2) solutions regular at the horizon respect the symmetries, and take the form of a finite polynomial that grows with radius. Taken together, these two properties imply that static response coefficients -- and in particular Love numbers -- vanish. Moreover, property (1) is consistent with the absence of black holes with linear (perturbative) hair. We also discuss the manifestation of these symmetries in the effective point particle description of a black hole, showing explicitly that for scalar probes the worldline couplings associated with a non-trivial tidal response and scalar hair must vanish in order for the symmetries to be preserved.
△ Less
Submitted 14 January, 2022; v1 submitted 3 May, 2021;
originally announced May 2021.
-
SSPC-Net: Semi-supervised Semantic 3D Point Cloud Segmentation Network
Authors:
Mingmei Cheng,
Le Hui,
Jin Xie,
Jian Yang
Abstract:
Point cloud semantic segmentation is a crucial task in 3D scene understanding. Existing methods mainly focus on employing a large number of annotated labels for supervised semantic segmentation. Nonetheless, manually labeling such large point clouds for the supervised segmentation task is time-consuming. In order to reduce the number of annotated labels, we propose a semi-supervised semantic point…
▽ More
Point cloud semantic segmentation is a crucial task in 3D scene understanding. Existing methods mainly focus on employing a large number of annotated labels for supervised semantic segmentation. Nonetheless, manually labeling such large point clouds for the supervised segmentation task is time-consuming. In order to reduce the number of annotated labels, we propose a semi-supervised semantic point cloud segmentation network, named SSPC-Net, where we train the semantic segmentation network by inferring the labels of unlabeled points from the few annotated 3D points. In our method, we first partition the whole point cloud into superpoints and build superpoint graphs to mine the long-range dependencies in point clouds. Based on the constructed superpoint graph, we then develop a dynamic label propagation method to generate the pseudo labels for the unsupervised superpoints. Particularly, we adopt a superpoint dropout strategy to dynamically select the generated pseudo labels. In order to fully exploit the generated pseudo labels of the unsupervised superpoints, we furthermore propose a coupled attention mechanism for superpoint feature embedding. Finally, we employ the cross-entropy loss to train the semantic segmentation network with the labels of the supervised superpoints and the pseudo labels of the unsupervised superpoints. Experiments on various datasets demonstrate that our semi-supervised segmentation method can achieve better performance than the current semi-supervised segmentation method with fewer annotated 3D points. Our code is available at https://github.com/MMCheng/SSPC-Net.
△ Less
Submitted 24 May, 2021; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Wave Dark Matter
Authors:
Lam Hui
Abstract:
We review the physics and phenomenology of wave dark matter: a bosonic dark matter candidate lighter than about 30 eV. Such particles have a de Broglie wavelength exceeding the average inter-particle separation in a galaxy like the Milky Way, and are well described as classical waves. We outline the particle physics motivations for them, including the QCD axion and ultra-light axion-like-particles…
▽ More
We review the physics and phenomenology of wave dark matter: a bosonic dark matter candidate lighter than about 30 eV. Such particles have a de Broglie wavelength exceeding the average inter-particle separation in a galaxy like the Milky Way, and are well described as classical waves. We outline the particle physics motivations for them, including the QCD axion and ultra-light axion-like-particles such as fuzzy dark matter. The wave nature of the dark matter implies a rich phenomenology: (1) Wave interference leads to order unity density fluctuations on de Broglie scale. A manifestation is vortices where the density vanishes and around which the velocity circulates. There is one vortex ring per de Broglie volume on average. (2) For sufficiently low masses, soliton condensation occurs at centers of halos. The soliton oscillates and random walks, another manifestation of wave interference. The halo/subhalo abundance is suppressed at small masses, but the precise prediction from numerical wave simulations remains to be determined. (3) For ultra-light ~$10^{-22}$ eV dark matter, the wave interference substructures can be probed by tidal streams/gravitational lensing. The signal can be distinguished from that due to subhalos by the dependence on stream orbital radius/image separation. (4) Axion detection experiments are sensitive to interference substructures for moderately light masses. The stochastic nature of the waves affects the interpretation of experiments and motivates the measurement of correlation functions. Current constraints and open questions, covering detection experiments and cosmological/galactic/black-hole observations, are discussed.
△ Less
Submitted 27 January, 2021;
originally announced January 2021.
-
Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition
Authors:
Le Hui,
Mingmei Cheng,
Jin Xie,
Jian Yang
Abstract:
Point cloud based retrieval for place recognition is still a challenging problem due to drastic appearance and illumination changes of scenes in changing environments. Existing deep learning based global descriptors for the retrieval task usually consume a large amount of computation resources (e.g., memory), which may not be suitable for the cases of limited hardware resources. In this paper, we…
▽ More
Point cloud based retrieval for place recognition is still a challenging problem due to drastic appearance and illumination changes of scenes in changing environments. Existing deep learning based global descriptors for the retrieval task usually consume a large amount of computation resources (e.g., memory), which may not be suitable for the cases of limited hardware resources. In this paper, we develop an efficient point cloud learning network (EPC-Net) to form a global descriptor for visual place recognition, which can obtain good performance and reduce computation memory and inference time. First, we propose a lightweight but effective neural network module, called ProxyConv, to aggregate the local geometric features of point clouds. We leverage the spatial adjacent matrix and proxy points to simplify the original edge convolution for lower memory consumption. Then, we design a lightweight grouped VLAD network (G-VLAD) to form global descriptors for retrieval. Compared with the original VLAD network, we propose a grouped fully connected (GFC) layer to decompose the high-dimensional vectors into a group of low-dimensional vectors, which can reduce the number of parameters of the network and maintain the discrimination of the feature vector. Finally, to further reduce the inference time, we develop a simple version of EPC-Net, called EPC-Net-L, which consists of two ProxyConv modules and one max pooling layer to aggregate global descriptors. By distilling the knowledge from EPC-Net, EPC-Net-L can obtain discriminative global descriptors for retrieval. Extensive experiments on the Oxford dataset and three in-house datasets demonstrate that our proposed method can achieve state-of-the-art performance with lower parameters, FLOPs, and runtime per frame.
△ Less
Submitted 7 January, 2021;
originally announced January 2021.
-
Don't cross the streams: caustics from Fuzzy Dark Matter
Authors:
Neal Dalal,
Jo Bovy,
Lam Hui,
Xinyu Li
Abstract:
We study how tidal streams from globular clusters may be used to constrain the mass of ultra-light dark matter particles, called `fuzzy' dark matter (FDM). A general feature of FDM models is the presence of ubiquitous density fluctuations in bound, virialized dark matter structures, on the scale of the de Broglie wavelength, arising from wave interference in the evolving dark matter distribution.…
▽ More
We study how tidal streams from globular clusters may be used to constrain the mass of ultra-light dark matter particles, called `fuzzy' dark matter (FDM). A general feature of FDM models is the presence of ubiquitous density fluctuations in bound, virialized dark matter structures, on the scale of the de Broglie wavelength, arising from wave interference in the evolving dark matter distribution. These time-varying fluctuations can disturb the motions of stars, leading to potentially observable signatures in cold thin tidal streams in our own Galaxy. The study of this effect has been hindered by the difficulty in simulating the FDM wavefunction in Milky Way-sized systems. We present a simple method to evolve realistic wavefunctions in nearly static potentials, that should provide an accurate estimate of this granulation effect. We quantify the impact of FDM perturbations on tidal streams, and show that initially, while stream perturbations are small in amplitude, their power spectra exhibit a sharp cutoff corresponding to the de Broglie wavelength of the FDM potential fluctuations. Eventually, when stream perturbations become nonlinear, fold caustics generically arise that lead to density fluctuations with universal behavior. This erases the signature of the de Broglie wavelength in the stream density power spectrum, but we show that it will still be possible to determine the FDM mass in this regime, by considering the fluctuations in quantities like angular momenta or actions.
△ Less
Submitted 26 November, 2020;
originally announced November 2020.
-
Oscillations and Random Walk of the Soliton Core in a Fuzzy Dark Matter Halo
Authors:
Xinyu Li,
Lam Hui,
Tomer D. Yavetz
Abstract:
A Fuzzy Dark Matter (FDM) halo consists of a soliton core close to the center and an NFW-like density profile in the outer region. Previous investigations found that the soliton core exhibits temporal oscillations and random walk excursions around the halo center. Analyzing a set of numerical simulations, we show that both phenomena can be understood as the results of wave interference -- a suitab…
▽ More
A Fuzzy Dark Matter (FDM) halo consists of a soliton core close to the center and an NFW-like density profile in the outer region. Previous investigations found that the soliton core exhibits temporal oscillations and random walk excursions around the halo center. Analyzing a set of numerical simulations, we show that both phenomena can be understood as the results of wave interference -- a suitable superposition of the ground (solitonic) state and excited states in a fixed potential suffices to account for the main features of these phenomena. Such an eigenmode analysis can shed light on the evolution of a satellite halo undergoing tidal disruption. As the outer halo is stripped away, reducing the amplitudes of the excited states, the ground state evolves adiabatically. This suggests diminished soliton oscillations and random walk excursions, an effect to consider in deducing constraints from stellar heating.
△ Less
Submitted 23 November, 2020;
originally announced November 2020.
-
Growth of accretion driven scalar hair around Kerr black holes
Authors:
Jamie Bamber,
Katy Clough,
Pedro G. Ferreira,
Lam Hui,
Macarena Lagos
Abstract:
Scalar fields around compact objects are of interest for scalar-tensor theories of gravity and dark matter models consisting of a massive scalar, e.g. axions. We study the behaviour of a scalar field around a Kerr black hole with non trivial asymptotic boundary conditions - both non zero density and non zero angular momentum. Starting from an initial radially homogeneous configuration, a scalar cl…
▽ More
Scalar fields around compact objects are of interest for scalar-tensor theories of gravity and dark matter models consisting of a massive scalar, e.g. axions. We study the behaviour of a scalar field around a Kerr black hole with non trivial asymptotic boundary conditions - both non zero density and non zero angular momentum. Starting from an initial radially homogeneous configuration, a scalar cloud is accreted, which asymptotes to known stationary configurations over time. We study the cloud growth for different parameters including black hole spin, scalar field mass, and the scalar field density and angular momentum far from the black hole. We characterise the transient growth of the mass and angular momentum in the cloud, and the spatial profile of the scalar around the black hole, and relate the results of fully non-linear simulations to an analytic perturbative expansion. We also highlight the potential for these accreted clouds to create monochromatic gravitational wave signals - similar to the signals from superradiant clouds, although significantly weaker in amplitude.
△ Less
Submitted 11 March, 2021; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Static response and Love numbers of Schwarzschild black holes
Authors:
Lam Hui,
Austin Joyce,
Riccardo Penco,
Luca Santoni,
Adam R. Solomon
Abstract:
We derive the quadratic action for the physical degrees of freedom of massless spin-0, spin-1, and spin-2 perturbations on a Schwarzschild--(A)dS background in arbitrary dimensions. We then use these results to compute the static response of asymptotically flat Schwarzschild black holes to external fields. Our analysis reproduces known facts about black hole Love numbers, in particular that they v…
▽ More
We derive the quadratic action for the physical degrees of freedom of massless spin-0, spin-1, and spin-2 perturbations on a Schwarzschild--(A)dS background in arbitrary dimensions. We then use these results to compute the static response of asymptotically flat Schwarzschild black holes to external fields. Our analysis reproduces known facts about black hole Love numbers, in particular that they vanish for all types of perturbation in four spacetime dimensions, but also leads to new results. For instance, we find that neutral Schwarzschild black holes polarize in the presence of an electromagnetic background in any number of spacetime dimensions except four. Moreover, we calculate for the first time black hole Love numbers for vector-type gravitational perturbations in higher dimensions and find that they generically do not vanish. Along the way, we shed some light on an apparent discrepancy between previous results in the literature, and clarify some aspects of the matching between perturbative calculations of static response on a Schwarzschild background and the point-particle effective theory
△ Less
Submitted 11 February, 2024; v1 submitted 1 October, 2020;
originally announced October 2020.
-
Fast Magnetic Reconnection with Turbulence in High Lundquist Number Limit
Authors:
Yang Liping,
Li Hui,
Guo Fan,
Li Xiaocan,
Li Shengtai,
He jiansen,
Zhang Lei,
Feng Xueshang
Abstract:
We use extensive 3D resistive MHD simulations to study how large-scale current sheets will undergo fast reconnection in the high Lundquist number $S$ limit (above $\sim 10^4$), when the system is subject to different externally driven turbulence levels and the self-generated turbulence produced by 3D reconnection dynamics. We find that the normalized global reconnection rate $\sim 0.01 - 0.13$, we…
▽ More
We use extensive 3D resistive MHD simulations to study how large-scale current sheets will undergo fast reconnection in the high Lundquist number $S$ limit (above $\sim 10^4$), when the system is subject to different externally driven turbulence levels and the self-generated turbulence produced by 3D reconnection dynamics. We find that the normalized global reconnection rate $\sim 0.01 - 0.13$, weakly dependent on $S$. Global reconnection with the classic inflow/outflow configurations is observed, and 3D flux ropes are hierarchically formed and ejected from reconnection regions. A statistical separation of the reconnected magnetic field lines follows a super-diffusive behavior, from which the rate is measured to be very similar to that obtained from the mixing of tracer populations. We find that the reconnection rate scales roughly linearly with the turbulence level during the peak of reconnection. This scaling is consistent with the turbulence properties produced by both the externally driven and self-generation processes. These results imply that large-scale thin current sheets tend to undergo rigorous reconnection.
△ Less
Submitted 14 September, 2020;
originally announced September 2020.
-
Cascaded Non-local Neural Network for Point Cloud Semantic Segmentation
Authors:
Mingmei Cheng,
Le Hui,
Jin Xie,
Jian Yang,
Hui Kong
Abstract:
In this paper, we propose a cascaded non-local neural network for point cloud segmentation. The proposed network aims to build the long-range dependencies of point clouds for the accurate segmentation. Specifically, we develop a novel cascaded non-local module, which consists of the neighborhood-level, superpoint-level and global-level non-local blocks. First, in the neighborhood-level block, we e…
▽ More
In this paper, we propose a cascaded non-local neural network for point cloud segmentation. The proposed network aims to build the long-range dependencies of point clouds for the accurate segmentation. Specifically, we develop a novel cascaded non-local module, which consists of the neighborhood-level, superpoint-level and global-level non-local blocks. First, in the neighborhood-level block, we extract the local features of the centroid points of point clouds by assigning different weights to the neighboring points. The extracted local features of the centroid points are then used to encode the superpoint-level block with the non-local operation. Finally, the global-level block aggregates the non-local features of the superpoints for semantic segmentation in an encoder-decoder framework. Benefiting from the cascaded structure, geometric structure information of different neighborhoods with the same label can be propagated. In addition, the cascaded structure can largely reduce the computational cost of the original non-local operation on point clouds. Experiments on different indoor and outdoor datasets show that our method achieves state-of-the-art performance and effectively reduces the time consumption and memory occupation.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
Approximated Bilinear Modules for Temporal Modeling
Authors:
Xinqi Zhu,
Chang Xu,
Langwen Hui,
Cewu Lu,
Dacheng Tao
Abstract:
We consider two less-emphasized temporal properties of video: 1. Temporal cues are fine-grained; 2. Temporal modeling needs reasoning. To tackle both problems at once, we exploit approximated bilinear modules (ABMs) for temporal modeling. There are two main points making the modules effective: two-layer MLPs can be seen as a constraint approximation of bilinear operations, thus can be used to cons…
▽ More
We consider two less-emphasized temporal properties of video: 1. Temporal cues are fine-grained; 2. Temporal modeling needs reasoning. To tackle both problems at once, we exploit approximated bilinear modules (ABMs) for temporal modeling. There are two main points making the modules effective: two-layer MLPs can be seen as a constraint approximation of bilinear operations, thus can be used to construct deep ABMs in existing CNNs while reusing pretrained parameters; frame features can be divided into static and dynamic parts because of visual repetition in adjacent frames, which enables temporal modeling to be more efficient. Multiple ABM variants and implementations are investigated, from high performance to high efficiency. Specifically, we show how two-layer subnets in CNNs can be converted to temporal bilinear modules by adding an auxiliary-branch. Besides, we introduce snippet sampling and shifting inference to boost sparse-frame video classification performance. Extensive ablation studies are conducted to show the effectiveness of proposed techniques. Our models can outperform most state-of-the-art methods on Something-Something v1 and v2 datasets without Kinetics pretraining, and are also competitive on other YouTube-like action recognition datasets. Our code is available on https://github.com/zhuxinqimac/abm-pytorch.
△ Less
Submitted 25 July, 2020;
originally announced July 2020.
-
Progressive Point Cloud Deconvolution Generation Network
Authors:
Le Hui,
Rui Xu,
Jin Xie,
Jianjun Qian,
Jian Yang
Abstract:
In this paper, we propose an effective point cloud generation method, which can generate multi-resolution point clouds of the same shape from a latent vector. Specifically, we develop a novel progressive deconvolution network with the learning-based bilateral interpolation. The learning-based bilateral interpolation is performed in the spatial and feature spaces of point clouds so that local geome…
▽ More
In this paper, we propose an effective point cloud generation method, which can generate multi-resolution point clouds of the same shape from a latent vector. Specifically, we develop a novel progressive deconvolution network with the learning-based bilateral interpolation. The learning-based bilateral interpolation is performed in the spatial and feature spaces of point clouds so that local geometric structure information of point clouds can be exploited. Starting from the low-resolution point clouds, with the bilateral interpolation and max-pooling operations, the deconvolution network can progressively output high-resolution local and global feature maps. By concatenating different resolutions of local and global feature maps, we employ the multi-layer perceptron as the generation network to generate multi-resolution point clouds. In order to keep the shapes of different resolutions of point clouds consistent, we propose a shape-preserving adversarial loss to train the point cloud deconvolution generation network. Experimental results demonstrate the effectiveness of our proposed method.
△ Less
Submitted 10 July, 2020;
originally announced July 2020.
-
Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks
Authors:
Like Hui,
Mikhail Belkin
Abstract:
Modern neural architectures for classification tasks are trained using the cross-entropy loss, which is widely believed to be empirically superior to the square loss. In this work we provide evidence indicating that this belief may not be well-founded. We explore several major neural architectures and a range of standard benchmark datasets for NLP, automatic speech recognition (ASR) and computer v…
▽ More
Modern neural architectures for classification tasks are trained using the cross-entropy loss, which is widely believed to be empirically superior to the square loss. In this work we provide evidence indicating that this belief may not be well-founded. We explore several major neural architectures and a range of standard benchmark datasets for NLP, automatic speech recognition (ASR) and computer vision tasks to show that these architectures, with the same hyper-parameter settings as reported in the literature, perform comparably or better when trained with the square loss, even after equalizing computational resources. Indeed, we observe that the square loss produces better results in the dominant majority of NLP and ASR experiments. Cross-entropy appears to have a slight edge on computer vision tasks.
We argue that there is little compelling empirical or theoretical evidence indicating a clear-cut advantage to the cross-entropy loss. Indeed, in our experiments, performance on nearly all non-vision tasks can be improved, sometimes significantly, by switching to the square loss. Furthermore, training with square loss appears to be less sensitive to the randomness in initialization. We posit that training using the square loss for classification needs to be a part of best practices of modern deep learning on equal footing with cross-entropy.
△ Less
Submitted 22 October, 2021; v1 submitted 12 June, 2020;
originally announced June 2020.
-
Line Art Correlation Matching Feature Transfer Network for Automatic Animation Colorization
Authors:
Zhang Qian,
Wang Bo,
Wen Wei,
Li Hai,
Liu Jun Hui
Abstract:
Automatic animation line art colorization is a challenging computer vision problem, since the information of the line art is highly sparse and abstracted and there exists a strict requirement for the color and style consistency between frames. Recently, a lot of Generative Adversarial Network (GAN) based image-to-image translation methods for single line art colorization have emerged. They can gen…
▽ More
Automatic animation line art colorization is a challenging computer vision problem, since the information of the line art is highly sparse and abstracted and there exists a strict requirement for the color and style consistency between frames. Recently, a lot of Generative Adversarial Network (GAN) based image-to-image translation methods for single line art colorization have emerged. They can generate perceptually appealing results conditioned on line art images. However, these methods can not be adopted for the purpose of animation colorization because there is a lack of consideration of the in-between frame consistency. Existing methods simply input the previous colored frame as a reference to color the next line art, which will mislead the colorization due to the spatial misalignment of the previous colored frame and the next line art especially at positions where apparent changes happen. To address these challenges, we design a kind of correlation matching feature transfer model (called CMFT) to align the colored reference feature in a learnable way and integrate the model into an U-Net based generator in a coarse-to-fine manner. This enables the generator to transfer the layer-wise synchronized features from the deep semantic code to the content progressively. Extension evaluation shows that CMFT model can effectively improve the in-between consistency and the quality of colored frames especially when the motion is intense and diverse.
△ Less
Submitted 10 November, 2020; v1 submitted 14 April, 2020;
originally announced April 2020.