-
Muon Collider Forum Report
Authors:
K. M. Black,
S. Jindariani,
D. Li,
F. Maltoni,
P. Meade,
D. Stratakis,
D. Acosta,
R. Agarwal,
K. Agashe,
C. Aime,
D. Ally,
A. Apresyan,
A. Apyan,
P. Asadi,
D. Athanasakos,
Y. Bao,
E. Barzi,
N. Bartosik,
L. A. T. Bauerdick,
J. Beacham,
S. Belomestnykh,
J. S. Berg,
J. Berryhill,
A. Bertolin,
P. C. Bhat
, et al. (160 additional authors not shown)
Abstract:
A multi-TeV muon collider offers a spectacular opportunity in the direct exploration of the energy frontier. Offering a combination of unprecedented energy collisions in a comparatively clean leptonic environment, a high energy muon collider has the unique potential to provide both precision measurements and the highest energy reach in one machine that cannot be paralleled by any currently availab…
▽ More
A multi-TeV muon collider offers a spectacular opportunity in the direct exploration of the energy frontier. Offering a combination of unprecedented energy collisions in a comparatively clean leptonic environment, a high energy muon collider has the unique potential to provide both precision measurements and the highest energy reach in one machine that cannot be paralleled by any currently available technology. The topic generated a lot of excitement in Snowmass meetings and continues to attract a large number of supporters, including many from the early career community. In light of this very strong interest within the US particle physics community, Snowmass Energy, Theory and Accelerator Frontiers created a cross-frontier Muon Collider Forum in November of 2020. The Forum has been meeting on a monthly basis and organized several topical workshops dedicated to physics, accelerator technology, and detector R&D. Findings of the Forum are summarized in this report.
△ Less
Submitted 8 August, 2023; v1 submitted 2 September, 2022;
originally announced September 2022.
-
MIME: Minority Inclusion for Majority Group Enhancement of AI Performance
Authors:
Pradyumna Chari,
Yunhao Ba,
Shreeram Athreya,
Achuta Kadambi
Abstract:
Several papers have rightly included minority groups in artificial intelligence (AI) training data to improve test inference for minority groups and/or society-at-large. A society-at-large consists of both minority and majority stakeholders. A common misconception is that minority inclusion does not increase performance for majority groups alone. In this paper, we make the surprising finding that…
▽ More
Several papers have rightly included minority groups in artificial intelligence (AI) training data to improve test inference for minority groups and/or society-at-large. A society-at-large consists of both minority and majority stakeholders. A common misconception is that minority inclusion does not increase performance for majority groups alone. In this paper, we make the surprising finding that including minority samples can improve test error for the majority group. In other words, minority group inclusion leads to majority group enhancements (MIME) in performance. A theoretical existence proof of the MIME effect is presented and found to be consistent with experimental results on six different datasets. Project webpage: https://visual.ee.ucla.edu/mime.htm/
△ Less
Submitted 1 September, 2022;
originally announced September 2022.
-
Anti-Retroactive Interference for Lifelong Learning
Authors:
Runqi Wang,
Yuxiang Bao,
Baochang Zhang,
Jianzhuang Liu,
Wentao Zhu,
Guodong Guo
Abstract:
Humans can continuously learn new knowledge. However, machine learning models suffer from drastic dropping in performance on previous tasks after learning new tasks. Cognitive science points out that the competition of similar knowledge is an important cause of forgetting. In this paper, we design a paradigm for lifelong learning based on meta-learning and associative mechanism of the brain. It ta…
▽ More
Humans can continuously learn new knowledge. However, machine learning models suffer from drastic dropping in performance on previous tasks after learning new tasks. Cognitive science points out that the competition of similar knowledge is an important cause of forgetting. In this paper, we design a paradigm for lifelong learning based on meta-learning and associative mechanism of the brain. It tackles the problem from two aspects: extracting knowledge and memorizing knowledge. First, we disrupt the sample's background distribution through a background attack, which strengthens the model to extract the key features of each task. Second, according to the similarity between incremental knowledge and base knowledge, we design an adaptive fusion of incremental knowledge, which helps the model allocate capacity to the knowledge of different difficulties. It is theoretically analyzed that the proposed learning paradigm can make the models of different tasks converge to the same optimum. The proposed method is validated on the MNIST, CIFAR100, CUB200 and ImageNet100 datasets.
△ Less
Submitted 29 October, 2022; v1 submitted 27 August, 2022;
originally announced August 2022.
-
FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation
Authors:
Yiming Bao,
Xu Zhao,
Dahong Qian
Abstract:
There exist challenging problems in 3D human pose estimation mission, such as poor performance caused by occlusion and self-occlusion. Recently, IMU-vision sensor fusion is regarded as valuable for solving these problems. However, previous researches on the fusion of IMU and vision data, which is heterogeneous, fail to adequately utilize either IMU raw data or reliable high-level vision features.…
▽ More
There exist challenging problems in 3D human pose estimation mission, such as poor performance caused by occlusion and self-occlusion. Recently, IMU-vision sensor fusion is regarded as valuable for solving these problems. However, previous researches on the fusion of IMU and vision data, which is heterogeneous, fail to adequately utilize either IMU raw data or reliable high-level vision features. To facilitate a more efficient sensor fusion, in this work we propose a framework called \emph{FusePose} under a parametric human kinematic model. Specifically, we aggregate different information of IMU or vision data and introduce three distinctive sensor fusion approaches: NaiveFuse, KineFuse and AdaDeepFuse. NaiveFuse servers as a basic approach that only fuses simplified IMU data and estimated 3D pose in euclidean space. While in kinematic space, KineFuse is able to integrate the calibrated and aligned IMU raw data with converted 3D pose parameters. AdaDeepFuse further develops this kinematical fusion process to an adaptive and end-to-end trainable manner. Comprehensive experiments with ablation studies demonstrate the rationality and superiority of the proposed framework. The performance of 3D human pose estimation is improved compared to the baseline result. On Total Capture dataset, KineFuse surpasses previous state-of-the-art which uses IMU only for testing by 8.6\%. AdaDeepFuse surpasses state-of-the-art which uses IMU for both training and testing by 8.5\%. Moreover, we validate the generalization capability of our framework through experiments on Human3.6M dataset.
△ Less
Submitted 25 August, 2022;
originally announced August 2022.
-
End boundary effects on wakes dynamics of inclined circular cylinders
Authors:
Kai Zhang,
Yan Bao,
Dai Zhou,
Zhaolong Han
Abstract:
We perform direct numerical simulations to characterize the three-dimensional wake dynamics of long inclined circular cylinders with inhomogeneous end boundary conditions. Three Reynolds numbers, $\Rey=100$, 200 and 300 are considered to reveal the roles of the intrinsic secondary instabilities and the extrinsic end boundary effects in shaping the three-dimensional flows. At $\Rey=100$, the end bo…
▽ More
We perform direct numerical simulations to characterize the three-dimensional wake dynamics of long inclined circular cylinders with inhomogeneous end boundary conditions. Three Reynolds numbers, $\Rey=100$, 200 and 300 are considered to reveal the roles of the intrinsic secondary instabilities and the extrinsic end boundary effects in shaping the three-dimensional flows. At $\Rey=100$, the end boundary effects are felt over the entire cylinder span by inducing oblique vortex shedding, which is associated with stronger spanwise flow in the wake than a parallel shedding. The Strouhal number of the oblique shedding is related to that of the parallel shedding of straight cylinder by the cosine law, considering the combined inclination angle and oblique angle. At $\Rey=200$, the intrinsic secondary instability results in large-scale vortex dislocation, precluding the propagation of the end boundary effects towards further span. Nevertheless, for the inclined cases, oblique shedding is still observed within limited span from the upstream end boundary. The oblique vortices are related to the two-dimensional flow at $\Rey=200$ from the viewpoint of cosine law. Further along the span, the oblique vortices destabilizes with the formation of small-scale vortices, and the flow transitions to the typical mode A* wake. At $\Rey=300$, the highly three-dimensional flow near the end boundary creates disturbances that travel along the cylinder span, creating vortex dislocations for cases with low inclination angles. For high inclination angle, oblique vortex shedding is again observed over the cylinder span, and is not disrupted by vortex dislocations of either intrinsic or extrinsic causes.
△ Less
Submitted 24 August, 2022;
originally announced August 2022.
-
Development of a Scanning Tunneling Microscope for Variable Temperature Electron Spin Resonance
Authors:
Jiyoon Hwang,
Denis Krylov,
Robertus J. G. Elbertse,
Sangwon Yoon,
Taehong Ahn,
Jeongmin Oh,
Lei Fang,
Won-jun Jang,
Franklin H. Cho,
Andreas J. Heinrich,
Yujeong Bae
Abstract:
Recent advances in increasing the spectroscopic energy resolution in scanning tunneling microscopy (STM) have been achieved by integrating electron spin resonance (ESR) with STM. Here, we demonstrate the design and performance of a home-built STM capable of ESR at temperatures ranging from 1 K to 10 K. The STM is incorporated with a home-built Joule-Thomson refrigerator and a 2-axis vector magnet.…
▽ More
Recent advances in increasing the spectroscopic energy resolution in scanning tunneling microscopy (STM) have been achieved by integrating electron spin resonance (ESR) with STM. Here, we demonstrate the design and performance of a home-built STM capable of ESR at temperatures ranging from 1 K to 10 K. The STM is incorporated with a home-built Joule-Thomson refrigerator and a 2-axis vector magnet. Our STM design allows for the deposition of atoms and molecules directly into the cold STM, eliminating the need to extract the sample for deposition. In addition, we adopt two methods to apply radio-frequency (RF) voltages to the tunnel junction, the early design of wiring to the STM tip directly, and a more recent idea to use an RF antenna. Direct comparisons of ESR results measured using the two methods and simulations of electric field distribution around the tunnel junction show that, despite their different designs and capacitive couplings to the tunnel junction, there is no discernible difference in the driving and detection of ESR. Furthermore, at a magnetic field of 1.6 T, we observe ESR signals (near 40 GHz) sustained up to 10 K, which is the highest temperature for ESR-STM measurement reported to date, to the best of our knowledge. Although the ESR intensity exponentially decreases with increasing temperature, our ESR-STM system with low noise at the tunnel junction allows us to measure weak ESR signals with intensities in the sub-fA range. Our new design of ESR-STM, which is operational in a large frequency and temperature range, can broaden the use of ESR spectroscopy in STM and enable the simple modification of existing STM systems, which will hopefully accelerate a generalized use of ESR-STM.
△ Less
Submitted 23 August, 2022;
originally announced August 2022.
-
Counting surfaces on Calabi-Yau 4-folds I: Foundations
Authors:
Younghan Bae,
Martijn Kool,
Hyeonjun Park
Abstract:
This is the first part in a series of papers on counting surfaces on Calabi-Yau 4-folds. Besides the Hilbert scheme of 2-dimensional subschemes, we introduce \emph{two} types of moduli spaces of stable pairs. We show that all three moduli spaces are related by GIT wall-crossing and parametrize stable objects in the bounded derived category.
We construct \emph{reduced} Oh-Thomas virtual cycles on…
▽ More
This is the first part in a series of papers on counting surfaces on Calabi-Yau 4-folds. Besides the Hilbert scheme of 2-dimensional subschemes, we introduce \emph{two} types of moduli spaces of stable pairs. We show that all three moduli spaces are related by GIT wall-crossing and parametrize stable objects in the bounded derived category.
We construct \emph{reduced} Oh-Thomas virtual cycles on the moduli spaces via Kiem-Li cosection localization and prove that they are deformation invariant along Hodge loci. As an application, we show that the variational Hodge conjecture holds for any family of Calabi-Yau 4-folds supporting a non-zero reduced virtual cycle.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
Observational limits on the rate of radiation-driven binary black hole capture events
Authors:
Michael Ebersold,
Shubhanshu Tiwari,
Leigh Smith,
Yeong-Bok Bae,
Gungwong Kang,
Daniel Williams,
Achamveedu Gopakumar,
Ik Siong Heng,
Maria Haney
Abstract:
Dense astrophysical environments like globular clusters and galactic nuclei can host hyperbolic encounters of black holes which can lead to gravitational-wave driven capture. There are several astrophysical models which predict a fraction of binary black hole mergers to come from these radiation-driven capture scenarios. In this paper we present the sensitivity of a search towards gravitational-wa…
▽ More
Dense astrophysical environments like globular clusters and galactic nuclei can host hyperbolic encounters of black holes which can lead to gravitational-wave driven capture. There are several astrophysical models which predict a fraction of binary black hole mergers to come from these radiation-driven capture scenarios. In this paper we present the sensitivity of a search towards gravitational-wave driven capture events for O3, the third observing run of LIGO and Virgo. We use capture waveforms produced by numerical relativity simulations covering four different mass ratios and at least two different values of initial angular momentum per mass ratio. We employed the most generic search for short-duration transients in O3 to evaluate the search sensitivity in this parameter space for a wide range in total mass in terms of visible spacetime volume. From the visible spacetime volume we determine for the first time the merger rate upper limit of such systems. The most stringent estimate of rate upper limits at 90\% confidence is $0.2~\mathrm{Gpc}^{-3}\,\mathrm{yr}^{-1}$ for an equal mass $200~M_\odot$ binary. Furthermore, in recent studies the event GW190521 has been suggested to be a capture event. With this interpretation of GW190521, we find the merger rate of similar events to be $0.47~\mathrm{Gpc}^{-3}\,\mathrm{yr}^{-1}$.
△ Less
Submitted 27 October, 2022; v1 submitted 16 August, 2022;
originally announced August 2022.
-
Semi-Supervised Cross-Modal Salient Object Detection with U-Structure Networks
Authors:
Yunqing Bao,
Hang Dai,
Abdulmotaleb Elsaddik
Abstract:
Salient Object Detection (SOD) is a popular and important topic aimed at precise detection and segmentation of the interesting regions in the images. We integrate the linguistic information into the vision-based U-Structure networks designed for salient object detection tasks. The experiments are based on the newly created DUTS Cross Modal (DUTS-CM) dataset, which contains both visual and linguist…
▽ More
Salient Object Detection (SOD) is a popular and important topic aimed at precise detection and segmentation of the interesting regions in the images. We integrate the linguistic information into the vision-based U-Structure networks designed for salient object detection tasks. The experiments are based on the newly created DUTS Cross Modal (DUTS-CM) dataset, which contains both visual and linguistic labels. We propose a new module called efficient Cross-Modal Self-Attention (eCMSA) to combine visual and linguistic features and improve the performance of the original U-structure networks. Meanwhile, to reduce the heavy burden of labeling, we employ a semi-supervised learning method by training an image caption model based on the DUTS-CM dataset, which can automatically label other datasets like DUT-OMRON and HKU-IS. The comprehensive experiments show that the performance of SOD can be improved with the natural language input and is competitive compared with other SOD methods.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
Flux Variations of Cosmic Ray Air Showers Detected by LHAASO-KM2A During a Thunderstorm on 10 June 2021
Authors:
LHAASO Collaboration,
F. Aharonian,
Q. An,
Axikegu,
L. X. Bai,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Zhe Cao,
Zhen Cao,
J. Chang,
J. F. Chang,
E. S. Chen,
Liang Chen,
Liang Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
S. H. Chen,
S. Z. Chen,
T. L. Chen,
X. J. Chen
, et al. (248 additional authors not shown)
Abstract:
The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations…
▽ More
The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations of trigger rates (increases or decreases) are found to be strongly dependent on the primary zenith angle. The flux of secondary particles increases significantly, following a similar trend with that of the shower events. To better understand the observed behavior, Monte Carlo simulations are performed with CORSIKA and G4KM2A (a code based on GEANT4). We find that the experimental data (in saturated negative fields) are in good agreement with simulations, assuming the presence of a uniform upward electric field of 700 V/cm with a thickness of 1500 m in the atmosphere above the observation level. Due to the acceleration/deceleration and deflection by the atmospheric electric field, the number of secondary particles with energy above the detector threshold is modified, resulting in the changes in shower detection rate.
△ Less
Submitted 6 December, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
W2N:Switching From Weak Supervision to Noisy Supervision for Object Detection
Authors:
Zitong Huang,
Yiping Bao,
Bowen Dong,
Erjin Zhou,
Wangmeng Zuo
Abstract:
Weakly-supervised object detection (WSOD) aims to train an object detector only requiring the image-level annotations. Recently, some works have managed to select the accurate boxes generated from a well-trained WSOD network to supervise a semi-supervised detection framework for better performance. However, these approaches simply divide the training set into labeled and unlabeled sets according t…
▽ More
Weakly-supervised object detection (WSOD) aims to train an object detector only requiring the image-level annotations. Recently, some works have managed to select the accurate boxes generated from a well-trained WSOD network to supervise a semi-supervised detection framework for better performance. However, these approaches simply divide the training set into labeled and unlabeled sets according to the image-level criteria, such that sufficient mislabeled or wrongly localized box predictions are chosen as pseudo ground-truths, resulting in a sub-optimal solution of detection performance. To overcome this issue, we propose a novel WSOD framework with a new paradigm that switches from weak supervision to noisy supervision (W2N). Generally, with given pseudo ground-truths generated from the well-trained WSOD network, we propose a two-module iterative training algorithm to refine pseudo labels and supervise better object detector progressively. In the localization adaptation module, we propose a regularization loss to reduce the proportion of discriminative parts in original pseudo ground-truths, obtaining better pseudo ground-truths for further training. In the semi-supervised module, we propose a two tasks instance-level split method to select high-quality labels for training a semi-supervised detector. Experimental results on different benchmarks verify the effectiveness of W2N, and our W2N outperforms all existing pure WSOD methods and transfer learning methods. Our code is publicly available at https://github.com/1170300714/w2n_wsod.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Fast Composite Optimization and Statistical Recovery in Federated Learning
Authors:
Yajie Bao,
Michael Crawshaw,
Shan Luo,
Mingrui Liu
Abstract:
As a prevalent distributed learning paradigm, Federated Learning (FL) trains a global model on a massive amount of devices with infrequent communication. This paper investigates a class of composite optimization and statistical recovery problems in the FL setting, whose loss function consists of a data-dependent smooth loss and a non-smooth regularizer. Examples include sparse linear regression us…
▽ More
As a prevalent distributed learning paradigm, Federated Learning (FL) trains a global model on a massive amount of devices with infrequent communication. This paper investigates a class of composite optimization and statistical recovery problems in the FL setting, whose loss function consists of a data-dependent smooth loss and a non-smooth regularizer. Examples include sparse linear regression using Lasso, low-rank matrix recovery using nuclear norm regularization, etc. In the existing literature, federated composite optimization algorithms are designed only from an optimization perspective without any statistical guarantees. In addition, they do not consider commonly used (restricted) strong convexity in statistical recovery problems. We advance the frontiers of this problem from both optimization and statistical perspectives. From optimization upfront, we propose a new algorithm named \textit{Fast Federated Dual Averaging} for strongly convex and smooth loss and establish state-of-the-art iteration and communication complexity in the composite setting. In particular, we prove that it enjoys a fast rate, linear speedup, and reduced communication rounds. From statistical upfront, for restricted strongly convex and smooth loss, we design another algorithm, namely \textit{Multi-stage Federated Dual Averaging}, and prove a high probability complexity bound with linear speedup up to optimal statistical precision. Experiments in both synthetic and real data demonstrate that our methods perform better than other baselines. To the best of our knowledge, this is the first work providing fast optimization algorithms and statistical recovery guarantees for composite problems in FL.
△ Less
Submitted 3 October, 2022; v1 submitted 17 July, 2022;
originally announced July 2022.
-
Anisotropic hyperfine interaction of surface-adsorbed single atoms
Authors:
Jinkyung Kim,
Kyungju Noh,
Yi Chen,
Fabio Donati,
Andreas J. Heinrich,
Christoph Wolf,
Yujeong Bae
Abstract:
Hyperfine interactions between electron and nuclear spins have been widely used in material science, organic chemistry, and structural biology as a sensitive probe to the local chemical environment through spatial identification of nuclear spins. With the nuclear spins identified, the isotropic and anisotropic components of the hyperfine interactions in turn offer unique insight into the electroni…
▽ More
Hyperfine interactions between electron and nuclear spins have been widely used in material science, organic chemistry, and structural biology as a sensitive probe to the local chemical environment through spatial identification of nuclear spins. With the nuclear spins identified, the isotropic and anisotropic components of the hyperfine interactions in turn offer unique insight into the electronic ground-state properties of the paramagnetic centers. However, traditional ensemble measurements of hyperfine interactions average over a macroscopic number of spins with different geometrical locations and nuclear isotopes. Here, we use a scanning tunneling microscope (STM) combined with electron spin resonance (ESR) to measure hyperfine spectra of hydrogenated-titanium (Ti) atoms on MgO/Ag(100) and thereby determine the isotropic and anisotropic hyperfine interactions at the single-atom level. By combining vector-field ESR spectroscopy with STM-based atom manipulation, we characterize the full hyperfine tensor of individual Ti-47 and Ti-49 atoms and identify significant spatial anisotropy of hyperfine interaction for both isotopes when they are adsorbed at low-symmetry binding sites. Density functional theory calculations reveal that the large hyperfine anisotropy arises from a highly anisotropic distribution of the ground-state electron spin density. Our work highlights the power of ESR-STM-enabled single-atom hyperfine spectroscopy as a powerful tool in revealing ground-state electronic structures and atomic-scale chemical environments with nano-electronvolt resolution.
△ Less
Submitted 13 July, 2022; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Spin excitations in the quantum dipolar magnet Yb(BaBO$_3$)$_3$
Authors:
C. Y. Jiang,
Y. X. Yang,
Y. X. Gao,
Z. T. Wan,
Z. H. Zhu,
T. Shiroka,
C. S. Chen,
Q. Wu,
X. Li,
J. C. Jiao,
K. W. Chen,
Y. Bao,
Z. M. Tian,
L. Shu
Abstract:
We report results of magnetization, specific-heat and muon-spin relaxation measurements on single crystals of disorder-free Yb$^{3+}$ triangular lattice Yb(BaBO$_3$)$_3$. The magnetization experiments show anisotropic magnetic properties with Curie-Weiss temperatures $θ_{\perp}=-1.40$~K ($H \perp c$) and $θ_{\parallel}=-1.16$~K ($H \parallel c$) determined from low temperature data. The absence of…
▽ More
We report results of magnetization, specific-heat and muon-spin relaxation measurements on single crystals of disorder-free Yb$^{3+}$ triangular lattice Yb(BaBO$_3$)$_3$. The magnetization experiments show anisotropic magnetic properties with Curie-Weiss temperatures $θ_{\perp}=-1.40$~K ($H \perp c$) and $θ_{\parallel}=-1.16$~K ($H \parallel c$) determined from low temperature data. The absence of both long-range antiferromagnetic order and spin freezing is confirmed down to 0.27 K at zero field. A two-level Schottky anomaly due to the opening of the ground-state Kramers doublet is observed from the low-temperature specific-heat measurements when the applied magnetic fields $μ_0H >0.7$~T. At zero field, the increase of both $C_{\rm mag}/T$ and the muon spin relaxation rate $λ$ below 1~K is due to the electronic spin excitations, which often exist in quantum magnets where dipole-dipole interaction creates an anisotropy of magnetic properties. The spin excitation is also supported by the unusual maximum of field dependence of $λ$ due to the field-induced increase of the density of excitations. We argue that dipolar interaction is dominant and induces the spin dynamics in the quantum magnet Yb(BaBO$_3$)$_3$.
△ Less
Submitted 1 July, 2022;
originally announced July 2022.
-
Universal Learned Image Compression With Low Computational Cost
Authors:
Bowen Li,
Yao Xin,
Youneng Bao,
Fanyang Meng,
Yongsheng Liang,
Wen Tan
Abstract:
Recently, learned image compression methods have developed rapidly and exhibited excellent rate-distortion performance when compared to traditional standards, such as JPEG, JPEG2000 and BPG. However, the learning-based methods suffer from high computational costs, which is not beneficial for deployment on devices with limited resources. To this end, we propose shift-addition parallel modules (SAPM…
▽ More
Recently, learned image compression methods have developed rapidly and exhibited excellent rate-distortion performance when compared to traditional standards, such as JPEG, JPEG2000 and BPG. However, the learning-based methods suffer from high computational costs, which is not beneficial for deployment on devices with limited resources. To this end, we propose shift-addition parallel modules (SAPMs), including SAPM-E for the encoder and SAPM-D for the decoder, to largely reduce the energy consumption. To be specific, they can be taken as plug-and-play components to upgrade existing CNN-based architectures, where the shift branch is used to extract large-grained features as compared to small-grained features learned by the addition branch. Furthermore, we thoroughly analyze the probability distribution of latent representations and propose to use Laplace Mixture Likelihoods for more accurate entropy estimation. Experimental results demonstrate that the proposed methods can achieve comparable or even better performance on both PSNR and MS-SSIM metrics to that of the convolutional counterpart with an about 2x energy reduction.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Not Just Streaks: Towards Ground Truth for Single Image Deraining
Authors:
Yunhao Ba,
Howard Zhang,
Ethan Yang,
Akira Suzuki,
Arnold Pfahnl,
Chethan Chinder Chandrappa,
Celso de Melo,
Suya You,
Stefano Soatto,
Alex Wong,
Achuta Kadambi
Abstract:
We propose a large-scale dataset of real-world rainy and clean image pairs and a method to remove degradations, induced by rain streaks and rain accumulation, from the image. As there exists no real-world dataset for deraining, current state-of-the-art methods rely on synthetic data and thus are limited by the sim2real domain gap; moreover, rigorous evaluation remains a challenge due to the absenc…
▽ More
We propose a large-scale dataset of real-world rainy and clean image pairs and a method to remove degradations, induced by rain streaks and rain accumulation, from the image. As there exists no real-world dataset for deraining, current state-of-the-art methods rely on synthetic data and thus are limited by the sim2real domain gap; moreover, rigorous evaluation remains a challenge due to the absence of a real paired dataset. We fill this gap by collecting a real paired deraining dataset through meticulous control of non-rain variations. Our dataset enables paired training and quantitative evaluation for diverse real-world rain phenomena (e.g. rain streaks and rain accumulation). To learn a representation robust to rain phenomena, we propose a deep neural network that reconstructs the underlying scene by minimizing a rain-robust loss between rainy and clean images. Extensive experiments demonstrate that our model outperforms the state-of-the-art deraining methods on real rainy images under various conditions. Project website: https://visual.ee.ucla.edu/gt_rain.htm/.
△ Less
Submitted 28 August, 2022; v1 submitted 21 June, 2022;
originally announced June 2022.
-
Beating the fault-tolerance bound and security loopholes for Byzantine agreement with a quantum solution
Authors:
Chen-Xun Weng,
Rui-Qi Gao,
Yu Bao,
Bing-Hong Li,
Wen-Bo Liu,
Yuan-Mei Xie,
Yu-Shuo Lu,
Hua-Lei Yin,
Zeng-Bing Chen
Abstract:
Byzantine agreement, the underlying core of blockchain, aims to make every node in a decentralized network reach consensus. Classical Byzantine agreements unavoidably face two major problems. One is $1/3$ fault-tolerance bound, which means that the system to tolerate $f$ malicious players requires at least $3f+1$ players. The other is the security loopholes from its classical cryptography methods.…
▽ More
Byzantine agreement, the underlying core of blockchain, aims to make every node in a decentralized network reach consensus. Classical Byzantine agreements unavoidably face two major problems. One is $1/3$ fault-tolerance bound, which means that the system to tolerate $f$ malicious players requires at least $3f+1$ players. The other is the security loopholes from its classical cryptography methods. Here, we propose a Byzantine agreement framework with unconditional security to break this bound with nearly $1/2$ fault tolerance due to multiparty correlation provided by quantum digital signatures. \textcolor{black}{It is intriguing that quantum entanglement is not necessary to break the $1/3$ fault-tolerance bound, and we show that weaker correlation, such as asymmetric relationship of quantum digital signature, can also work.} Our work strictly obeys two Byzantine conditions and can be extended to any number of players without requirements for multiparticle entanglement. We experimentally demonstrate three-party and five-party consensus for a digital ledger. Our work indicates the quantum advantage in terms of consensus problems and suggests an important avenue for quantum blockchain and quantum consensus networks.
△ Less
Submitted 22 November, 2023; v1 submitted 18 June, 2022;
originally announced June 2022.
-
A novel MDPSO-SVR hybrid model for feature selection in electricity consumption forecasting
Authors:
Yukun Bao,
Liang Shen,
Xiaoyuan Zhang,
Yanmei Huang,
Changrui Deng
Abstract:
Electricity consumption forecasting has vital importance for the energy planning of a country. Of the enabling machine learning models, support vector regression (SVR) has been widely used to set up forecasting models due to its superior generalization for unseen data. However, one key procedure for the predictive modeling is feature selection, which might hurt the prediction accuracy if improper…
▽ More
Electricity consumption forecasting has vital importance for the energy planning of a country. Of the enabling machine learning models, support vector regression (SVR) has been widely used to set up forecasting models due to its superior generalization for unseen data. However, one key procedure for the predictive modeling is feature selection, which might hurt the prediction accuracy if improper features were selected. In this regard, a modified discrete particle swarm optimization (MDPSO) was employed for feature selection in this study, and then MDPSO-SVR hybrid mode was built to predict future electricity consumption. Compared with other well-established counterparts, MDPSO-SVR model consistently performs best in two real-world electricity consumption datasets, which indicates that MDPSO for feature selection can improve the prediction accuracy and the SVR equipped with the MDPSO can be a promised alternative for electricity consumption forecasting.
△ Less
Submitted 15 September, 2022; v1 submitted 14 June, 2022;
originally announced June 2022.
-
Predicting Corporate Risk by Jointly Modeling Company Networks and Dialogues in Earnings Conference Calls
Authors:
Yunxin Sang,
Yang Bao
Abstract:
Earnings conference calls are significant information events for volatility forecasting, which is essential for financial risk management and asset pricing. Although some recent volatility forecasting models have utilized the textual content of conference calls, the dialogue structures of conference calls and company relationships are almost ignored in extant literature. To bridge this gap, we pro…
▽ More
Earnings conference calls are significant information events for volatility forecasting, which is essential for financial risk management and asset pricing. Although some recent volatility forecasting models have utilized the textual content of conference calls, the dialogue structures of conference calls and company relationships are almost ignored in extant literature. To bridge this gap, we propose a new model called Temporal Virtual Graph Neural Network (TVGNN) for volatility forecasting by jointly modeling conference call dialogues and company networks. Our model differs from existing models in several important ways. First, we propose to exploit more dialogue structures by encoding position, utterance, speaker role, and Q\&A segments. Second, we propose to encode the market states for volatility forecasting by extending the Gated Recurrent Units (GRU). Third, we propose a new method for constructing temporal company networks in which the messages can only flow from temporally preceding to successive nodes, and extend the Graph Attention Networks (GAT) for modeling company relationships. We collect conference call transcripts of S\&P500 companies from 2008 to 2019, and construct a dataset of conference call dialogues with additional information on dialogue structures and company networks. Empirical results on our dataset demonstrate the superiority of our model over competitive baselines for volatility forecasting. We also conduct supplementary analyses to examine the effectiveness of our model's key components and interpretability.
△ Less
Submitted 16 August, 2022; v1 submitted 25 May, 2022;
originally announced June 2022.
-
Noise subtraction from KAGRA O3GK data using Independent Component Analysis
Authors:
KAGRA collaboration,
H. Abe,
T. Akutsu,
M. Ando,
A. Araya,
N. Aritomi,
H. Asada,
Y. Aso,
S. Bae,
Y. Bae,
R. Bajpai,
K. Cannon,
Z. Cao,
E. Capocasa,
M. Chan,
C. Chen,
D. Chen,
K. Chen,
Y. Chen,
C-Y. Chiang,
Y-K. Chu,
S. Eguchi,
M. Eisenmann,
Y. Enomoto,
R. Flaminio
, et al. (178 additional authors not shown)
Abstract:
In April 2020, KAGRA conducted its first science observation in combination with the GEO~600 detector (O3GK) for two weeks. According to the noise budget estimation, suspension control noise in the low frequency band and acoustic noise in the middle frequency band are identified as the dominant contribution. In this study, we show that such noise can be reduced in offline data analysis by utilizin…
▽ More
In April 2020, KAGRA conducted its first science observation in combination with the GEO~600 detector (O3GK) for two weeks. According to the noise budget estimation, suspension control noise in the low frequency band and acoustic noise in the middle frequency band are identified as the dominant contribution. In this study, we show that such noise can be reduced in offline data analysis by utilizing a method called Independent Component Analysis (ICA). Here the ICA model is extended from the one studied in iKAGRA data analysis by incorporating frequency dependence while linearity and stationarity of the couplings are still assumed. By using optimal witness sensors, those two dominant contributions are mitigated in the real observational data. We also analyze the stability of the transfer functions for whole two weeks data in order to investigate how the current subtraction method can be practically used in gravitational wave search.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Kaggle Kinship Recognition Challenge: Introduction of Convolution-Free Model to boost conventional
Authors:
Mingchuan Tian,
Guangway Teng,
Yipeng Bao
Abstract:
This work aims to explore a convolution-free base classifier that can be used to widen the variations of the conventional ensemble classifier. Specifically, we propose Vision Transformers as base classifiers to combine with CNNs for a unique ensemble solution in Kaggle kinship recognition. In this paper, we verify our proposed idea by implementing and optimizing variants of the Vision Transformer…
▽ More
This work aims to explore a convolution-free base classifier that can be used to widen the variations of the conventional ensemble classifier. Specifically, we propose Vision Transformers as base classifiers to combine with CNNs for a unique ensemble solution in Kaggle kinship recognition. In this paper, we verify our proposed idea by implementing and optimizing variants of the Vision Transformer model on top of the existing CNN models. The combined models achieve better scores than conventional ensemble classifiers based solely on CNN variants. We demonstrate that highly optimized CNN ensembles publicly available on the Kaggle Discussion board can easily achieve a significant boost in ROC score by simply ensemble with variants of the Vision Transformer model due to low correlation.
△ Less
Submitted 11 June, 2022;
originally announced June 2022.
-
Speaker-Guided Encoder-Decoder Framework for Emotion Recognition in Conversation
Authors:
Yinan Bao,
Qianwen Ma,
Lingwei Wei,
Wei Zhou,
Songlin Hu
Abstract:
The emotion recognition in conversation (ERC) task aims to predict the emotion label of an utterance in a conversation. Since the dependencies between speakers are complex and dynamic, which consist of intra- and inter-speaker dependencies, the modeling of speaker-specific information is a vital role in ERC. Although existing researchers have proposed various methods of speaker interaction modelin…
▽ More
The emotion recognition in conversation (ERC) task aims to predict the emotion label of an utterance in a conversation. Since the dependencies between speakers are complex and dynamic, which consist of intra- and inter-speaker dependencies, the modeling of speaker-specific information is a vital role in ERC. Although existing researchers have proposed various methods of speaker interaction modeling, they cannot explore dynamic intra- and inter-speaker dependencies jointly, leading to the insufficient comprehension of context and further hindering emotion prediction. To this end, we design a novel speaker modeling scheme that explores intra- and inter-speaker dependencies jointly in a dynamic manner. Besides, we propose a Speaker-Guided Encoder-Decoder (SGED) framework for ERC, which fully exploits speaker information for the decoding of emotion. We use different existing methods as the conversational context encoder of our framework, showing the high scalability and flexibility of the proposed framework. Experimental results demonstrate the superiority and effectiveness of SGED.
△ Less
Submitted 7 June, 2022;
originally announced June 2022.
-
A Learning- and Scenario-based MPC Design for Nonlinear Systems in LPV Framework with Safety and Stability Guarantees
Authors:
Yajie Bao,
Hossam S. Abbas,
Javad Mohammadpour Velni
Abstract:
This paper presents a learning- and scenario-based model predictive control (MPC) design approach for systems modeled in linear parameter-varying (LPV) framework. Using input-output data collected from the system, a state-space LPV model with uncertainty quantification is first learned through the variational Bayesian inference Neural Network (BNN) approach. The learned probabilistic model is assu…
▽ More
This paper presents a learning- and scenario-based model predictive control (MPC) design approach for systems modeled in linear parameter-varying (LPV) framework. Using input-output data collected from the system, a state-space LPV model with uncertainty quantification is first learned through the variational Bayesian inference Neural Network (BNN) approach. The learned probabilistic model is assumed to contain the true dynamics of the system with a high probability and used to generate scenarios which ensure safety for a scenario-based MPC. Moreover, to guarantee stability and enhance performance of the closed-loop system, a parameter-dependent terminal cost and controller, as well as a terminal robust positive invariant set are designed. Numerical examples will be used to demonstrate that the proposed control design approach can ensure safety and achieve desired control performance.
△ Less
Submitted 6 May, 2023; v1 submitted 6 June, 2022;
originally announced June 2022.
-
DELMAR: Deep Linear Matrix Approximately Reconstruction to Extract Hierarchical Functional Connectivity in the Human Brain
Authors:
Wei Zhang,
Yu Bao
Abstract:
The Matrix Decomposition techniques have been a vital computational approach to analyzing the hierarchy of functional connectivity in the human brain. However, there are still four shortcomings of these methodologies: 1). Large training samples; 2). Manually tuning hyperparameters; 3). Time-consuming and require extensive computational source; 4). It cannot guarantee convergence to a unique fixed…
▽ More
The Matrix Decomposition techniques have been a vital computational approach to analyzing the hierarchy of functional connectivity in the human brain. However, there are still four shortcomings of these methodologies: 1). Large training samples; 2). Manually tuning hyperparameters; 3). Time-consuming and require extensive computational source; 4). It cannot guarantee convergence to a unique fixed point.
Therefore, we propose a novel deep matrix factorization technique called Deep Linear Matrix Approximate Reconstruction (DELMAR) to bridge the abovementioned gaps. The advantages of the proposed method are: at first, proposed DELMAR can estimate the important hyperparameters automatically; furthermore, DELMAR employs the matrix backpropagation to reduce the potential accumulative errors; finally, an orthogonal projection is introduced to update all variables of DELMAR rather than directly calculating the inverse matrices.
The validation experiments of three peer methods and DELMAR using real functional MRI signal of the human brain demonstrates that our proposed method can efficiently identify the spatial feature in fMRI signal even faster and more accurately than other peer methods. Moreover, the theoretical analyses indicate that DELMAR can converge to the unique fixed point and even enable the accurate approximation of original input as DNNs.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
DEMAND: Deep Matrix Approximately Nonlinear Decomposition to Identify Meta, Canonical, and Sub-Spatial Pattern of functional Magnetic Resonance Imaging in the Human Brain
Authors:
Wei Zhang,
Yu Bao
Abstract:
Deep Neural Networks (DNNs) have already become a crucial computational approach to revealing the spatial patterns in the human brain; however, there are three major shortcomings in utilizing DNNs to detect the spatial patterns in functional Magnetic Resonance Signals: 1). It is a fully connected architecture that increases the complexity of network structures that is difficult to optimize and vul…
▽ More
Deep Neural Networks (DNNs) have already become a crucial computational approach to revealing the spatial patterns in the human brain; however, there are three major shortcomings in utilizing DNNs to detect the spatial patterns in functional Magnetic Resonance Signals: 1). It is a fully connected architecture that increases the complexity of network structures that is difficult to optimize and vulnerable to overfitting; 2). The requirement of large training samples results in erasing the individual/minor patterns in feature extraction; 3). The hyperparameters are required to be tuned manually, which is time-consuming. Therefore, we propose a novel deep nonlinear matrix factorization named Deep Matrix Approximately Nonlinear Decomposition (DEMAND) in this work to take advantage of the shallow linear model, e.g., Sparse Dictionary Learning (SDL) and DNNs. At first, the proposed DEMAND employs a non-fully connected and multilayer-stacked architecture that is easier to be optimized compared with canonical DNNs; furthermore, due to the efficient architecture, training DEMAND can avoid overfitting and enables the recognition of individual/minor features based on a small dataset such as an individual data; finally, a novel rank estimator technique is introduced to tune all hyperparameters of DEMAND automatically. Moreover, the proposed DEMAND is validated by four other peer methodologies via real functional Magnetic Resonance Imaging data in the human brain. In short, the validation results demonstrate that DEMAND can reveal the reproducible meta, canonical, and sub-spatial features of the human brain more efficiently than other peer methodologies.
△ Less
Submitted 24 May, 2022; v1 submitted 20 May, 2022;
originally announced May 2022.
-
SADAM: Stochastic Adam, A Stochastic Operator for First-Order Gradient-based Optimizer
Authors:
Wei Zhang,
Yu Bao
Abstract:
In this work, to efficiently help escape the stationary and saddle points, we propose, analyze, and generalize a stochastic strategy performed as an operator for a first-order gradient descent algorithm in order to increase the target accuracy and reduce time consumption. Unlike existing algorithms, the proposed stochastic the strategy does not require any batches and sampling techniques, enabling…
▽ More
In this work, to efficiently help escape the stationary and saddle points, we propose, analyze, and generalize a stochastic strategy performed as an operator for a first-order gradient descent algorithm in order to increase the target accuracy and reduce time consumption. Unlike existing algorithms, the proposed stochastic the strategy does not require any batches and sampling techniques, enabling efficient implementation and maintaining the initial first-order optimizer's convergence rate, but provides an incomparable improvement of target accuracy when optimizing the target functions. In short, the proposed strategy is generalized, applied to Adam, and validated via the decomposition of biomedical signals using Deep Matrix Fitting and another four peer optimizers. The validation results show that the proposed random strategy can be easily generalized for first-order optimizers and efficiently improve the target accuracy.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
DouFu: A Double Fusion Joint Learning Method For Driving Trajectory Representation
Authors:
Han Wang,
Zhou Huang,
Xiao Zhou,
Ganmin Yin,
Yi Bao,
Yi Zhang
Abstract:
Driving trajectory representation learning is of great significance for various location-based services, such as driving pattern mining and route recommendation. However, previous representation generation approaches tend to rarely address three challenges: 1) how to represent the intricate semantic intentions of mobility inexpensively; 2) complex and weak spatial-temporal dependencies due to the…
▽ More
Driving trajectory representation learning is of great significance for various location-based services, such as driving pattern mining and route recommendation. However, previous representation generation approaches tend to rarely address three challenges: 1) how to represent the intricate semantic intentions of mobility inexpensively; 2) complex and weak spatial-temporal dependencies due to the sparsity and heterogeneity of the trajectory data; 3) route selection preferences and their correlation to driving behavior. In this paper, we propose a novel multimodal fusion model, DouFu, for trajectory representation joint learning, which applies multimodal learning and attention fusion module to capture the internal characteristics of trajectories. We first design movement, route, and global features generated from the trajectory data and urban functional zones and then analyze them respectively with the attention encoder or feed forward network. The attention fusion module incorporates route features with movement features to create a better spatial-temporal embedding. With the global semantic feature, DouFu produces a comprehensive embedding for each trajectory. We evaluate representations generated by our method and other baseline models on classification and clustering tasks. Empirical results show that DouFu outperforms other models in most of the learning algorithms like the linear regression and the support vector machine by more than 10%.
△ Less
Submitted 14 October, 2022; v1 submitted 5 May, 2022;
originally announced May 2022.
-
Fast optical transport of ultracold molecules over long distances
Authors:
Yicheng Bao,
Scarlett S. Yu,
Loïc Anderegg,
Sean Burchesky,
Derick Gonzalez-Acevedo,
Eunmi Chae,
Wolfgang Ketterle,
Kang-Kuen Ni,
John M. Doyle
Abstract:
Optically trapped laser-cooled polar molecules hold promise for new science and technology in quantum information and quantum simulation. Large numerical aperture optical access and long trap lifetimes are needed for many studies, but these requirements are challenging to achieve in a magneto-optical trap (MOT) vacuum chamber that is connected to a cryogenic buffer gas beam source, as is the case…
▽ More
Optically trapped laser-cooled polar molecules hold promise for new science and technology in quantum information and quantum simulation. Large numerical aperture optical access and long trap lifetimes are needed for many studies, but these requirements are challenging to achieve in a magneto-optical trap (MOT) vacuum chamber that is connected to a cryogenic buffer gas beam source, as is the case for all molecule laser cooling experiments so far. Long distance transport of molecules greatly eases fulfilling these requirements as molecules are placed into a region separate from the MOT chamber. We realize a fast transport method for ultracold molecules based on an electronically focus-tunable lens combined with an optical lattice. The high transport speed is achieved by the 1D red-detuned optical lattice, which is generated by interference of a focus-tunable laser beam and a focus-fixed laser beam. Efficiency of 48(8)% is realized in the transport of ultracold calcium monofluoride (CaF) molecules over 46 cm distance in 50 ms, with a moderate heating from 32(2) μK to 53(4) μK. Positional stability of the molecular cloud allows for stable loading of an optical tweezer array with single molecules.
△ Less
Submitted 12 May, 2022;
originally announced May 2022.
-
Strong Sign Controllability of Diffusively-Coupled Networks
Authors:
Nam-Jin Park,
Seong-Ho Kwon,
Yoo-Bin Bae,
Byeong-Yeon Kim,
Kevin L. Moore,
Hyo-Sung Ahn
Abstract:
This paper presents several conditions to determine strong sign controllability for diffusively-coupled undirected networks. The strong sign controllability is determined by the sign patterns (positive, negative, zero) of the edges. We first provide the necessary and sufficient conditions for strong sign controllability of basic components, such as path, cycle, and tree. Next, we propose a merging…
▽ More
This paper presents several conditions to determine strong sign controllability for diffusively-coupled undirected networks. The strong sign controllability is determined by the sign patterns (positive, negative, zero) of the edges. We first provide the necessary and sufficient conditions for strong sign controllability of basic components, such as path, cycle, and tree. Next, we propose a merging process to extend the basic componenets to a larger graph based on the conditions of the strong sign controllability. Furthermore, we develop an algorithm of polynomial complexity to find the minimum number of external input nodes while maintaining the strong sign controllability of a network.
△ Less
Submitted 11 May, 2022;
originally announced May 2022.
-
Optimal Lighting Control in Greenhouses Using Bayesian Neural Networks for Sunlight Prediction
Authors:
Shirin Afzali,
Yajie Bao,
Marc W. van Iersel,
Javad Mohammadpour Velni
Abstract:
Controlling the environmental parameters, including light in greenhouses, increases the crop yield; however, the electricity cost of supplemental lighting can be high. Therefore, the importance of applying cost-effective lighting methods arises. In this paper, an optimal supplemental lighting control approach is developed considering a variational inference Bayesian Neural Network (BNN) model for…
▽ More
Controlling the environmental parameters, including light in greenhouses, increases the crop yield; however, the electricity cost of supplemental lighting can be high. Therefore, the importance of applying cost-effective lighting methods arises. In this paper, an optimal supplemental lighting control approach is developed considering a variational inference Bayesian Neural Network (BNN) model for sunlight prediction. The predictive model is validated through testing the model on the historical solar data of a site located at North Carolina ($R^{2}$=0.9971, RMSE=1.8%). The proposed lighting approach is shown to minimize electricity cost by considering the BNN-based sunlight prediction, plant light needs, and variable electricity pricing when solving the underlying optimization problem. For evaluation, the new strategy is compared to: 1) a Markov-based prediction method, which solves the same optimization problem, assuming a Markov model for sunlight prediction; 2) a heuristic method which aims to supply a fixed amount of light. Simulation studies are conducted to examine the electricity cost improvements of the BNN-based approach. The results show that the BNN-based approach reduces cost by (on average) 2.27% and 43.91% compared to the Markov prediction-based method and the heuristic method, respectively, throughout a year.
△ Less
Submitted 7 May, 2022;
originally announced May 2022.
-
A Deep Reinforcement Learning-based Sliding Mode Control Design for Partially-known Nonlinear Systems
Authors:
Sahand Mosharafian,
Shirin Afzali,
Yajie Bao,
Javad Mohammadpour Velni
Abstract:
Presence of model uncertainties creates challenges for model-based control design, and complexity of the control design is further exacerbated when coping with nonlinear systems. This paper presents a sliding mode control (SMC) design approach for nonlinear systems with partially known dynamics by blending data-driven and model-based approaches. First, an SMC is designed for the available (nominal…
▽ More
Presence of model uncertainties creates challenges for model-based control design, and complexity of the control design is further exacerbated when coping with nonlinear systems. This paper presents a sliding mode control (SMC) design approach for nonlinear systems with partially known dynamics by blending data-driven and model-based approaches. First, an SMC is designed for the available (nominal) model of the nonlinear system. The closed-loop state trajectory of the available model is used to build the desired trajectory for the partially known nonlinear system states. Next, a deep policy gradient method is used to cope with unknown parts of the system dynamics and adjust the sliding mode control output to achieve a desired state trajectory. The performance (and viability) of the proposed design approach is finally examined through numerical examples.
△ Less
Submitted 5 May, 2022;
originally announced May 2022.
-
Multi-Granularity Semantic Aware Graph Model for Reducing Position Bias in Emotion-Cause Pair Extraction
Authors:
Yinan Bao,
Qianwen Ma,
Lingwei Wei,
Wei Zhou,
Songlin Hu
Abstract:
The Emotion-Cause Pair Extraction (ECPE) task aims to extract emotions and causes as pairs from documents. We observe that the relative distance distribution of emotions and causes is extremely imbalanced in the typical ECPE dataset. Existing methods have set a fixed size window to capture relations between neighboring clauses. However, they neglect the effective semantic connections between dista…
▽ More
The Emotion-Cause Pair Extraction (ECPE) task aims to extract emotions and causes as pairs from documents. We observe that the relative distance distribution of emotions and causes is extremely imbalanced in the typical ECPE dataset. Existing methods have set a fixed size window to capture relations between neighboring clauses. However, they neglect the effective semantic connections between distant clauses, leading to poor generalization ability towards position-insensitive data. To alleviate the problem, we propose a novel Multi-Granularity Semantic Aware Graph model (MGSAG) to incorporate fine-grained and coarse-grained semantic features jointly, without regard to distance limitation. In particular, we first explore semantic dependencies between clauses and keywords extracted from the document that convey fine-grained semantic features, obtaining keywords enhanced clause representations. Besides, a clause graph is also established to model coarse-grained semantic relations between clauses. Experimental results indicate that MGSAG surpasses the existing state-of-the-art ECPE models. Especially, MGSAG outperforms other models significantly in the condition of position-insensitive data.
△ Less
Submitted 18 August, 2022; v1 submitted 4 May, 2022;
originally announced May 2022.
-
Learning to Split for Automatic Bias Detection
Authors:
Yujia Bao,
Regina Barzilay
Abstract:
Classifiers are biased when trained on biased datasets. As a remedy, we propose Learning to Split (ls), an algorithm for automatic bias detection. Given a dataset with input-label pairs, ls learns to split this dataset so that predictors trained on the training split cannot generalize to the testing split. This performance gap suggests that the testing split is under-represented in the dataset, wh…
▽ More
Classifiers are biased when trained on biased datasets. As a remedy, we propose Learning to Split (ls), an algorithm for automatic bias detection. Given a dataset with input-label pairs, ls learns to split this dataset so that predictors trained on the training split cannot generalize to the testing split. This performance gap suggests that the testing split is under-represented in the dataset, which is a signal of potential bias. Identifying non-generalizable splits is challenging since we have no annotations about the bias. In this work, we show that the prediction correctness of each example in the testing split can be used as a source of weak supervision: generalization performance will drop if we move examples that are predicted correctly away from the testing split, leaving only those that are mis-predicted. ls is task-agnostic and can be applied to any supervised learning problem, ranging from natural language understanding and image classification to molecular property prediction. Empirical results show that ls is able to generate astonishingly challenging splits that correlate with human-identified biases. Moreover, we demonstrate that combining robust learning algorithms (such as group DRO) with splits identified by ls enables automatic de-biasing. Compared to previous state-of-the-art, we substantially improve the worst-group performance (23.4% on average) when the source of biases is unknown during training and validation.
△ Less
Submitted 20 July, 2022; v1 submitted 28 April, 2022;
originally announced April 2022.
-
Spin waves and magnetic exchange Hamiltonian in CrSBr
Authors:
A. Scheie,
M. Ziebel,
D. G. Chica,
Y. J. Bae,
Xiaoping Wang,
A. I. Kolesnikov,
Xiaoyang Zhu,
X. Roy
Abstract:
CrSBr is an air-stable 2D van der Waals semiconducting magnet with great technological promise, but its atomic-scale magnetic interactions -- crucial information for high-frequency switching -- are poorly understood. We present an experimental study to determine the CrSBr magnetic exchange Hamiltonian and bulk magnon spectrum. We confirm the $A$-type antiferromagnetic order using single crystal ne…
▽ More
CrSBr is an air-stable 2D van der Waals semiconducting magnet with great technological promise, but its atomic-scale magnetic interactions -- crucial information for high-frequency switching -- are poorly understood. We present an experimental study to determine the CrSBr magnetic exchange Hamiltonian and bulk magnon spectrum. We confirm the $A$-type antiferromagnetic order using single crystal neutron diffraction. We also measure the magnon dispersions using inelastic neutron scattering and rigorously fit the excitation modes to a spin wave model. The magnon spectrum is well described by an intra-plane ferromagnetic Heisenberg exchange model with seven nearest in-plane exchanges. This fitted exchange Hamiltonian enables theoretical predictions of CrSBr behavior: as one example, we use the fitted Hamiltonian to predict the presence of chiral magnon edge modes with a spin-orbit enhanced CrSBr heterostructure.
△ Less
Submitted 15 July, 2022; v1 submitted 26 April, 2022;
originally announced April 2022.
-
Optimizing Task Placement and Online Scheduling for Distributed GNN Training Acceleration
Authors:
Ziyue Luo,
Yixin Bao,
Chuan Wu
Abstract:
Training Graph Neural Networks (GNN) on large graphs is resource-intensive and time-consuming, mainly due to the large graph data that cannot be fit into the memory of a single machine, but have to be fetched from distributed graph storage and processed on the go. Unlike distributed deep neural network (DNN) training, the bottleneck in distributed GNN training lies largely in large graph data tran…
▽ More
Training Graph Neural Networks (GNN) on large graphs is resource-intensive and time-consuming, mainly due to the large graph data that cannot be fit into the memory of a single machine, but have to be fetched from distributed graph storage and processed on the go. Unlike distributed deep neural network (DNN) training, the bottleneck in distributed GNN training lies largely in large graph data transmission for constructing mini-batches of training samples. Existing solutions often advocate data-computation colocation, and do not work well with limited resources where the colocation is infeasible. The potentials of strategical task placement and optimal scheduling of data transmission and task execution have not been well explored. This paper designs an efficient algorithm framework for task placement and execution scheduling of distributed GNN training, to better resource utilization, improve execution pipelining, and expediting training completion. Our framework consists of two modules: (i) an online scheduling algorithm that schedules the execution of training tasks, and the data transmission plan; and (ii) an exploratory task placement scheme that decides the placement of each training task. We conduct thorough theoretical analysis, testbed experiments and simulation studies, and observe up to 67% training speed-up with our algorithm as compared to representative baselines.
△ Less
Submitted 24 April, 2022;
originally announced April 2022.
-
Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation
Authors:
Jie Liu,
Yanqi Bao,
Guo-Sen Xie,
Huan Xiong,
Jan-Jakob Sonke,
Efstratios Gavves
Abstract:
The key challenge for few-shot semantic segmentation (FSS) is how to tailor a desirable interaction among support and query features and/or their prototypes, under the episodic training scenario. Most existing FSS methods implement such support-query interactions by solely leveraging plain operations - e.g., cosine similarity and feature concatenation - for segmenting the query objects. However, t…
▽ More
The key challenge for few-shot semantic segmentation (FSS) is how to tailor a desirable interaction among support and query features and/or their prototypes, under the episodic training scenario. Most existing FSS methods implement such support-query interactions by solely leveraging plain operations - e.g., cosine similarity and feature concatenation - for segmenting the query objects. However, these interaction approaches usually cannot well capture the intrinsic object details in the query images that are widely encountered in FSS, e.g., if the query object to be segmented has holes and slots, inaccurate segmentation almost always happens. To this end, we propose a dynamic prototype convolution network (DPCN) to fully capture the aforementioned intrinsic details for accurate FSS. Specifically, in DPCN, a dynamic convolution module (DCM) is firstly proposed to generate dynamic kernels from support foreground, then information interaction is achieved by convolution operations over query features using these kernels. Moreover, we equip DPCN with a support activation module (SAM) and a feature filtering module (FFM) to generate pseudo mask and filter out background information for the query images, respectively. SAM and FFM together can mine enriched context information from the query features. Our DPCN is also flexible and efficient under the k-shot FSS setting. Extensive experiments on PASCAL-5i and COCO-20i show that DPCN yields superior performances under both 1-shot and 5-shot settings.
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
Search for continuous gravitational wave emission from the Milky Way center in O3 LIGO--Virgo data
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Allocca,
P. A. Altin
, et al. (1645 additional authors not shown)
Abstract:
We present a directed search for continuous gravitational wave (CW) signals emitted by spinning neutron stars located in the inner parsecs of the Galactic Center (GC). Compelling evidence for the presence of a numerous population of neutron stars has been reported in the literature, turning this region into a very interesting place to look for CWs. In this search, data from the full O3 LIGO--Virgo…
▽ More
We present a directed search for continuous gravitational wave (CW) signals emitted by spinning neutron stars located in the inner parsecs of the Galactic Center (GC). Compelling evidence for the presence of a numerous population of neutron stars has been reported in the literature, turning this region into a very interesting place to look for CWs. In this search, data from the full O3 LIGO--Virgo run in the detector frequency band $[10,2000]\rm~Hz$ have been used. No significant detection was found and 95$\%$ confidence level upper limits on the signal strain amplitude were computed, over the full search band, with the deepest limit of about $7.6\times 10^{-26}$ at $\simeq 142\rm~Hz$. These results are significantly more constraining than those reported in previous searches. We use these limits to put constraints on the fiducial neutron star ellipticity and r-mode amplitude. These limits can be also translated into constraints in the black hole mass -- boson mass plane for a hypothetical population of boson clouds around spinning black holes located in the GC.
△ Less
Submitted 9 April, 2022;
originally announced April 2022.
-
$\textit{latent}$-GLAT: Glancing at Latent Variables for Parallel Text Generation
Authors:
Yu Bao,
Hao Zhou,
Shujian Huang,
Dongqi Wang,
Lihua Qian,
Xinyu Dai,
Jiajun Chen,
Lei Li
Abstract:
Recently, parallel text generation has received widespread attention due to its success in generation efficiency. Although many advanced techniques are proposed to improve its generation quality, they still need the help of an autoregressive model for training to overcome the one-to-many multi-modal phenomenon in the dataset, limiting their applications. In this paper, we propose…
▽ More
Recently, parallel text generation has received widespread attention due to its success in generation efficiency. Although many advanced techniques are proposed to improve its generation quality, they still need the help of an autoregressive model for training to overcome the one-to-many multi-modal phenomenon in the dataset, limiting their applications. In this paper, we propose $\textit{latent}$-GLAT, which employs the discrete latent variables to capture word categorical information and invoke an advanced curriculum learning technique, alleviating the multi-modality problem. Experiment results show that our method outperforms strong baselines without the help of an autoregressive model, which further broadens the application scenarios of the parallel decoding paradigm.
△ Less
Submitted 5 April, 2022;
originally announced April 2022.
-
Identification and classification of exfoliated graphene flakes from microscopy images using a hierarchical deep convolutional neural network
Authors:
Soroush Mahjoubi,
Fan Ye,
Yi Bao,
Weina Meng,
Xian Zhang
Abstract:
Identification of the mechanically exfoliated graphene flakes and classification of the thickness is important in the nanomanufacturing of next-generation materials and devices that overcome the bottleneck of Moore's Law. Currently, identification and classification of exfoliated graphene flakes are conducted by human via inspecting the optical microscope images. The existing state-of-the-art auto…
▽ More
Identification of the mechanically exfoliated graphene flakes and classification of the thickness is important in the nanomanufacturing of next-generation materials and devices that overcome the bottleneck of Moore's Law. Currently, identification and classification of exfoliated graphene flakes are conducted by human via inspecting the optical microscope images. The existing state-of-the-art automatic identification by machine learning is not able to accommodate images with different backgrounds while different backgrounds are unavoidable in experiments. This paper presents a deep learning method to automatically identify and classify the thickness of exfoliated graphene flakes on Si/SiO2 substrates from optical microscope images with various settings and background colors. The presented method uses a hierarchical deep convolutional neural network that is capable of learning new images while preserving the knowledge from previous images. The deep learning model was trained and used to classify exfoliated graphene flakes into monolayer (1L), bi-layer (2L), tri-layer (3L), four-to-six-layer (4-6L), seven-to-ten-layer (7-10L), and bulk categories. Compared with existing machine learning methods, the presented method possesses high accuracy and efficiency as well as robustness to the backgrounds and resolutions of images. The results indicated that our deep learning model has accuracy as high as 99% in identifying and classifying exfoliated graphene flakes. This research will shed light on scaled-up manufacturing and characterization of graphene for advanced materials and devices.
△ Less
Submitted 29 March, 2022;
originally announced March 2022.
-
Learning to Mediate Disparities Towards Pragmatic Communication
Authors:
Yuwei Bao,
Sayan Ghosh,
Joyce Chai
Abstract:
Human communication is a collaborative process. Speakers, on top of conveying their own intent, adjust the content and language expressions by taking the listeners into account, including their knowledge background, personalities, and physical capabilities. Towards building AI agents with similar abilities in language communication, we propose Pragmatic Rational Speaker (PRS), a framework extendin…
▽ More
Human communication is a collaborative process. Speakers, on top of conveying their own intent, adjust the content and language expressions by taking the listeners into account, including their knowledge background, personalities, and physical capabilities. Towards building AI agents with similar abilities in language communication, we propose Pragmatic Rational Speaker (PRS), a framework extending Rational Speech Act (RSA). The PRS attempts to learn the speaker-listener disparity and adjust the speech accordingly, by adding a light-weighted disparity adjustment layer into working memory on top of speaker's long-term memory system. By fixing the long-term memory, the PRS only needs to update its working memory to learn and adapt to different types of listeners. To validate our framework, we create a dataset that simulates different types of speaker-listener disparities in the context of referential games. Our empirical results demonstrate that the PRS is able to shift its output towards the language that listener are able to understand, significantly improve the collaborative task outcome.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
3D-printed facet-attached optical elements for beam shaping in optical phased arrays
Authors:
Stefan Singer,
Yilin Xu,
Sebastian Tobias Skacel,
Yiyang Bao,
Heiner Zwickel,
Pascal Maier,
Lukas Freter,
Philipp-Immanuel Dietrich,
Mathias Kaschel,
Christoph Menzel,
Sebastian Randel,
Wolfgang Freude,
Christian Koos
Abstract:
We demonstrate an optical phased-array (OPA) equipped with a 3D-printed facet-attached element for shaping and deflection of the emitted beam. The beam shaper combines freeform refractive surfaces with total-internal-reflection (TIR) mirrors and is in-situ printed to edge-emitting waveguide facets using high-resolution multi-photon lithography, thereby ensuring precise alignment with respect to on…
▽ More
We demonstrate an optical phased-array (OPA) equipped with a 3D-printed facet-attached element for shaping and deflection of the emitted beam. The beam shaper combines freeform refractive surfaces with total-internal-reflection (TIR) mirrors and is in-situ printed to edge-emitting waveguide facets using high-resolution multi-photon lithography, thereby ensuring precise alignment with respect to on-chip waveguide structures. In a proof-of-concept experiment, we achieve a grating-lobe free steering range of $\pm 30°$ and a full-width-halfmaximum (FWHM) beam divergence of approximately $2°$. The concept opens an attractive alternative to currently used grating structures and is applicable to a wide range of integration platforms.
△ Less
Submitted 9 January, 2023; v1 submitted 24 March, 2022;
originally announced March 2022.
-
Analyses of Some Structural Properties on a Class of Hierarchical Scale-free Networks
Authors:
Jia-Bao Liu,
Yan Bao,
Wu-Ting Zheng
Abstract:
Hierarchical networks actually have many applications in the real world. Firstly, we propose a new class of hierarchical networks with scale-free and fractal structure, which are the networks with triangles compared to traditional hierarchical networks. Secondly, we study the precise results of some structural properties to derive small-world effect and scale-free feature. Thirdly, it is found tha…
▽ More
Hierarchical networks actually have many applications in the real world. Firstly, we propose a new class of hierarchical networks with scale-free and fractal structure, which are the networks with triangles compared to traditional hierarchical networks. Secondly, we study the precise results of some structural properties to derive small-world effect and scale-free feature. Thirdly, it is found that the constructed network is sparse through the average degree and density. Fourthly, it is also demonstrated the degree distributions of hub nodes and the bottom nodes are the power law and exponential, respectively. Finally, we prove that clustering coefficient with a definite value z tends to stabilize at a lower bound as t iterates to a certain number, and the average distance of G_{t}^{z} has a increasing relationship along with the value of lnN_{t}.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
Search for Gravitational Waves Associated with Fast Radio Bursts Detected by CHIME/FRB During the LIGO--Virgo Observing Run O3a
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
the CHIME/FRB Collaboration,
:,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
N. Adhikari,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
A. Allocca
, et al. (1633 additional authors not shown)
Abstract:
We search for gravitational-wave transients associated with fast radio bursts (FRBs) detected by the Canadian Hydrogen Intensity Mapping Experiment Fast Radio Burst Project (CHIME/FRB), during the first part of the third observing run of Advanced LIGO and Advanced Virgo (1 April 2019 15:00 UTC-1 Oct 2019 15:00 UTC). Triggers from 22 FRBs were analyzed with a search that targets compact binary coal…
▽ More
We search for gravitational-wave transients associated with fast radio bursts (FRBs) detected by the Canadian Hydrogen Intensity Mapping Experiment Fast Radio Burst Project (CHIME/FRB), during the first part of the third observing run of Advanced LIGO and Advanced Virgo (1 April 2019 15:00 UTC-1 Oct 2019 15:00 UTC). Triggers from 22 FRBs were analyzed with a search that targets compact binary coalescences with at least one neutron star component. A targeted search for generic gravitational-wave transients was conducted on 40 FRBs. We find no significant evidence for a gravitational-wave association in either search. Given the large uncertainties in the distances of the FRBs inferred from the dispersion measures in our sample, however, this does not conclusively exclude any progenitor models that include emission of a gravitational wave of the types searched for from any of these FRB events. We report $90\%$ confidence lower bounds on the distance to each FRB for a range of gravitational-wave progenitor models. By combining the inferred maximum distance information for each FRB with the sensitivity of the gravitational-wave searches, we set upper limits on the energy emitted through gravitational waves for a range of emission scenarios. We find values of order $10^{51}$-$10^{57}$ erg for a range of different emission models with central gravitational wave frequencies in the range 70-3560 Hz. Finally, we also found no significant coincident detection of gravitational waves with the repeater, FRB 20200120E, which is the closest known extragalactic FRB.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Performance of the KAGRA detector during the first joint observation with GEO 600 (O3GK)
Authors:
KAGRA Collaboration,
H. Abe,
R. X. Adhikari,
T. Akutsu,
M. Ando,
A. Araya,
N. Aritomi,
H. Asada,
Y. Aso,
S. Bae,
Y. Bae,
R. Bajpai,
S. W. Ballmer,
K. Cannon,
Z. Cao,
E. Capocasa,
M. Chan,
C. Chen,
D. Chen,
K. Chen,
Y. Chen,
C-Y. Chiang,
Y-K. Chu,
J. C. Driggers,
S. E. Dwyer
, et al. (193 additional authors not shown)
Abstract:
KAGRA, the kilometer-scale underground gravitational-wave detector, is located at Kamioka, Japan. In April 2020, an astrophysics observation was performed at the KAGRA detector in combination with the GEO 600 detector; this observation operation is called O3GK. The optical configuration in O3GK is based on a power recycled Fabry-Pérot Michelson interferometer; all the mirrors were set at room temp…
▽ More
KAGRA, the kilometer-scale underground gravitational-wave detector, is located at Kamioka, Japan. In April 2020, an astrophysics observation was performed at the KAGRA detector in combination with the GEO 600 detector; this observation operation is called O3GK. The optical configuration in O3GK is based on a power recycled Fabry-Pérot Michelson interferometer; all the mirrors were set at room temperature. The duty factor of the operation was approximately 53%, and the strain sensitivity was $3\times10^{-22}~/\sqrt{\rm{Hz}}$ at 250 Hz. In addition, the binary-neutron-star (BNS) inspiral range was approximately 0.6 Mpc. The contributions of various noise sources to the sensitivity of O3GK were investigated to understand how the observation range could be improved; this study is called a "noise budget". According to our noise budget, the measured sensitivity could be approximated by adding up the effect of each noise. The sensitivity was dominated by noise from the sensors used for local controls of the vibration isolation systems, acoustic noise, shot noise, and laser frequency noise. Further, other noise sources that did not limit the sensitivity were investigated. This paper provides a detailed account of the KAGRA detector in O3GK including interferometer configuration, status, and noise budget. In addition, strategies for future sensitivity improvements such as hardware upgrades, are discussed.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
Mimicking Mergers: Mistaking Black Hole Captures as Mergers
Authors:
Weichangfeng Guo,
Daniel Williams,
Ik Siong Heng,
Hunter Gabbard,
Yeong-Bok Bae,
Gungwon Kang,
Zong-Hong Zhu
Abstract:
As the number of gravitational wave observations has increased in recent years, the variety of sources has broadened. Here we investigate whether it is possible for the current generation of detectors to distinguish between very short-lived gravitational wave signals from mergers between high-mass black holes, and the signal produced by a close encounter between two black holes which results in gr…
▽ More
As the number of gravitational wave observations has increased in recent years, the variety of sources has broadened. Here we investigate whether it is possible for the current generation of detectors to distinguish between very short-lived gravitational wave signals from mergers between high-mass black holes, and the signal produced by a close encounter between two black holes which results in gravitational capture, and ultimately a merger. We compare the posterior probability distributions produced by analysing simulated signals from both types of progenitor events, both under ideal and realistic scenarios. We show that while, under ideal conditions it is possible to distinguish both progenitors, under more realistic conditions they are indistinguishable. This has important implications for the interpretation of such short signals, and we therefore advocate that these signals be the focus of additional investigation even when satisfactory results have been achieved from standard analyses.
△ Less
Submitted 22 September, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Varying Coefficient Linear Discriminant Analysis for Dynamic Data
Authors:
Yajie Bao,
Yuyang Liu
Abstract:
Linear discriminant analysis (LDA) is an important classification tool in statistics and machine learning. This paper investigates the varying coefficient LDA model for dynamic data, with Bayes' discriminant direction being a function of some exposure variable to address the heterogeneity. We propose a new least-square estimation method based on the B-spline approximation. The data-driven discrimi…
▽ More
Linear discriminant analysis (LDA) is an important classification tool in statistics and machine learning. This paper investigates the varying coefficient LDA model for dynamic data, with Bayes' discriminant direction being a function of some exposure variable to address the heterogeneity. We propose a new least-square estimation method based on the B-spline approximation. The data-driven discriminant procedure is more computationally efficient than the dynamic linear programming rule \citep{jiang2020dynamic}. We also establish the convergence rates for the corresponding estimation error bound and the excess misclassification risk. The estimation error in $L_2$ distance is optimal for the low-dimensional regime and is near optimal for the high-dimensional regime. Numerical experiments on synthetic data and real data both corroborate the superiority of our proposed classification method.
△ Less
Submitted 10 October, 2022; v1 submitted 12 March, 2022;
originally announced March 2022.
-
Electroweak ALP Searches at a Muon Collider
Authors:
Yunjia Bao,
JiJi Fan,
Lingfeng Li
Abstract:
A high-energy muon collider with center-of-mass energy around and above 10 TeV is also a vector boson fusion (VBF) machine, due to the significant virtual electroweak (EW) gauge boson content of high-energy muon beams. This feature, together with the clean environment, makes it an ideal collider to search for TeV-scale axion-like particles (ALP) coupling to Standard Model EW gauge bosons, which cu…
▽ More
A high-energy muon collider with center-of-mass energy around and above 10 TeV is also a vector boson fusion (VBF) machine, due to the significant virtual electroweak (EW) gauge boson content of high-energy muon beams. This feature, together with the clean environment, makes it an ideal collider to search for TeV-scale axion-like particles (ALP) coupling to Standard Model EW gauge bosons, which current and other future colliders have limited sensitivities to. We present detailed analyses of heavy ALP searches in both the VBF and associated production channels at a muon collider with different running benchmarks. We also show projected constraints on the ALP couplings in the effective field theory, including an operator with its coefficient not determined by the mixed Peccei-Quinn anomaly. We demonstrate that a muon collider could probe new ALP parameter space and push the sensitivities of the couplings between the ALP and EW gauge bosons by one order of magnitude compared to HL-LHC. The projected limits and search strategies for ALPs could also be applied to other types of resonances coupling to EW gauge bosons.
△ Less
Submitted 10 September, 2022; v1 submitted 8 March, 2022;
originally announced March 2022.
-
Transformations in Learned Image Compression from a Modulation Perspective
Authors:
Youneng Bao,
Fangyang Meng,
Wen Tan,
Chao Li,
Yonghong Tian,
Yongsheng Liang
Abstract:
In this paper, a unified transformation method in learned image compression(LIC) is proposed from the perspective of modulation. Firstly, the quantization in LIC is considered as a generalized channel with additive uniform noise. Moreover, the LIC is interpreted as a particular communication system according to the consistency in structures and optimization objectives. Thus, the technology of comm…
▽ More
In this paper, a unified transformation method in learned image compression(LIC) is proposed from the perspective of modulation. Firstly, the quantization in LIC is considered as a generalized channel with additive uniform noise. Moreover, the LIC is interpreted as a particular communication system according to the consistency in structures and optimization objectives. Thus, the technology of communication systems can be applied to guide the design of modules in LIC. Furthermore, a unified transform method based on signal modulation (TSM) is defined. In the view of TSM, the existing transformation methods are mathematically reduced to a linear modulation. A series of transformation methods, e.g. TPM and TJM, are obtained by extending to nonlinear modulation. The experimental results on various datasets and backbone architectures verify that the effectiveness and robustness of the proposed method. More importantly, it further confirms the feasibility of guiding LIC design from a communication perspective. For example, when backbone architecture is hyperprior combining context model, our method achieves 3.52$\%$ BD-rate reduction over GDN on Kodak dataset without increasing complexity.
△ Less
Submitted 12 March, 2024; v1 submitted 4 March, 2022;
originally announced March 2022.
-
First joint observation by the underground gravitational-wave detector, KAGRA, with GEO600
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Allocca,
P. A. Altin
, et al. (1647 additional authors not shown)
Abstract:
We report the results of the first joint observation of the KAGRA detector with GEO600. KAGRA is a cryogenic and underground gravitational-wave detector consisting of a laser interferometer with three-kilometer arms, and located in Kamioka, Gifu, Japan. GEO600 is a British--German laser interferometer with 600 m arms, and located near Hannover, Germany. GEO600 and KAGRA performed a joint observing…
▽ More
We report the results of the first joint observation of the KAGRA detector with GEO600. KAGRA is a cryogenic and underground gravitational-wave detector consisting of a laser interferometer with three-kilometer arms, and located in Kamioka, Gifu, Japan. GEO600 is a British--German laser interferometer with 600 m arms, and located near Hannover, Germany. GEO600 and KAGRA performed a joint observing run from April 7 to 20, 2020. We present the results of the joint analysis of the GEO--KAGRA data for transient gravitational-wave signals, including the coalescence of neutron-star binaries and generic unmodeled transients. We also perform dedicated searches for binary coalescence signals and generic transients associated with gamma-ray burst events observed during the joint run. No gravitational-wave events were identified. We evaluate the minimum detectable amplitude for various types of transient signals and the spacetime volume for which the network is sensitive to binary neutron-star coalescences. We also place lower limits on the distances to the gamma-ray bursts analysed based on the non-detection of an associated gravitational-wave signal for several signal models, including binary coalescences. These analyses demonstrate the feasibility and utility of KAGRA as a member of the global gravitational-wave detector network.
△ Less
Submitted 19 August, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Measurement-Induced Power-Law Negativity in an Open Monitored Quantum Circuit
Authors:
Zack Weinstein,
Yimu Bao,
Ehud Altman
Abstract:
Generic many-body systems coupled to an environment lose their quantum entanglement due to decoherence and evolve to a mixed state with only classical correlations. Here, we show that measurements can stabilize quantum entanglement within open quantum systems. Specifically, in random unitary circuits with dephasing at the boundary, we find both numerically and analytically that projective measurem…
▽ More
Generic many-body systems coupled to an environment lose their quantum entanglement due to decoherence and evolve to a mixed state with only classical correlations. Here, we show that measurements can stabilize quantum entanglement within open quantum systems. Specifically, in random unitary circuits with dephasing at the boundary, we find both numerically and analytically that projective measurements performed at a small nonvanishing rate results in a steady state with an $L^{1/3}$ power-law scaling entanglement negativity within the system. Using an analytical mapping to a statistical mechanics model of directed polymers in a random environment, we show that the power-law negativity scaling can be understood as Kardar-Parisi-Zhang fluctuations due to the random measurement locations. Further increasing the measurement rate leads to a phase transition into an area-law negativity phase, which is of the same universality as the entanglement transition in monitored random circuits without decoherence.
△ Less
Submitted 3 September, 2022; v1 submitted 25 February, 2022;
originally announced February 2022.