-
Structural Similarity: When to Use Deep Generative Models on Imbalanced Image Dataset Augmentation
Authors:
Chenqi Guo,
Fabian Benitez-Quiroz,
Qianli Feng,
Aleix Martinez
Abstract:
Improving the performance on an imbalanced training set is one of the main challenges in nowadays Machine Learning. One way to augment and thus re-balance the image dataset is through existing deep generative models, like class-conditional Generative Adversarial Networks (cGAN) or Diffusion Models by synthesizing images on each of the tail-class. Our experiments on imbalanced image dataset classif…
▽ More
Improving the performance on an imbalanced training set is one of the main challenges in nowadays Machine Learning. One way to augment and thus re-balance the image dataset is through existing deep generative models, like class-conditional Generative Adversarial Networks (cGAN) or Diffusion Models by synthesizing images on each of the tail-class. Our experiments on imbalanced image dataset classification show that, the validation accuracy improvement with such re-balancing method is related to the image similarity between different classes. Thus, to quantify this image dataset class similarity, we propose a measurement called Super-Sub Class Structural Similarity (SSIM-supSubCls) based on Structural Similarity (SSIM). A deep generative model data augmentation classification (GM-augCls) pipeline is also provided to verify this metric correlates with the accuracy enhancement. We further quantify the relationship between them, discovering that the accuracy improvement decays exponentially with respect to SSIM-supSubCls values.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
X-Avatar: Expressive Human Avatars
Authors:
Kaiyue Shen,
Chen Guo,
Manuel Kaufmann,
Juan Jose Zarate,
Julien Valentin,
Jie Song,
Otmar Hilliges
Abstract:
We present X-Avatar, a novel avatar model that captures the full expressiveness of digital humans to bring about life-like experiences in telepresence, AR/VR and beyond. Our method models bodies, hands, facial expressions and appearance in a holistic fashion and can be learned from either full 3D scans or RGB-D data. To achieve this, we propose a part-aware learned forward skinning module that can…
▽ More
We present X-Avatar, a novel avatar model that captures the full expressiveness of digital humans to bring about life-like experiences in telepresence, AR/VR and beyond. Our method models bodies, hands, facial expressions and appearance in a holistic fashion and can be learned from either full 3D scans or RGB-D data. To achieve this, we propose a part-aware learned forward skinning module that can be driven by the parameter space of SMPL-X, allowing for expressive animation of X-Avatars. To efficiently learn the neural shape and deformation fields, we propose novel part-aware sampling and initialization strategies. This leads to higher fidelity results, especially for smaller body parts while maintaining efficient training despite increased number of articulated bones. To capture the appearance of the avatar with high-frequency details, we extend the geometry and deformation fields with a texture network that is conditioned on pose, facial expression, geometry and the normals of the deformed surface. We show experimentally that our method outperforms strong baselines in both data domains both quantitatively and qualitatively on the animation task. To facilitate future research on expressive avatars we contribute a new dataset, called X-Humans, containing 233 sequences of high-quality textured scans from 20 participants, totalling 35,500 data frames.
△ Less
Submitted 9 March, 2023; v1 submitted 8 March, 2023;
originally announced March 2023.
-
JUNO sensitivity to $^7$Be, $pep$, and CNO solar neutrinos
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta
, et al. (592 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented levels of precision. In this paper, we provide estimation of the JUNO sensitivity to 7Be, pep, and CNO solar neutrinos that can be obtained via a spectral analysis above the 0.45 MeV threshold. This study is performed assuming different scenarios of the liquid scintillator radiopurity, ranging from the most opti mistic one corresponding to the radiopurity levels obtained by the Borexino experiment, up to the minimum requirements needed to perform the neutrino mass ordering determination with reactor antineutrinos - the main goal of JUNO. Our study shows that in most scenarios, JUNO will be able to improve the current best measurements on 7Be, pep, and CNO solar neutrino fluxes. We also perform a study on the JUNO capability to detect periodical time variations in the solar neutrino flux, such as the day-night modulation induced by neutrino flavor regeneration in Earth, and the modulations induced by temperature changes driven by helioseismic waves.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Towards practical and massively parallel quantum computing emulation for quantum chemistry
Authors:
Honghui Shang,
Yi Fan,
Li Shen,
Chu Guo,
Jie Liu,
Xiaohui Duan,
Fang Li,
Zhenyu Li
Abstract:
Quantum computing is moving beyond its early stage and seeking for commercial applications in chemical and biomedical sciences. In the current noisy intermediate-scale quantum computing era, quantum resource is too scarce to support these explorations. Therefore, it is valuable to emulate quantum computing on classical computers for developing quantum algorithms and validating quantum hardware. Ho…
▽ More
Quantum computing is moving beyond its early stage and seeking for commercial applications in chemical and biomedical sciences. In the current noisy intermediate-scale quantum computing era, quantum resource is too scarce to support these explorations. Therefore, it is valuable to emulate quantum computing on classical computers for developing quantum algorithms and validating quantum hardware. However, existing simulators mostly suffer from the memory bottleneck so developing the approaches for large-scale quantum chemistry calculations remains challenging. Here we demonstrate a high-performance and massively parallel variational quantum eigensolver (VQE) simulator based on matrix product states, combined with embedding theory for solving large-scale quantum computing emulation for quantum chemistry on HPC platforms. We apply this method to study the torsional barrier of ethane and the quantification of the protein-ligand interactions. Our largest simulation reaches $1000$ qubits, and a performance of $216.9$ PFLOPS is achieved on a new Sunway supercomputer, which sets the state-of-the-art for quantum computing emulation for quantum chemistry
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Fine-Grained Classification with Noisy Labels
Authors:
Qi Wei,
Lei Feng,
Haoliang Sun,
Ren Wang,
Chenhui Guo,
Yilong Yin
Abstract:
Learning with noisy labels (LNL) aims to ensure model generalization given a label-corrupted training set. In this work, we investigate a rarely studied scenario of LNL on fine-grained datasets (LNL-FG), which is more practical and challenging as large inter-class ambiguities among fine-grained classes cause more noisy labels. We empirically show that existing methods that work well for LNL fail t…
▽ More
Learning with noisy labels (LNL) aims to ensure model generalization given a label-corrupted training set. In this work, we investigate a rarely studied scenario of LNL on fine-grained datasets (LNL-FG), which is more practical and challenging as large inter-class ambiguities among fine-grained classes cause more noisy labels. We empirically show that existing methods that work well for LNL fail to achieve satisfying performance for LNL-FG, arising the practical need of effective solutions for LNL-FG. To this end, we propose a novel framework called stochastic noise-tolerated supervised contrastive learning (SNSCL) that confronts label noise by encouraging distinguishable representation. Specifically, we design a noise-tolerated supervised contrastive learning loss that incorporates a weight-aware mechanism for noisy label correction and selectively updating momentum queue lists. By this mechanism, we mitigate the effects of noisy anchors and avoid inserting noisy labels into the momentum-updated queue. Besides, to avoid manually-defined augmentation strategies in contrastive learning, we propose an efficient stochastic module that samples feature embeddings from a generated distribution, which can also enhance the representation ability of deep models. SNSCL is general and compatible with prevailing robust LNL strategies to improve their performance for LNL-FG. Extensive experiments demonstrate the effectiveness of SNSCL.
△ Less
Submitted 4 March, 2023;
originally announced March 2023.
-
A note to "Radial limits of quasiregular Local Homeomorphisms''
Authors:
Chang-Yu Guo,
Yi Xuan
Abstract:
In this short note, we consider quasiregular local homeomorphisms on uniform domains. We prove that such mappings always can be extended to some boundary points along John curves, which extends the corresponding result of Rajala [Amer. J. Math. 2008].
In this short note, we consider quasiregular local homeomorphisms on uniform domains. We prove that such mappings always can be extended to some boundary points along John curves, which extends the corresponding result of Rajala [Amer. J. Math. 2008].
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching
Authors:
Zheng Li,
Caili Guo,
Xin Wang,
Zerun Feng,
Zhongtian Du
Abstract:
Recently, a series of Image-Text Matching (ITM) methods achieve impressive performance. However, we observe that most existing ITM models suffer from gradients vanishing at the beginning of training, which makes these models prone to falling into local minima. Most ITM models adopt triplet loss with Hard Negative mining (HN) as the optimization objective. We find that optimizing an ITM model using…
▽ More
Recently, a series of Image-Text Matching (ITM) methods achieve impressive performance. However, we observe that most existing ITM models suffer from gradients vanishing at the beginning of training, which makes these models prone to falling into local minima. Most ITM models adopt triplet loss with Hard Negative mining (HN) as the optimization objective. We find that optimizing an ITM model using only the hard negative samples can easily lead to gradient vanishing. In this paper, we derive the condition under which the gradient vanishes during training. When the difference between the positive pair similarity and the negative pair similarity is close to 0, the gradients on both the image and text encoders will approach 0. To alleviate the gradient vanishing problem, we propose a Selectively Hard Negative Mining (SelHN) strategy, which chooses whether to mine hard negative samples according to the gradient vanishing condition. SelHN can be plug-and-play applied to existing ITM models to give them better training behavior. To further ensure the back-propagation of gradients, we construct a Residual Visual Semantic Embedding model with SelHN, denoted as RVSE++. Extensive experiments on two ITM benchmarks demonstrate the strength of RVSE++, achieving state-of-the-art performance.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
CLICKER: Attention-Based Cross-Lingual Commonsense Knowledge Transfer
Authors:
Ruolin Su,
Zhongkai Sun,
Sixing Lu,
Chengyuan Ma,
Chenlei Guo
Abstract:
Recent advances in cross-lingual commonsense reasoning (CSR) are facilitated by the development of multilingual pre-trained models (mPTMs). While mPTMs show the potential to encode commonsense knowledge for different languages, transferring commonsense knowledge learned in large-scale English corpus to other languages is challenging. To address this problem, we propose the attention-based Cross-LI…
▽ More
Recent advances in cross-lingual commonsense reasoning (CSR) are facilitated by the development of multilingual pre-trained models (mPTMs). While mPTMs show the potential to encode commonsense knowledge for different languages, transferring commonsense knowledge learned in large-scale English corpus to other languages is challenging. To address this problem, we propose the attention-based Cross-LIngual Commonsense Knowledge transfER (CLICKER) framework, which minimizes the performance gaps between English and non-English languages in commonsense question-answering tasks. CLICKER effectively improves commonsense reasoning for non-English languages by differentiating non-commonsense knowledge from commonsense knowledge. Experimental results on public benchmarks demonstrate that CLICKER achieves remarkable improvements in the cross-lingual CSR task for languages other than English.
△ Less
Submitted 25 February, 2023;
originally announced February 2023.
-
LightTS: Lightweight Time Series Classification with Adaptive Ensemble Distillation -- Extended Version
Authors:
David Campos,
Miao Zhang,
Bin Yang,
Tung Kieu,
Chenjuan Guo,
Christian S. Jensen
Abstract:
Due to the sweeping digitalization of processes, increasingly vast amounts of time series data are being produced. Accurate classification of such time series facilitates decision making in multiple domains. State-of-the-art classification accuracy is often achieved by ensemble learning where results are synthesized from multiple base models. This characteristic implies that ensemble learning need…
▽ More
Due to the sweeping digitalization of processes, increasingly vast amounts of time series data are being produced. Accurate classification of such time series facilitates decision making in multiple domains. State-of-the-art classification accuracy is often achieved by ensemble learning where results are synthesized from multiple base models. This characteristic implies that ensemble learning needs substantial computing resources, preventing their use in resource-limited environments, such as in edge devices. To extend the applicability of ensemble learning, we propose the LightTS framework that compresses large ensembles into lightweight models while ensuring competitive accuracy. First, we propose adaptive ensemble distillation that assigns adaptive weights to different base models such that their varying classification capabilities contribute purposefully to the training of the lightweight model. Second, we propose means of identifying Pareto optimal settings w.r.t. model accuracy and model size, thus enabling users with a space budget to select the most accurate lightweight model. We report on experiments using 128 real-world time series sets and different types of base models that justify key decisions in the design of LightTS and provide evidence that LightTS is able to outperform competitors.
△ Less
Submitted 24 February, 2023;
originally announced February 2023.
-
Embedding Fourier for Ultra-High-Definition Low-Light Image Enhancement
Authors:
Chongyi Li,
Chun-Le Guo,
Man Zhou,
Zhexin Liang,
Shangchen Zhou,
Ruicheng Feng,
Chen Change Loy
Abstract:
Ultra-High-Definition (UHD) photo has gradually become the standard configuration in advanced imaging devices. The new standard unveils many issues in existing approaches for low-light image enhancement (LLIE), especially in dealing with the intricate issue of joint luminance enhancement and noise removal while remaining efficient. Unlike existing methods that address the problem in the spatial do…
▽ More
Ultra-High-Definition (UHD) photo has gradually become the standard configuration in advanced imaging devices. The new standard unveils many issues in existing approaches for low-light image enhancement (LLIE), especially in dealing with the intricate issue of joint luminance enhancement and noise removal while remaining efficient. Unlike existing methods that address the problem in the spatial domain, we propose a new solution, UHDFour, that embeds Fourier transform into a cascaded network. Our approach is motivated by a few unique characteristics in the Fourier domain: 1) most luminance information concentrates on amplitudes while noise is closely related to phases, and 2) a high-resolution image and its low-resolution version share similar amplitude patterns.Through embedding Fourier into our network, the amplitude and phase of a low-light image are separately processed to avoid amplifying noise when enhancing luminance. Besides, UHDFour is scalable to UHD images by implementing amplitude and phase enhancement under the low-resolution regime and then adjusting the high-resolution scale with few computations. We also contribute the first real UHD LLIE dataset, \textbf{UHD-LL}, that contains 2,150 low-noise/normal-clear 4K image pairs with diverse darkness and noise levels captured in different scenarios. With this dataset, we systematically analyze the performance of existing LLIE methods for processing UHD images and demonstrate the advantage of our solution. We believe our new framework, coupled with the dataset, would push the frontier of LLIE towards UHD. The code and dataset are available at https://li-chongyi.github.io/UHDFour.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition
Authors:
Chen Guo,
Tianjian Jiang,
Xu Chen,
Jie Song,
Otmar Hilliges
Abstract:
We present Vid2Avatar, a method to learn human avatars from monocular in-the-wild videos. Reconstructing humans that move naturally from monocular in-the-wild videos is difficult. Solving it requires accurately separating humans from arbitrary backgrounds. Moreover, it requires reconstructing detailed 3D surface from short video sequences, making it even more challenging. Despite these challenges,…
▽ More
We present Vid2Avatar, a method to learn human avatars from monocular in-the-wild videos. Reconstructing humans that move naturally from monocular in-the-wild videos is difficult. Solving it requires accurately separating humans from arbitrary backgrounds. Moreover, it requires reconstructing detailed 3D surface from short video sequences, making it even more challenging. Despite these challenges, our method does not require any groundtruth supervision or priors extracted from large datasets of clothed human scans, nor do we rely on any external segmentation modules. Instead, it solves the tasks of scene decomposition and surface reconstruction directly in 3D by modeling both the human and the background in the scene jointly, parameterized via two separate neural fields. Specifically, we define a temporally consistent human representation in canonical space and formulate a global optimization over the background model, the canonical human shape and texture, and per-frame human pose parameters. A coarse-to-fine sampling strategy for volume rendering and novel objectives are introduced for a clean separation of dynamic human and static background, yielding detailed and robust 3D human geometry reconstructions. We evaluate our methods on publicly available datasets and show improvements over prior art.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
KG-ECO: Knowledge Graph Enhanced Entity Correction for Query Rewriting
Authors:
Jinglun Cai,
Mingda Li,
Ziyan Jiang,
Eunah Cho,
Zheng Chen,
Yang Liu,
Xing Fan,
Chenlei Guo
Abstract:
Query Rewriting (QR) plays a critical role in large-scale dialogue systems for reducing frictions. When there is an entity error, it imposes extra challenges for a dialogue system to produce satisfactory responses. In this work, we propose KG-ECO: Knowledge Graph enhanced Entity COrrection for query rewriting, an entity correction system with corrupt entity span detection and entity retrieval/re-r…
▽ More
Query Rewriting (QR) plays a critical role in large-scale dialogue systems for reducing frictions. When there is an entity error, it imposes extra challenges for a dialogue system to produce satisfactory responses. In this work, we propose KG-ECO: Knowledge Graph enhanced Entity COrrection for query rewriting, an entity correction system with corrupt entity span detection and entity retrieval/re-ranking functionalities. To boost the model performance, we incorporate Knowledge Graph (KG) to provide entity structural information (neighboring entities encoded by graph neural networks) and textual information (KG entity descriptions encoded by RoBERTa). Experimental results show that our approach yields a clear performance gain over two baselines: utterance level QR and entity correction without utilizing KG information. The proposed system is particularly effective for few-shot learning cases where target entities are rarely seen in training or there is a KG relation between the target entity and other contextual entities in the query.
△ Less
Submitted 22 February, 2023; v1 submitted 21 February, 2023;
originally announced February 2023.
-
Path Integral Method for Pricing Proportional Step Double-Barrier Option with Time Dependent Parameters
Authors:
Qi Chen,
Chao Guo
Abstract:
Path integral method in quantum mechanics provides a new thinking for barrier option pricing. For proportional double-barrier step (PDBS) options, the option price changing process is analogous to a particle moving in a finite symmetric square potential well. We have derived the pricing kernel of PDBS options with time dependent interest rate and volatility. Numerical results of option price as a…
▽ More
Path integral method in quantum mechanics provides a new thinking for barrier option pricing. For proportional double-barrier step (PDBS) options, the option price changing process is analogous to a particle moving in a finite symmetric square potential well. We have derived the pricing kernel of PDBS options with time dependent interest rate and volatility. Numerical results of option price as a function of underlying asset price are shown as well. Path integral method can be easily generalized to the pricing of PDBS options with curved boundaries.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.
-
Accumulation of scale-free localized states induced by local non-Hermiticity
Authors:
Cui-Xian Guo,
Xueliang Wang,
Haiping Hu,
Shu Chen
Abstract:
The bulk states of Hermitian systems are believed insensitive to local Hermitian impurities or perturbations except for a few impurity-induced bound states. Thus, it is important to ask whether \textit{local} non-Hermiticity can cause drastic changes to the original Hermitian systems. Here we address this issue affirmatively and present exact solutions for the double chain model with local non-Her…
▽ More
The bulk states of Hermitian systems are believed insensitive to local Hermitian impurities or perturbations except for a few impurity-induced bound states. Thus, it is important to ask whether \textit{local} non-Hermiticity can cause drastic changes to the original Hermitian systems. Here we address this issue affirmatively and present exact solutions for the double chain model with local non-Hermitian terms possessing parity-time ($\mathcal{PT}$) symmetry. Induced by the non-Hermiticity, the system undergoes a sequence of $\mathcal{PT}$-symmetry breakings, after which the eigenenergies appear in complex conjugate pairs. The associated extended bulk states then become scale-free localized and unidirectionally accumulated around the impurity. There exist mobility edges separating the residual extended states until a full scale-free localization of all eigenstates. Further increasing the non-Hermitity counter-intuitively brings the system to a $\mathcal{PT}$-restoration regime with fully real spectra except for a pair of complex bound states. We demonstrate that the local non-Hermiticity generated scale-free localization is a general phenomenon and can even survive the quasiperiodic disorder. Our results indicate that the bulk properties of the original Hermitian system can be globally reshaped by local non-Hermiticity.
△ Less
Submitted 30 April, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Page curves and Entanglement Islands for the Step-Function Vaidya Model of Evaporating Black Holes
Authors:
Chang-Zhong Guo,
Wen-Cong Gan,
Fu-Wen Shu
Abstract:
It was proposed recently that the fine-grained entropy of the Hawking radiation can be expressed by the semiclassical island formula, which reproduces the unitary Page curve. In this paper, we choose the ``in'' vacuum state and apply the quantum extremal surface construction to study the Page curve for the step-function Vaidya model of evaporating black holes in four dimensions, which is produced…
▽ More
It was proposed recently that the fine-grained entropy of the Hawking radiation can be expressed by the semiclassical island formula, which reproduces the unitary Page curve. In this paper, we choose the ``in'' vacuum state and apply the quantum extremal surface construction to study the Page curve for the step-function Vaidya model of evaporating black holes in four dimensions, which is produced by the spherical null shells. Metrics of the three regions of this spacetimes are obtained. In addition, the entanglement islands for the step-function Vaidya model of evaporating black holes at very late times are studied. When cutoff surface $A$ is located in Minkowski region III with $u_A < u_H$ at very late times, we find that the location of the boundary of island $\partial I$ depends on the value of $8M-v_A+v_I$. Specifically, $\partial I$ is inside, at or outside the horizon when $8M-v_A+v_I$ is less than, equal to or larger than zero respectively. Moreover, when cutoff surface $A$ is located in Minkowski region III with $u_A > u_H$ after the black hole evaporates completely, we find that entanglement island still exists and $\partial I$ is located on an equal-time Cauchy surface of the observer $A$ when $r_{(A)}^2\geq64G_Nκc $.
△ Less
Submitted 8 May, 2023; v1 submitted 5 February, 2023;
originally announced February 2023.
-
Deep Joint Source-Channel Coding for Wireless Image Transmission with Semantic Importance
Authors:
Qizheng Sun,
Caili Guo,
Yang Yang,
Jiujiu Chen,
Rui Tang,
Chuanhong Liu
Abstract:
The sixth-generation mobile communication system proposes the vision of smart interconnection of everything, which requires accomplishing communication tasks while ensuring the performance of intelligent tasks. A joint source-channel coding method based on semantic importance is proposed, which aims at preserving semantic information during wireless image transmission and thereby boosting the perf…
▽ More
The sixth-generation mobile communication system proposes the vision of smart interconnection of everything, which requires accomplishing communication tasks while ensuring the performance of intelligent tasks. A joint source-channel coding method based on semantic importance is proposed, which aims at preserving semantic information during wireless image transmission and thereby boosting the performance of intelligent tasks for images at the receiver. Specifically, we first propose semantic importance weight calculation method, which is based on the gradient of intelligent task's perception results with respect to the features. Then, we design the semantic loss function in the way of using semantic weights to weight the features. Finally, we train the deep joint source-channel coding network using the semantic loss function. Experiment results demonstrate that the proposed method achieves up to 57.7% and 9.1% improvement in terms of intelligent task's performance compared with the source-channel separation coding method and the deep sourcechannel joint coding method without considering semantics at the same compression rate and signal-to-noise ratio, respectively.
△ Less
Submitted 4 February, 2023;
originally announced February 2023.
-
Chromatic aberrations correction of attosecond high-order harmonic beams by flat-top spatial shaping of the fundamental beam
Authors:
K. Veyrinas,
M. Plach,
J. Peschel,
M. Hoflund,
F. Catoire,
C. Valentin,
P. Smorenburg,
H. Dacasa,
S. Maclot,
C. Guo,
H. Wikmark,
A. Zair,
V. Strelkov,
C. Picot,
C. Arnold,
P. Eng-Johnsson,
A. L Huillier,
E. Mevel,
E. Constant
Abstract:
Attosecond pulses created by high-order harmonic generation in gases often exhibit strong chromatic aberrations, arising from the broad bandwidth and wavelength-dependent nonlinear light-matter interaction. When the driving laser intensity varies spatially, as for Gaussian driving beams, the apparent source position of the harmonics differs significantly from one order to the next, thus affecting…
▽ More
Attosecond pulses created by high-order harmonic generation in gases often exhibit strong chromatic aberrations, arising from the broad bandwidth and wavelength-dependent nonlinear light-matter interaction. When the driving laser intensity varies spatially, as for Gaussian driving beams, the apparent source position of the harmonics differs significantly from one order to the next, thus affecting the achievable intensity and duration of the attosecond pulses when they are focused on a target. We show that these chromatic aberrations can be reduced by spatially shaping the fundamental beam to generate high-order harmonics with a driver having a flat-top profile inside the gas medium. By measuring both the intensity profile and wavefront for each harmonic in a plane, we access the extreme ultra-violet (XUV) beam properties and investigate these properties near focus. We observe that controlling chromatic aberrations by flat-top spatial shaping strongly reduces the variation of the XUV spectrum on the beam axis during propagation and, in return, the longitudinal sensitivity of both the temporal profiles and the temporal shifts of the focused attosecond pulses.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
Ultra-stable and versatile high-energy resolution setup for attosecond photoelectron spectroscopy
Authors:
Sizuo Luo,
Robin Weissenbilder,
Hugo Laurell,
Mattias Ammitzböll,
Vénus Poulain,
David Busto,
Lana Neoričić,
Chen Guo,
Shiyang Zhong,
David Kroon,
Richard J Squibb,
Raimund Feifel,
Mathieu Gisselbrecht,
Anne L'Huillier,
Cord L Arnold
Abstract:
Attosecond photoelectron spectroscopy is often performed with interferometric experimental setups that require outstanding stability. We demonstrate and characterize in detail an actively stabilized, versatile, high spectral resolution attosecond beamline. The active-stabilization system can remain ultra-stable for several hours with an RMS stability of 13 as and a total pump-probe delay scanning…
▽ More
Attosecond photoelectron spectroscopy is often performed with interferometric experimental setups that require outstanding stability. We demonstrate and characterize in detail an actively stabilized, versatile, high spectral resolution attosecond beamline. The active-stabilization system can remain ultra-stable for several hours with an RMS stability of 13 as and a total pump-probe delay scanning range of \sim 400 fs. A tunable femtosecond laser source to drive high-order harmonic generation allows for precisely addressing atomic and molecular resonances. Furthermore, the interferometer includes a spectral shaper in 4f-geometry in the probe arm as well as a tunable bandpass filter in the pump arm, which offer additional high flexibility in terms of tunability as well as narrowband or polychromatic probe pulses. We show that spectral phase measurements of photoelectron wavepackets with the rainbow RABBIT technique (reconstruction of attosecond beating by two photon transitions) with narrowband probe pulses can significantly improve the photoelectron energy resolution. In this setup, the temporal-spectral resolution of photoelectron spectroscopy can reach a new level of accuracy and precision.
△ Less
Submitted 21 January, 2023;
originally announced January 2023.
-
Research of radon diffusion behavior in liquid scintillator
Authors:
Z. F. Xu,
C. Guo,
J. C. Liu,
Y. P. Zhang,
P. Zhang,
C. G. Yang,
Q. Tang,
Y. Liu,
C. Li,
T. Y. Guan
Abstract:
The background caused by radon and its daughters is an important background in the low background liquid scintillator (LS) detectors. The study of the diffusion behaviour of radon in the LS contributes to the analysis of the related background caused by radon. Methodologies and devices for measuring the diffusion coefficient and solubility of radon in materials are developed and described. The rad…
▽ More
The background caused by radon and its daughters is an important background in the low background liquid scintillator (LS) detectors. The study of the diffusion behaviour of radon in the LS contributes to the analysis of the related background caused by radon. Methodologies and devices for measuring the diffusion coefficient and solubility of radon in materials are developed and described. The radon diffusion coefficient of the LS was measured for the first time and in addition the solubility coefficient was also obtained. In addition, the radon diffusion coefficient of the polyolefine film which is consistent with data in the literature was measured to verify the reliability of the diffusion device.
△ Less
Submitted 28 January, 2023; v1 submitted 17 January, 2023;
originally announced January 2023.
-
Block belief propagation algorithm for two-dimensional tensor networks
Authors:
Chu Guo,
Dario Poletti,
Itai Arad
Abstract:
Belief propagation is a well-studied algorithm for approximating local marginals of multivariate probability distribution over complex networks, while tensor network states are powerful tools for quantum and classical many-body problems. Building on a recent connection between the belief propagation algorithm and the problem of tensor network contraction, we propose a block belief propagation algo…
▽ More
Belief propagation is a well-studied algorithm for approximating local marginals of multivariate probability distribution over complex networks, while tensor network states are powerful tools for quantum and classical many-body problems. Building on a recent connection between the belief propagation algorithm and the problem of tensor network contraction, we propose a block belief propagation algorithm for contracting two-dimensional tensor networks and approximating the ground state of $2D$ systems. The advantages of our method are three-fold: 1) the same algorithm works for both finite and infinite systems; 2) it allows natural and efficient parallelization; 3) given its flexibility it would allow to deal with different unit cells. As applications, we use our algorithm to study the $2D$ Heisenberg and transverse Ising models, and show that the accuracy of the method is on par with state-of-the-art results.
△ Less
Submitted 6 September, 2023; v1 submitted 14 January, 2023;
originally announced January 2023.
-
A real neural network state for quantum chemistry
Authors:
Yangjun Wu,
Xiansong Xu,
Dario Poletti,
Yi Fan,
Chu Guo,
Honghui Shang
Abstract:
The restricted Boltzmann machine (RBM) has been successfully applied to solve the many-electron Schr$\ddot{\text{o}}$dinger equation. In this work we propose a single-layer fully connected neural network adapted from RBM and apply it to study ab initio quantum chemistry problems. Our contribution is two-fold: 1) our neural network only uses real numbers to represent the real electronic wave functi…
▽ More
The restricted Boltzmann machine (RBM) has been successfully applied to solve the many-electron Schr$\ddot{\text{o}}$dinger equation. In this work we propose a single-layer fully connected neural network adapted from RBM and apply it to study ab initio quantum chemistry problems. Our contribution is two-fold: 1) our neural network only uses real numbers to represent the real electronic wave function, while we obtain comparable precision to RBM for various prototypical molecules; 2) we show that the knowledge of the Hartree-Fock reference state can be used to systematically accelerate the convergence of the variational Monte Carlo algorithm as well as to increase the precision of the final energy.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
System upgrade for $μ$Bq/m$^3$ level $^{222}$Rn concentration measurement
Authors:
Y. Liu,
Y. P. Zhang,
J. C. Liu,
C. Guo,
C. G. Yang. P. Zhang,
Q. Tang,
Z. F. Xu,
C. Li,
T. Y. Guan,
S. B. Wang
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO), a 20 kton multipurpose underground liquid scintillator detector, was proposed for the determination of the neutrino mass hierarchy as primary physics goal. The central detector will be submerged in a water Cherenkov detector to lower the background from the environment and cosmic muons. Radon is one of the primary background sources. Nitrogen w…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO), a 20 kton multipurpose underground liquid scintillator detector, was proposed for the determination of the neutrino mass hierarchy as primary physics goal. The central detector will be submerged in a water Cherenkov detector to lower the background from the environment and cosmic muons. Radon is one of the primary background sources. Nitrogen will be used in several sub-systems, and a highly sensitive radon detector has to be developed to measure its radon concentration. A system has been developed based on $^{222}$Rn enrichment of activated carbon and $^{222}$Rn detection based on the electrostatic collection. This paper presents the detail of a $μ$Bq/m$^3$ level $^{222}$Rn concentration measurement system and gives detailed information about how the adsorption coefficient was measured and how the temperature, flow rate, and $^{222}$Rn concentration affect the adsorption coefficient.
△ Less
Submitted 24 September, 2023; v1 submitted 3 January, 2023;
originally announced January 2023.
-
Provable Robust Saliency-based Explanations
Authors:
Chao Chen,
Chenghua Guo,
Guixiang Ma,
Ming Zeng,
Xi Zhang,
Sihong Xie
Abstract:
Robust explanations of machine learning models are critical to establishing human trust in the models. The top-$k$ intersection is widely used to evaluate the robustness of explanations. However, most existing attacking and defense strategies are based on $\ell_p$ norms, thus creating a mismatch between the evaluation and optimization objectives. To this end, we define explanation thickness for me…
▽ More
Robust explanations of machine learning models are critical to establishing human trust in the models. The top-$k$ intersection is widely used to evaluate the robustness of explanations. However, most existing attacking and defense strategies are based on $\ell_p$ norms, thus creating a mismatch between the evaluation and optimization objectives. To this end, we define explanation thickness for measuring top-$k$ salient features ranking stability, and design the \textit{R2ET} algorithm based on a novel tractable surrogate to maximize the thickness and stabilize the top salient features efficiently. Theoretically, we prove a connection between R2ET and adversarial training; using a novel multi-objective optimization formulation and a generalization error bound, we further prove that the surrogate objective can improve both the numerical and statistical stability of the explanations. Experiments with a wide spectrum of network architectures and data modalities demonstrate that R2ET attains higher explanation robustness under stealthy attacks while retaining model accuracy.
△ Less
Submitted 8 July, 2023; v1 submitted 28 December, 2022;
originally announced December 2022.
-
Developing a single phase liquid argon detector with SiPM readout
Authors:
L. Wang,
Y. Lei,
T. A. Wang,
C. Guo,
K. K. Zhao,
X. H. Liang,
S. B. Wang,
Y. D. Chen
Abstract:
Liquid argon is used as a target material in several current and planned experiments related to dark matter direct searching and neutrino detection. SiPM is becoming the standard for scintillator detectors because of its advantages over traditional PMT. In this paper, we developed a single-phase liquid argon detector using eight 1 $\times$1 inch$^2$ Hamamatsu S14161-6050HS 4$\times$4 SiPM arrays.…
▽ More
Liquid argon is used as a target material in several current and planned experiments related to dark matter direct searching and neutrino detection. SiPM is becoming the standard for scintillator detectors because of its advantages over traditional PMT. In this paper, we developed a single-phase liquid argon detector using eight 1 $\times$1 inch$^2$ Hamamatsu S14161-6050HS 4$\times$4 SiPM arrays. The directly measured light yield is 25.7 $\pm$ 1.6 photo-electrons per keV, which corresponds to 12.8 $\pm$ 0.8 photo-electrons primarily generated by the argon scintillation. The rest is contributed by the cross-talk and after-pulse of SiPM. In addition, we provide an experimental method to estimate the effect of crosstalk and afterpulse on light yield using dark noise data. Finally, we quantitatively give the relationship between the light yield and the decay time of the slow component of a liquid argon detector.
△ Less
Submitted 1 January, 2023; v1 submitted 26 December, 2022;
originally announced December 2022.
-
Tightening Quadratic Convex Relaxations for the AC Optimal Transmission Switching Problem
Authors:
Cheng Guo,
Harsha Nagarajan,
Merve Bodur
Abstract:
The Alternating Current Optimal Transmission Switching (ACOTS) problem incorporates line switching decisions into the fundamental AC optimal power flow (ACOPF) problem. The advantages of the ACOTS problem are well-known in terms of reducing the operational cost and improving system reliability. ACOTS optimization models contain discrete variables and nonlinear, non-convex structures, which make it…
▽ More
The Alternating Current Optimal Transmission Switching (ACOTS) problem incorporates line switching decisions into the fundamental AC optimal power flow (ACOPF) problem. The advantages of the ACOTS problem are well-known in terms of reducing the operational cost and improving system reliability. ACOTS optimization models contain discrete variables and nonlinear, non-convex structures, which make it difficult to solve. We derive strengthened quadratic convex (QC) relaxations for ACOTS by combining several methodologies recently developed in the ACOPF literature. First, we relax the ACOTS model with the on/off QC relaxation, which has been empirically observed to be both tight and computationally efficient in approximating the ACOPF problem. Further, we tighten this relaxation by using strong linearization with extreme-point representation, and by adding several types of new valid inequalities. In particular, we derive a novel kind of "on/off cycle-based polynomial constraints", by taking advantage of the network structure. Those constraints are linearized using convex-hull representations and implemented in an efficient "branch-and-cut" framework. We also tighten the relaxation using the optimization-based bound tightening algorithm. Our extensive numerical experiments on medium-scale PGLib instances show that, compared with the state-of-the-art formulations, our strengthening techniques are able to improve the quality of ACOTS relaxations on many of the PGLib instances, with some being substantial improvements.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
Reactor neutrino physics potentials of cryogenic pure-CsI crystal
Authors:
L. Wang,
G. d. Li,
Z. Y. Yu,
X. H. Liang,
T. A. Wang,
F. Liu,
X. L. Sun,
C. Guo,
X. Zhang,
L. Yu,
Y. D. Chen
Abstract:
This paper presents a world-leading scintillation light yield among inorganic crystals measured from a 0.5~kg pure-CsI detector operated at 77 Kelvin. Scintillation photons were detected by two 2-inch Hamamatsu SiPM arrays equipped with cryogenic front-end electronics. Benefiting the light yield enhancement of pure-CsI at low temperatures and the high photon detection efficiency of SiPM, a light y…
▽ More
This paper presents a world-leading scintillation light yield among inorganic crystals measured from a 0.5~kg pure-CsI detector operated at 77 Kelvin. Scintillation photons were detected by two 2-inch Hamamatsu SiPM arrays equipped with cryogenic front-end electronics. Benefiting the light yield enhancement of pure-CsI at low temperatures and the high photon detection efficiency of SiPM, a light yield of 30.1 photoelectrons per keV energy deposit was obtained for X-rays and $γ$-rays with energies from 5.9~keV to 59.6~keV. Instrumental and physical effects in the light yield measurement are carefully analyzed. This is the first stable cryogenic operation of kg-scale pure-CsI crystal readout by SiPM arrays at liquid nitrogen temperatures for several days. The world-leading light yield opens a door for the usage of pure-CsI crystal in several fields, particularly in detecting the coherent elastic neutrino-nucleus scattering of reactor neutrinos. The potential of using pure-CsI crystals in neutrino physics is discussed in the paper.
△ Less
Submitted 16 April, 2024; v1 submitted 22 December, 2022;
originally announced December 2022.
-
A Pattern Discovery Approach to Multivariate Time Series Forecasting
Authors:
Yunyao Cheng,
Chenjuan Guo,
Kaixuan Chen,
Kai Zhao,
Bin Yang,
Jiandong Xie,
Christian S. Jensen,
Feiteng Huang,
Kai Zheng
Abstract:
Multivariate time series forecasting constitutes important functionality in cyber-physical systems, whose prediction accuracy can be improved significantly by capturing temporal and multivariate correlations among multiple time series. State-of-the-art deep learning methods fail to construct models for full time series because model complexity grows exponentially with time series length. Rather, t…
▽ More
Multivariate time series forecasting constitutes important functionality in cyber-physical systems, whose prediction accuracy can be improved significantly by capturing temporal and multivariate correlations among multiple time series. State-of-the-art deep learning methods fail to construct models for full time series because model complexity grows exponentially with time series length. Rather, these methods construct local temporal and multivariate correlations within subsequences, but fail to capture correlations among subsequences, which significantly affect their forecasting accuracy. To capture the temporal and multivariate correlations among subsequences, we design a pattern discovery model, that constructs correlations via diverse pattern functions. While the traditional pattern discovery method uses shared and fixed pattern functions that ignore the diversity across time series. We propose a novel pattern discovery method that can automatically capture diverse and complex time series patterns. We also propose a learnable correlation matrix, that enables the model to capture distinct correlations among multiple time series. Extensive experiments show that our model achieves state-of-the-art prediction accuracy.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Information Bottleneck-Inspired Type Based Multiple Access for Remote Estimation in IoT Systems
Authors:
Meiyi Zhu,
Chunyan Feng,
Caili Guo,
Nan Jiang,
Osvaldo Simeone
Abstract:
Type-based multiple access (TBMA) is a semantics-aware multiple access protocol for remote inference. In TBMA, codewords are reused across transmitting sensors, with each codeword being assigned to a different observation value. Existing TBMA protocols are based on fixed shared codebooks and on conventional maximum-likelihood or Bayesian decoders, which require knowledge of the distributions of ob…
▽ More
Type-based multiple access (TBMA) is a semantics-aware multiple access protocol for remote inference. In TBMA, codewords are reused across transmitting sensors, with each codeword being assigned to a different observation value. Existing TBMA protocols are based on fixed shared codebooks and on conventional maximum-likelihood or Bayesian decoders, which require knowledge of the distributions of observations and channels. In this letter, we propose a novel design principle for TBMA based on the information bottleneck (IB). In the proposed IB-TBMA protocol, the shared codebook is jointly optimized with a decoder based on artificial neural networks (ANNs), so as to adapt to source, observations, and channel statistics based on data only. We also introduce the Compressed IB-TBMA (CIB-TBMA) protocol, which improves IB-TBMA by enabling a reduction in the number of codewords via an IB-inspired clustering phase. Numerical results demonstrate the importance of a joint design of codebook and neural decoder, and validate the benefits of codebook compression.
△ Less
Submitted 5 April, 2023; v1 submitted 19 December, 2022;
originally announced December 2022.
-
Biomedical image analysis competitions: The state of current participation practice
Authors:
Matthias Eisenmann,
Annika Reinke,
Vivienn Weru,
Minu Dietlinde Tizabi,
Fabian Isensee,
Tim J. Adler,
Patrick Godau,
Veronika Cheplygina,
Michal Kozubek,
Sharib Ali,
Anubha Gupta,
Jan Kybic,
Alison Noble,
Carlos Ortiz de Solórzano,
Samiksha Pachade,
Caroline Petitjean,
Daniel Sage,
Donglai Wei,
Elizabeth Wilden,
Deepak Alapatt,
Vincent Andrearczyk,
Ujjwal Baid,
Spyridon Bakas,
Niranjan Balu,
Sophia Bano
, et al. (331 additional authors not shown)
Abstract:
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,…
▽ More
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.
△ Less
Submitted 12 September, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
JUNO Sensitivity on Proton Decay $p\to \barνK^+$ Searches
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Antonio Bergnoli,
Thilo Birkenfeld,
Sylvie Blin
, et al. (586 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO) is a large liquid scintillator detector designed to explore many topics in fundamental physics. In this paper, the potential on searching for proton decay in $p\to \barνK^+$ mode with JUNO is investigated.The kaon and its decay particles feature a clear three-fold coincidence signature that results in a high efficiency for identification. Moreov…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO) is a large liquid scintillator detector designed to explore many topics in fundamental physics. In this paper, the potential on searching for proton decay in $p\to \barνK^+$ mode with JUNO is investigated.The kaon and its decay particles feature a clear three-fold coincidence signature that results in a high efficiency for identification. Moreover, the excellent energy resolution of JUNO permits to suppress the sizable background caused by other delayed signals. Based on these advantages, the detection efficiency for the proton decay via $p\to \barνK^+$ is 36.9% with a background level of 0.2 events after 10 years of data taking. The estimated sensitivity based on 200 kton-years exposure is $9.6 \times 10^{33}$ years, competitive with the current best limits on the proton lifetime in this channel.
△ Less
Submitted 26 October, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Experimental quantum computational chemistry with optimised unitary coupled cluster ansatz
Authors:
Shaojun Guo,
Jinzhao Sun,
Haoran Qian,
Ming Gong,
Yukun Zhang,
Fusheng Chen,
Yangsen Ye,
Yulin Wu,
Sirui Cao,
Kun Liu,
Chen Zha,
Chong Ying,
Qingling Zhu,
He-Liang Huang,
Youwei Zhao,
Shaowei Li,
Shiyu Wang,
Jiale Yu,
Daojin Fan,
Dachao Wu,
Hong Su,
Hui Deng,
Hao Rong,
Yuan Li,
Kaili Zhang
, et al. (13 additional authors not shown)
Abstract:
Quantum computational chemistry has emerged as an important application of quantum computing. Hybrid quantum-classical computing methods, such as variational quantum eigensolvers (VQE), have been designed as promising solutions to quantum chemistry problems, yet challenges due to theoretical complexity and experimental imperfections hinder progress in achieving reliable and accurate results. Exper…
▽ More
Quantum computational chemistry has emerged as an important application of quantum computing. Hybrid quantum-classical computing methods, such as variational quantum eigensolvers (VQE), have been designed as promising solutions to quantum chemistry problems, yet challenges due to theoretical complexity and experimental imperfections hinder progress in achieving reliable and accurate results. Experimental works for solving electronic structures are consequently still restricted to nonscalable (hardware efficient) or classically simulable (Hartree-Fock) ansatz, or limited to a few qubits with large errors. The experimental realisation of scalable and high-precision quantum chemistry simulation remains elusive. Here, we address the critical challenges {associated with} solving molecular electronic structures using noisy quantum processors. Our protocol presents significant improvements in the circuit depth and running time, key metrics for chemistry simulation. Through systematic hardware enhancements and the integration of error mitigation techniques, we push forward the limit of experimental quantum computational chemistry and successfully scale up the implementation of VQE with an optimised unitary coupled-cluster ansatz to 12 qubits. We produce high-precision results of the ground-state energy for molecules with error suppression by around two orders of magnitude. We achieve chemical accuracy for H$_2$ at all bond distances and LiH at small bond distances in the experiment, even beyond the two recent concurrent works. Our work demonstrates a feasible path towards a scalable solution to electronic structure calculation, validating the key technological features and identifying future challenges for this goal.
△ Less
Submitted 17 June, 2024; v1 submitted 15 December, 2022;
originally announced December 2022.
-
U2Net: A General Framework with Spatial-Spectral-Integrated Double U-Net for Image Fusion
Authors:
Siran Peng,
Chenhao Guo,
Xiao Wu,
Liang-Jian Deng
Abstract:
In image fusion tasks, images obtained from different sources exhibit distinct properties. Consequently, treating them uniformly with a single-branch network can lead to inadequate feature extraction. Additionally, numerous works have demonstrated that multi-scaled networks capture information more sufficiently than single-scaled models in pixel-level computer vision problems. Considering these fa…
▽ More
In image fusion tasks, images obtained from different sources exhibit distinct properties. Consequently, treating them uniformly with a single-branch network can lead to inadequate feature extraction. Additionally, numerous works have demonstrated that multi-scaled networks capture information more sufficiently than single-scaled models in pixel-level computer vision problems. Considering these factors, we propose U2Net, a spatial-spectral-integrated double U-shape network for image fusion. The U2Net utilizes a spatial U-Net and a spectral U-Net to extract spatial details and spectral characteristics, which allows for the discriminative and hierarchical learning of features from diverse images. In contrast to most previous works that merely employ concatenation to merge spatial and spectral information, this paper introduces a novel spatial-spectral integration structure called S2Block, which combines feature maps from different sources in a logical and effective way. We conduct a series of experiments on two image fusion tasks, including remote sensing pansharpening and hyperspectral image super-resolution (HISR). The U2Net outperforms representative state-of-the-art (SOTA) approaches in both quantitative and qualitative evaluations, demonstrating the superiority of our method. The code is available at https://github.com/PSRben/U2Net.
△ Less
Submitted 2 October, 2023; v1 submitted 13 December, 2022;
originally announced December 2022.
-
BeautyREC: Robust, Efficient, and Content-preserving Makeup Transfer
Authors:
Qixin Yan,
Chunle Guo,
Jixin Zhao,
Yuekun Dai,
Chen Change Loy,
Chongyi Li
Abstract:
In this work, we propose a Robust, Efficient, and Component-specific makeup transfer method (abbreviated as BeautyREC). A unique departure from prior methods that leverage global attention, simply concatenate features, or implicitly manipulate features in latent space, we propose a component-specific correspondence to directly transfer the makeup style of a reference image to the corresponding com…
▽ More
In this work, we propose a Robust, Efficient, and Component-specific makeup transfer method (abbreviated as BeautyREC). A unique departure from prior methods that leverage global attention, simply concatenate features, or implicitly manipulate features in latent space, we propose a component-specific correspondence to directly transfer the makeup style of a reference image to the corresponding components (e.g., skin, lips, eyes) of a source image, making elaborate and accurate local makeup transfer. As an auxiliary, the long-range visual dependencies of Transformer are introduced for effective global makeup transfer. Instead of the commonly used cycle structure that is complex and unstable, we employ a content consistency loss coupled with a content encoder to implement efficient single-path makeup transfer. The key insights of this study are modeling component-specific correspondence for local makeup transfer, capturing long-range dependencies for global makeup transfer, and enabling efficient makeup transfer via a single-path structure. We also contribute BeautyFace, a makeup transfer dataset to supplement existing datasets. This dataset contains 3,000 faces, covering more diverse makeup styles, face poses, and races. Each face has annotated parsing map. Extensive experiments demonstrate the effectiveness of our method against state-of-the-art methods. Besides, our method is appealing as it is with only 1M parameters, outperforming the state-of-the-art methods (BeautyGAN: 8.43M, PSGAN: 12.62M, SCGAN: 15.30M, CPM: 9.24M, SSAT: 10.48M).
△ Less
Submitted 12 December, 2022;
originally announced December 2022.
-
Quantum State Tomography Inspired by Language Modeling
Authors:
Lu Zhong,
Chu Guo,
Xiaoting Wang
Abstract:
Quantum state tomography is an elementary tool to fully characterize an unknown quantum state. As the quantum hardware scales up in size, the standard quantum state tomography becomes increasingly challenging due to its exponentially growing complexity. In this work, we propose a scalable solution by considering state tomography as a language modeling task, where the unknown quantum state is treat…
▽ More
Quantum state tomography is an elementary tool to fully characterize an unknown quantum state. As the quantum hardware scales up in size, the standard quantum state tomography becomes increasingly challenging due to its exponentially growing complexity. In this work, we propose a scalable solution by considering state tomography as a language modeling task, where the unknown quantum state is treated as an unknown language, the correlation of the quantum state is interpreted as the semantic information specific to this language, and the measurement outcomes are simply the text instances generated from the language. Based on a customized transformer model from language modeling, we demonstrate that our method can accurately reconstruct prototypical pure and mixed quantum states using less samples than state-of-the-art methods. More importantly, our method can reconstruct a class of similar states simultaneously, in comparison with the existing neural network methods that need to train a model for each unknown state.
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
Validating quantum-supremacy experiments with exact and fast tensor network contraction
Authors:
Yong Liu,
Yaojian Chen,
Chu Guo,
Jiawei Song,
Xinmin Shi,
Lin Gan,
Wenzhao Wu,
Wei Wu,
Haohuan Fu,
Xin Liu,
Dexun Chen,
Zhifeng Zhao,
Guangwen Yang,
Jiangang Gao
Abstract:
The quantum supremacy experiment, such as Google Sycamore [Nature \textbf{574}, 505 (2019)], poses great challenge for classical verification due to the exponentially-increasing compute cost. Using a new-generation Sunway supercomputer within $8.5$ days, we provide a direct verification by computing three million exact amplitudes for the experimentally generated bitstrings, obtaining an XEB fideli…
▽ More
The quantum supremacy experiment, such as Google Sycamore [Nature \textbf{574}, 505 (2019)], poses great challenge for classical verification due to the exponentially-increasing compute cost. Using a new-generation Sunway supercomputer within $8.5$ days, we provide a direct verification by computing three million exact amplitudes for the experimentally generated bitstrings, obtaining an XEB fidelity of $0.191\%$ (the estimated value is $0.224\%$). The leap of simulation capability is built on a multiple-amplitude tensor network contraction algorithm which systematically exploits the ``classical advantage" (the inherent ``store-and-compute" operation mode of von Neumann machines) of current supercomputers, and a fused tensor network contraction algorithm which drastically increases the compute efficiency on heterogeneous architectures. Our method has a far-reaching impact in solving quantum many-body problems, statistical problems as well as combinatorial optimization problems.
△ Less
Submitted 16 January, 2024; v1 submitted 9 December, 2022;
originally announced December 2022.
-
GAUCHE: A Library for Gaussian Processes in Chemistry
Authors:
Ryan-Rhys Griffiths,
Leo Klarner,
Henry B. Moss,
Aditya Ravuri,
Sang Truong,
Samuel Stanton,
Gary Tom,
Bojana Rankovic,
Yuanqi Du,
Arian Jamasb,
Aryan Deshwal,
Julius Schwartz,
Austin Tripp,
Gregory Kell,
Simon Frieder,
Anthony Bourached,
Alex Chan,
Jacob Moss,
Chengzhi Guo,
Johannes Durholt,
Saudamini Chaurasia,
Felix Strieth-Kalthoff,
Alpha A. Lee,
Bingqing Cheng,
Alán Aspuru-Guzik
, et al. (2 additional authors not shown)
Abstract:
We introduce GAUCHE, a library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to chemical representations, however, is nontrivial, necessitating kernels defined over structured inputs such as graphs, strings…
▽ More
We introduce GAUCHE, a library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to chemical representations, however, is nontrivial, necessitating kernels defined over structured inputs such as graphs, strings and bit vectors. By defining such kernels in GAUCHE, we seek to open the door to powerful tools for uncertainty quantification and Bayesian optimisation in chemistry. Motivated by scenarios frequently encountered in experimental chemistry, we showcase applications for GAUCHE in molecular discovery and chemical reaction optimisation. The codebase is made available at https://github.com/leojklarner/gauche
△ Less
Submitted 21 February, 2023; v1 submitted 6 December, 2022;
originally announced December 2022.
-
AutoPINN: When AutoML Meets Physics-Informed Neural Networks
Authors:
Xinle Wu,
Dalin Zhang,
Miao Zhang,
Chenjuan Guo,
Shuai Zhao,
Yi Zhang,
Huai Wang,
Bin Yang
Abstract:
Physics-Informed Neural Networks (PINNs) have recently been proposed to solve scientific and engineering problems, where physical laws are introduced into neural networks as prior knowledge. With the embedded physical laws, PINNs enable the estimation of critical parameters, which are unobservable via physical tools, through observable variables. For example, Power Electronic Converters (PECs) are…
▽ More
Physics-Informed Neural Networks (PINNs) have recently been proposed to solve scientific and engineering problems, where physical laws are introduced into neural networks as prior knowledge. With the embedded physical laws, PINNs enable the estimation of critical parameters, which are unobservable via physical tools, through observable variables. For example, Power Electronic Converters (PECs) are essential building blocks for the green energy transition. PINNs have been applied to estimate the capacitance, which is unobservable during PEC operations, using current and voltage, which can be observed easily during operations. The estimated capacitance facilitates self-diagnostics of PECs. Existing PINNs are often manually designed, which is time-consuming and may lead to suboptimal performance due to a large number of design choices for neural network architectures and hyperparameters. In addition, PINNs are often deployed on different physical devices, e.g., PECs, with limited and varying resources. Therefore, it requires designing different PINN models under different resource constraints, making it an even more challenging task for manual design. To contend with the challenges, we propose Automated Physics-Informed Neural Networks (AutoPINN), a framework that enables the automated design of PINNs by combining AutoML and PINNs. Specifically, we first tailor a search space that allows finding high-accuracy PINNs for PEC internal parameter estimation. We then propose a resource-aware search strategy to explore the search space to find the best PINN model under different resource constraints. We experimentally demonstrate that AutoPINN is able to find more accurate PINN models than human-designed, state-of-the-art PINN models using fewer resources.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
Concealed Object Detection for Passive Millimeter-Wave Security Imaging Based on Task-Aligned Detection Transformer
Authors:
Cheng Guo,
Fei Hu,
Yan Hu
Abstract:
Passive millimeter-wave (PMMW) is a significant potential technique for human security screening. Several popular object detection networks have been used for PMMW images. However, restricted by the low resolution and high noise of PMMW images, PMMW hidden object detection based on deep learning usually suffers from low accuracy and low classification confidence. To tackle the above problems, this…
▽ More
Passive millimeter-wave (PMMW) is a significant potential technique for human security screening. Several popular object detection networks have been used for PMMW images. However, restricted by the low resolution and high noise of PMMW images, PMMW hidden object detection based on deep learning usually suffers from low accuracy and low classification confidence. To tackle the above problems, this paper proposes a Task-Aligned Detection Transformer network, named PMMW-DETR. In the first stage, a Denoising Coarse-to-Fine Transformer (DCFT) backbone is designed to extract long- and short-range features in the different scales. In the second stage, we propose the Query Selection module to introduce learned spatial features into the network as prior knowledge, which enhances the semantic perception capability of the network. In the third stage, aiming to improve the classification performance, we perform a Task-Aligned Dual-Head block to decouple the classification and regression tasks. Based on our self-developed PMMW security screening dataset, experimental results including comparison with State-Of-The-Art (SOTA) methods and ablation study demonstrate that the PMMW-DETR obtains higher accuracy and classification confidence than previous works, and exhibits robustness to the PMMW images of low quality.
△ Less
Submitted 7 July, 2023; v1 submitted 1 December, 2022;
originally announced December 2022.
-
Joint Neural Architecture and Hyperparameter Search for Correlated Time Series Forecasting
Authors:
Xinle Wu,
Dalin Zhang,
Miao Zhang,
Chenjuan Guo,
Bin Yang,
Christian S. Jensen
Abstract:
Sensors in cyber-physical systems often capture interconnected processes and thus emit correlated time series (CTS), the forecasting of which enables important applications. The key to successful CTS forecasting is to uncover the temporal dynamics of time series and the spatial correlations among time series. Deep learning-based solutions exhibit impressive performance at discerning these aspects.…
▽ More
Sensors in cyber-physical systems often capture interconnected processes and thus emit correlated time series (CTS), the forecasting of which enables important applications. The key to successful CTS forecasting is to uncover the temporal dynamics of time series and the spatial correlations among time series. Deep learning-based solutions exhibit impressive performance at discerning these aspects. In particular, automated CTS forecasting, where the design of an optimal deep learning architecture is automated, enables forecasting accuracy that surpasses what has been achieved by manual approaches. However, automated CTS solutions remain in their infancy and are only able to find optimal architectures for predefined hyperparameters and scale poorly to large-scale CTS. To overcome these limitations, we propose SEARCH, a joint, scalable framework, to automatically devise effective CTS forecasting models. Specifically, we encode each candidate architecture and accompanying hyperparameters into a joint graph representation. We introduce an efficient Architecture-Hyperparameter Comparator (AHC) to rank all architecture-hyperparameter pairs, and we then further evaluate the top-ranked pairs to select a final result. Extensive experiments on six benchmark datasets demonstrate that SEARCH not only eliminates manual efforts but also is capable of better performance than manually designed and existing automatically designed CTS models. In addition, it shows excellent scalability to large CTS.
△ Less
Submitted 27 February, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
SLLEN: Semantic-aware Low-light Image Enhancement Network
Authors:
Mingye Ju,
Chuheng Chen,
Charles A. Guo,
Jinshan Pan,
Jinhui Tang,
Dacheng Tao
Abstract:
How to effectively explore semantic feature is vital for low-light image enhancement (LLE). Existing methods usually utilize the semantic feature that is only drawn from the output produced by high-level semantic segmentation (SS) network. However, if the output is not accurately estimated, it would affect the high-level semantic feature (HSF) extraction, which accordingly interferes with LLE. To…
▽ More
How to effectively explore semantic feature is vital for low-light image enhancement (LLE). Existing methods usually utilize the semantic feature that is only drawn from the output produced by high-level semantic segmentation (SS) network. However, if the output is not accurately estimated, it would affect the high-level semantic feature (HSF) extraction, which accordingly interferes with LLE. To this end, we develop a simple and effective semantic-aware LLE network (SSLEN) composed of a LLE main-network (LLEmN) and a SS auxiliary-network (SSaN). In SLLEN, LLEmN integrates the random intermediate embedding feature (IEF), i.e., the information extracted from the intermediate layer of SSaN, together with the HSF into a unified framework for better LLE. SSaN is designed to act as a SS role to provide HSF and IEF. Moreover, thanks to a shared encoder between LLEmN and SSaN, we further propose an alternating training mechanism to facilitate the collaboration between them. Unlike currently available approaches, the proposed SLLEN is able to fully lever the semantic information, e.g., IEF, HSF, and SS dataset, to assist LLE, thereby leading to a more promising enhancement performance. Comparisons between the proposed SLLEN and other state-of-the-art techniques demonstrate the superiority of SLLEN with respect to LLE quality over all the comparable alternatives.
△ Less
Submitted 15 May, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
LHDR: HDR Reconstruction for Legacy Content using a Lightweight DNN
Authors:
Cheng Guo,
Xiuhua Jiang
Abstract:
High dynamic range (HDR) image is widely-used in graphics and photography due to the rich information it contains. Recently the community has started using deep neural network (DNN) to reconstruct standard dynamic range (SDR) images into HDR. Albeit the superiority of current DNN-based methods, their application scenario is still limited: (1) heavy model impedes real-time processing, and (2) inapp…
▽ More
High dynamic range (HDR) image is widely-used in graphics and photography due to the rich information it contains. Recently the community has started using deep neural network (DNN) to reconstruct standard dynamic range (SDR) images into HDR. Albeit the superiority of current DNN-based methods, their application scenario is still limited: (1) heavy model impedes real-time processing, and (2) inapplicable to legacy SDR content with more degradation types. Therefore, we propose a lightweight DNN-based method trained to tackle legacy SDR. For better design, we reform the problem modeling and emphasize degradation model. Experiments show that our method reached appealing performance with minimal computational cost compared with others.
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
Sideband Cooling of a Trapped Ion in Strong Sideband Coupling Regime
Authors:
Shuo Zhang,
Zhuo-Peng Huang,
Tian-Ci Tian,
Zheng-Yang Wu,
Jian-Qi Zhang,
Wan-Su Bao,
Chu Guo
Abstract:
Conventional theoretical studies on the ground-state laser cooling of a trapped ion have mostly focused on the weak sideband coupling (WSC) regime, where the cooling rate is inverse proportional to the linewidth of the excited state. In a recent work~[New J. Phys. 23, 023018 (2021)], we proposed a theoretical framework to study the ground state cooling of a trapped ion in the strong sideband coupl…
▽ More
Conventional theoretical studies on the ground-state laser cooling of a trapped ion have mostly focused on the weak sideband coupling (WSC) regime, where the cooling rate is inverse proportional to the linewidth of the excited state. In a recent work~[New J. Phys. 23, 023018 (2021)], we proposed a theoretical framework to study the ground state cooling of a trapped ion in the strong sideband coupling (SSC) regime, under the assumption of a vanishing carrier transition. Here we extend this analysis to more general situations with nonvanishing carrier transitions, where we show that by properly tuning the coupling lasers a cooling rate proportional to the linewidth can be achieved. Our theoretical predictions closely agree with the corresponding exact solutions in the SSC regime, which provide an important theoretical guidance for sideband cooling experiments.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
Near-Term Quantum Computing Techniques: Variational Quantum Algorithms, Error Mitigation, Circuit Compilation, Benchmarking and Classical Simulation
Authors:
He-Liang Huang,
Xiao-Yue Xu,
Chu Guo,
Guojing Tian,
Shi-Jie Wei,
Xiaoming Sun,
Wan-Su Bao,
Gui-Lu Long
Abstract:
Quantum computing is a game-changing technology for global academia, research centers and industries including computational science, mathematics, finance, pharmaceutical, materials science, chemistry and cryptography. Although it has seen a major boost in the last decade, we are still a long way from reaching the maturity of a full-fledged quantum computer. That said, we will be in the Noisy-Inte…
▽ More
Quantum computing is a game-changing technology for global academia, research centers and industries including computational science, mathematics, finance, pharmaceutical, materials science, chemistry and cryptography. Although it has seen a major boost in the last decade, we are still a long way from reaching the maturity of a full-fledged quantum computer. That said, we will be in the Noisy-Intermediate Scale Quantum (NISQ) era for a long time, working on dozens or even thousands of qubits quantum computing systems. An outstanding challenge, then, is to come up with an application that can reliably carry out a nontrivial task of interest on the near-term quantum devices with non-negligible quantum noise. To address this challenge, several near-term quantum computing techniques, including variational quantum algorithms, error mitigation, quantum circuit compilation and benchmarking protocols, have been proposed to characterize and mitigate errors, and to implement algorithms with a certain resistance to noise, so as to enhance the capabilities of near-term quantum devices and explore the boundaries of their ability to realize useful applications. Besides, the development of near-term quantum devices is inseparable from the efficient classical simulation, which plays a vital role in quantum algorithm design and verification, error-tolerant verification and other applications. This review will provide a thorough introduction of these near-term quantum computing techniques, report on their progress, and finally discuss the future prospect of these techniques, which we hope will motivate researchers to undertake additional studies in this field.
△ Less
Submitted 27 December, 2022; v1 submitted 16 November, 2022;
originally announced November 2022.
-
Differentiable matrix product states for simulating variational quantum computational chemistry
Authors:
Chu Guo,
Yi Fan,
Zhiqian Xu,
Honghui Shang
Abstract:
Quantum Computing is believed to be the ultimate solution for quantum chemistry problems. Before the advent of large-scale, fully fault-tolerant quantum computers, the variational quantum eigensolver~(VQE) is a promising heuristic quantum algorithm to solve real world quantum chemistry problems on near-term noisy quantum computers. Here we propose a highly parallelizable classical simulator for VQ…
▽ More
Quantum Computing is believed to be the ultimate solution for quantum chemistry problems. Before the advent of large-scale, fully fault-tolerant quantum computers, the variational quantum eigensolver~(VQE) is a promising heuristic quantum algorithm to solve real world quantum chemistry problems on near-term noisy quantum computers. Here we propose a highly parallelizable classical simulator for VQE based on the matrix product state representation of quantum state, which significantly extend the simulation range of the existing simulators. Our simulator seamlessly integrates the quantum circuit evolution into the classical auto-differentiation framework, thus the gradients could be computed efficiently similar to the classical deep neural network, with a scaling that is independent of the number of variational parameters. As applications, we use our simulator to study commonly used small molecules such as HF, HCl, LiH and H$_2$O, as well as larger molecules CO$_2$, BeH$_2$ and H$_4$ with up to $40$ qubits. The favorable scaling of our simulator against the number of qubits and the number of parameters could make it an ideal testing ground for near-term quantum algorithms and a perfect benchmarking baseline for oncoming large scale VQE experiments on noisy quantum computers.
△ Less
Submitted 30 November, 2023; v1 submitted 15 November, 2022;
originally announced November 2022.
-
Observation of size-dependent boundary effects in non-Hermitian electric circuits
Authors:
Luhong Su,
Cui-Xian Guo,
Yongliang Wang,
Li Li,
Xinhui Ruan,
Yanjing Du,
Shu Chen,
Dongning Zheng
Abstract:
The non-Hermitian systems with the non-Hermitian skin effect (NHSE) are very sensitive to the imposed boundary conditions and lattice size, which leads to size-dependent non-Hermitian skin effects. Here, we report the experimental observation of NHSE with different boundary conditions and different lattice size in a unidirectional hopping model based on a circuit platform. The circuit admittance s…
▽ More
The non-Hermitian systems with the non-Hermitian skin effect (NHSE) are very sensitive to the imposed boundary conditions and lattice size, which leads to size-dependent non-Hermitian skin effects. Here, we report the experimental observation of NHSE with different boundary conditions and different lattice size in a unidirectional hopping model based on a circuit platform. The circuit admittance spectra and corresponding eigenstates are very sensitive to the presence of the boundary. Meanwhile, our experimental results show how the lattice size and boundary terms together affect the strength of NHSE. Therefore, our electric circuit provides a good platform to observe size-dependent boundary effects in non-Hermitian systems.
△ Less
Submitted 3 December, 2022; v1 submitted 14 November, 2022;
originally announced November 2022.
-
Optimizing Trigger-Level Track Reconstruction for Sensitivity to Exotic Signatures
Authors:
K. F. Di Petrillo,
J. N. Farr,
C. Guo,
T. R. Holmes,
J. Nelson,
K. Pachal
Abstract:
Many compelling beyond the Standard Model scenarios predict signals that result in unconventional charged particle trajectories. Signatures for which unusual tracks are the most conspicuous feature of the event pose significant challenges for experiments at the Large Hadron Collider (LHC), particularly for the trigger. This article presents a study of track-based triggers for a representative set…
▽ More
Many compelling beyond the Standard Model scenarios predict signals that result in unconventional charged particle trajectories. Signatures for which unusual tracks are the most conspicuous feature of the event pose significant challenges for experiments at the Large Hadron Collider (LHC), particularly for the trigger. This article presents a study of track-based triggers for a representative set of long-lived and unconventional signatures at the upcoming High Luminosity LHC, as well as resulting recommendations for the target parameters of a hardware-based tracking system. Scenarios studied include large multiplicities of low momentum tracks produced in a soft-unclustered-energy-pattern model, displaced leptons and anomalous prompt tracks predicted in a Supersymmetry model with long-lived staus, and displaced hadrons predicted in a Higgs portal scenario with long-lived scalars.
△ Less
Submitted 12 January, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Privacy-Aware Compression for Federated Learning Through Numerical Mechanism Design
Authors:
Chuan Guo,
Kamalika Chaudhuri,
Pierre Stock,
Mike Rabbat
Abstract:
In private federated learning (FL), a server aggregates differentially private updates from a large number of clients in order to train a machine learning model. The main challenge in this setting is balancing privacy with both classification accuracy of the learnt model as well as the number of bits communicated between the clients and server. Prior work has achieved a good trade-off by designing…
▽ More
In private federated learning (FL), a server aggregates differentially private updates from a large number of clients in order to train a machine learning model. The main challenge in this setting is balancing privacy with both classification accuracy of the learnt model as well as the number of bits communicated between the clients and server. Prior work has achieved a good trade-off by designing a privacy-aware compression mechanism, called the minimum variance unbiased (MVU) mechanism, that numerically solves an optimization problem to determine the parameters of the mechanism. This paper builds upon it by introducing a new interpolation procedure in the numerical design process that allows for a far more efficient privacy analysis. The result is the new Interpolated MVU mechanism that is more scalable, has a better privacy-utility trade-off, and provides SOTA results on communication-efficient private FL on a variety of datasets.
△ Less
Submitted 9 August, 2023; v1 submitted 7 November, 2022;
originally announced November 2022.
-
Exceptional Non-Abelian Topology in Multiband Non-Hermitian Systems
Authors:
Cui-Xian Guo,
Shu Chen,
Kun Ding,
Haiping Hu
Abstract:
Defective spectral degeneracy, known as exceptional point (EP), lies at the heart of various intriguing phenomena in optics, acoustics, and other nonconservative systems. Despite extensive studies in the past two decades, the \textit{collective} behaviors (e.g., annihilation, coalescence, braiding, etc.) involving multiple exceptional points or lines and their interplay have been rarely understood…
▽ More
Defective spectral degeneracy, known as exceptional point (EP), lies at the heart of various intriguing phenomena in optics, acoustics, and other nonconservative systems. Despite extensive studies in the past two decades, the \textit{collective} behaviors (e.g., annihilation, coalescence, braiding, etc.) involving multiple exceptional points or lines and their interplay have been rarely understood. Here we put forward a universal non-Abelian conservation rule governing these collective behaviors in generic multiband non-Hermitian systems and uncover several counterintuitive phenomena. We demonstrate that two EPs with opposite charges (even the pairwise created) do not necessarily annihilate, depending on how they approach each other. Furthermore, we unveil that the conservation rule imposes strict constraints on the permissible exceptional-line configurations. It excludes structures like Hopf link yet permits novel staggered rings composed of noncommutative exceptional lines. These intriguing phenomena are illustrated by concrete models which could be readily implemented in platforms like coupled acoustic cavities, optical waveguides, and ring resonators. Our findings lay the cornerstone for a comprehensive understanding of the exceptional non-Abelian topology and shed light on the versatile manipulations and applications based on exceptional degeneracies in nonconservative systems.
△ Less
Submitted 11 April, 2023; v1 submitted 30 October, 2022;
originally announced October 2022.
-
Characterization of two SiPM arrays from Hamamatsu and Onsemi for liquid argon detector
Authors:
T. A. Wang,
C. Guo,
X. H. Liang,
L. Wang,
M. Y. Guan,
C. G. Yang,
J. C. Liu,
F. Y. Lin
Abstract:
Silicon photomultiplier (SiPM), a new type of photosensor, is considered a substitute for traditional photomultiplier tube (PMT) in the next generation of dark matter and neutrino detectors, especially in noble gas detectors like liquid argon. However, the design of compact SiPM arrays and their cryogenic electronics that can work in liquid argon is barely developed. Thus, two candidate SiPM array…
▽ More
Silicon photomultiplier (SiPM), a new type of photosensor, is considered a substitute for traditional photomultiplier tube (PMT) in the next generation of dark matter and neutrino detectors, especially in noble gas detectors like liquid argon. However, the design of compact SiPM arrays and their cryogenic electronics that can work in liquid argon is barely developed. Thus, two candidate SiPM arrays from Hamamatsu and Onsemi were selected to verify the feasibility and effectiveness of the design. In this work, we successfully developed a cryogenic electronics read-out system that connects and works with 1-inch 4$\times$4 SiPM arrays at 87~K. The power dissipation of amplifiers is less than 10 $μ$W/mm$^2$. Furthermore, multiply significant characteristics of both types of SiPM arrays were measured at liquid argon temperature, such as dark count rate (DCR), breakdown voltage (V${_{bd}}$), single photoelectron (SPE) performance, signal to noise ratio (SNR) and correlated signal probability.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Computing the extremal nonnegative solutions of the M-tensor equation with a nonnegative right side vector
Authors:
Chun-Hua Guo
Abstract:
We consider the tensor equation whose coefficient tensor is a nonsingular M-tensor and whose right side vector is nonnegative. Such a tensor equation may have a large number of nonnegative solutions. It is already known that the tensor equation has a maximal nonnegative solution and a minimal nonnegative solution (called extremal solutions collectively). However, the existing proofs do not show ho…
▽ More
We consider the tensor equation whose coefficient tensor is a nonsingular M-tensor and whose right side vector is nonnegative. Such a tensor equation may have a large number of nonnegative solutions. It is already known that the tensor equation has a maximal nonnegative solution and a minimal nonnegative solution (called extremal solutions collectively). However, the existing proofs do not show how the extremal solutions can be computed. The existing numerical methods can find one of the nonnegative solutions, without knowing whether the computed solution is an extremal solution. In this paper, we present new proofs for the existence of extremal solutions. Our proofs are much shorter than existing ones and more importantly they give numerical methods that can compute the extremal solutions. Linear convergence of these numerical methods is also proved under mild assumptions. Some of our discussions also allow the coefficient tensor to be a Z-tensor or allow the right side vector to have some negative elements.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.