-
Automatic Meta-Path Discovery for Effective Graph-Based Recommendation
Authors:
Wentao Ning,
Reynold Cheng,
Jiajun Shen,
Nur Al Hasan Haldar,
Ben Kao,
Xiao Yan,
Nan Huo,
Wai Kit Lam,
Tian Li,
Bo Tang
Abstract:
Heterogeneous Information Networks (HINs) are labeled graphs that depict relationships among different types of entities (e.g., users, movies and directors). For HINs, meta-path-based recommenders (MPRs) utilize meta-paths (i.e., abstract paths consisting of node and link types) to predict user preference, and have attracted a lot of attention due to their explainability and performance. We observ…
▽ More
Heterogeneous Information Networks (HINs) are labeled graphs that depict relationships among different types of entities (e.g., users, movies and directors). For HINs, meta-path-based recommenders (MPRs) utilize meta-paths (i.e., abstract paths consisting of node and link types) to predict user preference, and have attracted a lot of attention due to their explainability and performance. We observe that the performance of MPRs is highly sensitive to the meta-paths they use, but existing works manually select the meta-paths from many possible ones. Thus, to discover effective meta-paths automatically, we propose the Reinforcement learning-based Meta-path Selection (RMS) framework. Specifically, we define a vector encoding for meta-paths and design a policy network to extend meta-paths. The policy network is trained based on the results of downstream recommendation tasks and an early stopping approximation strategy is proposed to speed up training. RMS is a general model, and it can work with all existing MPRs. We also propose a new MPR called RMS-HRec, which uses an attention mechanism to aggregate information from the meta-paths. We conduct extensive experiments on real datasets. Compared with the manually selected meta-paths, the meta-paths identified by RMS consistently improve recommendation quality. Moreover, RMS-HRec outperforms state-of-the-art recommender systems by an average of 7% in hit ratio. The codes and datasets are available on https://github.com/Stevenn9981/RMS-HRec.
△ Less
Submitted 7 September, 2022; v1 submitted 23 December, 2021;
originally announced December 2021.
-
Theory of Harmonic Hall Responses of Spin-Torque Driven Antiferromagnets
Authors:
Hantao Zhang,
Ran Cheng
Abstract:
Harmonic analysis is a powerful tool to characterize and quantify current-induced torques acting on magnetic materials, but so far it remains an open question in studying antiferromagnets. Here we formulate a general theory of harmonic Hall responses of collinear antiferromagnets driven by current-induced torques including both field-like and damping-like components. By scanning a magnetic field o…
▽ More
Harmonic analysis is a powerful tool to characterize and quantify current-induced torques acting on magnetic materials, but so far it remains an open question in studying antiferromagnets. Here we formulate a general theory of harmonic Hall responses of collinear antiferromagnets driven by current-induced torques including both field-like and damping-like components. By scanning a magnetic field of variable strength in three orthogonal planes, we are able to distinguish the contributions from field-like torque, damping-like torque, and concomitant thermal effects by analyzing the second harmonic signals in the Hall voltage. The analytical expressions of the first and second harmonics as functions of the magnetic field direction and strength are confirmed by numerical simulations with good agreement. We demonstrate our predictions in two prototype antiferromagnets, $α-$Fe$_{2}$O$_{3}$ and NiO, providing direct and general guidance to current and future experiments.
△ Less
Submitted 5 May, 2022; v1 submitted 23 December, 2021;
originally announced December 2021.
-
Quantifying Spin-Orbit Torques in Antiferromagnet/Heavy Metal Heterostructures
Authors:
Egecan Cogulu,
Hantao Zhang,
Nahuel N. Statuto,
Yang Cheng,
Fengyuan Yang,
Ran Cheng,
Andrew D. Kent
Abstract:
The effect of spin currents on the magnetic order of insulating antiferromagnets (AFMs) is of fundamental interest and can enable new applications. Toward this goal, characterizing the spin-orbit torques (SOT) associated with AFM/heavy metal (HM) interfaces is important. Here we report the full angular dependence of the harmonic Hall voltages in a predominantly easy-plane AFM, epitaxial c-axis ori…
▽ More
The effect of spin currents on the magnetic order of insulating antiferromagnets (AFMs) is of fundamental interest and can enable new applications. Toward this goal, characterizing the spin-orbit torques (SOT) associated with AFM/heavy metal (HM) interfaces is important. Here we report the full angular dependence of the harmonic Hall voltages in a predominantly easy-plane AFM, epitaxial c-axis oriented $α$-Fe$_2$O$_3$ films, with an interface to Pt. By modeling the harmonic Hall signals together with the $α$-Fe$_2$O$_3$ magnetic parameters, we determine the amplitudes of field-like and damping-like SOT. Out-of-plane field scans are shown to be essential to determining the damping-like component of the torques. In contrast to ferromagnetic/heavy metal heterostructures, our results demonstrate that the field-like torques are significantly larger than the damping-like torques, which we correlate with the presence of a large imaginary component of the interface spin-mixing conductance. Our work demonstrates a direct way of characterizing SOT in AFM/HM heterostructures.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Data-Driven Outage Restoration Time Prediction via Transfer Learning with Cluster Ensembles
Authors:
Dingwei Wang,
Yuxuan Yuan,
Rui Cheng,
Zhaoyu Wang
Abstract:
This paper develops a data-driven approach to accurately predict the restoration time of outages under different scales and factors. To achieve the goal, the proposed method consists of three stages. First, given the unprecedented amount of data collected by utilities, a sparse dictionary-based ensemble spectral clustering (SDESC) method is proposed to decompose historical outage datasets, which e…
▽ More
This paper develops a data-driven approach to accurately predict the restoration time of outages under different scales and factors. To achieve the goal, the proposed method consists of three stages. First, given the unprecedented amount of data collected by utilities, a sparse dictionary-based ensemble spectral clustering (SDESC) method is proposed to decompose historical outage datasets, which enjoys good computational efficiency and scalability. Specifically, each outage sample is represented by a linear combination of a small number of selected dictionary samples using a density-based method. Then, the dictionary-based representation is utilized to perform the spectral analysis to group the data samples with similar features into the same subsets. In the second stage, a knowledge-transfer-added restoration time prediction model is trained for each subset by combining weather information and outage-related features. The transfer learning technology is introduced with the aim of dealing with the underestimation problem caused by data imbalance in different subsets, thus improving the model performance. Furthermore, to connect unseen outages with the learned outage subsets, a t-distributed stochastic neighbor embedding-based strategy is applied. The proposed method fully builds on and is also tested on a large real-world outage dataset from a utility provider with a time span of six consecutive years. The numerical results validate that our method has high prediction accuracy while showing good stability against real-world data limitations.
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
Data Efficient Language-supervised Zero-shot Recognition with Optimal Transport Distillation
Authors:
Bichen Wu,
Ruizhe Cheng,
Peizhao Zhang,
Tianren Gao,
Peter Vajda,
Joseph E. Gonzalez
Abstract:
Traditional computer vision models are trained to predict a fixed set of predefined categories. Recently, natural language has been shown to be a broader and richer source of supervision that provides finer descriptions to visual concepts than supervised "gold" labels. Previous works, such as CLIP, use InfoNCE loss to train a model to predict the pairing between images and text captions. CLIP, how…
▽ More
Traditional computer vision models are trained to predict a fixed set of predefined categories. Recently, natural language has been shown to be a broader and richer source of supervision that provides finer descriptions to visual concepts than supervised "gold" labels. Previous works, such as CLIP, use InfoNCE loss to train a model to predict the pairing between images and text captions. CLIP, however, is data hungry and requires more than 400M image-text pairs for training. The inefficiency can be partially attributed to the fact that the image-text pairs are noisy. To address this, we propose OTTER (Optimal TransporT distillation for Efficient zero-shot Recognition), which uses online entropic optimal transport to find a soft image-text match as labels for contrastive learning. Based on pretrained image and text encoders, models trained with OTTER achieve strong performance with only 3M image text pairs. Compared with InfoNCE loss, label smoothing, and knowledge distillation, OTTER consistently outperforms these baselines in zero shot evaluation on Google Open Images (19,958 classes) and multi-labeled ImageNet 10K (10032 classes) from Tencent ML-Images. Over 42 evaluations on 7 different dataset/architecture settings x 6 metrics, OTTER outperforms (32) or ties (2) all baselines in 34 of them.
△ Less
Submitted 17 December, 2023; v1 submitted 17 December, 2021;
originally announced December 2021.
-
Femtosecond Pulse Generation via an Integrated Electro-Optic Time Lens
Authors:
Mengjie Yu,
Christian Reimer,
David Barton,
Prashanta Kharel,
Rebecca Cheng,
Lingyan He,
Linbo Shao,
Di Zhu,
Yaowen Hu,
Hannah R. Grant,
Leif Johansson,
Yoshitomo Okawachi,
Alexander L. Gaeta,
Mian Zhang,
Marko Lončar
Abstract:
Integrated femtosecond pulse and frequency comb sources are critical components for a wide range of applications. The leading approaches for on-chip pulse generation rely on mode locking inside microresonator with either third-order nonlinearity or with semiconductor gain. These approaches, however, are limited in noise performance, wavelength tunability and repetition rates. Alternatively, sub-pi…
▽ More
Integrated femtosecond pulse and frequency comb sources are critical components for a wide range of applications. The leading approaches for on-chip pulse generation rely on mode locking inside microresonator with either third-order nonlinearity or with semiconductor gain. These approaches, however, are limited in noise performance, wavelength tunability and repetition rates. Alternatively, sub-picosecond pulses can be synthesized without mode-locking, by modulating a continuous-wave (CW) single-frequency laser using a cascade of electro-optic (EO) modulators. This method is particularly attractive due to its simplicity, robustness, and frequency-agility but has been realized only on a tabletop using multiple discrete EO modulators and requiring optical amplifiers (to overcome large insertion losses), microwave amplifiers, and phase shifters. Here we demonstrate a chip-scale femtosecond pulse source implemented on an integrated lithium niobate (LN) photonic platform18, using cascaded low-loss electro-optic amplitude and phase modulators and chirped Bragg grating, forming a time-lens system. The device is driven by a CW distributed feedback (DFB) chip laser and controlled by a single CW microwave source without the need for any stabilization or locking. We measure femtosecond pulse trains (520 fs duration) with a 30-GHz repetition rate, flat-top optical spectra with a 10-dB optical bandwidth of 12.6 nm, individual comb-line powers above 0.1 milliwatt, and pulse energies of 0.54 picojoule. Our results represent a tunable, robust and low-cost integrated pulsed light source with CW-to-pulse conversion efficiencies an order of magnitude higher than achieved with previous integrated sources. Our pulse generator can find applications from ultrafast optical measurement to networks of distributed quantum computers.
△ Less
Submitted 16 December, 2021;
originally announced December 2021.
-
Progressive Graph Convolution Network for EEG Emotion Recognition
Authors:
Yijin Zhou,
Fu Li,
Yang Li,
Youshuo Ji,
Guangming Shi,
Wenming Zheng,
Lijian Zhang,
Yuanfang Chen,
Rui Cheng
Abstract:
Studies in the area of neuroscience have revealed the relationship between emotional patterns and brain functional regions, demonstrating that dynamic relationships between different brain regions are an essential factor affecting emotion recognition determined through electroencephalography (EEG). Moreover, in EEG emotion recognition, we can observe that clearer boundaries exist between coarse-gr…
▽ More
Studies in the area of neuroscience have revealed the relationship between emotional patterns and brain functional regions, demonstrating that dynamic relationships between different brain regions are an essential factor affecting emotion recognition determined through electroencephalography (EEG). Moreover, in EEG emotion recognition, we can observe that clearer boundaries exist between coarse-grained emotions than those between fine-grained emotions, based on the same EEG data; this indicates the concurrence of large coarse- and small fine-grained emotion variations. Thus, the progressive classification process from coarse- to fine-grained categories may be helpful for EEG emotion recognition. Consequently, in this study, we propose a progressive graph convolution network (PGCN) for capturing this inherent characteristic in EEG emotional signals and progressively learning the discriminative EEG features. To fit different EEG patterns, we constructed a dual-graph module to characterize the intrinsic relationship between different EEG channels, containing the dynamic functional connections and static spatial proximity information of brain regions from neuroscience research. Moreover, motivated by the observation of the relationship between coarse- and fine-grained emotions, we adopt a dual-head module that enables the PGCN to progressively learn more discriminative EEG features, from coarse-grained (easy) to fine-grained categories (difficult), referring to the hierarchical characteristic of emotion. To verify the performance of our model, extensive experiments were conducted on two public datasets: SEED-IV and multi-modal physiological emotion database (MPED).
△ Less
Submitted 13 December, 2021;
originally announced December 2021.
-
High-efficiency and broadband electro-optic frequency combs enabled by coupled micro-resonators
Authors:
Yaowen Hu,
Mengjie Yu,
Brandon Buscaino,
Neil Sinclair,
Di Zhu,
Rebecca Cheng,
Amirhassan Shams-Ansari,
Linbo Shao,
Mian Zhang,
Joseph M. Kahn,
Marko Loncar
Abstract:
Developments in integrated photonics have led to stable, compact, and broadband comb generators that support a wide range of applications. Current on-chip comb generators, however, are still limited by low optical pump-to-comb conversion efficiencies. Here, we demonstrate an integrated electro-optic frequency comb with a conversion efficiency of 30% and an optical bandwidth of 132 nm, featuring a…
▽ More
Developments in integrated photonics have led to stable, compact, and broadband comb generators that support a wide range of applications. Current on-chip comb generators, however, are still limited by low optical pump-to-comb conversion efficiencies. Here, we demonstrate an integrated electro-optic frequency comb with a conversion efficiency of 30% and an optical bandwidth of 132 nm, featuring a 100-times higher conversion efficiency and 2.2-times broader optical bandwidth compared with previous state-of-the-art integrated electro-optic combs. We further show that, enabled by the high efficiency, the device acts as an on-chip femtosecond pulse source (336 fs pulse duration), which is important for applications in nonlinear optics, sensing, and computing. As an example, in the ultra-fast and high-power regime, we demonstrate the observation of a combined EO-χ^(3) nonlinear frequency comb. Our device paves the way for practical optical frequency comb generators enabling energy-efficient computing, communication, and metrology, and provides a platform to investigate new regimes of optical physics that simultaneously involve multiple nonlinearities.
△ Less
Submitted 16 December, 2021; v1 submitted 29 November, 2021;
originally announced November 2021.
-
Quantum mechanics of fermion confined to a curved surface in Foldy-Wouthuysen representation
Authors:
Hao Zhao,
Yong-Long Wang,
Cheng-Zhi Ye,
Run Cheng,
Guo-Hua Liang,
Hui Liu
Abstract:
In Foldy-Wouthuysen representation, we deduce the effective quantum mechanics for a particle confined to a curved surface by using the thin-layer quantization scheme. We find that the spin effect caused by confined potential as the results of relativistic correction in the non-relativistic limit. Furthermore, the spin connection appeared in curved surface which depends on curvature contributes a Z…
▽ More
In Foldy-Wouthuysen representation, we deduce the effective quantum mechanics for a particle confined to a curved surface by using the thin-layer quantization scheme. We find that the spin effect caused by confined potential as the results of relativistic correction in the non-relativistic limit. Furthermore, the spin connection appeared in curved surface which depends on curvature contributes a Zeeman-like gap in the relativistic correction term. In addition, the confined potential also induces a curvature-independent energy shift, which is from the zitterbewegung effect. As an example, we apply the effective Hamiltonian to torus surface, in which we obtain expectantly the spin effects related to confined potential. Those results directly demonstrate the scaling of the uncommutation of the non-relativistic limit and the thin-layer quantization formalism
△ Less
Submitted 28 November, 2021;
originally announced November 2021.
-
A novel time delay estimation algorithm of acoustic pyrometry for furnace
Authors:
Qi Liu,
Bin Zhou,
Jianyong Zhang,
Ruixue Cheng
Abstract:
Acoustic pyrometry is a non-contact measurement technology for monitoring furnace combustion reaction, diagnosing energy loss due to incomplete combustion and ensuring safe production. The accuracy of time of flight (TOF) estimation of an acoustic pyrometry directly affects the authenticity of furnace temperature measurement. In this paper presented is a novel TOF (i.e. time delay) estimation algo…
▽ More
Acoustic pyrometry is a non-contact measurement technology for monitoring furnace combustion reaction, diagnosing energy loss due to incomplete combustion and ensuring safe production. The accuracy of time of flight (TOF) estimation of an acoustic pyrometry directly affects the authenticity of furnace temperature measurement. In this paper presented is a novel TOF (i.e. time delay) estimation algorithm based on digital lock-in filtering (DLF) algorithm. In this research, the time-frequency relationship between the first harmonic of the acoustic signal and the moment of characteristic frequency applied is established through the digital lock-in and low-pass filtering techniques. The accurate estimation of TOF is obtained by extracting and comparing the temporal relationship of the characteristic frequency occurrence between received and source acoustic signals. The computational error analysis indicates that the accuracy of the proposed algorithm is better than that of the classical generalized cross-correlation (GCC) algorithm, and the computational effort is significantly reduced to half of that the GCC can offer. It can be confirmed that with this method, the temperature measurement in furnaces can be improved in terms of computational effort and accuracy, which are vital parameters in furnace combustion control. It provides a new idea of time delay estimation with the utilization of acoustic pyrometry for furnace.
△ Less
Submitted 23 March, 2022; v1 submitted 24 November, 2021;
originally announced November 2021.
-
Identifying Axion Insulator by Quantized Magnetoelectric Effect in Antiferromagnetic ${\mathrm{MnBi}}_{2}{\mathrm{Te}}_{4}$ Tunnel Junction
Authors:
Yu-Hang Li,
Ran Cheng
Abstract:
Intrinsic magnetic topological insulator ${\mathrm{MnBi}}_{2}{\mathrm{Te}}_{4}$ is believed to be an axion insulator in its antiferromagnetic ground state. However, direct identification of axion insulators remains experimentally elusive because the observed vanishing Hall resistance, while indicating the onset of the axion field, is inadequate to distinguish the system from a trivial normal insul…
▽ More
Intrinsic magnetic topological insulator ${\mathrm{MnBi}}_{2}{\mathrm{Te}}_{4}$ is believed to be an axion insulator in its antiferromagnetic ground state. However, direct identification of axion insulators remains experimentally elusive because the observed vanishing Hall resistance, while indicating the onset of the axion field, is inadequate to distinguish the system from a trivial normal insulator. Using numerical Green's functions, we theoretically demonstrate the quantized magnetoelectric current in a tunnel junction of atomically thin ${\mathrm{MnBi}}_{2}{\mathrm{Te}}_{4}$ sandwiched between two contacts, which is a smoking-gun signal that unambiguously confirms antiferromagnetic ${\mathrm{MnBi}}_{2}{\mathrm{Te}}_{4}$ to be an axion insulator. Our predictions can be verified directly by experiments.
△ Less
Submitted 20 March, 2023; v1 submitted 22 November, 2021;
originally announced November 2021.
-
Electrically-pumped high-power laser transmitter integrated on thin-film lithium niobate
Authors:
Amirhassan Shams-Ansari,
Dylan Renaud,
Rebecca Cheng,
Linbo Shao,
Lingyan He,
Di Zhu,
Mengjie Yu,
Hannah R. Grant,
Leif Johansson,
Mian Zhang,
Marko Loncar
Abstract:
Integrated thin-film lithium niobate (TFLN) photonics has emerged as a promising platform for realization of high-performance chip-scale optical systems. Of particular importance are TFLN electro-optic modulators featuring high-linearity, low driving voltage and lowpropagation loss. However, fully integrated system requires integration of high power, low noise, and narrow linewidth lasers on TFLN…
▽ More
Integrated thin-film lithium niobate (TFLN) photonics has emerged as a promising platform for realization of high-performance chip-scale optical systems. Of particular importance are TFLN electro-optic modulators featuring high-linearity, low driving voltage and lowpropagation loss. However, fully integrated system requires integration of high power, low noise, and narrow linewidth lasers on TFLN chip. Here we achieve this goal, and demonstrate integrated high-power lasers on TFLN platform with up to 60 mW of optical power in the waveguides. We use this platform to realize a highpower transmitter consisting an electrically-pumped laser integrated with a 50 GHz modulator.
△ Less
Submitted 25 November, 2021; v1 submitted 16 November, 2021;
originally announced November 2021.
-
On the geometry of the multiplier space of $\ell_A^p$
Authors:
Raymond Cheng,
Christopher Felder
Abstract:
For $p \in (1,\infty)\setminus \{2\}$, some properties of the space $\mathscr{M}_p$ of multipliers on $\ell^p_A$ are derived. In particular, the failure of the weak parallelogram laws and the Pythagorean inequalities is demonstrated for $\mathscr{M}_p$. It is also shown that the extremal multipliers on the $\ell^p_A$ spaces are exactly the monomials, in stark contrast to the $p=2$ case.
For $p \in (1,\infty)\setminus \{2\}$, some properties of the space $\mathscr{M}_p$ of multipliers on $\ell^p_A$ are derived. In particular, the failure of the weak parallelogram laws and the Pythagorean inequalities is demonstrated for $\mathscr{M}_p$. It is also shown that the extremal multipliers on the $\ell^p_A$ spaces are exactly the monomials, in stark contrast to the $p=2$ case.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
On Joint Learning for Solving Placement and Routing in Chip Design
Authors:
Ruoyu Cheng,
Junchi Yan
Abstract:
For its advantage in GPU acceleration and less dependency on human experts, machine learning has been an emerging tool for solving the placement and routing problems, as two critical steps in modern chip design flow. Being still in its early stage, there are fundamental issues: scalability, reward design, and end-to-end learning paradigm etc. To achieve end-to-end placement learning, we first prop…
▽ More
For its advantage in GPU acceleration and less dependency on human experts, machine learning has been an emerging tool for solving the placement and routing problems, as two critical steps in modern chip design flow. Being still in its early stage, there are fundamental issues: scalability, reward design, and end-to-end learning paradigm etc. To achieve end-to-end placement learning, we first propose a joint learning method termed by DeepPlace for the placement of macros and standard cells, by the integration of reinforcement learning with a gradient based optimization scheme. To further bridge the placement with the subsequent routing task, we also develop a joint learning approach via reinforcement learning to fulfill both macro placement and routing, which is called DeepPR. One key design in our (reinforcement) learning paradigm involves a multi-view embedding model to encode both global graph level and local node level information of the input macros. Moreover, the random network distillation is devised to encourage exploration. Experiments on public chip design benchmarks show that our method can effectively learn from experience and also provides intermediate placement for the post standard cell placement, within few hours for training.
△ Less
Submitted 26 December, 2021; v1 submitted 30 October, 2021;
originally announced November 2021.
-
Analyzing Photovoltaic's Impact on Conservation Voltage Reduction in Distribution Networks
Authors:
Rui Cheng,
Zhaoyu Wang,
Yifei Guo,
Fankun Bu
Abstract:
Conservation voltage reduction (CVR) has been widely implemented in distribution networks and helped utilities effectively reduce energy and peak load. However, the increasing penetration level of solar photovoltaic (PV) has affected voltage profiles and the performance of CVR. It remains an outstanding question how CVR and solar PV interact with each other. Understanding this interaction is impor…
▽ More
Conservation voltage reduction (CVR) has been widely implemented in distribution networks and helped utilities effectively reduce energy and peak load. However, the increasing penetration level of solar photovoltaic (PV) has affected voltage profiles and the performance of CVR. It remains an outstanding question how CVR and solar PV interact with each other. Understanding this interaction is important for utilities in implementing CVR and assessing its performance. This paper studies the impact of solar PV on CVR in a real distribution system in the Midwest U.S. using comprehensive simulations. We have considered various PV allocations and penetration levels, as well as different inverter control modes according to IEEE Std 1547-2018. Three metrics are used to quantify the impact of solar PV on CVR: voltages at the substation, voltage distribution across the network, and energy consumption reduction due to CVR. The results show that the allocations of solar PV have the most significant effect on the CVR performance, where a dispersed allocation of solar PV will help flatten voltage profile and achieve deeper voltage reductions at the substation, less energy consumption and line losses.
△ Less
Submitted 27 October, 2021;
originally announced October 2021.
-
R4: A Framework for Route Representation and Route Recommendation
Authors:
Ran Cheng,
Chao Chen,
Longfei Xu,
Shen Li,
Lei Wang,
Hengbin Cui,
Kaikui Liu,
Xiaolong Li
Abstract:
Route recommendation is significant in navigation service. Two major challenges for route recommendation are route representation and user representation. Different from items that can be identified by unique IDs in traditional recommendation, routes are combinations of links (i.e., a road segment and its following action like turning left) and the number of combinations could be close to infinite…
▽ More
Route recommendation is significant in navigation service. Two major challenges for route recommendation are route representation and user representation. Different from items that can be identified by unique IDs in traditional recommendation, routes are combinations of links (i.e., a road segment and its following action like turning left) and the number of combinations could be close to infinite. Besides, the representation of a route changes under different scenarios. These facts result in severe sparsity of routes, which increases the difficulty of route representation. Moreover, link attribute deficiencies and errors affect preciseness of route representation. Because of the sparsity of routes, the interaction data between users and routes are also sparse. This makes it not easy to acquire user representation from historical user-item interactions as traditional recommendations do. To address these issues, we propose a novel learning framework R4. In R4, we design a sparse & dense network to obtain representations of routes. The sparse unit learns link ID embeddings and aggregates them to represent a route, which captures implicit route characteristics and subsequently alleviates problems caused by link attribute deficiencies and errors. The dense unit extracts implicit local features of routes from link attributes. For user representation, we utilize a series of historical navigation to extract user preference. R4 achieves remarkable performance in both offline and online experiments.
△ Less
Submitted 24 October, 2021; v1 submitted 20 October, 2021;
originally announced October 2021.
-
A Two-layer Approach for Estimating Behind-the-Meter PV Generation Using Smart Meter Data
Authors:
Fankun Bu,
Rui Cheng,
Zhaoyu Wang
Abstract:
As the cost of the residential solar system decreases, rooftop photovoltaic (PV) has been widely integrated into distribution systems. Most rooftop PV systems are installed behind-the-meter (BTM), i.e., only the net demand is metered, while the native demand and PV generation are not separately recorded. Under this condition, the PV generation and native demand are invisible to utilities, which br…
▽ More
As the cost of the residential solar system decreases, rooftop photovoltaic (PV) has been widely integrated into distribution systems. Most rooftop PV systems are installed behind-the-meter (BTM), i.e., only the net demand is metered, while the native demand and PV generation are not separately recorded. Under this condition, the PV generation and native demand are invisible to utilities, which brings challenges for optimal distribution system operation and expansion. In this paper, we have come up with a novel two-layer approach to disaggregate the unknown PV generation and native demand from the known hourly net demand data recorded by smart meters: 1) At the aggregate level, the proposed approach separates the total PV generation and native demand time series from the total net demand time series for customers with PVs. 2) At the customer level, the separated aggregate-level PV generation is allocated to individual PVs. These two layers leverage the spatial correlations of native demand and PV generation, respectively. One primary advantage of our proposed approach is that it is more independent and practical compared to previous works because it does not require PV array parameters, meteorological data and previously recorded solar power exemplars. We have verified our proposed approach using real native demand and PV generation data.
△ Less
Submitted 13 March, 2022; v1 submitted 14 October, 2021;
originally announced October 2021.
-
Manipulating Ferrimagnets by Fields and Currents
Authors:
Mingda Guo,
Hantao Zhang,
Ran Cheng
Abstract:
Ferrimagnets (FIMs) can function as high-frequency antiferromagnets while being easy to detect as ferromagnets, offering unique opportunities for ultrafast device applications. While the physical behavior of FIMs near the compensation point has been widely studied, there lacks a generic understanding of FIMs where the ratio of sublattice spins can vary freely between the ferromagnetic and antiferr…
▽ More
Ferrimagnets (FIMs) can function as high-frequency antiferromagnets while being easy to detect as ferromagnets, offering unique opportunities for ultrafast device applications. While the physical behavior of FIMs near the compensation point has been widely studied, there lacks a generic understanding of FIMs where the ratio of sublattice spins can vary freely between the ferromagnetic and antiferromagnetic limits. Here we investigate the physical properties of a model two-sublattice FIM manipulated by static magnetic fields and current-induced torques. By continuously varying the ratio of sublattice spins, we clarify how the dynamical chiral modes in an FIM are intrinsically connected to their ferro- and antiferromagnetic counterparts, which reveals unique features not visible near the compensation point. In particular, we find that current-induced torques can trigger spontaneous oscillation of the terahertz exchange mode. Compared with its realization in antiferromagnets, a spin-torque oscillator using FIMs not only has a reduced threshold current density but also can be self-stabilized, obviating the need for dynamic feedback.
△ Less
Submitted 8 February, 2022; v1 submitted 12 October, 2021;
originally announced October 2021.
-
Accelerating Multi-Objective Neural Architecture Search by Random-Weight Evaluation
Authors:
Shengran Hu,
Ran Cheng,
Cheng He,
Zhichao Lu,
Jing Wang,
Miao Zhang
Abstract:
For the goal of automated design of high-performance deep convolutional neural networks (CNNs), Neural Architecture Search (NAS) methodology is becoming increasingly important for both academia and industries.Due to the costly stochastic gradient descent (SGD) training of CNNs for performance evaluation, most existing NAS methods are computationally expensive for real-world deployments. To address…
▽ More
For the goal of automated design of high-performance deep convolutional neural networks (CNNs), Neural Architecture Search (NAS) methodology is becoming increasingly important for both academia and industries.Due to the costly stochastic gradient descent (SGD) training of CNNs for performance evaluation, most existing NAS methods are computationally expensive for real-world deployments. To address this issue, we first introduce a new performance estimation metric, named Random-Weight Evaluation (RWE) to quantify the quality of CNNs in a cost-efficient manner. Instead of fully training the entire CNN, the RWE only trains its last layer and leaves the remainders with randomly initialized weights, which results in a single network evaluation in seconds.Second, a complexity metric is adopted for multi-objective NAS to balance the model size and performance. Overall, our proposed method obtains a set of efficient models with state-of-the-art performance in two real-world search spaces. Then the results obtained on the CIFAR-10 dataset are transferred to the ImageNet dataset to validate the practicality of the proposed algorithm. Moreover, ablation studies on NAS-Bench-301 datasets reveal the effectiveness of the proposed RWE in estimating the performance compared with existing methods.
△ Less
Submitted 8 October, 2021;
originally announced October 2021.
-
Revisiting Self-Training for Few-Shot Learning of Language Model
Authors:
Yiming Chen,
Yan Zhang,
Chen Zhang,
Grandee Lee,
Ran Cheng,
Haizhou Li
Abstract:
As unlabeled data carry rich task-relevant information, they are proven useful for few-shot learning of language model. The question is how to effectively make use of such data. In this work, we revisit the self-training technique for language model fine-tuning and present a state-of-the-art prompt-based few-shot learner, SFLM. Given two views of a text sample via weak and strong augmentation tech…
▽ More
As unlabeled data carry rich task-relevant information, they are proven useful for few-shot learning of language model. The question is how to effectively make use of such data. In this work, we revisit the self-training technique for language model fine-tuning and present a state-of-the-art prompt-based few-shot learner, SFLM. Given two views of a text sample via weak and strong augmentation techniques, SFLM generates a pseudo label on the weakly augmented version. Then, the model predicts the same pseudo label when fine-tuned with the strongly augmented version. This simple approach is shown to outperform other state-of-the-art supervised and semi-supervised counterparts on six sentence classification and six sentence-pair classification benchmarking tasks. In addition, SFLM only relies on a few in-domain unlabeled data. We conduct a comprehensive analysis to demonstrate the robustness of our proposed approach under various settings, including augmentation techniques, model scale, and few-shot knowledge transfer across tasks.
△ Less
Submitted 4 October, 2021;
originally announced October 2021.
-
An Optimal Resource Allocator of Elastic Training for Deep Learning Jobs on Cloud
Authors:
Liang Hu,
Jiangcheng Zhu,
Zirui Zhou,
Ruiqing Cheng,
Xiaolong Bai,
Yong Zhang
Abstract:
Cloud training platforms, such as Amazon Web Services and Huawei Cloud provide users with computational resources to train their deep learning jobs. Elastic training is a service embedded in cloud training platforms that dynamically scales up or down the resources allocated to a job. The core technique of an elastic training system is to best allocate limited resources among heterogeneous jobs in…
▽ More
Cloud training platforms, such as Amazon Web Services and Huawei Cloud provide users with computational resources to train their deep learning jobs. Elastic training is a service embedded in cloud training platforms that dynamically scales up or down the resources allocated to a job. The core technique of an elastic training system is to best allocate limited resources among heterogeneous jobs in terms of shorter queueing delay and higher training efficiency. This paper presents an optimal resource allocator for elastic training system that leverages a mixed-integer programming (MIP) model to maximize the training progress of deep learning jobs. We take advantage of the real-world job data obtained from ModelArts, the deep learning training platform of Huawei Cloud and conduct simulation experiments to compare the optimal resource allocator with a greedy one as benchmark. Numerical results show that the proposed allocator can reduce queuing time by up to 32% and accelerate training efficiency by up to 24% relative to the greedy resource allocator, thereby greatly improving user experience with Huawei ModelArts and potentially enabling the realization of higher profits for the product. Also, the optimal resource allocator is fast in decision-making, taking merely 0.4 seconds on average.
△ Less
Submitted 7 September, 2021;
originally announced September 2021.
-
GP-S3Net: Graph-based Panoptic Sparse Semantic Segmentation Network
Authors:
Ryan Razani,
Ran Cheng,
Enxu Li,
Ehsan Taghavi,
Yuan Ren,
Liu Bingbing
Abstract:
Panoptic segmentation as an integrated task of both static environmental understanding and dynamic object identification, has recently begun to receive broad research interest. In this paper, we propose a new computationally efficient LiDAR based panoptic segmentation framework, called GP-S3Net. GP-S3Net is a proposal-free approach in which no object proposals are needed to identify the objects in…
▽ More
Panoptic segmentation as an integrated task of both static environmental understanding and dynamic object identification, has recently begun to receive broad research interest. In this paper, we propose a new computationally efficient LiDAR based panoptic segmentation framework, called GP-S3Net. GP-S3Net is a proposal-free approach in which no object proposals are needed to identify the objects in contrast to conventional two-stage panoptic systems, where a detection network is incorporated for capturing instance information. Our new design consists of a novel instance-level network to process the semantic results by constructing a graph convolutional network to identify objects (foreground), which later on are fused with the background classes. Through the fine-grained clusters of the foreground objects from the semantic segmentation backbone, over-segmentation priors are generated and subsequently processed by 3D sparse convolution to embed each cluster. Each cluster is treated as a node in the graph and its corresponding embedding is used as its node feature. Then a GCNN predicts whether edges exist between each cluster pair. We utilize the instance label to generate ground truth edge labels for each constructed graph in order to supervise the learning. Extensive experiments demonstrate that GP-S3Net outperforms the current state-of-the-art approaches, by a significant margin across available datasets such as, nuScenes and SemanticPOSS, ranking first on the competitive public SemanticKITTI leaderboard upon publication.
△ Less
Submitted 18 August, 2021;
originally announced August 2021.
-
End-to-End Dense Video Captioning with Parallel Decoding
Authors:
Teng Wang,
Ruimao Zhang,
Zhichao Lu,
Feng Zheng,
Ran Cheng,
Ping Luo
Abstract:
Dense video captioning aims to generate multiple associated captions with their temporal locations from the video. Previous methods follow a sophisticated "localize-then-describe" scheme, which heavily relies on numerous hand-crafted components. In this paper, we proposed a simple yet effective framework for end-to-end dense video captioning with parallel decoding (PDVC), by formulating the dense…
▽ More
Dense video captioning aims to generate multiple associated captions with their temporal locations from the video. Previous methods follow a sophisticated "localize-then-describe" scheme, which heavily relies on numerous hand-crafted components. In this paper, we proposed a simple yet effective framework for end-to-end dense video captioning with parallel decoding (PDVC), by formulating the dense caption generation as a set prediction task. In practice, through stacking a newly proposed event counter on the top of a transformer decoder, the PDVC precisely segments the video into a number of event pieces under the holistic understanding of the video content, which effectively increases the coherence and readability of predicted captions. Compared with prior arts, the PDVC has several appealing advantages: (1) Without relying on heuristic non-maximum suppression or a recurrent event sequence selection network to remove redundancy, PDVC directly produces an event set with an appropriate size; (2) In contrast to adopting the two-stage scheme, we feed the enhanced representations of event queries into the localization head and caption head in parallel, making these two sub-tasks deeply interrelated and mutually promoted through the optimization; (3) Without bells and whistles, extensive experiments on ActivityNet Captions and YouCook2 show that PDVC is capable of producing high-quality captioning results, surpassing the state-of-the-art two-stage methods when its localization accuracy is on par with them. Code is available at https://github.com/ttengwang/PDVC.
△ Less
Submitted 17 November, 2021; v1 submitted 17 August, 2021;
originally announced August 2021.
-
FaPN: Feature-aligned Pyramid Network for Dense Image Prediction
Authors:
Shihua Huang,
Zhichao Lu,
Ran Cheng,
Cheng He
Abstract:
Recent advancements in deep neural networks have made remarkable leap-forwards in dense image prediction. However, the issue of feature alignment remains as neglected by most existing approaches for simplicity. Direct pixel addition between upsampled and local features leads to feature maps with misaligned contexts that, in turn, translate to mis-classifications in prediction, especially on object…
▽ More
Recent advancements in deep neural networks have made remarkable leap-forwards in dense image prediction. However, the issue of feature alignment remains as neglected by most existing approaches for simplicity. Direct pixel addition between upsampled and local features leads to feature maps with misaligned contexts that, in turn, translate to mis-classifications in prediction, especially on object boundaries. In this paper, we propose a feature alignment module that learns transformation offsets of pixels to contextually align upsampled higher-level features; and another feature selection module to emphasize the lower-level features with rich spatial details. We then integrate these two modules in a top-down pyramidal architecture and present the Feature-aligned Pyramid Network (FaPN). Extensive experimental evaluations on four dense prediction tasks and four datasets have demonstrated the efficacy of FaPN, yielding an overall improvement of 1.2 - 2.6 points in AP / mIoU over FPN when paired with Faster / Mask R-CNN. In particular, our FaPN achieves the state-of-the-art of 56.7% mIoU on ADE20K when integrated within Mask-Former. The code is available from https://github.com/EMI-Group/FaPN.
△ Less
Submitted 17 August, 2021; v1 submitted 16 August, 2021;
originally announced August 2021.
-
Photoacoustic Silk Scaffolds for Neural stimulation and Regeneration
Authors:
Nan Zheng,
Vincent Fitzpatrick,
Ran Cheng,
Linli Shi,
David L. Kaplan,
Chen Yang
Abstract:
Neural interfaces using biocompatible scaffolds provide crucial properties for the functional repair of nerve injuries and neurodegenerative diseases, including cell adhesion, structural support, and mass transport. Neural stimulation has also been found to be effective in promoting neural regeneration. This work provides a new strategy to integrate photoacoustic (PA) neural stimulation into hydro…
▽ More
Neural interfaces using biocompatible scaffolds provide crucial properties for the functional repair of nerve injuries and neurodegenerative diseases, including cell adhesion, structural support, and mass transport. Neural stimulation has also been found to be effective in promoting neural regeneration. This work provides a new strategy to integrate photoacoustic (PA) neural stimulation into hydrogel scaffolds using a nanocomposite hydrogel approach. Specifically, polyethylene glycol (PEG)-functionalized carbon nanotubes (CNT), highly efficient photoacoustic agents, are embedded into silk fibroin to form biocompatible and soft photoacoustic materials. We show that these photoacoustic functional scaffolds enable non-genetic activation of neurons with a spatial precision defined by the area of light illumination, promoting neuron regeneration. These CNT/silk scaffolds offered reliable and repeatable photoacoustic neural stimulation. 94% of photoacoustic stimulated neurons exhibit a fluorescence change larger than 10% in calcium imaging in the light illuminated area. The on-demand photoacoustic stimulation increased neurite outgrowth by 1.74-fold in a dorsal root ganglion model, when compared to the unstimulated group. We also confirmed that photoacoustic neural stimulation promoted neurite outgrowth by impacting the brain-derived neurotrophic factor (BDNF) pathway. As a multifunctional neural scaffold, CNT/silk scaffolds demonstrated non-genetic PA neural stimulation functions and promoted neurite outgrowth, providing a new method for non-pharmacological neural regeneration.
△ Less
Submitted 27 June, 2021;
originally announced June 2021.
-
Existence of curves with constant geodesic curvature in a Riemannian 2-sphere
Authors:
Da Rong Cheng,
Xin Zhou
Abstract:
We prove the existence of immersed closed curves of constant geodesic curvature in an arbitrary Riemannian 2-sphere for almost every prescribed curvature. To achieve this, we develop a min-max scheme for a weighted length functional.
We prove the existence of immersed closed curves of constant geodesic curvature in an arbitrary Riemannian 2-sphere for almost every prescribed curvature. To achieve this, we develop a min-max scheme for a weighted length functional.
△ Less
Submitted 23 June, 2021;
originally announced June 2021.
-
Protein-Polymer Mixtures in the Colloid Limit: Aggregation, Sedimentation and Crystallization
Authors:
Rui Cheng,
Jingwen Li,
Ioatzin Ríos de Anda,
Thomas W. C. Taylor,
Malcolm A. Faers,
J. L. Ross Anderson,
Annela M. Seddon,
C. Patrick Royall
Abstract:
While proteins have been treated as particles with a spherically symmetric interaction, of course in reality the situation is rather more complex. A simple step towards higher complexity is to treat the proteins as non--spherical particles and that is the approach we pursue here. We investigate the phase behavior of enhanced green fluorescent protein (eGFP) under the addition of a non--adsorbing p…
▽ More
While proteins have been treated as particles with a spherically symmetric interaction, of course in reality the situation is rather more complex. A simple step towards higher complexity is to treat the proteins as non--spherical particles and that is the approach we pursue here. We investigate the phase behavior of enhanced green fluorescent protein (eGFP) under the addition of a non--adsorbing polymer, polyethylene glycol (PEG). From small angle x-ray scattering we infer that the eGFP undergoes dimerization and we treat the dimers as spherocylinders with aspect ratio $L/D-1 = 1.05$. Despite the complex nature of the proteins, we find that the phase behaviour is similar to that of hard spherocylinders with ideal polymer depletant, exhibiting aggregation and, in a small region of the phase diagram, crystallization. By comparing our measurements of the onset of aggregation with predictions for hard colloids and ideal polymers [S.V. Savenko and M. Dijkstra, J. Chem. Phys 124, 234902 (2006) and F. lo Verso et al., Phys. Rev. E 73, 061407 (2006)] we find good agreement, which suggests that the eGFP proteins are consistent with hard spherocylinders and ideal polymer.
△ Less
Submitted 16 June, 2021;
originally announced June 2021.
-
Multi-Label Few-Shot Learning for Aspect Category Detection
Authors:
Mengting Hu,
Shiwan Zhao,
Honglei Guo,
Chao Xue,
Hang Gao,
Tiegang Gao,
Renhong Cheng,
Zhong Su
Abstract:
Aspect category detection (ACD) in sentiment analysis aims to identify the aspect categories mentioned in a sentence. In this paper, we formulate ACD in the few-shot learning scenario. However, existing few-shot learning approaches mainly focus on single-label predictions. These methods can not work well for the ACD task since a sentence may contain multiple aspect categories. Therefore, we propos…
▽ More
Aspect category detection (ACD) in sentiment analysis aims to identify the aspect categories mentioned in a sentence. In this paper, we formulate ACD in the few-shot learning scenario. However, existing few-shot learning approaches mainly focus on single-label predictions. These methods can not work well for the ACD task since a sentence may contain multiple aspect categories. Therefore, we propose a multi-label few-shot learning method based on the prototypical network. To alleviate the noise, we design two effective attention mechanisms. The support-set attention aims to extract better prototypes by removing irrelevant aspects. The query-set attention computes multiple prototype-specific representations for each query instance, which are then used to compute accurate distances with the corresponding prototypes. To achieve multi-label inference, we further learn a dynamic threshold per instance by a policy network. Extensive experimental results on three datasets demonstrate that the proposed method significantly outperforms strong baselines.
△ Less
Submitted 28 May, 2021;
originally announced May 2021.
-
Data-Efficient Language-Supervised Zero-Shot Learning with Self-Distillation
Authors:
Ruizhe Cheng,
Bichen Wu,
Peizhao Zhang,
Peter Vajda,
Joseph E. Gonzalez
Abstract:
Traditional computer vision models are trained to predict a fixed set of predefined categories. Recently, natural language has been shown to be a broader and richer source of supervision that provides finer descriptions to visual concepts than supervised "gold" labels. Previous works, such as CLIP, use a simple pretraining task of predicting the pairings between images and text captions. CLIP, how…
▽ More
Traditional computer vision models are trained to predict a fixed set of predefined categories. Recently, natural language has been shown to be a broader and richer source of supervision that provides finer descriptions to visual concepts than supervised "gold" labels. Previous works, such as CLIP, use a simple pretraining task of predicting the pairings between images and text captions. CLIP, however, is data hungry and requires more than 400M image text pairs for training. We propose a data-efficient contrastive distillation method that uses soft labels to learn from noisy image-text pairs. Our model transfers knowledge from pretrained image and sentence encoders and achieves strong performance with only 3M image text pairs, 133x smaller than CLIP. Our method exceeds the previous SoTA of general zero-shot learning on ImageNet 21k+1k by 73% relatively with a ResNet50 image encoder and DeCLUTR text encoder. We also beat CLIP by 10.5% relatively on zero-shot evaluation on Google Open Images (19,958 classes).
△ Less
Submitted 18 April, 2021;
originally announced April 2021.
-
Zeros of optimal polynomial approximants in $\ell^p_{A}$
Authors:
Raymond Cheng,
William T. Ross,
Daniel Seco
Abstract:
The study of inner and cyclic functions in $\ell^p_A$ spaces requires a better understanding of the zeros of the so-called optimal polynomial approximants.
We determine that a point of the complex plane is the zero of an optimal polynomial approximant for some element of $\ell^p_A$ if and only if it lies outside of a closed disk (centered at the origin) of a particular radius which depends on th…
▽ More
The study of inner and cyclic functions in $\ell^p_A$ spaces requires a better understanding of the zeros of the so-called optimal polynomial approximants.
We determine that a point of the complex plane is the zero of an optimal polynomial approximant for some element of $\ell^p_A$ if and only if it lies outside of a closed disk (centered at the origin) of a particular radius which depends on the value of $p$. We find the value of this radius for $p\neq 2$. In addition, for each positive integer $d$ there is a polynomial $f_d$ of degree at most $d$ that minimizes the modulus of the root of its optimal linear polynomial approximant. We develop a method for finding these extremal functions $f_d$ and discuss their properties. The method involves the Lagrange multiplier method and a resulting dynamical system.
△ Less
Submitted 16 April, 2021;
originally announced April 2021.
-
Spin Pumping of an Easy-Plane Antiferromagnet Enhanced by Dzyaloshinskii-Moriya Interaction
Authors:
Hailong Wang,
Yuxuan Xiao,
Mingda Guo,
Eric Lee-Wong,
Gerald Q. Yan,
Ran Cheng,
Chunhui Rita Du
Abstract:
Recently, antiferromagnets have received revived interest due to their significant potential for developing next-generation ultrafast magnetic storage. Here we report dc spin pumping by the acoustic resonant mode in a canted easy-plane antiferromagnet α-Fe2O3 enabled by the Dzyaloshinskii-Moriya interaction. Systematic angle and frequency dependent measurements demonstrate that the observed spin p…
▽ More
Recently, antiferromagnets have received revived interest due to their significant potential for developing next-generation ultrafast magnetic storage. Here we report dc spin pumping by the acoustic resonant mode in a canted easy-plane antiferromagnet α-Fe2O3 enabled by the Dzyaloshinskii-Moriya interaction. Systematic angle and frequency dependent measurements demonstrate that the observed spin pumping signals arise from resonance-induced spin injection and inverse spin Hall effect in α-Fe2O3/metal heterostructures, mimicking the behavior of spin pumping in conventional ferromagnet/nonmagnet systems. The pure spin current nature is further corroborated by reversal of the polarity of spin pumping signals when the spin detector is switched from platinum to tungsten which has an opposite sign of the spin Hall angle. Our results highlight the potential opportunities offered by the low-frequency acoustic resonant mode in canted easy-plane antiferromagnets for developing next-generation, functional spintronic devices.
△ Less
Submitted 8 June, 2021; v1 submitted 5 April, 2021;
originally announced April 2021.
-
An Online Feedback-Based Linearized Power Flow Model for Unbalanced Distribution Networks
Authors:
Rui Cheng,
Zhaoyu Wang,
Yifei Guo
Abstract:
The non-linearity and non-convexity of power flow models and the phase coupling challenge the analysis and optimization of unbalanced distribution networks. To tackle the challenges, this paper proposes an online feedback-based linearized power flow model for unbalanced distribution networks with both wye-connected and delta-connected loads. The online feedback-based linearized model is grounded o…
▽ More
The non-linearity and non-convexity of power flow models and the phase coupling challenge the analysis and optimization of unbalanced distribution networks. To tackle the challenges, this paper proposes an online feedback-based linearized power flow model for unbalanced distribution networks with both wye-connected and delta-connected loads. The online feedback-based linearized model is grounded on the first-order Taylor expansion of the branch flow model, and updates the model parameters via online feedback by leveraging the instantaneous measured voltages and load consumption at the previous time step. Its closed-loop nature can asymptotically mitigate the model mismatch, thus lending itself to a good performance. In addition, exploiting the connection structure of unbalanced radial distribution networks, we provide a unified matrix-vector compact form of the proposed linearized power flow model. Finally, the numerical tests on the IEEE 123-bus test system validate the effectiveness and superiority of the proposed model. A simple optimal power flow case is also provided to illustrate the application of the online feedback-based linearized model.
△ Less
Submitted 25 July, 2021; v1 submitted 27 March, 2021;
originally announced March 2021.
-
Lite-HDSeg: LiDAR Semantic Segmentation Using Lite Harmonic Dense Convolutions
Authors:
Ryan Razani,
Ran Cheng,
Ehsan Taghavi,
Liu Bingbing
Abstract:
Autonomous driving vehicles and robotic systems rely on accurate perception of their surroundings. Scene understanding is one of the crucial components of perception modules. Among all available sensors, LiDARs are one of the essential sensing modalities of autonomous driving systems due to their active sensing nature with high resolution of sensor readings. Accurate and fast semantic segmentation…
▽ More
Autonomous driving vehicles and robotic systems rely on accurate perception of their surroundings. Scene understanding is one of the crucial components of perception modules. Among all available sensors, LiDARs are one of the essential sensing modalities of autonomous driving systems due to their active sensing nature with high resolution of sensor readings. Accurate and fast semantic segmentation methods are needed to fully utilize LiDAR sensors for scene understanding. In this paper, we present Lite-HDSeg, a novel real-time convolutional neural network for semantic segmentation of full $3$D LiDAR point clouds. Lite-HDSeg can achieve the best accuracy vs. computational complexity trade-off in SemanticKitti benchmark and is designed on the basis of a new encoder-decoder architecture with light-weight harmonic dense convolutions as its core. Moreover, we introduce ICM, an improved global contextual module to capture multi-scale contextual features, and MCSPN, a multi-class Spatial Propagation Network to further refine the semantic boundaries. Our experimental results show that the proposed method outperforms state-of-the-art semantic segmentation approaches which can run real-time, thus is suitable for robotic and autonomous driving applications.
△ Less
Submitted 16 March, 2021;
originally announced March 2021.
-
S3Net: 3D LiDAR Sparse Semantic Segmentation Network
Authors:
Ran Cheng,
Ryan Razani,
Yuan Ren,
Liu Bingbing
Abstract:
Semantic Segmentation is a crucial component in the perception systems of many applications, such as robotics and autonomous driving that rely on accurate environmental perception and understanding. In literature, several approaches are introduced to attempt LiDAR semantic segmentation task, such as projection-based (range-view or birds-eye-view), and voxel-based approaches. However, they either a…
▽ More
Semantic Segmentation is a crucial component in the perception systems of many applications, such as robotics and autonomous driving that rely on accurate environmental perception and understanding. In literature, several approaches are introduced to attempt LiDAR semantic segmentation task, such as projection-based (range-view or birds-eye-view), and voxel-based approaches. However, they either abandon the valuable 3D topology and geometric relations and suffer from information loss introduced in the projection process or are inefficient. Therefore, there is a need for accurate models capable of processing the 3D driving-scene point cloud in 3D space. In this paper, we propose S3Net, a novel convolutional neural network for LiDAR point cloud semantic segmentation. It adopts an encoder-decoder backbone that consists of Sparse Intra-channel Attention Module (SIntraAM), and Sparse Inter-channel Attention Module (SInterAM) to emphasize the fine details of both within each feature map and among nearby feature maps. To extract the global contexts in deeper layers, we introduce Sparse Residual Tower based upon sparse convolution that suits varying sparsity of LiDAR point cloud. In addition, geo-aware anisotrophic loss is leveraged to emphasize the semantic boundaries and penalize the noise within each predicted regions, leading to a robust prediction. Our experimental results show that the proposed method leads to a large improvement (12\%) compared to its baseline counterpart (MinkNet42 \cite{choy20194d}) on SemanticKITTI \cite{DBLP:conf/iccv/BehleyGMQBSG19} test set and achieves state-of-the-art mIoU accuracy of semantic segmentation approaches.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
Spin Nernst Effect of Antiferromagnetic Magnons in the Presence of Spin Diffusion
Authors:
Hantao Zhang,
Ran Cheng
Abstract:
Magnon spin Nernst effect was recently proposed as an intrinsic effect in antiferromagnets, where spin diffusion and boundary spin transmission have been ignored. However, diffusion processes are essential to convert a bulk spin current into boundary spin accumulation, which determines the spin injection rate into detectors through imperfect transmission. We formulate a diffusive theory to describ…
▽ More
Magnon spin Nernst effect was recently proposed as an intrinsic effect in antiferromagnets, where spin diffusion and boundary spin transmission have been ignored. However, diffusion processes are essential to convert a bulk spin current into boundary spin accumulation, which determines the spin injection rate into detectors through imperfect transmission. We formulate a diffusive theory to describe the detection of magnon spin Nernst effect with boundary conditions reflecting real device geometry. Thanks to the spin diffusion effect, the output signals in both electronic and optical detection grow rapidly with an increasing system size in the transverse dimension, which eventually saturate. Counterintuitively, the measurable signals are even functions of magnetic field, yielding optical detection more favorable than electronic detection.
△ Less
Submitted 21 September, 2021; v1 submitted 11 March, 2021;
originally announced March 2021.
-
Limits of Probabilistic Safety Guarantees when Considering Human Uncertainty
Authors:
Richard Cheng,
Richard M. Murray,
Joel W. Burdick
Abstract:
When autonomous robots interact with humans, such as during autonomous driving, explicit safety guarantees are crucial in order to avoid potentially life-threatening accidents. Many data-driven methods have explored learning probabilistic bounds over human agents' trajectories (i.e. confidence tubes that contain trajectories with probability $δ$), which can then be used to guarantee safety with pr…
▽ More
When autonomous robots interact with humans, such as during autonomous driving, explicit safety guarantees are crucial in order to avoid potentially life-threatening accidents. Many data-driven methods have explored learning probabilistic bounds over human agents' trajectories (i.e. confidence tubes that contain trajectories with probability $δ$), which can then be used to guarantee safety with probability $1-δ$. However, almost all existing works consider $δ\geq 0.001$. The purpose of this paper is to argue that (1) in safety-critical applications, it is necessary to provide safety guarantees with $δ< 10^{-8}$, and (2) current learning-based methods are ill-equipped to compute accurate confidence bounds at such low $δ$. Using human driving data (from the highD dataset), as well as synthetically generated data, we show that current uncertainty models use inaccurate distributional assumptions to describe human behavior and/or require infeasible amounts of data to accurately learn confidence bounds for $δ\leq 10^{-8}$. These two issues result in unreliable confidence bounds, which can have dangerous implications if deployed on safety-critical systems.
△ Less
Submitted 24 March, 2021; v1 submitted 4 March, 2021;
originally announced March 2021.
-
Unbounded negativity on rational surfaces in positive characteristic
Authors:
Raymond Cheng,
Remy van Dobben de Bruyn
Abstract:
We give explicit blowups of the projective plane in positive characteristic that contain smooth rational curves of arbitrarily negative self-intersection, showing that the Bounded Negativity Conjecture fails even for rational surfaces in positive characteristic.
We give explicit blowups of the projective plane in positive characteristic that contain smooth rational curves of arbitrarily negative self-intersection, showing that the Bounded Negativity Conjecture fails even for rational surfaces in positive characteristic.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Integrated photonics on thin-film lithium niobate
Authors:
Di Zhu,
Linbo Shao,
Mengjie Yu,
Rebecca Cheng,
Boris Desiatov,
C. J. Xin,
Yaowen Hu,
Jeffrey Holzgrafe,
Soumya Ghosh,
Amirhassan Shams-Ansari,
Eric Puma,
Neil Sinclair,
Christian Reimer,
Mian Zhang,
Marko Lončar
Abstract:
Lithium niobate (LN), an outstanding and versatile material, has influenced our daily life for decades: from enabling high-speed optical communications that form the backbone of the Internet to realizing radio-frequency filtering used in our cell phones. This half-century-old material is currently embracing a revolution in thin-film LN integrated photonics. The success of manufacturing wafer-scale…
▽ More
Lithium niobate (LN), an outstanding and versatile material, has influenced our daily life for decades: from enabling high-speed optical communications that form the backbone of the Internet to realizing radio-frequency filtering used in our cell phones. This half-century-old material is currently embracing a revolution in thin-film LN integrated photonics. The success of manufacturing wafer-scale, high-quality, thin films of LN on insulator (LNOI), accompanied with breakthroughs in nanofabrication techniques, have made high-performance integrated nanophotonic components possible. With rapid development in the past few years, some of these thin-film LN devices, such as optical modulators and nonlinear wavelength converters, have already outperformed their legacy counterparts realized in bulk LN crystals. Furthermore, the nanophotonic integration enabled ultra-low-loss resonators in LN, which unlocked many novel applications such as optical frequency combs and quantum transducers. In this Review, we cover -- from basic principles to the state of the art -- the diverse aspects of integrated thin-film LN photonics, including the materials, basic passive components, and various active devices based on electro-optics, all-optical nonlinearities, and acousto-optics. We also identify challenges that this platform is currently facing and point out future opportunities. The field of integrated LNOI photonics is advancing rapidly and poised to make critical impacts on a broad range of applications in communication, signal processing, and quantum information.
△ Less
Submitted 23 February, 2021;
originally announced February 2021.
-
(AF)2-S3Net: Attentive Feature Fusion with Adaptive Feature Selection for Sparse Semantic Segmentation Network
Authors:
Ran Cheng,
Ryan Razani,
Ehsan Taghavi,
Enxu Li,
Bingbing Liu
Abstract:
Autonomous robotic systems and self driving cars rely on accurate perception of their surroundings as the safety of the passengers and pedestrians is the top priority. Semantic segmentation is one the essential components of environmental perception that provides semantic information of the scene. Recently, several methods have been introduced for 3D LiDAR semantic segmentation. While, they can le…
▽ More
Autonomous robotic systems and self driving cars rely on accurate perception of their surroundings as the safety of the passengers and pedestrians is the top priority. Semantic segmentation is one the essential components of environmental perception that provides semantic information of the scene. Recently, several methods have been introduced for 3D LiDAR semantic segmentation. While, they can lead to improved performance, they are either afflicted by high computational complexity, therefore are inefficient, or lack fine details of smaller instances. To alleviate this problem, we propose AF2-S3Net, an end-to-end encoder-decoder CNN network for 3D LiDAR semantic segmentation. We present a novel multi-branch attentive feature fusion module in the encoder and a unique adaptive feature selection module with feature map re-weighting in the decoder. Our AF2-S3Net fuses the voxel based learning and point-based learning into a single framework to effectively process the large 3D scene. Our experimental results show that the proposed method outperforms the state-of-the-art approaches on the large-scale SemanticKITTI benchmark, ranking 1st on the competitive public leaderboard competition upon publication.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
Hierarchical Ranking for Answer Selection
Authors:
Hang Gao,
Mengting Hu,
Renhong Cheng,
Tiegang Gao
Abstract:
Answer selection is a task to choose the positive answers from a pool of candidate answers for a given question. In this paper, we propose a novel strategy for answer selection, called hierarchical ranking. We introduce three levels of ranking: point-level ranking, pair-level ranking, and list-level ranking. They formulate their optimization objectives by employing supervisory information from dif…
▽ More
Answer selection is a task to choose the positive answers from a pool of candidate answers for a given question. In this paper, we propose a novel strategy for answer selection, called hierarchical ranking. We introduce three levels of ranking: point-level ranking, pair-level ranking, and list-level ranking. They formulate their optimization objectives by employing supervisory information from different perspectives to achieve the same goal of ranking candidate answers. Therefore, the three levels of ranking are related and they can promote each other. We take the well-performed compare-aggregate model as the backbone and explore three schemes to implement the idea of applying the hierarchical rankings jointly: the scheme under the Multi-Task Learning (MTL) strategy, the Ranking Integration (RI) scheme, and the Progressive Ranking Integration (PRI) scheme. Experimental results on two public datasets, WikiQA and TREC-QA, demonstrate that the proposed hierarchical ranking is effective. Our method achieves state-of-the-art (non-BERT) performance on both TREC-QA and WikiQA.
△ Less
Submitted 1 February, 2021;
originally announced February 2021.
-
The Physics of Accretion Discs, Winds And Jets in Tidal Disruption Events
Authors:
Jane Lixin Dai,
Giuseppe Lodato,
Roseanne M. Cheng
Abstract:
Accretion onto black holes is an efficient mechanism in converting the gas mass-energy into energetic outputs as radiation, wind and jet. Tidal disruption events, in which stars are tidally torn apart and then accreted onto supermassive black holes, offer unique opportunities of studying the accretion physics as well as the wind and jet launching physics across different accretion regimes. In this…
▽ More
Accretion onto black holes is an efficient mechanism in converting the gas mass-energy into energetic outputs as radiation, wind and jet. Tidal disruption events, in which stars are tidally torn apart and then accreted onto supermassive black holes, offer unique opportunities of studying the accretion physics as well as the wind and jet launching physics across different accretion regimes. In this review, we systematically describe and discuss the models that have been developed to study the accretion flows and jets in tidal disruption events. A good knowledge of these physics is not only needed for understanding the emissions of the observed events, but also crucial for probing the general relativistic space-time around black holes and the demographics of supermassive black holes via tidal disruption events.
△ Less
Submitted 13 January, 2021;
originally announced January 2021.
-
Bidirectional electro-optic conversion reaching 1% efficiency with thin-film lithium niobate
Authors:
Yuntao Xu,
Ayed Al Sayem,
Linran Fan,
Sihao Wang,
Risheng Cheng,
Chang-Ling Zou,
Wei Fu,
Likai Yang,
Mingrui Xu,
Hong X. Tang
Abstract:
Superconducting cavity electro-optics (EO) presents a promising route to coherently convert microwave and optical photons and distribute quantum entanglement between superconducting circuits over long-distance through an optical network. High EO conversion efficiency demands transduction materials with strong Pockels effect and excellent optical transparency. Thin-film Lithium Niobate (TFLN) offer…
▽ More
Superconducting cavity electro-optics (EO) presents a promising route to coherently convert microwave and optical photons and distribute quantum entanglement between superconducting circuits over long-distance through an optical network. High EO conversion efficiency demands transduction materials with strong Pockels effect and excellent optical transparency. Thin-film Lithium Niobate (TFLN) offers these desired characteristics however so far has only delivered unidirectional conversion with efficiencies on the order of $10^{-5}$, largely impacted by its prominent photorefractive (PR) effect at cryogenic temperatures. Here we show that, by mitigating the PR effect and associated charge-screening effect, the device's conversion efficiency can be enhanced by orders of magnitude while maintaining stable cryogenic operation, thus allowing a demonstration of conversion bidirectionality and accurate quantification of on-chip efficiency. With the optimized monolithic integrated superconducting EO device based on TFLN-on-sapphire substrate, an on-chip conversion efficiency of 1.02% (internal efficiency, 15.2%) is realized. Our demonstration indicates that with further device improvement, it is feasible for TFLN to approach unitary internal conversion efficiency.
△ Less
Submitted 31 December, 2020; v1 submitted 29 December, 2020;
originally announced December 2020.
-
Existence of constant mean curvature 2-spheres in Riemannian 3-spheres
Authors:
Da Rong Cheng,
Xin Zhou
Abstract:
We prove the existence of branched immersed constant mean curvature 2-spheres in an arbitrary Riemannian 3-sphere for almost every prescribed mean curvature, and moreover for all prescribed mean curvatures when the 3-sphere is positively curved. To achieve this, we develop a min-max scheme for a weighted Dirichlet energy functional. There are three main ingredients in our approach: a bi-harmonic a…
▽ More
We prove the existence of branched immersed constant mean curvature 2-spheres in an arbitrary Riemannian 3-sphere for almost every prescribed mean curvature, and moreover for all prescribed mean curvatures when the 3-sphere is positively curved. To achieve this, we develop a min-max scheme for a weighted Dirichlet energy functional. There are three main ingredients in our approach: a bi-harmonic approximation procedure to obtain compactness of the new functional, a derivative estimate of the min-max values to gain energy upper bounds for min-max sequences for almost every choice of mean curvature, and a Morse index estimate to obtain another uniform energy bound required to reach the remaining constant mean curvatures in the presence of positive curvature.
△ Less
Submitted 22 October, 2021; v1 submitted 24 December, 2020;
originally announced December 2020.
-
Field-free deterministic switching of a perpendicularly polarized magnet using unconventional spin-orbit torques in WTe2
Authors:
I-Hsuan Kao,
Ryan Muzzio,
Hantao Zhang,
Menglin Zhu,
Jacob Gobbo,
Daniel Weber,
Rahul Rao,
Jiahan Li,
James H. Edgar,
Joshua E. Goldberger,
Jiaqiang Yan,
David G. Mandrus,
Jinwoo Hwang,
Ran Cheng,
Jyoti Katoch,
Simranjeet Singh
Abstract:
Spin-orbit torque (SOT) driven deterministic control of the magnetization state of a magnet with perpendicular magnetic anisotropy (PMA) is key to next generation spintronic applications including non-volatile, ultrafast, and energy efficient data storage devices. But, field-free deterministic switching of perpendicular magnetization remains a challenge because it requires an out-of-plane anti-dam…
▽ More
Spin-orbit torque (SOT) driven deterministic control of the magnetization state of a magnet with perpendicular magnetic anisotropy (PMA) is key to next generation spintronic applications including non-volatile, ultrafast, and energy efficient data storage devices. But, field-free deterministic switching of perpendicular magnetization remains a challenge because it requires an out-of-plane anti-damping torque, which is not allowed in conventional spin source materials such as heavy metals (HM) and topological insulators due to the system's symmetry. The exploitation of low-crystal symmetries in emergent quantum materials offers a unique approach to achieve SOTs with unconventional forms. Here, we report the first experimental realization of field-free deterministic magnetic switching of a perpendicularly polarized van der Waals (vdW) magnet employing an out-of-plane anti-damping SOT generated in layered WTe2 which is a low-crystal symmetry quantum material. The numerical simulations confirm that out-of-plane antidamping torque in WTe2 is responsible for the observed magnetization switching in the perpendicular direction.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds
Authors:
Ran Cheng,
Christopher Agia,
Yuan Ren,
Xinhai Li,
Liu Bingbing
Abstract:
With the increasing reliance of self-driving and similar robotic systems on robust 3D vision, the processing of LiDAR scans with deep convolutional neural networks has become a trend in academia and industry alike. Prior attempts on the challenging Semantic Scene Completion task - which entails the inference of dense 3D structure and associated semantic labels from "sparse" representations - have…
▽ More
With the increasing reliance of self-driving and similar robotic systems on robust 3D vision, the processing of LiDAR scans with deep convolutional neural networks has become a trend in academia and industry alike. Prior attempts on the challenging Semantic Scene Completion task - which entails the inference of dense 3D structure and associated semantic labels from "sparse" representations - have been, to a degree, successful in small indoor scenes when provided with dense point clouds or dense depth maps often fused with semantic segmentation maps from RGB images. However, the performance of these systems drop drastically when applied to large outdoor scenes characterized by dynamic and exponentially sparser conditions. Likewise, processing of the entire sparse volume becomes infeasible due to memory limitations and workarounds introduce computational inefficiency as practitioners are forced to divide the overall volume into multiple equal segments and infer on each individually, rendering real-time performance impossible. In this work, we formulate a method that subsumes the sparsity of large-scale environments and present S3CNet, a sparse convolution based neural network that predicts the semantically completed scene from a single, unified LiDAR point cloud. We show that our proposed method outperforms all counterparts on the 3D task, achieving state-of-the art results on the SemanticKITTI benchmark. Furthermore, we propose a 2D variant of S3CNet with a multi-view fusion strategy to complement our 3D network, providing robustness to occlusions and extreme sparsity in distant regions. We conduct experiments for the 2D semantic scene completion task and compare the results of our sparse 2D network against several leading LiDAR segmentation models adapted for bird's eye view segmentation on two open-source datasets.
△ Less
Submitted 16 December, 2020;
originally announced December 2020.
-
Multi-objective Neural Architecture Search with Almost No Training
Authors:
Shengran Hu,
Ran Cheng,
Cheng He,
Zhichao Lu
Abstract:
In the recent past, neural architecture search (NAS) has attracted increasing attention from both academia and industries. Despite the steady stream of impressive empirical results, most existing NAS algorithms are computationally prohibitive to execute due to the costly iterations of stochastic gradient descent (SGD) training. In this work, we propose an effective alternative, dubbed Random-Weigh…
▽ More
In the recent past, neural architecture search (NAS) has attracted increasing attention from both academia and industries. Despite the steady stream of impressive empirical results, most existing NAS algorithms are computationally prohibitive to execute due to the costly iterations of stochastic gradient descent (SGD) training. In this work, we propose an effective alternative, dubbed Random-Weight Evaluation (RWE), to rapidly estimate the performance of network architectures. By just training the last linear classification layer, RWE reduces the computational cost of evaluating an architecture from hours to seconds. When integrated within an evolutionary multi-objective algorithm, RWE obtains a set of efficient architectures with state-of-the-art performance on CIFAR-10 with less than two hours' searching on a single GPU card. Ablation studies on rank-order correlations and transfer learning experiments to ImageNet have further validated the effectiveness of RWE.
△ Less
Submitted 27 November, 2020;
originally announced November 2020.
-
Ground-state Pulsed Cavity Electro-optics for Microwave-to-optical Conversion
Authors:
Wei Fu,
Mingrui Xu,
Xianwen Liu,
Chang-Ling Zou,
Changchun Zhong,
Xu Han,
Mohan Shen,
Yuntao Xu,
Risheng Cheng,
Sihao Wang,
Liang Jiang,
Hong X. Tang
Abstract:
In the development of quantum microwave-to-optical (MO) converters, excessive noise induced by the parametric optical drive remains a major challenge at milli-Kelvin temperatures. Here we study the extraneous noise added to an electro-optic transducer in its quantum ground state under an intense pulsed optical excitation. The integrated electro-optical transducer leverages the inherent Pockels eff…
▽ More
In the development of quantum microwave-to-optical (MO) converters, excessive noise induced by the parametric optical drive remains a major challenge at milli-Kelvin temperatures. Here we study the extraneous noise added to an electro-optic transducer in its quantum ground state under an intense pulsed optical excitation. The integrated electro-optical transducer leverages the inherent Pockels effect of aluminum nitride microrings, flip-chip bonded to a superconducting resonator. Applying a pulsed optical drive with peak power exceeding the cooling power of the dilution refrigerator at its base temperature, we observe efficient bi-directional MO conversion, with near-ground state microwave thermal excitation ($\bar{n}_\mathrm{e}=0.09\pm0.06$). Time evolution study reveals that the residual thermal excitation is dominated by the superconductor absorption of stray light scattered off the chip-fiber interface. Our results shed light on suppressing microwave noise in a cavity electro-optic system under intense optical drive, which is an essential step towards quantum state transduction between microwave and optical frequencies.
△ Less
Submitted 21 October, 2020;
originally announced October 2020.
-
Spin fluctuations in quantized transport of magnetic topological insulators
Authors:
Yu-Hang Li,
Ran Cheng
Abstract:
In magnetic topological insulators, quantized electronic transport is interwined with spontaneous magnetic ordering, as magnetization controls band gaps, hence band topology, through the exchange interaction. We show that considering the exchange gaps at the mean-field level is inadequate to predict phase transitions between electronic states of distinct topology. Thermal spin fluctuations disturb…
▽ More
In magnetic topological insulators, quantized electronic transport is interwined with spontaneous magnetic ordering, as magnetization controls band gaps, hence band topology, through the exchange interaction. We show that considering the exchange gaps at the mean-field level is inadequate to predict phase transitions between electronic states of distinct topology. Thermal spin fluctuations disturbing the magnetization can act as frozen disorders that strongly scatter electrons, reducing the onset temperature of quantized transport appreciably even in the absence of structural impurities. This effect, which has hitherto been overlooked, provides an alternative explanation of recent experiments on intrinsic magnetic topological insulators.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Magnon Thermal Edelstein Effect Detected by Inverse Spin Hall Effect
Authors:
Hantao Zhang,
Ran Cheng
Abstract:
In an easy-plane antiferromagnet with the Dzyaloshinskii-Moriya interaction (DMI), magnons are subject to an effective spin-momentum locking. An in-plane temperature gradient can generate interfacial accumulation of magnons with a specified polarization, realizing the magnon thermal Edelstein effect. We theoretically investigate the injection and detection of this thermally-driven spin polarizatio…
▽ More
In an easy-plane antiferromagnet with the Dzyaloshinskii-Moriya interaction (DMI), magnons are subject to an effective spin-momentum locking. An in-plane temperature gradient can generate interfacial accumulation of magnons with a specified polarization, realizing the magnon thermal Edelstein effect. We theoretically investigate the injection and detection of this thermally-driven spin polarization in an adjacent heavy metal with strong spin Hall effect. We find that the inverse spin Hall voltage depends monotonically on both temperature and the DMI but non-monotonically on the hard-axis anisotropy. Counterintuitively, the magnon thermal Edelstein effect is an even function of a magnetic field applied along the Néel vector.
△ Less
Submitted 30 November, 2020; v1 submitted 22 September, 2020;
originally announced September 2020.
-
RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning
Authors:
Hao Tan,
Ran Cheng,
Shihua Huang,
Cheng He,
Changxiao Qiu,
Fan Yang,
Ping Luo
Abstract:
Despite the remarkable successes of Convolutional Neural Networks (CNNs) in computer vision, it is time-consuming and error-prone to manually design a CNN. Among various Neural Architecture Search (NAS) methods that are motivated to automate designs of high-performance CNNs, the differentiable NAS and population-based NAS are attracting increasing interests due to their unique characters. To benef…
▽ More
Despite the remarkable successes of Convolutional Neural Networks (CNNs) in computer vision, it is time-consuming and error-prone to manually design a CNN. Among various Neural Architecture Search (NAS) methods that are motivated to automate designs of high-performance CNNs, the differentiable NAS and population-based NAS are attracting increasing interests due to their unique characters. To benefit from the merits while overcoming the deficiencies of both, this work proposes a novel NAS method, RelativeNAS. As the key to efficient search, RelativeNAS performs joint learning between fast-learners (i.e. networks with relatively higher accuracy) and slow-learners in a pairwise manner. Moreover, since RelativeNAS only requires low-fidelity performance estimation to distinguish each pair of fast-learner and slow-learner, it saves certain computation costs for training the candidate architectures. The proposed RelativeNAS brings several unique advantages: (1) it achieves state-of-the-art performance on ImageNet with top-1 error rate of 24.88%, i.e. outperforming DARTS and AmoebaNet-B by 1.82% and 1.12% respectively; (2) it spends only nine hours with a single 1080Ti GPU to obtain the discovered cells, i.e. 3.75x and 7875x faster than DARTS and AmoebaNet respectively; (3) it provides that the discovered cells obtained on CIFAR-10 can be directly transferred to object detection, semantic segmentation, and keypoint detection, yielding competitive results of 73.1% mAP on PASCAL VOC, 78.7% mIoU on Cityscapes, and 68.5% AP on MSCOCO, respectively. The implementation of RelativeNAS is available at https://github.com/EMI-Group/RelativeNAS
△ Less
Submitted 13 July, 2021; v1 submitted 14 September, 2020;
originally announced September 2020.