subscribe to arXiv mailings

A design of human-like robust AI machines in object identification

Abstract: This is a perspective paper inspired from the study of Turing Test proposed by A.M. Turing (23 June 1912 - 7 June 1954) in 1950. Following one important implication of Turing Test for enabling a machine with a human-like behavior or performance, we define human-like robustness (HLR) for AI machines. The objective of the new definition aims to enforce AI machines with HLR, including to evaluate the… ▽ More This is a perspective paper inspired from the study of Turing Test proposed by A.M. Turing (23 June 1912 - 7 June 1954) in 1950. Following one important implication of Turing Test for enabling a machine with a human-like behavior or performance, we define human-like robustness (HLR) for AI machines. The objective of the new definition aims to enforce AI machines with HLR, including to evaluate them in terms of HLR. A specific task is discussed only on object identification, because it is the most common task for every person in daily life. Similar to the perspective, or design, position by Turing, we provide a solution of how to achieve HLR AI machines without constructing them and conducting real experiments. The solution should consists of three important features in the machines. The first feature of HLR machines is to utilize common sense from humans for realizing a causal inference. The second feature is to make a decision from a semantic space for having interpretations to the decision. The third feature is to include a "human-in-the-loop" setting for advancing HLR machines. We show an "identification game" using proposed design of HLR machines. The present paper shows an attempt to learn and explore further from Turing Test towards the design of human-like AI machines. △ Less

Submitted 6 January, 2021; originally announced January 2021.

Comments: 6 pages, 6 figures

arXiv:2101.00828 [pdf, other]

Transformer-based Conditional Variational Autoencoder for Controllable Story Generation

Authors: Le Fang, Tao Zeng, Chaochun Liu, Liefeng Bo, Wen Dong, Changyou Chen

Abstract: We investigate large-scale latent variable models (LVMs) for neural story generation -- an under-explored application for open-domain long text -- with objectives in two threads: generation effectiveness and controllability. LVMs, especially the variational autoencoder (VAE), have achieved both effective and controllable generation through exploiting flexible distributional latent representations.… ▽ More We investigate large-scale latent variable models (LVMs) for neural story generation -- an under-explored application for open-domain long text -- with objectives in two threads: generation effectiveness and controllability. LVMs, especially the variational autoencoder (VAE), have achieved both effective and controllable generation through exploiting flexible distributional latent representations. Recently, Transformers and its variants have achieved remarkable effectiveness without explicit latent representation learning, thus lack satisfying controllability in generation. In this paper, we advocate to revive latent variable modeling, essentially the power of representation learning, in the era of Transformers to enhance controllability without hurting state-of-the-art generation effectiveness. Specifically, we integrate latent representation vectors with a Transformer-based pre-trained architecture to build conditional variational autoencoder (CVAE). Model components such as encoder, decoder and the variational posterior are all built on top of pre-trained language models -- GPT2 specifically in this paper. Experiments demonstrate state-of-the-art conditional generation ability of our model, as well as its excellent representation learning capability and controllability. △ Less

Submitted 8 July, 2021; v1 submitted 4 January, 2021; originally announced January 2021.

arXiv:2101.00822 [pdf, other]

Outline to Story: Fine-grained Controllable Story Generation from Cascaded Events

Authors: Le Fang, Tao Zeng, Chaochun Liu, Liefeng Bo, Wen Dong, Changyou Chen

Abstract: Large-scale pretrained language models have shown thrilling generation capabilities, especially when they generate consistent long text in thousands of words with ease. However, users of these models can only control the prefix of sentences or certain global aspects of generated text. It is challenging to simultaneously achieve fine-grained controllability and preserve the state-of-the-art uncondi… ▽ More Large-scale pretrained language models have shown thrilling generation capabilities, especially when they generate consistent long text in thousands of words with ease. However, users of these models can only control the prefix of sentences or certain global aspects of generated text. It is challenging to simultaneously achieve fine-grained controllability and preserve the state-of-the-art unconditional text generation capability. In this paper, we first propose a new task named "Outline to Story" (O2S) as a test bed for fine-grained controllable generation of long text, which generates a multi-paragraph story from cascaded events, i.e. a sequence of outline events that guide subsequent paragraph generation. We then create dedicate datasets for future benchmarks, built by state-of-the-art keyword extraction techniques. Finally, we propose an extremely simple yet strong baseline method for the O2S task, which fine tunes pre-trained language models on augmented sequences of outline-story pairs with simple language modeling objective. Our method does not introduce any new parameters or perform any architecture modification, except several special tokens as delimiters to build augmented sequences. Extensive experiments on various datasets demonstrate state-of-the-art conditional story generation performance with our model, achieving better fine-grained controllability and user flexibility. Our paper is among the first ones by our knowledge to propose a model and to create datasets for the task of "outline to story". Our work also instantiates research interest of fine-grained controllable generation of open-domain long text, where controlling inputs are represented by short text. △ Less

Submitted 4 January, 2021; originally announced January 2021.

arXiv:2012.13228 [pdf, ps, other]

doi 10.1093/mnras/staa2448

Insight-HXMT observations of Swift J0243.6+6124: the evolution of RMS pulse fractions at super-Eddington luminosity

Authors: P. J. Wang, L. D. Kong, S. Zhang, Y. P. Chen, S. N. Zhang, J. L. Qu, L. Ji, L. Tao, M. Y. Ge, F. J. Lu, L. Chen, L. M. Song, T. P. Li, Y. P. Xu, X. L. Cao, Y. Chen, C. Z. Liu, Q. C. Bu, C. Cai, Z. Chang, G. Chen, T. X. Chen, Y. B. Chen, W. Cui, W. W. Cui , et al. (95 additional authors not shown)

Abstract: Based on Insight-HXMT data, we report on the pulse fraction evolution during the 2017-2018 outburst of the newly discovered first Galactic ultraluminous X-ray source (ULX) Swift J0243.6+6124. The pulse fractions of 19 observation pairs selected in the rising and fading phases with similar luminosity are investigated. The results show a general trend of the pulse fraction increasing with luminosity… ▽ More Based on Insight-HXMT data, we report on the pulse fraction evolution during the 2017-2018 outburst of the newly discovered first Galactic ultraluminous X-ray source (ULX) Swift J0243.6+6124. The pulse fractions of 19 observation pairs selected in the rising and fading phases with similar luminosity are investigated. The results show a general trend of the pulse fraction increasing with luminosity and energy at super-critical luminosity. However, the relative strength of the pulsation between each pair evolves strongly with luminosity. The pulse fraction in the rising phase is larger at luminosity below $7.71\times10^{38}$~erg~s$^{-1}$, but smaller at above. A transition luminosity is found to be energy independent. Such a phenomena is firstly confirmed by Insight-HXMT observations and we speculate it may have relation with the radiation pressure dominated accretion disk. △ Less

Submitted 24 December, 2020; originally announced December 2020.

arXiv:2012.02621 [pdf, other]

Effective Label Propagation for Discriminative Semi-Supervised Domain Adaptation

Authors: Zhiyong Huang, Kekai Sheng, Weiming Dong, Xing Mei, Chongyang Ma, Feiyue Huang, Dengwen Zhou, Changsheng Xu

Abstract: Semi-supervised domain adaptation (SSDA) methods have demonstrated great potential in large-scale image classification tasks when massive labeled data are available in the source domain but very few labeled samples are provided in the target domain. Existing solutions usually focus on feature alignment between the two domains while paying little attention to the discrimination capability of learne… ▽ More Semi-supervised domain adaptation (SSDA) methods have demonstrated great potential in large-scale image classification tasks when massive labeled data are available in the source domain but very few labeled samples are provided in the target domain. Existing solutions usually focus on feature alignment between the two domains while paying little attention to the discrimination capability of learned representations in the target domain. In this paper, we present a novel and effective method, namely Effective Label Propagation (ELP), to tackle this problem by using effective inter-domain and intra-domain semantic information propagation. For inter-domain propagation, we propose a new cycle discrepancy loss to encourage consistency of semantic information between the two domains. For intra-domain propagation, we propose an effective self-training strategy to mitigate the noises in pseudo-labeled target domain data and improve the feature discriminability in the target domain. As a general method, our ELP can be easily applied to various domain adaptation approaches and can facilitate their feature discrimination in the target domain. Experiments on Office-Home and DomainNet benchmarks show ELP consistently improves the classification accuracy of mainstream SSDA methods by 2%~3%. Additionally, ELP also improves the performance of UDA methods as well (81.5% vs 86.1%), based on UDA experiments on the VisDA-2017 benchmark. Our source code and pre-trained models will be released soon. △ Less

Submitted 4 December, 2020; originally announced December 2020.

arXiv:2011.02658 [pdf, other]

Compositional Scalable Object SLAM

Authors: Akash Sharma, Wei Dong, Michael Kaess

Abstract: We present a fast, scalable, and accurate Simultaneous Localization and Mapping (SLAM) system that represents indoor scenes as a graph of objects. Leveraging the observation that artificial environments are structured and occupied by recognizable objects, we show that a compositional scalable object mapping formulation is amenable to a robust SLAM solution for drift-free large scale indoor reconst… ▽ More We present a fast, scalable, and accurate Simultaneous Localization and Mapping (SLAM) system that represents indoor scenes as a graph of objects. Leveraging the observation that artificial environments are structured and occupied by recognizable objects, we show that a compositional scalable object mapping formulation is amenable to a robust SLAM solution for drift-free large scale indoor reconstruction. To achieve this, we propose a novel semantically assisted data association strategy that obtains unambiguous persistent object landmarks, and a 2.5D compositional rendering method that enables reliable frame-to-model RGB-D tracking. Consequently, we deliver an optimized online implementation that can run at near frame rate with a single graphics card, and provide a comprehensive evaluation against state of the art baselines. An open source implementation will be provided at https://placeholder. △ Less

Submitted 4 November, 2020; originally announced November 2020.

Comments: Submitted to the 2021 IEEE International Conference on Robotics and Automation (ICRA) 7 pages, 7 figures

arXiv:2010.14050 [pdf, other]

Convergence Analysis for Computation of Coupled Advection-Diffusion-Reaction Problems

Authors: W. B. Dong, H. S. Tang, Y. J. Liu

Abstract: A study is presented on the convergence of the computation of coupled advection-diffusion-reaction equations. In the computation, the equations with different coefficients and even types are assigned in two subdomains, and Schwarz iteration is made between the equations when marching from a time level to the next one. The analysis starts with the linear systems resulting from the full discretizati… ▽ More A study is presented on the convergence of the computation of coupled advection-diffusion-reaction equations. In the computation, the equations with different coefficients and even types are assigned in two subdomains, and Schwarz iteration is made between the equations when marching from a time level to the next one. The analysis starts with the linear systems resulting from the full discretization of the equations by explicit schemes. Conditions for convergence are derived, and its speedup and the effects of difference in the equations are discussed. Then, it proceeds to an implicit scheme, and a recursive expression for convergence speed is derived. An optimal interface condition for the Schwarz iteration is obtained, and it leads to "perfect convergence", that is, convergence within two times of iteration. Furthermore, the methods and analyses are extended to the coupling of the viscous Burgers equations. Numerical experiments indicate that the conclusions, such as the "perfect convergence, " drawn in the linear situations may remain in the Burgers equations' computation. △ Less

Submitted 2 April, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

Comments: Revision mainly on description/discussion, results unchanged

arXiv:2010.13917 [pdf, other]

doi 10.1016/j.cnsns.2021.105729

An exploratory study on machine learning to couple numerical solutions of partial differential equations

Authors: H. S. Tang, L. Li, M. Grossberg, Y. J. Liu, Y. M. Jia, S. S. Li, W. B. Dong

Abstract: As further progress in the accurate and efficient computation of coupled partial differential equations (PDEs) becomes increasingly difficult, it has become highly desired to develop new methods for such computation. In deviation from conventional approaches, this short communication paper explores a computational paradigm that couples numerical solutions of PDEs via machine-learning (ML) based me… ▽ More As further progress in the accurate and efficient computation of coupled partial differential equations (PDEs) becomes increasingly difficult, it has become highly desired to develop new methods for such computation. In deviation from conventional approaches, this short communication paper explores a computational paradigm that couples numerical solutions of PDEs via machine-learning (ML) based methods, together with a preliminary study on the paradigm. Particularly, it solves PDEs in subdomains as in a conventional approach but develops and trains artificial neural networks (ANN) to couple the PDEs' solutions at their interfaces, leading to solutions to the PDEs in the whole domains. The concepts and algorithms for the ML coupling are discussed using coupled Poisson equations and coupled advection-diffusion equations. Preliminary numerical examples illustrate the feasibility and performance of the ML coupling. Although preliminary, the results of this exploratory study indicate that the ML paradigm is promising and deserves further research. △ Less

Submitted 26 October, 2020; originally announced October 2020.

arXiv:2010.08300 [pdf]

Interpretable Disease Prediction based on Reinforcement Path Reasoning over Knowledge Graphs

Authors: Zhoujian Sun, Wei Dong, Jinlong Shi, Zhengxing Huang

Abstract: Objective: To combine medical knowledge and medical data to interpretably predict the risk of disease. Methods: We formulated the disease prediction task as a random walk along a knowledge graph (KG). Specifically, we build a KG to record relationships between diseases and risk factors according to validated medical knowledge. Then, a mathematical object walks along the KG. It starts walking at a… ▽ More Objective: To combine medical knowledge and medical data to interpretably predict the risk of disease. Methods: We formulated the disease prediction task as a random walk along a knowledge graph (KG). Specifically, we build a KG to record relationships between diseases and risk factors according to validated medical knowledge. Then, a mathematical object walks along the KG. It starts walking at a patient entity, which connects the KG based on the patient current diseases or risk factors and stops at a disease entity, which represents the predicted disease. The trajectory generated by the object represents an interpretable disease progression path of the given patient. The dynamics of the object are controlled by a policy-based reinforcement learning (RL) module, which is trained by electronic health records (EHRs). Experiments: We utilized two real-world EHR datasets to evaluate the performance of our model. In the disease prediction task, our model achieves 0.743 and 0.639 in terms of macro area under the curve (AUC) in predicting 53 circulation system diseases in the two datasets, respectively. This performance is comparable to the commonly used machine learning (ML) models in medical research. In qualitative analysis, our clinical collaborator reviewed the disease progression paths generated by our model and advocated their interpretability and reliability. Conclusion: Experimental results validate the proposed model in interpretably evaluating and optimizing disease prediction. Significance: Our work contributes to leveraging the potential of medical knowledge and medical data jointly for interpretable prediction tasks. △ Less

Submitted 6 January, 2023; v1 submitted 16 October, 2020; originally announced October 2020.

Comments: 10 pages, 5 figures

arXiv:2010.04977 [pdf, other]

doi 10.1109/TMECH.2021.3060511

An Active Sense and Avoid System for Flying Robots in Dynamic Environments

Authors: Gang Chen, Wei Dong, Xinjun Sheng, Xiangyang Zhu, Han Ding

Abstract: This paper investigates a novel active-sensing-based obstacle avoidance paradigm for flying robots in dynamic environments. Instead of fusing multiple sensors to enlarge the field of view (FOV), we introduce an alternative approach that utilizes a stereo camera with an independent rotational DOF to sense the obstacles actively. In particular, the sensing direction is planned heuristically by multi… ▽ More This paper investigates a novel active-sensing-based obstacle avoidance paradigm for flying robots in dynamic environments. Instead of fusing multiple sensors to enlarge the field of view (FOV), we introduce an alternative approach that utilizes a stereo camera with an independent rotational DOF to sense the obstacles actively. In particular, the sensing direction is planned heuristically by multiple objectives, including tracking dynamic obstacles, observing the heading direction, and exploring the previously unseen area. With the sensing result, a flight path is then planned based on real-time sampling and uncertainty-aware collision checking in the state space, which constitutes an active sense and avoid (ASAA) system. Experiments in both simulation and the real world demonstrate that this system can well cope with dynamic obstacles and abrupt goal direction changes. Since only one stereo camera is utilized, this system provides a low-cost and effective approach to overcome the FOV limitation in visual navigation. △ Less

Submitted 17 February, 2021; v1 submitted 10 October, 2020; originally announced October 2020.

Comments: Accepted by IEEE Transactions on Mechatronics on 27 Jan 2021

arXiv:2009.08003 [pdf, other]

Arbitrary Video Style Transfer via Multi-Channel Correlation

Authors: Yingying Deng, Fan Tang, Weiming Dong, Haibin Huang, Chongyang Ma, Changsheng Xu

Abstract: Video style transfer is getting more attention in AI community for its numerous applications such as augmented reality and animation productions. Compared with traditional image style transfer, performing this task on video presents new challenges: how to effectively generate satisfactory stylized results for any specified style, and maintain temporal coherence across frames at the same time. Towa… ▽ More Video style transfer is getting more attention in AI community for its numerous applications such as augmented reality and animation productions. Compared with traditional image style transfer, performing this task on video presents new challenges: how to effectively generate satisfactory stylized results for any specified style, and maintain temporal coherence across frames at the same time. Towards this end, we propose Multi-Channel Correction network (MCCNet), which can be trained to fuse the exemplar style features and input content features for efficient style transfer while naturally maintaining the coherence of input videos. Specifically, MCCNet works directly on the feature space of style and content domain where it learns to rearrange and fuse style features based on their similarity with content features. The outputs generated by MCC are features containing the desired style patterns which can further be decoded into images with vivid style textures. Moreover, MCCNet is also designed to explicitly align the features to input which ensures the output maintains the content structures as well as the temporal continuity. To further improve the performance of MCCNet under complex light conditions, we also introduce the illumination loss during training. Qualitative and quantitative evaluations demonstrate that MCCNet performs well in both arbitrary video and image style transfer tasks. △ Less

Submitted 19 January, 2021; v1 submitted 16 September, 2020; originally announced September 2020.

arXiv:2009.06254 [pdf, other]

doi 10.1109/JSTSP.2020.3037516

Accurate and Lightweight Image Super-Resolution with Model-Guided Deep Unfolding Network

Authors: Qian Ning, Weisheng Dong, Guangming Shi, Leida Li, Xin Li

Abstract: Deep neural networks (DNNs) based methods have achieved great success in single image super-resolution (SISR). However, existing state-of-the-art SISR techniques are designed like black boxes lacking transparency and interpretability. Moreover, the improvement in visual quality is often at the price of increased model complexity due to black-box design. In this paper, we present and advocate an ex… ▽ More Deep neural networks (DNNs) based methods have achieved great success in single image super-resolution (SISR). However, existing state-of-the-art SISR techniques are designed like black boxes lacking transparency and interpretability. Moreover, the improvement in visual quality is often at the price of increased model complexity due to black-box design. In this paper, we present and advocate an explainable approach toward SISR named model-guided deep unfolding network (MoG-DUN). Targeting at breaking the coherence barrier, we opt to work with a well-established image prior named nonlocal auto-regressive model and use it to guide our DNN design. By integrating deep denoising and nonlocal regularization as trainable modules within a deep learning framework, we can unfold the iterative process of model-based SISR into a multi-stage concatenation of building blocks with three interconnected modules (denoising, nonlocal-AR, and reconstruction). The design of all three modules leverages the latest advances including dense/skip connections as well as fast nonlocal implementation. In addition to explainability, MoG-DUN is accurate (producing fewer aliasing artifacts), computationally efficient (with reduced model parameters), and versatile (capable of handling multiple degradations). The superiority of the proposed MoG-DUN method to existing state-of-the-art image SR methods including RCAN, SRMDNF, and SRFBN is substantiated by extensive experiments on several popular datasets and various degradation scenarios. △ Less

Submitted 21 November, 2020; v1 submitted 14 September, 2020; originally announced September 2020.

Comments: Image Super-resolution, in IEEE Journal of Selected Topics in Signal Processing

arXiv:2008.11832 [pdf, other]

doi 10.1145/3295500.3356147

Adaptive Neural Network-Based Approximation to Accelerate Eulerian Fluid Simulation

Authors: Wenqian Dong, Jie Liu, Zhen Xie, Dong Li

Abstract: The Eulerian fluid simulation is an important HPC application. The neural network has been applied to accelerate it. The current methods that accelerate the fluid simulation with neural networks lack flexibility and generalization. In this paper, we tackle the above limitation and aim to enhance the applicability of neural networks in the Eulerian fluid simulation. We introduce Smartfluidnet, a fr… ▽ More The Eulerian fluid simulation is an important HPC application. The neural network has been applied to accelerate it. The current methods that accelerate the fluid simulation with neural networks lack flexibility and generalization. In this paper, we tackle the above limitation and aim to enhance the applicability of neural networks in the Eulerian fluid simulation. We introduce Smartfluidnet, a framework that automates model generation and application. Given an existing neural network as input, Smartfluidnet generates multiple neural networks before the simulation to meet the execution time and simulation quality requirement. During the simulation, Smartfluidnet dynamically switches the neural networks to make the best efforts to reach the user requirement on simulation quality. Evaluating with 20,480 input problems, we show that Smartfluidnet achieves 1.46x and 590x speedup comparing with a state-of-the-art neural network model and the original fluid simulation respectively on an NVIDIA Titan X Pascal GPU, while providing better simulation quality than the state-of-the-art model. △ Less

Submitted 26 August, 2020; originally announced August 2020.

arXiv:2008.11827 [pdf, other]

Smart-PGSim: Using Neural Network to Accelerate AC-OPF Power Grid Simulation

Authors: Wenqian Dong, Zhen Xie, Gokcen Kestor, Dong Li

Abstract: The optimal power flow (OPF) problem is one of the most important optimization problems for the operation of the power grid. It calculates the optimum scheduling of the committed generation units. In this paper, we develop a neural network approach to the problem of accelerating the current optimal power flow (AC-OPF) by generating an intelligent initial solution. The high quality of the initial s… ▽ More The optimal power flow (OPF) problem is one of the most important optimization problems for the operation of the power grid. It calculates the optimum scheduling of the committed generation units. In this paper, we develop a neural network approach to the problem of accelerating the current optimal power flow (AC-OPF) by generating an intelligent initial solution. The high quality of the initial solution and guidance of other outputs generated by the neural network enables faster convergence to the solution without losing optimality of final solution as computed by traditional methods. Smart-PGSim generates a novel multitask-learning neural network model to accelerate the AC-OPF simulation. Smart-PGSim also imposes the physical constraints of the simulation on the neural network automatically. Smart-PGSim brings an average of 49.2% performance improvement (up to 91%), computed over 10,000 problem simulations, with respect to the original AC-OPF implementation, without losing the optimality of the final solution. △ Less

Submitted 26 August, 2020; originally announced August 2020.

arXiv:2008.01797 [pdf, ps, other]

doi 10.3847/2041-8213/abac05

Insight-HXMT firm detection of the highest energy fundamental cyclotron resonance scattering feature in the spectrum of GRO J1008-57

Authors: M. Y. Ge, L. Ji, S. N. Zhang, A. Santangelo, C. Z. Liu, V. Doroshenko, R. Staubert, J. L. Qu, S. Zhang, F. J. Lu, L. M. Song, T. P. Li, L. Tao, Y. P. Xu, X. L. Cao, Y. Chen, Q. C. Bu, C. Cai, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. B. Chen, Y. P. Chen, W. Cui , et al. (99 additional authors not shown)

Abstract: We report on the observation of the accreting pulsar GRO J1008-57 performed by Insight-HXMT at the peak of the source's 2017 outburst. Pulsations are detected with a spin period of 93.283(1) s. The pulse profile shows double peaks at soft X-rays, and only one peak above 20 keV. The spectrum is well described by the phenomenological models of X-ray pulsars. A cyclotron resonant scattering feature i… ▽ More We report on the observation of the accreting pulsar GRO J1008-57 performed by Insight-HXMT at the peak of the source's 2017 outburst. Pulsations are detected with a spin period of 93.283(1) s. The pulse profile shows double peaks at soft X-rays, and only one peak above 20 keV. The spectrum is well described by the phenomenological models of X-ray pulsars. A cyclotron resonant scattering feature is detected with very high statistical significance at a centroid energy of $E_{\rm cyc}=90.32_{-0.28}^{+0.32}$ keV, for the reference continuum and line models, HIGHECUT and GABS respectively. Detection is very robust with respect to different continuum models. The line energy is significantly higher than what suggested from previous observations, which provided very marginal evidence for the line. This establishes a new record for the centroid energy of a fundamental cyclotron resonant scattering feature observed in accreting pulsars. We also discuss the accretion regime of the source during the Insight-HXMT observation. △ Less

Submitted 4 August, 2020; originally announced August 2020.

Comments: 8 pages, 3 figures, accepted for publication in ApJL

arXiv:2006.03978 [pdf, other]

Stable and Efficient Policy Evaluation

Authors: Daoming Lyu, Bo Liu, Matthieu Geist, Wen Dong, Saad Biaz, Qi Wang

Abstract: Policy evaluation algorithms are essential to reinforcement learning due to their ability to predict the performance of a policy. However, there are two long-standing issues lying in this prediction problem that need to be tackled: off-policy stability and on-policy efficiency. The conventional temporal difference (TD) algorithm is known to perform very well in the on-policy setting, yet is not of… ▽ More Policy evaluation algorithms are essential to reinforcement learning due to their ability to predict the performance of a policy. However, there are two long-standing issues lying in this prediction problem that need to be tackled: off-policy stability and on-policy efficiency. The conventional temporal difference (TD) algorithm is known to perform very well in the on-policy setting, yet is not off-policy stable. On the other hand, the gradient TD and emphatic TD algorithms are off-policy stable, but are not on-policy efficient. This paper introduces novel algorithms that are both off-policy stable and on-policy efficient by using the oblique projection method. The empirical experimental results on various domains validate the effectiveness of the proposed approach. △ Less

Submitted 27 December, 2021; v1 submitted 6 June, 2020; originally announced June 2020.

Comments: IEEE Transactions on Neural Networks and Learning Systems (IEEE-TNNLS). arXiv admin note: text overlap with arXiv:1704.05147

arXiv:2006.01431 [pdf, other]

Distribution Aligned Multimodal and Multi-Domain Image Stylization

Authors: Minxuan Lin, Fan Tang, Weiming Dong, Xiao Li, Chongyang Ma, Changsheng Xu

Abstract: Multimodal and multi-domain stylization are two important problems in the field of image style transfer. Currently, there are few methods that can perform both multimodal and multi-domain stylization simultaneously. In this paper, we propose a unified framework for multimodal and multi-domain style transfer with the support of both exemplar-based reference and randomly sampled guidance. The key co… ▽ More Multimodal and multi-domain stylization are two important problems in the field of image style transfer. Currently, there are few methods that can perform both multimodal and multi-domain stylization simultaneously. In this paper, we propose a unified framework for multimodal and multi-domain style transfer with the support of both exemplar-based reference and randomly sampled guidance. The key component of our method is a novel style distribution alignment module that eliminates the explicit distribution gaps between various style domains and reduces the risk of mode collapse. The multimodal diversity is ensured by either guidance from multiple images or random style code, while the multi-domain controllability is directly achieved by using a domain label. We validate our proposed framework on painting style transfer with a variety of different artistic styles and genres. Qualitative and quantitative comparisons with state-of-the-art methods demonstrate that our method can generate high-quality results of multi-domain styles and multimodal instances with reference style guidance or random sampled style. △ Less

Submitted 2 June, 2020; originally announced June 2020.

arXiv:2005.13219 [pdf, other]

Arbitrary Style Transfer via Multi-Adaptation Network

Authors: Yingying Deng, Fan Tang, Weiming Dong, Wen Sun, Feiyue Huang, Changsheng Xu

Abstract: Arbitrary style transfer is a significant topic with research value and application prospect. A desired style transfer, given a content image and referenced style painting, would render the content image with the color tone and vivid stroke patterns of the style painting while synchronously maintaining the detailed content structure information. Style transfer approaches would initially learn cont… ▽ More Arbitrary style transfer is a significant topic with research value and application prospect. A desired style transfer, given a content image and referenced style painting, would render the content image with the color tone and vivid stroke patterns of the style painting while synchronously maintaining the detailed content structure information. Style transfer approaches would initially learn content and style representations of the content and style references and then generate the stylized images guided by these representations. In this paper, we propose the multi-adaptation network which involves two self-adaptation (SA) modules and one co-adaptation (CA) module: the SA modules adaptively disentangle the content and style representations, i.e., content SA module uses position-wise self-attention to enhance content representation and style SA module uses channel-wise self-attention to enhance style representation; the CA module rearranges the distribution of style representation based on content representation distribution by calculating the local similarity between the disentangled content and style features in a non-local fashion. Moreover, a new disentanglement loss function enables our network to extract main style patterns and exact content structures to adapt to various input images, respectively. Various qualitative and quantitative experiments demonstrate that the proposed multi-adaptation network leads to better results than the state-of-the-art style transfer methods. △ Less

Submitted 16 August, 2020; v1 submitted 27 May, 2020; originally announced May 2020.

arXiv:2005.11071 [pdf, ps, other]

doi 10.1038/s41550-021-01302-6

HXMT Identification of a non-thermal X-ray burst from SGR J1935+2154 and with FRB 200428

Authors: C. K. Li, L. Lin, S. L. Xiong, M. Y. Ge, X. B. Li, T. P. Li, F. J. Lu, S. N. Zhang, Y. L. Tuo, Y. Nang, B. Zhang, S. Xiao, Y. Chen, L. M. Song, Y. P. Xu, C. Z. Liu, S. M. Jia, X. L. Cao, J. L. Qu, S. Zhang, Y. D. Gu, J. Y. Liao, X. F. Zhao, Y. Tan, J. Y. Nie , et al. (96 additional authors not shown)

Abstract: Fast radio bursts (FRBs) are short pulses observed in radio band from cosmological distances. One class of models invoke soft gamma-ray repeaters (SGRs), or magnetars, as the sources of FRBs. Some radio pulses have been observed from some magnetars, however, no FRB-like events had been detected in association any magnetar burst, including one giant flare. Recently, a pair of FRB-like bursts (FRB 2… ▽ More Fast radio bursts (FRBs) are short pulses observed in radio band from cosmological distances. One class of models invoke soft gamma-ray repeaters (SGRs), or magnetars, as the sources of FRBs. Some radio pulses have been observed from some magnetars, however, no FRB-like events had been detected in association any magnetar burst, including one giant flare. Recently, a pair of FRB-like bursts (FRB 200428 hereafter) separated by milliseconds (ms) were detected from the general direction of the Galactic magnetar SGR J1935+2154. Here we report the detection of a non-thermal X-ray burst in the 1-250 keV energy band with the Insight-HXMT satellite, which we identify as emitted from SGR J1935+2154. The burst showed two hard peaks with a separation of 34 ms, broadly consistent with that of the two bursts in FRB 200428. The delay time between the double radio and X-ray peaks is about 8.57 s, fully consistent with the dispersion delay of FRB 200428. We thus identify the non-thermal X-ray burst is associated with FRB 200428 whose high energy counterpart is the two hard peaks in X-ray. Our results suggest that the non-thermal X-ray burst and FRB 200428 share the same physical origin in an explosive event from SGR J1935+2154. △ Less

Submitted 6 April, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

Comments: 24 pages, 9 figures, 6 tables; initial submission to a journal on May 9th, 2020. Significant changes include updated localization and detailed spectral evolution of the X-ray burst, and better determination of the two narrow X-ray peaks corresponding to the two radio pulses. Conclusions are strengthened. Nature Astronomy online on Feb. 18, 2021

Journal ref: https://www.nature.com/articles/s41550-021-01302-6, Nature Astronomy online on Feb. 18, 2021

arXiv:2005.09973 [pdf, other]

Dynamic Refinement Network for Oriented and Densely Packed Object Detection

Authors: Xingjia Pan, Yuqiang Ren, Kekai Sheng, Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu

Abstract: Object detection has achieved remarkable progress in the past decade. However, the detection of oriented and densely packed objects remains challenging because of following inherent reasons: (1) receptive fields of neurons are all axis-aligned and of the same shape, whereas objects are usually of diverse shapes and align along various directions; (2) detection models are typically trained with gen… ▽ More Object detection has achieved remarkable progress in the past decade. However, the detection of oriented and densely packed objects remains challenging because of following inherent reasons: (1) receptive fields of neurons are all axis-aligned and of the same shape, whereas objects are usually of diverse shapes and align along various directions; (2) detection models are typically trained with generic knowledge and may not generalize well to handle specific objects at test time; (3) the limited dataset hinders the development on this task. To resolve the first two issues, we present a dynamic refinement network that consists of two novel components, i.e., a feature selection module (FSM) and a dynamic refinement head (DRH). Our FSM enables neurons to adjust receptive fields in accordance with the shapes and orientations of target objects, whereas the DRH empowers our model to refine the prediction dynamically in an object-aware manner. To address the limited availability of related benchmarks, we collect an extensive and fully annotated dataset, namely, SKU110K-R, which is relabeled with oriented bounding boxes based on SKU110K. We perform quantitative evaluations on several publicly available benchmarks including DOTA, HRSC2016, SKU110K, and our own SKU110K-R dataset. Experimental results show that our method achieves consistent and substantial gains compared with baseline approaches. The code and dataset are available at https://github.com/Anymake/DRN_CVPR2020. △ Less

Submitted 10 June, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

Comments: Accepted by CVPR 2020 as Oral

arXiv:2004.13307 [pdf, other]

Insight-HXMT insight into switch of the accretion mode: the case of the X-ray pulsar 4U 1901+03

Authors: Y. L. Tuo, L. Ji, S. S. Tsygankov, T. Mihara, L. M. Song, M. Y. Ge, A. Nabizadeh, L. Tao, J. L. Qu, Y. Zhang, S. Zhang, S. N. Zhang, Q. C. Bu, L. Chen, Y. P. Xu, X. L. Cao, Y. Chen, C. Z. Liu, C. Cai, Z. Chang, G. Chen, T. X. Chen, Y. B. Chen, Y. P. Chen, W. Cui , et al. (98 additional authors not shown)

Abstract: We use the In data collected during the 2019 outburst from X-ray pulsar 4U 1901+03 to complement the orbital parameters reported by Fermi/GBM. Using the Insight-HXMT, we examine the correlation between the derivative of the intrinsic spin frequency and bolometric flux based on accretion torque models. It was found that the pulse profiles significantly evolve during the outburst. The existence of t… ▽ More We use the In data collected during the 2019 outburst from X-ray pulsar 4U 1901+03 to complement the orbital parameters reported by Fermi/GBM. Using the Insight-HXMT, we examine the correlation between the derivative of the intrinsic spin frequency and bolometric flux based on accretion torque models. It was found that the pulse profiles significantly evolve during the outburst. The existence of two types of the profile's pattern discovered in the Insight-HXMT data indicates that this source experienced transition between a super-critical and a sub-critical accretion regime during its 2019 outburst. Based on the evolution of the pulse profiles and the torque model, we derive the distance to 4U 1901+03 as 12.4+-0.2 kpc. △ Less

Submitted 28 April, 2020; originally announced April 2020.

Comments: 8 pages, 5 figures, accepted by JHEAP

arXiv:2004.12946 [pdf, ps, other]

doi 10.3847/1538-4357/ab8db4

The evolution of the broadband temporal features observed in the black-hole transient MAXI J1820+070 with Insight-HXMT

Authors: Yanan Wang, Long Ji, S. N. Zhang, Mariano Méndez, J. L. Qu, Pierre Maggi, M. Y. Ge, Erlin Qiao, L. Tao, S. Zhang, Diego Altamirano, L. Zhang, X. Ma, F. J. Lu, T. P. Li, Y. Huang, S. J. Zheng, Y. P. Chen, Z. Chang, Y. L. Tuo, C. Gungor, L. M. Song, Y. P. Xu, X. L. Cao, Y. Chen , et al. (96 additional authors not shown)

Abstract: We study the evolution of the temporal properties of MAXI 1820+070 during the 2018 outburst in its hard state from MJD 58190 to 58289 with Insight-HXMT in a broad energy band 1-150 keV. We find different behaviors of the hardness ratio, the fractional rms and time lag before and after MJD 58257, suggesting a transition occurred at around this point. The observed time lags between the soft photons… ▽ More We study the evolution of the temporal properties of MAXI 1820+070 during the 2018 outburst in its hard state from MJD 58190 to 58289 with Insight-HXMT in a broad energy band 1-150 keV. We find different behaviors of the hardness ratio, the fractional rms and time lag before and after MJD 58257, suggesting a transition occurred at around this point. The observed time lags between the soft photons in the 1-5 keV band and the hard photons in higher energy bands, up to 150 keV, are frequency-dependent: the time lags in the low-frequency range, 2-10 mHz, are both soft and hard lags with a timescale of dozens of seconds but without a clear trend along the outburst; the time lags in the high-frequency range, 1-10 Hz, are only hard lags with a timescale of tens of milliseconds; first increase until around MJD 58257 and decrease after this date. The high-frequency time lags are significantly correlated to the photon index derived from the fit to the quasi-simultaneous NICER spectrum in the 1-10 keV band. This result is qualitatively consistent with a model in which the high-frequency time lags are produced by Comptonization in a jet. △ Less

Submitted 27 April, 2020; originally announced April 2020.

Comments: Accepted for publication in ApJ

arXiv:2004.11540 [pdf, other]

Deep Global Registration

Authors: Christopher Choy, Wei Dong, Vladlen Koltun

Abstract: We present Deep Global Registration, a differentiable framework for pairwise registration of real-world 3D scans. Deep global registration is based on three modules: a 6-dimensional convolutional network for correspondence confidence prediction, a differentiable Weighted Procrustes algorithm for closed-form pose estimation, and a robust gradient-based SE(3) optimizer for pose refinement. Experimen… ▽ More We present Deep Global Registration, a differentiable framework for pairwise registration of real-world 3D scans. Deep global registration is based on three modules: a 6-dimensional convolutional network for correspondence confidence prediction, a differentiable Weighted Procrustes algorithm for closed-form pose estimation, and a robust gradient-based SE(3) optimizer for pose refinement. Experiments demonstrate that our approach outperforms state-of-the-art methods, both learning-based and classical, on real-world data. △ Less

Submitted 8 May, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

Comments: Accepted for CVPR'20 oral presentation

arXiv:2004.05508 [pdf, other]

MetaIQA: Deep Meta-learning for No-Reference Image Quality Assessment

Authors: Hancheng Zhu, Leida Li, Jinjian Wu, Weisheng Dong, Guangming Shi

Abstract: Recently, increasing interest has been drawn in exploiting deep convolutional neural networks (DCNNs) for no-reference image quality assessment (NR-IQA). Despite of the notable success achieved, there is a broad consensus that training DCNNs heavily relies on massive annotated data. Unfortunately, IQA is a typical small sample problem. Therefore, most of the existing DCNN-based IQA metrics operate… ▽ More Recently, increasing interest has been drawn in exploiting deep convolutional neural networks (DCNNs) for no-reference image quality assessment (NR-IQA). Despite of the notable success achieved, there is a broad consensus that training DCNNs heavily relies on massive annotated data. Unfortunately, IQA is a typical small sample problem. Therefore, most of the existing DCNN-based IQA metrics operate based on pre-trained networks. However, these pre-trained networks are not designed for IQA task, leading to generalization problem when evaluating different types of distortions. With this motivation, this paper presents a no-reference IQA metric based on deep meta-learning. The underlying idea is to learn the meta-knowledge shared by human when evaluating the quality of images with various distortions, which can then be adapted to unknown distortions easily. Specifically, we first collect a number of NR-IQA tasks for different distortions. Then meta-learning is adopted to learn the prior knowledge shared by diversified distortions. Finally, the quality prior model is fine-tuned on a target NR-IQA task for quickly obtaining the quality model. Extensive experiments demonstrate that the proposed metric outperforms the state-of-the-arts by a large margin. Furthermore, the meta-model learned from synthetic distortions can also be easily generalized to authentic distortions, which is highly desired in real-world applications of IQA metrics. △ Less

Submitted 11 April, 2020; originally announced April 2020.

arXiv:2004.00791 [pdf, ps, other]

doi 10.3847/1538-4357/ab8db6

Discovery of delayed spin-up behavior following two large glitches in the Crab pulsar, and the statistics of such processes

Authors: M. Y. Ge, S. N. Zhang, F. J. Lu, T. P. Li, J. P. Yuan, X. P. Zheng, Y. Huang, S. J. Zheng, Y. P. Chen, Z. Chang, Y. L. Tuo, Q. Cheng, C. Güngör, L. M. Song, Y. P. Xu, X. L. Cao, Y. Chen, C. Z. Liu, S. Zhang, J. L. Qu, Q. C. Bu, C. Cai, G. Chen, L. Chen, M. Z. Chen , et al. (111 additional authors not shown)

Abstract: Glitches correspond to sudden jumps of rotation frequency ($ν$) and its derivative ($\dotν$) of pulsars, the origin of which remains not well understood yet, partly because the jump processes of most glitches are not well time-resolved. There are three large glitches of the Crab pulsar, detected in 1989, 1996 and 2017, which were found to have delayed spin-up processes before the normal recovery p… ▽ More Glitches correspond to sudden jumps of rotation frequency ($ν$) and its derivative ($\dotν$) of pulsars, the origin of which remains not well understood yet, partly because the jump processes of most glitches are not well time-resolved. There are three large glitches of the Crab pulsar, detected in 1989, 1996 and 2017, which were found to have delayed spin-up processes before the normal recovery processes. Here we report two additional glitches of the Crab pulsar occurred in 2004 and 2011 for which we discovered delayed spin up processes, and present refined parameters of the largest glitch occurred in 2017. The initial rising time of the glitch is determined as $<0.48$ hour. We also carried out a statistical study of these five glitches with observed spin-up processes. The two glitches occurred in 2004 and 2011 have delayed spin-up time scales ($τ_{1}$) of $1.7\pm0.8$\,days and $1.6\pm0.4$\,days, respectively. We find that the $Δν$ vs. $|Δ{\dotν}|$ relation of these five glitches is similar to those with no detected delayed spin-up process, indicating that they are similar to the others in nature except that they have larger amplitudes. For these five glitches, the amplitudes of the delayed spin-up process ($|Δν_{\rm d1}|$) and recovery process ($Δν_{\rm d2}$), their time scales ($τ_{1}$, $τ_{2}$), and permanent changes in spin frequency ($Δν_{\rm p}$) and total frequency step ($Δν_{\rm g}$) have positive correlations. From these correlations, we suggest that the delayed spin-up processes are common for all glitches, but are too short and thus difficult to be detected for most glitches. △ Less

Submitted 1 April, 2020; originally announced April 2020.

Comments: 25 pages, 8 figures

arXiv:2003.10889 [pdf, ps, other]

doi 10.1051/0004-6361/202037797

A search for prompt gamma-ray counterparts to fast radio bursts in the Insight-HXMT data

Authors: C. Guidorzi, M. Marongiu, R. Martone, L. Nicastro, S. L. Xiong, J. Y. Liao, G. Li, S. N. Zhang, L. Amati, F. Frontera, M. Orlandini, P. Rosati, E. Virgilli, S. Zhang, Q. C. Bu, C. Cai, X. L. Cao, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. B. Chen, Y. P. Chen, W. Cui, W. W. Cui , et al. (98 additional authors not shown)

Abstract: No robust detection of prompt electromagnetic counterparts to fast radio bursts (FRBs) has yet been obtained, in spite of several multi-wavelength searches carried out so far. Specifically, X/gamma-ray counterparts are predicted by some models. We planned on searching for prompt gamma-ray counterparts in the Insight-Hard X-ray Modulation Telescope (Insight-HXMT) data, taking advantage of the uniqu… ▽ More No robust detection of prompt electromagnetic counterparts to fast radio bursts (FRBs) has yet been obtained, in spite of several multi-wavelength searches carried out so far. Specifically, X/gamma-ray counterparts are predicted by some models. We planned on searching for prompt gamma-ray counterparts in the Insight-Hard X-ray Modulation Telescope (Insight-HXMT) data, taking advantage of the unique combination of large effective area in the keV-MeV energy range and of sub-ms time resolution. We selected 39 FRBs that were promptly visible from the High-Energy (HE) instrument aboard Insight-HXMT. After calculating the expected arrival times at the location of the spacecraft, we searched for a significant excess in both individual and cumulative time profiles over a wide range of time resolutions, from several seconds down to sub-ms scales. Using the dispersion measures in excess of the Galactic terms, we estimated the upper limits on the redshifts. No convincing signal was found and for each FRB we constrained the gamma-ray isotropic-equivalent luminosity and the released energy as a function of emission timescale. For the nearest FRB source, the periodic repeater FRB180916.J0158+65, we find $L_{γ,iso}<5.5\times 10^{47}$ erg/s over 1 s, whereas $L_{γ,iso}<10^{49}-10^{51}$ erg/s for the bulk of FRBs. The same values scale up by a factor of ~100 for a ms-long emission. Even on a timescale comparable with that of the radio pulse itself no keV-MeV emission is observed. A systematic association with either long or short GRBs is ruled out with high confidence, except for subluminous events, as is the case for core-collapse of massive stars (long) or binary neutron star mergers (short) viewed off axis. Only giant flares from extra-galactic magnetars at least ten times more energetic than Galactic siblings are ruled out for the nearest FRB. △ Less

Submitted 24 March, 2020; originally announced March 2020.

Comments: 15 pages, 3 figures, 6 tables, accepted by A&A

Journal ref: A&A 637, A69 (2020)

arXiv:2002.11261 [pdf, other]

Multi-Attribute Guided Painting Generation

Authors: Minxuan Lin, Yingying Deng, Fan Tang, Weiming Dong, Changsheng Xu

Abstract: Controllable painting generation plays a pivotal role in image stylization. Currently, the control way of style transfer is subject to exemplar-based reference or a random one-hot vector guidance. Few works focus on decoupling the intrinsic properties of painting as control conditions, e.g., artist, genre and period. Under this circumstance, we propose a novel framework adopting multiple attribute… ▽ More Controllable painting generation plays a pivotal role in image stylization. Currently, the control way of style transfer is subject to exemplar-based reference or a random one-hot vector guidance. Few works focus on decoupling the intrinsic properties of painting as control conditions, e.g., artist, genre and period. Under this circumstance, we propose a novel framework adopting multiple attributes from the painting to control the stylized results. An asymmetrical cycle structure is equipped to preserve the fidelity, associating with style preserving and attribute regression loss to keep the unique distinction of colors and textures between domains. Several qualitative and quantitative results demonstrate the effect of the combinations of multiple attributes and achieve satisfactory performance. △ Less

Submitted 25 February, 2020; originally announced February 2020.

arXiv:2002.08919 [pdf, ps, other]

doi 10.1093/mnras/staa569

Switches between accretion structures during flares in 4U 1901+03

Authors: L. Ji, L. Ducci, A. Santangelo, S. Zhang, V. Suleimanov, S. Tsygankov, V. Doroshenko, A. Nabizadeh, S. N. Zhang, M. Y. Ge, L. Tao, Q. C. Bu, J. L. Qu, F. J. Lu, L. Chen, L. M. Song, T. P. Li, Y. P. Xu, X. L. Cao, Y. Chen, C. Z. Liu, C. Cai, Z. Chang, G. Chen, T. X. Chen , et al. (98 additional authors not shown)

Abstract: We report on our analysis of the 2019 outburst of the X-ray accreting pulsar 4U 1901+03 observed with Insight-HXMT and NICER. Both spectra and pulse profiles evolve significantly in the decaying phase of the outburst. Dozens of flares are observed throughout the outburst. They are more frequent and brighter at the outburst peak. We find that the flares, which have a duration from tens to hundreds… ▽ More We report on our analysis of the 2019 outburst of the X-ray accreting pulsar 4U 1901+03 observed with Insight-HXMT and NICER. Both spectra and pulse profiles evolve significantly in the decaying phase of the outburst. Dozens of flares are observed throughout the outburst. They are more frequent and brighter at the outburst peak. We find that the flares, which have a duration from tens to hundreds of seconds, are generally brighter than the persistent emission by a factor of $\sim$ 1.5. The pulse profile shape during the flares can be significantly different than that of the persistent emission. In particular, a phase shift is clearly observed in many cases. We interpret these findings as direct evidence of changes of the pulsed beam pattern, due to transitions between the sub- and super-critical accretion regimes on a short time scale. We also observe that at comparable luminosities the flares' pulse profiles are rather similar to those of the persistent emission. This indicates that the accretion on the polar cap of the neutron star is mainly determined by the luminosity, i.e., the mass accretion rate. △ Less

Submitted 20 February, 2020; originally announced February 2020.

Comments: 11 pages, 8 figures, accepted for publication in MNRAS

arXiv:2002.01599 [pdf, other]

Unsupervised Community Detection with a Potts Model Hamiltonian, an Efficient Algorithmic Solution, and Applications in Digital Pathology

Authors: Brendon Lutnick, Wen Dong, Zohar Nussinov, Pinaki Sarder

Abstract: Unsupervised segmentation of large images using a Potts model Hamiltonian is unique in that segmentation is governed by a resolution parameter which scales the sensitivity to small clusters. Here, the input image is first modeled as a graph, which is then segmented by minimizing a Hamiltonian cost function defined on the graph and the respective segments. However, there exists no closed form solut… ▽ More Unsupervised segmentation of large images using a Potts model Hamiltonian is unique in that segmentation is governed by a resolution parameter which scales the sensitivity to small clusters. Here, the input image is first modeled as a graph, which is then segmented by minimizing a Hamiltonian cost function defined on the graph and the respective segments. However, there exists no closed form solution of this optimization, and using previous iterative algorithmic solution techniques, the problem scales quadratically in the Input Length. Therefore, while Potts model segmentation gives accurate segmentation, it is grossly underutilized as an unsupervised learning technique. We propose a fast statistical down-sampling of input image pixels based on the respective color features, and a new iterative method to minimize the Potts model energy considering pixel to segment relationship. This method is generalizable and can be extended for image pixel texture features as well as spatial features. We demonstrate that this new method is highly efficient, and outperforms existing methods for Potts model based image segmentation. We demonstrate the application of our method in medical microscopy image segmentation; particularly, in segmenting renal glomerular micro-environment in renal pathology. Our method is not limited to image segmentation, and can be extended to any image/data segmentation/clustering task for arbitrary datasets with discrete features. △ Less

Submitted 4 February, 2020; originally announced February 2020.

Comments: 46 pages, 19 Figures

arXiv:2002.01480 [pdf, other]

doi 10.1088/1367-2630/ab9bc0

Precise high-fidelity electron-nuclear spin entangling gates in NV centers via hybrid dynamical decoupling sequences

Authors: Wenzheng Dong, F. A. Calderon-Vargas, Sophia E. Economou

Abstract: Color centers in solids, such as the nitrogen-vacancy center in diamond, offer well-protected and well-controlled localized electron spins that can be employed in various quantum technologies. Moreover, the long coherence time of the surrounding spinful nuclei can enable a robust quantum register controlled through the color center. We design pulse sequence protocols that drive the electron spin t… ▽ More Color centers in solids, such as the nitrogen-vacancy center in diamond, offer well-protected and well-controlled localized electron spins that can be employed in various quantum technologies. Moreover, the long coherence time of the surrounding spinful nuclei can enable a robust quantum register controlled through the color center. We design pulse sequence protocols that drive the electron spin to generate robust entangling gates with these nuclear memory qubits. We find that compared to using Carr-Purcell-Meiboom-Gill (CPMG) alone, Uhrig decoupling sequence and hybrid protocols composed of CPMG and Uhrig sequences improve these entangling gates in terms of fidelity, spin control range, and spin selectivity. We provide analytical expressions for the sequence protocols and also show numerically the efficacy of our method on nitrogen-vacancy centers in diamond. Our results are broadly applicable to color centers weakly coupled to a small number of nuclear spin qubits. △ Less

Submitted 8 August, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

Comments: 22 pages, 15 figures

Journal ref: New J. Phys. 22 073059 (2020)

arXiv:2001.11584 [pdf, other]

doi 10.1109/TIP.2021.3050673

Ellipse R-CNN: Learning to Infer Elliptical Object from Clustering and Occlusion

Authors: Wenbo Dong, Pravakar Roy, Cheng Peng, Volkan Isler

Abstract: Images of heavily occluded objects in cluttered scenes, such as fruit clusters in trees, are hard to segment. To further retrieve the 3D size and 6D pose of each individual object in such cases, bounding boxes are not reliable from multiple views since only a little portion of the object's geometry is captured. We introduce the first CNN-based ellipse detector, called Ellipse R-CNN, to represent a… ▽ More Images of heavily occluded objects in cluttered scenes, such as fruit clusters in trees, are hard to segment. To further retrieve the 3D size and 6D pose of each individual object in such cases, bounding boxes are not reliable from multiple views since only a little portion of the object's geometry is captured. We introduce the first CNN-based ellipse detector, called Ellipse R-CNN, to represent and infer occluded objects as ellipses. We first propose a robust and compact ellipse regression based on the Mask R-CNN architecture for elliptical object detection. Our method can infer the parameters of multiple elliptical objects even they are occluded by other neighboring objects. For better occlusion handling, we exploit refined feature regions for the regression stage, and integrate the U-Net structure for learning different occlusion patterns to compute the final detection score. The correctness of ellipse regression is validated through experiments performed on synthetic data of clustered ellipses. We further quantitatively and qualitatively demonstrate that our approach outperforms the state-of-the-art model (i.e., Mask R-CNN followed by ellipse fitting) and its three variants on both synthetic and real datasets of occluded and clustered elliptical objects. △ Less

Submitted 14 November, 2020; v1 submitted 30 January, 2020; originally announced January 2020.

Comments: 18 pages, 20 figures, 7 tables

arXiv:2001.06637 [pdf, ps, other]

Joint Analysis of Energy and RMS Spectra from MAXI J1535-571 with Insight-HXMT

Authors: L. D. Kong, S. Zhang, Y. P. Chen, L. Ji, S. N. Zhang, Y. R. Yang, L. Tao, X. Ma, J. L. Qu, F. J. Lu, Q. C. Bu, L. Chen, L. M. Song, T. P. Li, Y. P. Xu, X. L. Cao, Y. Chen, C. Z. Liu, C. Cai, Z. Chang, G. Chen, T. X. Chen, Y. B. Chen, W. Cui, W. W. Cui , et al. (94 additional authors not shown)

Abstract: A new black hole X-ray binary (BHXRB) MAXI J1535-571 was discovered by MAXI during its outburst in 2017. Using observations taken by the first Chinese X-ray satellite, the Hard X-ray Modulation Telescope (dubbed as Insight-HXMT), we perform a joint spectral analysis (2-150 keV) in both energy and time domains. The energy spectra provide the essential input for probing the intrinsic Quasi-Periodic… ▽ More A new black hole X-ray binary (BHXRB) MAXI J1535-571 was discovered by MAXI during its outburst in 2017. Using observations taken by the first Chinese X-ray satellite, the Hard X-ray Modulation Telescope (dubbed as Insight-HXMT), we perform a joint spectral analysis (2-150 keV) in both energy and time domains. The energy spectra provide the essential input for probing the intrinsic Quasi-Periodic Oscillation (QPO) fractional rms spectra (FRS). Our results show that during the intermediate state, the energy spectra are in general consistent with those reported by Swift/XRT and NuSTAR. However, the QPO FRS become harder and the FRS residuals may suggest the presence of either an additional power-law component in the energy spectrum or a turn-over in the intrinsic QPO FRS at high energies. △ Less

Submitted 18 January, 2020; originally announced January 2020.

arXiv:2001.02039 [pdf]

doi 10.1002/adma.202002014

Broad-Spectral-Range Sustainability and Controllable Excitation of Hyperbolic Phonon Polaritons in $α$-MoO3

Authors: Weikang Dong, Ruishi Qi, Tiansheng Liu, Yi Li, Ning Li, Ze Hua, Zirui Gao, Shuyuan Zhang, Kaihui Liu, Jiandong Guo, Peng Gao

Abstract: Hyperbolic phonon polaritons (HPhPs) in orthorhombic-phase molybdenum trioxide ($α$-MoO3) show in-plane hyperbolicity, great wavelength compression and ultra-long lifetime, therefore holding great potential in nanophotonic applications. However, its polaritonic response in the far-infrared (FIR) range has long remained unexplored due to challenges in experimental characterization. Here, using mono… ▽ More Hyperbolic phonon polaritons (HPhPs) in orthorhombic-phase molybdenum trioxide ($α$-MoO3) show in-plane hyperbolicity, great wavelength compression and ultra-long lifetime, therefore holding great potential in nanophotonic applications. However, its polaritonic response in the far-infrared (FIR) range has long remained unexplored due to challenges in experimental characterization. Here, using monochromated electron energy loss spectroscopy (EELS) in a scanning transmission electron microscope (STEM), we probe HPhPs in $α$-MoO3 in both mid-infrared (MIR) and FIR frequencies and correlate their behaviors with microstructures and orientations. We find that low-structural symmetry leads to various phonon modes and multiple Reststrahlen bands (RBs) over a broad spectral range (over 70 meV) and in different directions (55-63 meV and 119-125 meV along b axis, 68-106 meV along c axis, 101-121 meV along a axis). These HPhPs can be selectively excited by controlling the direction of swift electrons. These findings provide new opportunities in nanophotonic and optoelectronic applications such as directed light propagation, hyperlenses and heat transfer. △ Less

Submitted 24 November, 2020; v1 submitted 14 November, 2019; originally announced January 2020.

Journal ref: Advanced Materials, 2020, 32, 2002014

arXiv:1912.08542 [pdf, other]

Diagnostic of the spectral properties of Aquila X-1 by Insight-HXMT snapshots during the early propeller phase

Authors: C. Güngör, M. Y. Ge, S. Zhang, A. Santangelo, S. N. Zhang, F. J. Lu, Y. Zhang, Y. P. Chen, L. Tao, Y. J. Yang, Q. C. Bu, C. Cai, X. L. Cao, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. Chen, Y. B. Chen, W. Cui, W. W. Cui, J. K. Deng, Y. W. Dong, Y. Y. Du, M. X. Fu , et al. (88 additional authors not shown)

Abstract: We study the 2018 outburst of Aql X-1 via the monitor of all sky X-ray image (MAXI) data. We show that the outburst starting in February 2018 is a member of short-low class in the frame of outburst duration and the peak count rate although the outburst morphology is slightly different from the other fast-rise-exponential-decay (FRED) type outbursts with a milder rising stage. We study the partial… ▽ More We study the 2018 outburst of Aql X-1 via the monitor of all sky X-ray image (MAXI) data. We show that the outburst starting in February 2018 is a member of short-low class in the frame of outburst duration and the peak count rate although the outburst morphology is slightly different from the other fast-rise-exponential-decay (FRED) type outbursts with a milder rising stage. We study the partial accretion in the weak propeller stage of Aql X-1 via the MAXI data of the 2018 outburst. We report on the spectral analysis of 3 observations of Aquila X-1 obtained by Insight - hard X-ray modulation telescope (Insight-HXMT) during the late decay stage of the 2018 outburst. We discuss that the data taken by Insight-HXMT is just after the transition to the weak propeller stage. Our analysis shows the necessity of a comptonization component to take into account the existence of an electron cloud resulting photons partly up-scattered. △ Less

Submitted 18 December, 2019; originally announced December 2019.

Comments: 8 pages, 4 figures, accepted for publication in JHEAp

arXiv:1911.11419 [pdf, other]

Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning

Authors: Kekai Sheng, Weiming Dong, Menglei Chai, Guohui Wang, Peng Zhou, Feiyue Huang, Bao-Gang Hu, Rongrong Ji, Chongyang Ma

Abstract: Visual aesthetic assessment has been an active research field for decades. Although latest methods have achieved promising performance on benchmark datasets, they typically rely on a large number of manual annotations including both aesthetic labels and related image attributes. In this paper, we revisit the problem of image aesthetic assessment from the self-supervised feature learning perspectiv… ▽ More Visual aesthetic assessment has been an active research field for decades. Although latest methods have achieved promising performance on benchmark datasets, they typically rely on a large number of manual annotations including both aesthetic labels and related image attributes. In this paper, we revisit the problem of image aesthetic assessment from the self-supervised feature learning perspective. Our motivation is that a suitable feature representation for image aesthetic assessment should be able to distinguish different expert-designed image manipulations, which have close relationships with negative aesthetic effects. To this end, we design two novel pretext tasks to identify the types and parameters of editing operations applied to synthetic instances. The features from our pretext tasks are then adapted for a one-layer linear classifier to evaluate the performance in terms of binary aesthetic classification. We conduct extensive quantitative experiments on three benchmark datasets and demonstrate that our approach can faithfully extract aesthetics-aware features and outperform alternative pretext schemes. Moreover, we achieve comparable results to state-of-the-art supervised methods that use 10 million labels from ImageNet. △ Less

Submitted 26 November, 2019; originally announced November 2019.

Comments: AAAI Conference on Artificial Intelligence, 2020, accepted

Journal ref: Proceedings of AAAI Conference on Articial Intelligence 2020

arXiv:1910.08382 [pdf, ps, other]

$Insight$-HXMT study of the timing properties of Sco X-1

Authors: S. M. Jia, Q. C. Bu, J. L. Qu, F. J. Lu, S. N. Zhang, Y. Huang, X. Ma, L. Tao, G. C. Xiao, W. Zhang, L. Chen, L. M. Song, S. Zhang, T. B. Li, Y. P. Xu, X. L. Cao, Y. Chen, C. Z. Liu, C. Cai, Z. Chang, G. Chen, T. X. Chen, Y. B. Chen, Y. P. Chen, W. Cui , et al. (85 additional authors not shown)

Abstract: We present a detailed timing study of the brightest persistent X-ray source Sco X-1 using the data collected by the Hard X-ray Modulation Telescope ($Insight$-HXMT) from July 2017 to August 2018. A complete $Z$-track hardness-intensity diagram (HID) is obtained. The normal branch oscillations (NBOs) at $\sim$ 6 Hz in the lower part of the normal branch (NB) and the flare branch oscillations (FBOs)… ▽ More We present a detailed timing study of the brightest persistent X-ray source Sco X-1 using the data collected by the Hard X-ray Modulation Telescope ($Insight$-HXMT) from July 2017 to August 2018. A complete $Z$-track hardness-intensity diagram (HID) is obtained. The normal branch oscillations (NBOs) at $\sim$ 6 Hz in the lower part of the normal branch (NB) and the flare branch oscillations (FBOs) at $\sim$ 16 Hz in the beginning part of the flaring branch (FB) are found in observations with the Low Energy X-ray Telescope (LE) and the Medium Energy X-ray Telescope (ME) of $Insight$-HXMT, while the horizontal branch oscillations (HBOs) at $\sim$ 40 Hz and the kilohertz quasi-periodic oscillations (kHz QPOs) at $\sim$ 800 Hz are found simultaneously up to 60 keV for the first time on the horizontal branch (HB) by the High Energy X-ray Telescope (HE) and ME. We find that for all types of the observed QPOs, the centroid frequencies are independent of energy, while the root mean square (rms) increases with energy; the centroid frequencies of both the HBOs and kHz QPOs increase along the $Z$-track from the top to the bottom of the HB; and the NBOs show soft phase lags increasing with energy. A continuous QPO transition from the FB to NB in $\sim$ 200 s are also detected. Our results indicate that the non-thermal emission is the origin of all types of QPOs, the innermost region of the accretion disk is non-thermal in nature, and the corona is nonhomogeneous geometrically. △ Less

Submitted 18 October, 2019; originally announced October 2019.

arXiv:1910.08220 [pdf, ps, other]

Insight-HXMT observation on 4U~1608--52: evolving spectral properties of a bright type-I X-ray burst

Authors: Y. P. Chen, S. Zhang, S. N. Zhang, L. Ji, L. D. Kong, A. Santangelo, J. L. Qu, F. J. Lu, T. P. Li, L. M. Song, Y. P. Xu, X. L. Cao, Y. Chen, C. Z. Liu, Q. C. Bu, C. Cai, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. B. Chen, W. Cui, W. W. Cui, J. K. Deng, Y. W. Dong , et al. (87 additional authors not shown)

Abstract: The evidences for the influence of thermonuclear (type-I) X-ray bursts upon the surrounding environments in neutron star low-mass X-ray binaries (LMXB) were detected previously via spectral and timing analyses. Benefitting from a broad energy coverage of Insight-HXMT, we analyze one photospheric radius expansion (PRE) burst, and find an emission excess at soft X-rays. Our spectral analysis shows t… ▽ More The evidences for the influence of thermonuclear (type-I) X-ray bursts upon the surrounding environments in neutron star low-mass X-ray binaries (LMXB) were detected previously via spectral and timing analyses. Benefitting from a broad energy coverage of Insight-HXMT, we analyze one photospheric radius expansion (PRE) burst, and find an emission excess at soft X-rays. Our spectral analysis shows that, such an excess is not likely relevant to the disk reflection induced by the burst emission and can be attributed to an enhanced pre-burst/persistent emission. We find that the burst and enhanced persistent emissions sum up to exceed Eddington luminosity by $\sim$ 40 percentages. We speculate that the enhanced emission is from a region beyond the PRE radius, or through the Comptonization of the corona. △ Less

Submitted 17 October, 2019; originally announced October 2019.

Comments: accepted by JHEA(Journal of High Energy Astrophysics)

arXiv:1910.06320 [pdf, ps, other]

doi 10.3847/2041-8213/aadc0e

Insight-HXMT observations of 4U~1636-536: Corona cooling revealed with single short type-I X-ray burst

Authors: Y. P. Chen, S. Zhang, S. N. Zhang, L. Ji, L. D. Kong, X. L. Cao, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. Chen, Y. B. Chen, W. Cui, W. W. Cui, J. K. Deng, Y. W. Dong, Y. Y. Du, M. X. Fu, G. H. Gao, H. Gao, M. Gao, M. Y. Ge, Y. D. Gu, J. Guan, C. C. Guo , et al. (87 additional authors not shown)

Abstract: Corona cooling was detected previously from stacking a series of short type-I bursts occurred during the low/had state of atoll outburst. Type-I bursts are hence regarded as sharp probe to our better understanding on the basic property of the corona. The launch of the first Chinese X-ray satellite Insight-HXMT has large detection area at hard X-rays which provide almost unique chance to move furth… ▽ More Corona cooling was detected previously from stacking a series of short type-I bursts occurred during the low/had state of atoll outburst. Type-I bursts are hence regarded as sharp probe to our better understanding on the basic property of the corona. The launch of the first Chinese X-ray satellite Insight-HXMT has large detection area at hard X-rays which provide almost unique chance to move further in this research field. We report the first detection of the corona cooling by Insight-HXMT from single short type-I burst showing up during {\bf flare} of 4U 1636-536. This type-I X-ray burst has a duration of $\sim$13 seconds and hard X-ray shortage is detected with significance 6.2~$σ$ in 40-70 keV. A cross-correlation analysis between the lightcurves of soft and hard X-ray band, shows that the corona shortage lag the burst emission by 1.6 $\pm$1.2~s. These results are consistent with those derived previously from stacking a large amount of bursts detected by RXTE/PCA within a series of {\bf flares} of 4U 1636-536. Moreover, the broad bandwidth of Insight-HXMT allows as well for the first time to infer the burst influence upon the continuum spectrum via performing the spectral fitting of the burst, which ends up with the finding that hard X-ray shortage appears at around 40 keV in the continuum spectrum. These results suggest that the evolution of the corona along with the outburst{\bf /flare} of NS XRB may be traced via looking into a series of embedded type-I bursts by using Insight-HXMT. △ Less

Submitted 15 October, 2019; v1 submitted 11 October, 2019; originally announced October 2019.

Comments: published in 2018, ApJL,864, L30

arXiv:1910.05758 [pdf, other]

Learning to Navigate from Simulation via Spatial and Semantic Information Synthesis with Noise Model Embedding

Authors: Gang Chen, Hongzhe Yu, Wei Dong, Xinjun Sheng, Xiangyang Zhu, Han Ding

Abstract: While training an end-to-end navigation network in the real world is usually of high cost, simulation provides a safe and cheap environment in this training stage. However, training neural network models in simulation brings up the problem of how to effectively transfer the model from simulation to the real world (sim-to-real). In this work, we regard the environment representation as a crucial el… ▽ More While training an end-to-end navigation network in the real world is usually of high cost, simulation provides a safe and cheap environment in this training stage. However, training neural network models in simulation brings up the problem of how to effectively transfer the model from simulation to the real world (sim-to-real). In this work, we regard the environment representation as a crucial element in this transfer process and propose a visual information pyramid (VIP) model to systematically investigate a practical environment representation. A novel representation composed of spatial and semantic information synthesis is then established accordingly, where noise model embedding is particularly considered. To explore the effectiveness of this representation, we compared the performance with representations popularly used in the literature in both simulated and real-world scenarios. Results suggest that our environment representation stands out. Furthermore, an analysis on the feature map is implemented to investigate the effectiveness through inner reaction, which could be irradiative for future researches on end-to-end navigation. △ Less

Submitted 11 November, 2019; v1 submitted 13 October, 2019; originally announced October 2019.

Comments: 10 pages, 11 figures

arXiv:1910.04955 [pdf]

The High Energy X-ray telescope (HE) onboard the Insight-HXMT astronomy satellite

Authors: C. Z. Liu, Y. F. Zhang, X. F. Li, X. F. Lu, Z. Chang, Z. W. Li, A. Z. Zhang, Y. J. Jin, H. M. Yu, Z. Zhang, M. X. Fu, Y. B. Chen, J. F. Ji, Y. P. Xu, J. K. Deng, R. C. Shang, G. Q. Liu, F. J. Lu, S. N. Zhang, Y. W. Dong, T. P. Li, M. Wu, Y. G. Li, H. Y. Wang, B. B. Wu , et al. (8 additional authors not shown)

Abstract: The Insight-Hard X-ray Modulation Telescope (Insight-HXMT) is a broad band X-ray and gamma-ray (1-3000 keV) astronomy satellite. The High Energy X-ray telescope (HE) is one of its three main telescopes. The main detector plane of HE is composed of 18 NaI(Tl)/CsI(Na) phoswich detectors, where NaI(Tl) serves as primary detector to measure ~ 20-250 keV photons incident from the field of view (FOV) de… ▽ More The Insight-Hard X-ray Modulation Telescope (Insight-HXMT) is a broad band X-ray and gamma-ray (1-3000 keV) astronomy satellite. The High Energy X-ray telescope (HE) is one of its three main telescopes. The main detector plane of HE is composed of 18 NaI(Tl)/CsI(Na) phoswich detectors, where NaI(Tl) serves as primary detector to measure ~ 20-250 keV photons incident from the field of view (FOV) defined by the collimators, and CsI(Na) is used as an active shield detector to NaI(Tl) by pulse shape discrimination. CsI(Na) is also used as an omnidirectional gamma-ray monitor. The HE collimators have a diverse FOV: 1.1°x 5.7° (15 units), 5.7°x 5.7° (2 units) and blocked (1 unit), thus the combined FOV of HE is about 5.7°x 5.7°. Each HE detector has a diameter of 190 mm, resulting in the total geometrical area of about 5100 cm_2. The energy resolution is ~15% at 60 keV. The timing accuracy is better than 10 μs and dead-time for each detector is less than 10 μs. HE is devoted to observe the spectra and temporal variability of X-ray sources in the 20-250 keV band either by pointing observations for known sources or scanning observations to unveil new sources, and to monitor the gamma-ray sky in 0.2-3 MeV. This paper presents the design and performance of the HE instruments. Results of the on-ground calibration experiments are also reported. △ Less

Submitted 10 October, 2019; originally announced October 2019.

Comments: Accepted by SCIENCE CHINA Physics, Mechanics & Astronomy

arXiv:1910.03955 [pdf, ps, other]

doi 10.1093/mnras/stz2745

Timing analysis of 2S 1417-624 observed with NICER and Insight-HXMT

Authors: L. Ji, V. Doroshenko, A. Santangelo, C. Gungor, S. Zhang, L. Ducci, S. -N. Zhang, M. -Y. Ge, L. J. Qu, Y. P. Chen, Q. C. Bu, X. L. Cao, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. Chen, Y. B. Chen, W. Cui, W. W. Cui, J. K. Deng, Y. W. Dong, Y. Y. Du, M. X. Fu, G. H. Gao , et al. (91 additional authors not shown)

Abstract: We present a study of timing properties of the accreting pulsar 2S 1417-624 observed during its 2018 outburst, based on Swift/BAT, Fermi/GBM, Insight-HXMT and NICER observations. We report a dramatic change of the pulse profiles with luminosity. The morphology of the profile in the range 0.2-10.0keV switches from double to triple peaks at $\sim2.5$ $\rm \times 10^{37}{\it D}_{10}^2\ erg\ s^{-1}$ a… ▽ More We present a study of timing properties of the accreting pulsar 2S 1417-624 observed during its 2018 outburst, based on Swift/BAT, Fermi/GBM, Insight-HXMT and NICER observations. We report a dramatic change of the pulse profiles with luminosity. The morphology of the profile in the range 0.2-10.0keV switches from double to triple peaks at $\sim2.5$ $\rm \times 10^{37}{\it D}_{10}^2\ erg\ s^{-1}$ and from triple to quadruple peaks at $\sim7$ $\rm \times 10^{37}{\it D}_{10}^2\ erg\ s^{-1}$. The profile at high energies (25-100keV) shows significant evolutions as well. We explain this phenomenon according to existing theoretical models. We argue that the first change is related to the transition from the sub to the super-critical accretion regime, while the second to the transition of the accretion disc from the gas-dominated to the radiation pressure-dominated state. Considering the spin-up as well due to the accretion torque, this interpretation allows to estimate the magnetic field self-consistently at $\sim7\times 10^{12}$G. △ Less

Submitted 9 October, 2019; originally announced October 2019.

Comments: 7 pages, 4 figures, 1 tables, accepted for publication in MNRAS

arXiv:1910.02393 [pdf, ps, other]

Constant cyclotron line energy in Hercules X-1 -- Joint Insight-HXMT and NuSTAR observations

Authors: G. C. Xiao, L. Ji, R. Staubert, M. Y. Ge, S. Zhang, S. N. Zhang, A. Santangelo, L. Ducci, J. Y. Liao, C. C. Guo, X. B. Li, W. Zhang, J. L. Qu, F. J. Lu, T. P. Li, L. M. Song, Y. P. Xu, Q. C. Bu, C. Cai, X. L. Cao, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. B. Chen , et al. (91 additional authors not shown)

Abstract: The long-term evolution of the centroid energy of the CRSF in Her X-1 is still a mystery. We report a new measurement from a campaign between {\sl Insight}-HXMT and {\sl NuSTAR} performed in February 2018. Generally, the two satellites show well consistent results of timing and spectral properties. The joint spectral analysis confirms that the previously observed long decay phase has ended, and th… ▽ More The long-term evolution of the centroid energy of the CRSF in Her X-1 is still a mystery. We report a new measurement from a campaign between {\sl Insight}-HXMT and {\sl NuSTAR} performed in February 2018. Generally, the two satellites show well consistent results of timing and spectral properties. The joint spectral analysis confirms that the previously observed long decay phase has ended, and that the line energy instead keeps constant around 37.5 keV after flux correction. △ Less

Submitted 6 October, 2019; originally announced October 2019.

arXiv:1909.12614 [pdf, other]

doi 10.1093/mnras/stz2879

Hot disk of the Swift J0243.6+6124 revealed by Insight-HXMT

Authors: V. Doroshenko, S. N. Zhang, A. Santangelo, L. Ji, S. Tsygankov, A. Mushtukov, L. J. Qu, S. Zhang, M. Y. Ge, Y. P. Chen, Q. C. Bu, X. L. Cao, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. Chen, Y. B. Chen, W. Cui, W. W. Cui, J. K. Deng, Y. W. Dong, Y. Y. Du, M. X. Fu, G. H. Gao , et al. (92 additional authors not shown)

Abstract: We report on analysis of observations of the bright transient X-ray pulsar \src obtained during its 2017-2018 giant outburst with Insight-HXMT, \emph{NuSTAR}, and \textit{Swift} observatories. We focus on the discovery of a sharp state transition of the timing and spectral properties of the source at super-Eddington accretion rates, which we associate with the transition of the accretion disk to a… ▽ More We report on analysis of observations of the bright transient X-ray pulsar \src obtained during its 2017-2018 giant outburst with Insight-HXMT, \emph{NuSTAR}, and \textit{Swift} observatories. We focus on the discovery of a sharp state transition of the timing and spectral properties of the source at super-Eddington accretion rates, which we associate with the transition of the accretion disk to a radiation pressure dominated (RPD) state, the first ever directly observed for magnetized neutron star. This transition occurs at slightly higher luminosity compared to already reported transition of the source from sub- to super-critical accretion regime associate with onset of an accretion column. We argue that this scenario can only be realized for comparatively weakly magnetized neutron star, not dissimilar to other ultra-luminous X-ray pulsars (ULPs), which accrete at similar rates. Further evidence for this conclusion is provided by the non-detection of the transition to the propeller state in quiescence which strongly implies compact magnetosphere and thus rules out magnetar-like fields. △ Less

Submitted 27 September, 2019; originally announced September 2019.

Comments: Submitted to MNRAS

arXiv:1908.11527 [pdf, other]

Implicit Deep Latent Variable Models for Text Generation

Authors: Le Fang, Chunyuan Li, Jianfeng Gao, Wen Dong, Changyou Chen

Abstract: Deep latent variable models (LVM) such as variational auto-encoder (VAE) have recently played an important role in text generation. One key factor is the exploitation of smooth latent structures to guide the generation. However, the representation power of VAEs is limited due to two reasons: (1) the Gaussian assumption is often made on the variational posteriors; and meanwhile (2) a notorious "pos… ▽ More Deep latent variable models (LVM) such as variational auto-encoder (VAE) have recently played an important role in text generation. One key factor is the exploitation of smooth latent structures to guide the generation. However, the representation power of VAEs is limited due to two reasons: (1) the Gaussian assumption is often made on the variational posteriors; and meanwhile (2) a notorious "posterior collapse" issue occurs. In this paper, we advocate sample-based representations of variational distributions for natural language, leading to implicit latent features, which can provide flexible representation power compared with Gaussian-based posteriors. We further develop an LVM to directly match the aggregated posterior to the prior. It can be viewed as a natural extension of VAEs with a regularization of maximizing mutual information, mitigating the "posterior collapse" issue. We demonstrate the effectiveness and versatility of our models in various text generation scenarios, including language modeling, unaligned style transfer, and dialog response generation. The source code to reproduce our experimental results is available on GitHub. △ Less

Submitted 27 November, 2019; v1 submitted 30 August, 2019; originally announced August 2019.

Comments: 13 pages, 8 Tables, 1 Figure, Accepted at 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019)

arXiv:1908.11078 [pdf, other]

Document Hashing with Mixture-Prior Generative Models

Authors: Wei Dong, Qinliang Su, Dinghan Shen, Changyou Chen

Abstract: Hashing is promising for large-scale information retrieval tasks thanks to the efficiency of distance evaluation between binary codes. Generative hashing is often used to generate hashing codes in an unsupervised way. However, existing generative hashing methods only considered the use of simple priors, like Gaussian and Bernoulli priors, which limits these methods to further improve their perform… ▽ More Hashing is promising for large-scale information retrieval tasks thanks to the efficiency of distance evaluation between binary codes. Generative hashing is often used to generate hashing codes in an unsupervised way. However, existing generative hashing methods only considered the use of simple priors, like Gaussian and Bernoulli priors, which limits these methods to further improve their performance. In this paper, two mixture-prior generative models are proposed, under the objective to produce high-quality hashing codes for documents. Specifically, a Gaussian mixture prior is first imposed onto the variational auto-encoder (VAE), followed by a separate step to cast the continuous latent representation of VAE into binary code. To avoid the performance loss caused by the separate casting, a model using a Bernoulli mixture prior is further developed, in which an end-to-end training is admitted by resorting to the straight-through (ST) discrete gradient estimator. Experimental results on several benchmark datasets demonstrate that the proposed methods, especially the one using Bernoulli mixture priors, consistently outperform existing ones by a substantial margin. △ Less

Submitted 29 August, 2019; originally announced August 2019.

Comments: 10 pages, 8 figures, to appear at EMNLP-IJCNLP 2019

arXiv:1908.03599 [pdf, ps, other]

Second order estimates for complex Hessian equations on Hermitian manifolds

Authors: Weisong Dong, Chang Li

Abstract: We derive second order estimates for $χ$-plurisubharmonic solutions of complex Hessian equations with right hand sides depending on gradients on compact Hermitian manifolds. We derive second order estimates for $χ$-plurisubharmonic solutions of complex Hessian equations with right hand sides depending on gradients on compact Hermitian manifolds. △ Less

Submitted 26 August, 2019; v1 submitted 9 August, 2019; originally announced August 2019.

Comments: 16 pages, v2: added a Theorem (Theorem 1.3); added references; corrected typos

MSC Class: 35J15; 53C55; 58J05; 35B45

arXiv:1908.01922 [pdf, ps, other]

doi 10.3847/1538-4365/ab3718

In-orbit demonstration of X-ray pulsar navigation with the Insight-HXMT satellite

Authors: S. J. Zheng, S. N. Zhang, F. J. Lu, W. B. Wang, Y. Gao, T. P. Li, L. M. Song, M. Y. Ge, D. W. Han, Y. Chen, Y. P. Xu, X. L. Cao, C. Z. Liu, S. Zhang, J. L. Qu, Z. Chang, G. Chen, L. Chen, T. X. Chen, Y. B. Chen, Y. P. Chen, W. Cui, W. W. Cui, J. K. Deng, Y. W. Dong , et al. (91 additional authors not shown)

Abstract: In this work, we report the in-orbit demonstration of X-ray pulsar navigation with Insight-Hard X-ray Modulation Telescope (Insight-HXMT), which was launched on Jun. 15th, 2017. The new pulsar navigation method 'Significance Enhancement of Pulse-profile with Orbit-dynamics' (SEPO) is adopted to determine the orbit with observations of only one pulsar. In this test, the Crab pulsar is chosen and ob… ▽ More In this work, we report the in-orbit demonstration of X-ray pulsar navigation with Insight-Hard X-ray Modulation Telescope (Insight-HXMT), which was launched on Jun. 15th, 2017. The new pulsar navigation method 'Significance Enhancement of Pulse-profile with Orbit-dynamics' (SEPO) is adopted to determine the orbit with observations of only one pulsar. In this test, the Crab pulsar is chosen and observed by Insight-HXMT from Aug. 31th to Sept. 5th in 2017. Using the 5-day-long observation data, the orbit of Insight-HXMT is determined successfully with the three telescopes onboard - High Energy X-ray Telescope (HE), Medium Energy X-ray Telescope (ME) and Low Energy X-ray Telescope (LE) - respectively. Combining all the data, the position and velocity of the Insight-HXMT are pinpointed to within 10 km (3 sigma) and 10 m/s (3 sigma), respectively. △ Less

Submitted 5 August, 2019; originally announced August 2019.

Comments: Accepted by the Astrophysical Journal Supplement

arXiv:1907.02788 [pdf, other]

Incremental Concept Learning via Online Generative Memory Recall

Authors: Huaiyu Li, Weiming Dong, Bao-Gang Hu

Abstract: The ability to learn more and more concepts over time from incrementally arriving data is essential for the development of a life-long learning system. However, deep neural networks often suffer from forgetting previously learned concepts when continually learning new concepts, which is known as catastrophic forgetting problem. The main reason for catastrophic forgetting is that the past concept d… ▽ More The ability to learn more and more concepts over time from incrementally arriving data is essential for the development of a life-long learning system. However, deep neural networks often suffer from forgetting previously learned concepts when continually learning new concepts, which is known as catastrophic forgetting problem. The main reason for catastrophic forgetting is that the past concept data is not available and neural weights are changed during incrementally learning new concepts. In this paper, we propose a pseudo-rehearsal based class incremental learning approach to make neural networks capable of continually learning new concepts. We use a conditional generative adversarial network to consolidate old concepts memory and recall pseudo samples during learning new concepts and a balanced online memory recall strategy is to maximally maintain old memories. And we design a comprehensible incremental concept learning network as well as a concept contrastive loss to alleviate the magnitude of neural weights change. We evaluate the proposed approach on MNIST, Fashion-MNIST and SVHN datasets and compare with other rehearsal based approaches. The extensive experiments demonstrate the effectiveness of our approach. △ Less

Submitted 5 July, 2019; originally announced July 2019.

arXiv:1906.05093 [pdf, other]

doi 10.1109/TITS.2021.3094758

Optimizing city-scale traffic through modeling observations of vehicle movements

Authors: Fan Yang, Alina Vereshchaka, Bruno Lepri, Wen Dong

Abstract: The capability of traffic-information systems to sense the movement of millions of users and offer trip plans through mobile phones has enabled a new way of optimizing city traffic dynamics, turning transportation big data into insights and actions in a closed-loop and evaluating this approach in the real world. Existing research has applied dynamic Bayesian networks and deep neural networks to ma… ▽ More The capability of traffic-information systems to sense the movement of millions of users and offer trip plans through mobile phones has enabled a new way of optimizing city traffic dynamics, turning transportation big data into insights and actions in a closed-loop and evaluating this approach in the real world. Existing research has applied dynamic Bayesian networks and deep neural networks to make traffic predictions from floating car data, utilized dynamic programming and simulation approaches to identify how people normally travel with dynamic traffic assignment for policy research, and introduced Markov decision processes and reinforcement learning to optimally control traffic signals. However, none of these works utilized floating car data to suggest departure times and route choices in order to optimize city traffic dynamics. In this paper, we present a study showing that floating car data can lead to lower average trip time, higher on-time arrival ratio, and higher Charypar-Nagel score compared with how people normally travel. The study is based on optimizing a partially observable discrete-time decision process and is evaluated in one synthesized scenario, one partly synthesized scenario, and three real-world scenarios. This study points to the potential of a "living lab" approach where we learn, predict, and optimize behaviors in the real world. △ Less

Submitted 15 July, 2021; v1 submitted 12 June, 2019; originally announced June 2019.

arXiv:1906.00240 [pdf]

Lung cancer screening with low-dose CT scans using a deep learning approach

Authors: Jason L. Causey, Yuanfang Guan, Wei Dong, Karl Walker, Jake A. Qualls, Fred Prior, Xiuzhen Huang

Abstract: Lung cancer is the leading cause of cancer deaths. Early detection through low-dose computed tomography (CT) screening has been shown to significantly reduce mortality but suffers from a high false positive rate that leads to unnecessary diagnostic procedures. Quantitative image analysis coupled to deep learning techniques has the potential to reduce this false positive rate. We conducted a comput… ▽ More Lung cancer is the leading cause of cancer deaths. Early detection through low-dose computed tomography (CT) screening has been shown to significantly reduce mortality but suffers from a high false positive rate that leads to unnecessary diagnostic procedures. Quantitative image analysis coupled to deep learning techniques has the potential to reduce this false positive rate. We conducted a computational analysis of 1449 low-dose CT studies drawn from the National Lung Screening Trial (NLST) cohort. We applied to this cohort our newly developed algorithm, DeepScreener, which is based on a novel deep learning approach. The algorithm, after the training process using about 3000 CT studies, does not require lung nodule annotations to conduct cancer prediction. The algorithm uses consecutive slices and multi-task features to determine whether a nodule is likely to be cancer, and a spatial pyramid to detect nodules at different scales. We find that the algorithm can predict a patient's cancer status from a volumetric lung CT image with high accuracy (78.2%, with area under the Receiver Operating Characteristic curve (AUC) of 0.858). Our preliminary framework ranked 16th of 1972 teams (top 1%) in the Data Science Bowl 2017 (DSB2017) competition, based on the challenge datasets. We report here the application of DeepScreener on an independent NLST test set. This study indicates that the deep learning approach has the potential to significantly reduce the false positive rate in lung cancer screening with low-dose CT scans. △ Less

Submitted 1 June, 2019; originally announced June 2019.

Comments: 6 figures

Showing 201–250 of 360 results for author: Dong, W