Skip to main content

Showing 151–200 of 553 results for author: Xiang, Y

  1. arXiv:2212.13860  [pdf

    cs.CL cs.DL

    Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain

    Authors: Chengzhi Zhang, Yi Xiang, Wenke Hao, Zhicheng Li, Yuchen Qian, Yuzhuo Wang

    Abstract: Future work sentences (FWS) are the particular sentences in academic papers that contain the author's description of their proposed follow-up research direction. This paper presents methods to automatically extract FWS from academic papers and classify them according to the different future directions embodied in the paper's content. FWS recognition methods will enable subsequent researchers to lo… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  2. arXiv:2212.10746  [pdf, other

    cs.CV

    SLGTformer: An Attention-Based Approach to Sign Language Recognition

    Authors: Neil Song, Yu Xiang

    Abstract: Sign language is the preferred method of communication of deaf or mute people, but similar to any language, it is difficult to learn and represents a significant barrier for those who are hard of hearing or unable to speak. A person's entire frontal appearance dictates and conveys specific meaning. However, this frontal appearance can be quantified as a temporal sequence of human body pose, leadin… ▽ More

    Submitted 22 December, 2022; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 12 pages, 3 figures

  3. DOSnet as a Non-Black-Box PDE Solver: When Deep Learning Meets Operator Splitting

    Authors: Yuan Lan, Zhen Li, Jie Sun, Yang Xiang

    Abstract: Deep neural networks (DNNs) recently emerged as a promising tool for analyzing and solving complex differential equations arising in science and engineering applications. Alternative to traditional numerical schemes, learning-based solvers utilize the representation power of DNNs to approximate the input-output relations in an automated manner. However, the lack of physics-in-the-loop often makes… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  4. arXiv:2211.16703  [pdf, other

    cs.DC cs.AI

    An Efficient Split Fine-tuning Framework for Edge and Cloud Collaborative Learning

    Authors: Shaohuai Shi, Qing Yang, Yang Xiang, Shuhan Qi, Xuan Wang

    Abstract: To enable the pre-trained models to be fine-tuned with local data on edge devices without sharing data with the cloud, we design an efficient split fine-tuning (SFT) framework for edge and cloud collaborative learning. We propose three novel techniques in this framework. First, we propose a matrix decomposition-based method to compress the intermediate output of a neural network to reduce the comm… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 7 pages

  5. arXiv:2211.16059  [pdf, ps, other

    stat.ME cs.LG eess.SP eess.SY

    On Large-Scale Multiple Testing Over Networks: An Asymptotic Approach

    Authors: Mehrdad Pournaderi, Yu Xiang

    Abstract: This work concerns developing communication- and computation-efficient methods for large-scale multiple testing over networks, which is of interest to many practical applications. We take an asymptotic approach and propose two methods, proportion-matching and greedy aggregation, tailored to distributed settings. The proportion-matching method achieves the global BH performance yet only requires a… ▽ More

    Submitted 16 March, 2024; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Published in the IEEE Transactions on Signal and Information Processing over Networks

  6. arXiv:2211.11679  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Mean Shift Mask Transformer for Unseen Object Instance Segmentation

    Authors: Yangxiao Lu, Yuqiao Chen, Nicholas Ruozzi, Yu Xiang

    Abstract: Segmenting unseen objects from images is a critical perception skill that a robot needs to acquire. In robot manipulation, it can facilitate a robot to grasp and manipulate unseen objects. Mean shift clustering is a widely used method for image segmentation tasks. However, the traditional mean shift clustering algorithm is not differentiable, making it difficult to integrate it into an end-to-end… ▽ More

    Submitted 21 September, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Add pixel confidence maps

  7. arXiv:2211.09166  [pdf, other

    eess.AS cs.SD

    A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training

    Authors: Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

    Abstract: This paper focuses on leveraging deep representation learning (DRL) for speech enhancement (SE). In general, the performance of the deep neural network (DNN) is heavily dependent on the learning of data representation. However, the DRL's importance is often ignored in many DNN-based SE algorithms. To obtain a higher quality enhanced speech, we propose a two-stage DRL-based SE method through advers… ▽ More

    Submitted 27 September, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing

  8. arXiv:2211.03885  [pdf, other

    cs.CV eess.IV

    Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Ziyao Yi, Yan Xiang, Zibin Liu, Shaoqing Li, Keming Shi, Dehui Kong, Ke Xu, Minsu Kwon, Yaqi Wu, Jiesi Zheng, Zhihao Fan, Xun Wu, Feng Zhang, Albert No, Minhyeok Cho, Zewen Chen, Xiaze Zhang, Ran Li , et al. (13 additional authors not shown)

    Abstract: The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. Th… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  9. arXiv:2211.00235  [pdf, other

    cs.DC

    Efficient AlphaFold2 Training using Parallel Evoformer and Branch Parallelism

    Authors: Guoxia Wang, Zhihua Wu, Xiaomin Fang, Yingfei Xiang, Yiqun Liu, Dianhai Yu, Yanjun Ma

    Abstract: The accuracy of AlphaFold2, a frontier end-to-end structure prediction system, is already close to that of the experimental determination techniques. Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to train AlphaFold2 from scratch. Efficient AlphaFold2 training could accelerate the development of life science. In this paper,… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2207.05477

  10. arXiv:2210.17408  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Accelerating Diffusion Models via Pre-segmentation Diffusion Sampling for Medical Image Segmentation

    Authors: Xutao Guo, Yanwu Yang, Chenfei Ye, Shang Lu, Yang Xiang, Ting Ma

    Abstract: Based on the Denoising Diffusion Probabilistic Model (DDPM), medical image segmentation can be described as a conditional image generation task, which allows to compute pixel-wise uncertainty maps of the segmentation and allows an implicit ensemble of segmentations to boost the segmentation performance. However, DDPM requires many iterative denoising steps to generate segmentations from Gaussian n… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  11. arXiv:2210.13721  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-modal Dynamic Graph Network: Coupling Structural and Functional Connectome for Disease Diagnosis and Classification

    Authors: Yanwu Yang, Xutao Guo, Zhikai Chang, Chenfei Ye, Yang Xiang, Ting Ma

    Abstract: Multi-modal neuroimaging technology has greatlly facilitated the efficiency and diagnosis accuracy, which provides complementary information in discovering objective disease biomarkers. Conventional deep learning methods, e.g. convolutional neural networks, overlook relationships between nodes and fail to capture topological properties in graphs. Graph neural networks have been proven to be of gre… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  12. arXiv:2210.11834  [pdf, other

    cs.LG stat.ML

    Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles

    Authors: Yuxuan Han, Jialin Zeng, Yang Wang, Yang Xiang, Jiheng Zhang

    Abstract: We study the stochastic contextual bandit with knapsacks (CBwK) problem, where each action, taken upon a context, not only leads to a random reward but also costs a random resource consumption in a vector form. The challenge is to maximize the total reward without violating the budget for each resource. We study this problem under a general realizability setting where the expected reward and expec… ▽ More

    Submitted 22 February, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: AISTATS2023

  13. RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control

    Authors: Yanfei Xiang, Xin Wang, Shu Hu, Bin Zhu, Xiaomeng Huang, Xi Wu, Siwei Lyu

    Abstract: Reinforcement learning is applied to solve actual complex tasks from high-dimensional, sensory inputs. The last decade has developed a long list of reinforcement learning algorithms. Recent progress benefits from deep learning for raw sensory signal representation. One question naturally arises: how well do they perform concerning different robotic manipulation tasks? Benchmarks use objective perf… ▽ More

    Submitted 7 March, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 8 pages, 3 figures, 2 tables; update code's link

    ACM Class: I.2.9

  14. arXiv:2210.10978  [pdf, other

    cs.CR

    A Comprehensive Survey on Edge Data Integrity Verification: Fundamentals and Future Trends

    Authors: Yao Zhao, Youyang Qu, Yong Xiang, Longxiang Gao

    Abstract: Recent advances in edge computing have pushed cloud-based data caching services to edge, however, such emerging edge storage comes with numerous challenging and unique security issues. One of them is the problem of edge data integrity verification (EDIV) which coordinates multiple participants (e.g., data owners and edge nodes) to inspect whether data cached on edge is authentic. To date, various… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  15. arXiv:2210.04435  [pdf, other

    cs.RO cs.AI eess.SY

    Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning

    Authors: Xiaoyu Huang, Zhongyu Li, Yanzhen Xiang, Yiming Ni, Yufeng Chi, Yunhao Li, Lizhi Yang, Xue Bin Peng, Koushil Sreenath

    Abstract: We present a reinforcement learning (RL) framework that enables quadrupedal robots to perform soccer goalkeeping tasks in the real world. Soccer goalkeeping using quadrupeds is a challenging problem, that combines highly dynamic locomotion with precise and fast non-prehensile object (ball) manipulation. The robot needs to react to and intercept a potentially flying ball using dynamic locomotion ma… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: First two authors contributed equally. Accompanying video is at https://youtu.be/iX6OgG67-ZQ

  16. arXiv:2210.03301  [pdf, other

    eess.IV cs.CV cs.LG

    GOLLIC: Learning Global Context beyond Patches for Lossless High-Resolution Image Compression

    Authors: Yuan Lan, Liang Qin, Zhaoyi Sun, Yang Xiang, Jie Sun

    Abstract: Neural-network-based approaches recently emerged in the field of data compression and have already led to significant progress in image compression, especially in achieving a higher compression ratio. In the lossless image compression scenario, however, existing methods often struggle to learn a probability model of full-size high-resolution images due to the limitation of the computation source.… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  17. arXiv:2210.02555  [pdf, ps, other

    eess.SP stat.ML

    Sample-and-Forward: Communication-Efficient Control of the False Discovery Rate in Networks

    Authors: Mehrdad Pournaderi, Yu Xiang

    Abstract: This work concerns controlling the false discovery rate (FDR) in networks under communication constraints. We present sample-and-forward, a flexible and communication-efficient version of the Benjamini-Hochberg (BH) procedure for multihop networks with general topologies. Our method evidences that the nodes in a network do not need to communicate p-values to each other to achieve a decent statisti… ▽ More

    Submitted 15 May, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted to the 2023 IEEE International Symposium on Information Theory (ISIT)

  18. Physical interpretation of nonlocal quantum correlation through local description of subsystems

    Authors: Tanumoy Pramanik, Xiaojiong Chen, Yu Xiang, Xudong Li, Jun Mao, Jueming Bao, Yaohao Deng, Tianxiang Dai, Bo Tang, Yan Yang, Zhihua Li, Qihuang Gong, Qiongyi He, Jianwei Wang

    Abstract: Characterization and categorization of quantum correlations are both fundamentally and practically important in quantum information science. Although quantum correlations such as non-separability, steerability, and non-locality can be characterized by different theoretical models in different scenarios with either known (trusted) or unknown (untrusted) knowledge of the associated systems, such cha… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

    Comments: 13 pages, 10 figures. Comments are welcome

    Journal ref: Sci Rep 12, 16400 (2022)

  19. arXiv:2209.12642  [pdf

    eess.SY

    Design of Automatic Driving Safety Level and Positioning Accuracy

    Authors: Tiantian Tang, Hao Xu, Chengcheng Wu, Sijie Lye, Yan Xiang

    Abstract: Autonomous driving is a hot research topic in the frontier of science and technology. Technology companies and traditional car companies are developing and designing autonomous driving technology from two different directions. Based on the automatic driving classification standard and ISO safety level, combined with the number of traffic accidents and death data in China, and referring to the risk… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: in Chinese language

  20. arXiv:2209.11715  [pdf, other

    cs.CR cs.AI

    The "Beatrix'' Resurrections: Robust Backdoor Detection via Gram Matrices

    Authors: Wanlun Ma, Derui Wang, Ruoxi Sun, Minhui Xue, Sheng Wen, Yang Xiang

    Abstract: Deep Neural Networks (DNNs) are susceptible to backdoor attacks during training. The model corrupted in this way functions normally, but when triggered by certain patterns in the input, produces a predefined target label. Existing defenses usually rely on the assumption of the universal backdoor setting in which poisoned samples share the same uniform trigger. However, recent advanced backdoor att… ▽ More

    Submitted 18 December, 2022; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 18 pages, 23 figures. Accepted to NDSS 2023. Camera-ready version. Code availability: https://github.com/wanlunsec/Beatrix

  21. arXiv:2209.08933  [pdf, ps, other

    eess.IV cs.CV

    Estimating Brain Age with Global and Local Dependencies

    Authors: Yanwu Yang, Xutao Guo, Zhikai Chang, Chenfei Ye, Yang Xiang, Haiyan Lv, Ting Ma

    Abstract: The brain age has been proven to be a phenotype of relevance to cognitive performance and brain disease. Achieving accurate brain age prediction is an essential prerequisite for optimizing the predicted brain-age difference as a biomarker. As a comprehensive biological characteristic, the brain age is hard to be exploited accurately with models using feature engineering and local processing such a… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  22. arXiv:2209.00514  [pdf, other

    cs.LG physics.chem-ph

    Efficient Chemical Space Exploration Using Active Learning Based on Marginalized Graph Kernel: an Application for Predicting the Thermodynamic Properties of Alkanes with Molecular Simulation

    Authors: Yan Xiang, Yu-Hang Tang, Zheng Gong, Hongyi Liu, Liang Wu, Guang Lin, Huai Sun

    Abstract: We introduce an explorative active learning (AL) algorithm based on Gaussian process regression and marginalized graph kernel (GPR-MGK) to explore chemical space with minimum cost. Using high-throughput molecular dynamics simulation to generate data and graph neural network (GNN) to predict, we constructed an active learning molecular simulation framework for thermodynamic property prediction. In… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 9 pages, 5 figures

  23. arXiv:2208.10027  [pdf, other

    stat.ME cs.LG

    Learning Invariant Representations under General Interventions on the Response

    Authors: Kang Du, Yu Xiang

    Abstract: It has become increasingly common nowadays to collect observations of feature and response pairs from different environments. As a consequence, one has to apply learned predictors to data with a different distribution due to distribution shifts. One principled approach is to adopt the structural causal models to describe training and test models, following the invariance principle which says that… ▽ More

    Submitted 30 October, 2023; v1 submitted 21 August, 2022; originally announced August 2022.

    Comments: Accepted to the IEEE Journal on Selected Areas in Information Theory. Special Issue: Causality: Fundamental Limits and Applications

  24. arXiv:2208.09804  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Electronic Correlations and Evolution of the Charge-Density Wave in the Kagome Metals $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, Cs)

    Authors: Xiaoxiang Zhou, Yongkai Li, Xinwei Fan, Jiahao Hao, Ying Xiang, Zhe Liu, Yaomin Dai, Zhiwei Wang, Yugui Yao, Hai-Hu Wen

    Abstract: The kagome metals $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, Cs) have attracted enormous interest as they exhibit intertwined charge-density wave (CDW) and superconductivity. The alkali-metal dependence of these characteristics contains pivotal information about the CDW and its interplay with superconductivity. Here, we report optical studies of $A$V$_{3}$Sb$_{5}$ across the whole family. With increasing al… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

    Comments: 6 pages, 4 figures. Comments are welcome and appreciated

  25. An Efficient and Reliable Asynchronous Federated Learning Scheme for Smart Public Transportation

    Authors: Chenhao Xu, Youyang Qu, Tom H. Luan, Peter W. Eklund, Yong Xiang, Longxiang Gao

    Abstract: Since the traffic conditions change over time, machine learning models that predict traffic flows must be updated continuously and efficiently in smart public transportation. Federated learning (FL) is a distributed machine learning scheme that allows buses to receive model updates without waiting for model training on the cloud. However, FL is vulnerable to poisoning or DDoS attacks since buses t… ▽ More

    Submitted 26 December, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

  26. arXiv:2208.00183  [pdf, other

    cs.CV

    Few-shot Single-view 3D Reconstruction with Memory Prior Contrastive Network

    Authors: Zhen Xing, Yijiang Chen, Zhixin Ling, Xiangdong Zhou, Yu Xiang

    Abstract: 3D reconstruction of novel categories based on few-shot learning is appealing in real-world applications and attracts increasing research interests. Previous approaches mainly focus on how to design shape prior models for different categories. Their performance on unseen categories is not very competitive. In this paper, we present a Memory Prior Contrastive Network (MPCN) that can store shape pri… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: Accepted by ECCV 2022

  27. arXiv:2207.13921  [pdf, other

    q-bio.BM cs.AI cs.LG q-bio.QM

    HelixFold-Single: MSA-free Protein Structure Prediction by Using Protein Language Model as an Alternative

    Authors: Xiaomin Fang, Fan Wang, Lihang Liu, Jingzhou He, Dayong Lin, Yingfei Xiang, Xiaonan Zhang, Hua Wu, Hui Li, Le Song

    Abstract: AI-based protein structure prediction pipelines, such as AlphaFold2, have achieved near-experimental accuracy. These advanced pipelines mainly rely on Multiple Sequence Alignments (MSAs) as inputs to learn the co-evolution information from the homologous sequences. Nonetheless, searching MSAs from protein databases is time-consuming, usually taking dozens of minutes. Consequently, we attempt to ex… ▽ More

    Submitted 21 February, 2023; v1 submitted 28 July, 2022; originally announced July 2022.

    Journal ref: Nature Machine Intelligence, 2023

  28. arXiv:2207.13342  [pdf, other

    quant-ph

    Quantum Steering: Practical Challenges and Perspectives

    Authors: Yu Xiang, Shuming Cheng, Qihuang Gong, Zbigniew Ficek, Qiongyi He

    Abstract: Einstein-Rosen-Podolsky (EPR) steering or quantum steering describes the "spooky-action-at-a-distance" that one party is able to remotely alter the states of the other if they share a certain entangled state. Generally, it admits an operational interpretation as the task of verifying entanglement without trust in the steering party's devices, making it lying intermediate between Bell nonlocality a… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: PRX Quantum accepted

  29. Characterizing Multipartite Non-Gaussian Entanglement for Three-Mode Spontaneous Parametric Down-Conversion Process

    Authors: Mingsheng Tian, Yu Xiang, Feng-Xiao Sun, Matteo Fadel, Qiongyi He

    Abstract: Very recently, strongly non-Gaussian states have been observed via a direct three-mode spontaneous parametric down-conversion in a superconducting cavity [Phys. Rev. X 10, 011011 (2020)]. The created multi-photon non-Gaussian correlations are attractive and useful for various quantum information tasks. However, how to detect and classify multipartite non-Gaussian entanglement has not yet been comp… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: 10 pages, 4 figures

    Journal ref: Phys. Rev. Applied 18, 024065 (2022)

  30. arXiv:2207.05477  [pdf, other

    cs.DC cs.LG q-bio.BM

    HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

    Authors: Guoxia Wang, Xiaomin Fang, Zhihua Wu, Yiqun Liu, Yang Xue, Yingfei Xiang, Dianhai Yu, Fan Wang, Yanjun Ma

    Abstract: Accurate protein structure prediction can significantly accelerate the development of life science. The accuracy of AlphaFold2, a frontier end-to-end structure prediction system, is already close to that of the experimental determination techniques. Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to implement the training and… ▽ More

    Submitted 13 July, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

  31. arXiv:2207.04579  [pdf, ps, other

    astro-ph.SR physics.space-ph

    Reconfiguration and eruption of a solar filament by magnetic reconnection with an emerging magnetic field

    Authors: Leping Li, Hardi Peter, Lakshmi Pradeep Chitta, Hongqiang Song, Zhe Xu, Yongyuan Xiang

    Abstract: Both observations and simulations suggest that the solar filament eruption is closely related to magnetic flux emergence. It is thought that the eruption is triggered by magnetic reconnection between the filament and the emerging flux. However, the details of such a reconnection are rarely presented. In this study, we report the detailed reconnection between a filament and its nearby emerging fiel… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: 15 pages, 7 figures, accepted for publication in ApJ

  32. arXiv:2207.04434  [pdf, other

    cs.CR cs.CV

    Hiding Your Signals: A Security Analysis of PPG-based Biometric Authentication

    Authors: Lin Li, Chao Chen, Lei Pan, Yonghang Tai, Jun Zhang, Yang Xiang

    Abstract: Recently, physiological signal-based biometric systems have received wide attention. Unlike traditional biometric features, physiological signals can not be easily compromised (usually unobservable to human eyes). Photoplethysmography (PPG) signal is easy to measure, making it more attractive than many other physiological signals for biometric authentication. However, with the advent of remote PPG… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  33. arXiv:2207.03333  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    FewSOL: A Dataset for Few-Shot Object Learning in Robotic Environments

    Authors: Jishnu Jaykumar P, Yu-Wei Chao, Yu Xiang

    Abstract: We introduce the Few-Shot Object Learning (FewSOL) dataset for object recognition with a few images per object. We captured 336 real-world objects with 9 RGB-D images per object from different views. Object segmentation masks, object poses and object attributes are provided. In addition, synthetic images generated using 330 3D object models are used to augment the dataset. We investigated (i) few-… ▽ More

    Submitted 5 March, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

  34. arXiv:2207.02959  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    NeuralGrasps: Learning Implicit Representations for Grasps of Multiple Robotic Hands

    Authors: Ninad Khargonkar, Neil Song, Zesheng Xu, Balakrishnan Prabhakaran, Yu Xiang

    Abstract: We introduce a neural implicit representation for grasps of objects from multiple robotic hands. Different grasps across multiple robotic hands are encoded into a shared latent space. Each latent vector is learned to decode to the 3D shape of an object and the 3D shape of a robotic hand in a grasping pose in terms of the signed distance functions of the two 3D shapes. In addition, the distance met… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  35. Multi-scale Attentive Image De-raining Networks via Neural Architecture Search

    Authors: Lei Cai, Yuli Fu, Wanliang Huo, Youjun Xiang, Tao Zhu, Ying Zhang, Huanqiang Zeng, Delu Zeng

    Abstract: Multi-scale architectures and attention modules have shown effectiveness in many deep learning-based image de-raining methods. However, manually designing and integrating these two components into a neural network requires a bulk of labor and extensive expertise. In this article, a high-performance multi-scale attentive neural architecture search (MANAS) framework is technically developed for imag… ▽ More

    Submitted 4 April, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology, vol.33, no.2, pp.618-633 September 2022

  36. arXiv:2207.00268  [pdf, ps, other

    astro-ph.IM eess.IV

    High-resolution Solar Image Reconstruction Based on Non-rigid Alignment

    Authors: Hui Liu, Zhenyu Jin, Yongyuan Xiang, Kaifan Ji

    Abstract: Suppressing the interference of atmospheric turbulence and obtaining observation data with a high spatial resolution is an issue to be solved urgently for ground observations. One way to solve this problem is to perform a statistical reconstruction of short-exposure speckle images. Combining the rapidity of Shift-Add and the accuracy of speckle masking, this paper proposes a novel reconstruction a… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  37. arXiv:2206.14362  [pdf, other

    cs.IT eess.SP stat.ME

    Lower Bounds on the Error Probability for Invariant Causal Prediction

    Authors: Austin Goddard, Yu Xiang, Ilya Soloveychik

    Abstract: It is common practice to collect observations of feature and response pairs from different environments. A natural question is how to identify features that have consistent prediction power across environments. The invariant causal prediction framework proposes to approach this problem through invariance, assuming a linear model that is invariant under different environments. In this work, we make… ▽ More

    Submitted 29 June, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: Accepted to the 2022 IEEE International Workshop on Machine Learning for Signal Processing (MLSP)

  38. arXiv:2206.12254  [pdf, other

    cs.LG cs.CE

    A Manifold-based Airfoil Geometric-feature Extraction and Discrepant Data Fusion Learning Method

    Authors: Yu Xiang, Guangbo Zhang, Liwei Hu, Jun Zhang, Wenyong Wang

    Abstract: Geometrical shape of airfoils, together with the corresponding flight conditions, are crucial factors for aerodynamic performances prediction. The obtained airfoils geometrical features in most existing approaches (e.g., geometrical parameters extraction, polynomial description and deep learning) are in Euclidean space. State-of-the-art studies showed that curves or surfaces of an airfoil formed a… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  39. arXiv:2206.10736  [pdf

    cs.LG cs.AI q-fin.CP q-fin.TR

    Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO

    Authors: Jin Fang, Jiacheng Weng, Yi Xiang, Xinwen Zhang

    Abstract: A novel framework for solving the optimal execution and placement problems using reinforcement learning (RL) with imitation was proposed. The RL agents trained from the proposed framework consistently outperformed the industry benchmark time-weighted average price (TWAP) strategy in execution cost and showed great generalization across out-of-sample trading dates and tickers. The impressive perfor… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  40. arXiv:2206.07257  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG

    Investigation of stellar magnetic activity using variational autoencoder based on low-resolution spectroscopic survey

    Authors: Yue Xiang, Shenghong Gu, Dongtao Cao

    Abstract: We apply the variational autoencoder (VAE) to the LAMOST-K2 low-resolution spectra to detect the magnetic activity of the stars in the K2 field. After the training on the spectra of the selected inactive stars, the VAE model can efficiently generate the synthetic reference templates needed by the spectral subtraction procedure, without knowing any stellar parameters. Then we detect the peculiar sp… ▽ More

    Submitted 6 July, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 13 pages, 19 figures, accepted for publication in MNRAS. Table 1 is available on Zenodo at https://doi.org/10.5281/zenodo.6802956 and the code can be found on GitHub at https://github.com/xylib/vae-for-spectroscopic-survey

  41. arXiv:2205.14421  [pdf, ps, other

    math.NA cs.LG math.OC

    Approximation of Functionals by Neural Network without Curse of Dimensionality

    Authors: Yahong Yang, Yang Xiang

    Abstract: In this paper, we establish a neural network to approximate functionals, which are maps from infinite dimensional spaces to finite dimensional spaces. The approximation error of the neural network is $O(1/\sqrt{m})$ where $m$ is the size of networks, which overcomes the curse of dimensionality. The key idea of the approximation is to define a Barron spectral space of functionals.

    Submitted 18 October, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

    Journal ref: J. Mach. Learn. , 1 (2022), pp. 342-372

  42. arXiv:2205.09747  [pdf, other

    cs.RO cs.CV

    HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers

    Authors: Yu-Wei Chao, Chris Paxton, Yu Xiang, Wei Yang, Balakumar Sundaralingam, Tao Chen, Adithyavairavan Murali, Maya Cakmak, Dieter Fox

    Abstract: We introduce a new simulation benchmark "HandoverSim" for human-to-robot object handovers. To simulate the giver's motion, we leverage a recent motion capture dataset of hand grasping of objects. We create training and evaluation environments for the receiver with standardized protocols and metrics. We analyze the performance of a set of baselines and show a correlation with a real-world evaluatio… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted to ICRA 2022

  43. arXiv:2205.09470  [pdf, other

    cs.LG cs.AI cs.DC

    Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

    Authors: Yang Xiang, Zhihua Wu, Weibao Gong, Siyu Ding, Xianjie Mo, Yuang Liu, Shuohuan Wang, Peng Liu, Yongshuai Hou, Long Li, Bin Wang, Shaohuai Shi, Yaqian Han, Yue Yu, Ge Li, Yu Sun, Yanjun Ma, Dianhai Yu

    Abstract: The ever-growing model size and scale of compute have attracted increasing interests in training deep learning models over multiple nodes. However, when it comes to training on cloud clusters, especially across remote clusters, huge challenges are faced. In this work, we introduce a general framework, Nebula-I, for collaboratively training deep learning models over remote heterogeneous clusters, t… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 20 pages, 10 figures, technical report

  44. arXiv:2205.09162  [pdf, ps, other

    stat.ME cs.LG

    An Invariant Matching Property for Distribution Generalization under Intervened Response

    Authors: Kang Du, Yu Xiang

    Abstract: The task of distribution generalization concerns making reliable prediction of a response in unseen environments. The structural causal models are shown to be useful to model distribution changes through intervention. Motivated by the fundamental invariance principle, it is often assumed that the conditional distribution of the response given its predictors remains the same across environments. Ho… ▽ More

    Submitted 10 June, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted to the European Signal Processing Conference (EUSIPCO) 2022

  45. arXiv:2205.07186  [pdf, other

    cond-mat.mtrl-sci

    Stochastic Continuum Models for High--Entropy Alloys with Short-range Order

    Authors: Yahong Yang, Luchan Zhang, Yang Xiang

    Abstract: High entropy alloys (HEAs) are a class of novel materials that exhibit superb engineering properties. It has been demonstrated by extensive experiments and first principles/atomistic simulations that short-range order in the atomic level randomness strongly influences the properties of HEAs. In this paper, we derive stochastic continuum models for HEAs with short-range order from atomistic models.… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

  46. arXiv:2205.07051  [pdf

    physics.optics physics.app-ph

    High-speed graphene-silicon-graphene waveguide PDs with high photo-to-dark-current ratio and large linear dynamic range

    Authors: Jingshu Guo, Chaoyue Liu, Laiwen Yu, Hengtai Xiang, Yuluan Xiang, Daoxin Dai

    Abstract: Two-dimensional materials (2DMs) meet the demand of broadband and low-cost photodetection on silicon for many applications. Currently, it is still very challenging to realize excellent silicon-2DM PDs. Here we demonstrate graphene-silicon-graphene waveguide PDs operating at the wavelength-bands of 1.55 μm and 2 μm, showing the potential for large-scale integration. For the fabricated PDs, the meas… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

  47. arXiv:2205.05581  [pdf, other

    eess.AS cs.SD

    A deep representation learning speech enhancement method using $β$-VAE

    Authors: Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

    Abstract: In previous work, we proposed a variational autoencoder-based (VAE) Bayesian permutation training speech enhancement (SE) method (PVAE) which indicated that the SE performance of the traditional deep neural network-based (DNN) method could be improved by deep representation learning (DRL). Based on our previous work, we in this paper propose to use $β$-VAE to further improve PVAE's ability of repr… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Submitted to Eurosipco

  48. arXiv:2204.13847  [pdf

    cs.LG cs.AI

    CATNet: Cross-event Attention-based Time-aware Network for Medical Event Prediction

    Authors: Sicen Liu, Xiaolong Wang, Yang Xiang, Hui Xu, Hui Wang, Buzhou Tang

    Abstract: Medical event prediction (MEP) is a fundamental task in the medical domain, which needs to predict medical events, including medications, diagnosis codes, laboratory tests, procedures, outcomes, and so on, according to historical medical records. The task is challenging as medical data is a type of complex time series data with heterogeneous and temporal irregular characteristics. Many machine lea… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: 15 pages,4 figures

  49. arXiv:2204.13325  [pdf, ps, other

    math.AP

    Weak solutions to an initial-boundary value problem for a continuum equation of motion of grain boundaries

    Authors: Peicheng Zhu, Lei Yu, Yang Xiang

    Abstract: We investigate an initial-(periodic-)boundary value problem for a continuum equation, which is a model for motion of grain boundaries based on the underlying microscopic mechanisms of line defects (disconnections) and integrated the effects of a diverse range of thermodynamic driving forces. We first prove the global-in-time existence and uniqueness of weak solution to this initial-boundary value… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  50. Experimental demonstration of remotely creating Wigner negativity via quantum steering

    Authors: Shuheng Liu, Dongmei Han, Na Wang, Yu Xiang, Fengxiao Sun, Meihong Wang, Zhongzhong Qin, Qihuang Gong, Xiaolong Su, Qiongyi He

    Abstract: Non-Gaussian states with Wigner negativity are of particular interest in quantum technology due to their potential applications in quantum computing and quantum metrology. However, how to create such states at a remote location remains a challenge, which is important for efficiently distributing quantum resource between distant nodes in a network. Here, we experimentally prepare optical non-Gaussi… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Phys. Rev. Lett. (Accepted)

    Journal ref: Phys. Rev. Lett. 128, 200401 (2022)