Skip to main content

Showing 1–50 of 106 results for author: Dou, D

  1. arXiv:2403.19178  [pdf, other

    cs.CR cs.AI cs.DC cs.LG

    Enhancing Trust and Privacy in Distributed Networks: A Comprehensive Survey on Blockchain-based Federated Learning

    Authors: Ji Liu, Chunlu Chen, Yu Li, Lin Sun, Yulun Song, Jingbo Zhou, Bo Jing, Dejing Dou

    Abstract: While centralized servers pose a risk of being a single point of failure, decentralized approaches like blockchain offer a compelling solution by implementing a consensus mechanism among multiple entities. Merging distributed computing with cryptographic techniques, decentralized technologies introduce a novel computing paradigm. Blockchain ensures secure, transparent, and tamper-proof data manage… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 25 pages, accepted by KAIS 2024

  2. arXiv:2401.00272  [pdf, other

    cs.IR

    Dual-space Hierarchical Learning for Goal-guided Conversational Recommendation

    Authors: Can Chen, Hao Liu, Zeming Liu, Xue Liu, Dejing Dou

    Abstract: Proactively and naturally guiding the dialog from the non-recommendation context (e.g., Chit-chat) to the recommendation scenario (e.g., Music) is crucial for the Conversational Recommender System (CRS). Prior studies mainly focus on planning the next dialog goal~(e.g., chat on a movie star) conditioned on the previous dialog. However, we find the dialog goals can be simultaneously observed at dif… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted by Neurocomputing

  3. arXiv:2312.15186  [pdf, other

    cs.DC cs.AI cs.LG

    Efficient Asynchronous Federated Learning with Sparsification and Quantization

    Authors: Juncheng Jia, Ji Liu, Chendi Zhou, Hao Tian, Mianxiong Dong, Dejing Dou

    Abstract: While data is distributed in multiple edge devices, Federated Learning (FL) is attracting more and more attention to collaboratively train a machine learning model without transferring raw data. FL generally exploits a parameter server and a large number of edge devices during the whole process of the model training, while several devices are selected in each round. However, straggler devices may… ▽ More

    Submitted 6 January, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: To appear in Concurrency and Computation: Practice and Experience (CCPE), 21 pages

  4. arXiv:2312.10935  [pdf, other

    cs.DC cs.AI cs.LG

    AEDFL: Efficient Asynchronous Decentralized Federated Learning with Heterogeneous Devices

    Authors: Ji Liu, Tianshi Che, Yang Zhou, Ruoming Jin, Huaiyu Dai, Dejing Dou, Patrick Valduriez

    Abstract: Federated Learning (FL) has achieved significant achievements recently, enabling collaborative model training on distributed data over edge devices. Iterative gradient or model exchanges between devices and the centralized server in the standard FL paradigm suffer from severe efficiency bottlenecks on the server. While enabling collaborative training without a central server, existing decentralize… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: To appear in SDM 2024, 15 pages

  5. arXiv:2312.08975  [pdf, other

    cs.CV cs.AI cs.DC

    On Mask-based Image Set Desensitization with Recognition Support

    Authors: Qilong Li, Ji Liu, Yifan Sun, Chongsheng Zhang, Dejing Dou

    Abstract: In recent years, Deep Neural Networks (DNN) have emerged as a practical method for image recognition. The raw data, which contain sensitive information, are generally exploited within the training process. However, when the training process is outsourced to a third-party organization, the raw data should be desensitized before being transferred to protect sensitive information. Although masks are… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: To appear in Applied Intelligence (APIN), 1-26 pages

  6. arXiv:2312.05770  [pdf, other

    cs.DC

    FedASMU: Efficient Asynchronous Federated Learning with Dynamic Staleness-aware Model Update

    Authors: Ji Liu, Juncheng Jia, Tianshi Che, Chao Huo, Jiaxiang Ren, Yang Zhou, Huaiyu Dai, Dejing Dou

    Abstract: As a promising approach to deal with distributed data, Federated Learning (FL) achieves major advancements in recent years. FL enables collaborative model training by exploiting the raw data dispersed in multiple edge devices. However, the data is generally non-independent and identically distributed, i.e., statistical heterogeneity, and the edge devices significantly differ in terms of both compu… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 18 pages, to appear in AAAI 2024

  7. arXiv:2310.15080  [pdf, other

    cs.LG cs.CL cs.DC

    Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization

    Authors: Tianshi Che, Ji Liu, Yang Zhou, Jiaxiang Ren, Jiwen Zhou, Victor S. Sheng, Huaiyu Dai, Dejing Dou

    Abstract: Federated learning (FL) is a promising paradigm to enable collaborative model training with decentralized data. However, the training process of Large Language Models (LLMs) generally incurs the update of significant parameters, which limits the applicability of FL techniques to tackle the LLMs in real scenarios. Prompt tuning can significantly reduce the number of parameters to update, but it eit… ▽ More

    Submitted 11 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 18 pages, accepted by EMNLP 2023

  8. MUSCLE: Multi-task Self-supervised Continual Learning to Pre-train Deep Models for X-ray Images of Multiple Body Parts

    Authors: Weibin Liao, Haoyi Xiong, Qingzhong Wang, Yan Mo, Xuhong Li, Yi Liu, Zeyu Chen, Siyu Huang, Dejing Dou

    Abstract: While self-supervised learning (SSL) algorithms have been widely used to pre-train deep models, few efforts [11] have been done to improve representation learning of X-ray image analysis with SSL pre-trained models. In this work, we study a novel self-supervised pre-training pipeline, namely Multi-task Self-super-vised Continual Learning (MUSCLE), for multiple medical imaging tasks, such as classi… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: accepted by Medical Image Computing and Computer Assisted Intervention (MICCAI) 2022

  9. arXiv:2306.03387  [pdf, other

    cs.AI

    ColdNAS: Search to Modulate for User Cold-Start Recommendation

    Authors: Shiguang Wu, Yaqing Wang, Qinghe Jing, Daxiang Dong, Dejing Dou, Quanming Yao

    Abstract: Making personalized recommendation for cold-start users, who only have a few interaction histories, is a challenging problem in recommendation systems. Recent works leverage hypernetworks to directly map user interaction histories to user-specific parameters, which are then used to modulate predictor by feature-wise linear modulation function. These works obtain the state-of-the-art performance. H… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  10. arXiv:2304.07775  [pdf, other

    cs.CV cs.MM

    Robust Cross-Modal Knowledge Distillation for Unconstrained Videos

    Authors: Wenke Xia, Xingjian Li, Andong Deng, Haoyi Xiong, Dejing Dou, Di Hu

    Abstract: Cross-modal distillation has been widely used to transfer knowledge across different modalities, enriching the representation of the target unimodal one. Recent studies highly relate the temporal synchronization between vision and sound to the semantic consistency for cross-modal distillation. However, such semantic consistency from the synchronization is hard to guarantee in unconstrained videos,… ▽ More

    Submitted 27 April, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

  11. arXiv:2304.00844  [pdf, other

    cs.CV eess.IV

    Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising

    Authors: Miaoyu Li, Ji Liu, Ying Fu, Yulun Zhang, Dejing Dou

    Abstract: Denoising is a crucial step for hyperspectral image (HSI) applications. Though witnessing the great power of deep learning, existing HSI denoising methods suffer from limitations in capturing the non-local self-similarity. Transformers have shown potential in capturing long-range dependencies, but few attempts have been made with specifically designed Transformer to model the spatial and spectral… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  12. arXiv:2304.00320  [pdf, other

    cs.LG cs.AI

    Doubly Stochastic Models: Learning with Unbiased Label Noises and Inference Stability

    Authors: Haoyi Xiong, Xuhong Li, Boyang Yu, Zhanxing Zhu, Dongrui Wu, Dejing Dou

    Abstract: Random label noises (or observational noises) widely exist in practical machine learning settings. While previous studies primarily focus on the affects of label noises to the performance of learning, our work intends to investigate the implicit regularization effects of the label noises, under mini-batch sampling settings of stochastic gradient descent (SGD), with assumptions that label noises ar… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: The complete manuscript of our previous submission to ICLR'21 (https://openreview.net/forum?id=g4szfsQUdy3). This manuscript was major done in 2021. We gave try to some venues but unfortunately haven't made it accepted yet

  13. arXiv:2303.04574  [pdf, other

    cs.DC

    Distributed and Deep Vertical Federated Learning with Big Data

    Authors: Ji Liu, Xuehai Zhou, Lei Mo, Shilei Ji, Yuan Liao, Zheng Li, Qin Gu, Dejing Dou

    Abstract: In recent years, data are typically distributed in multiple organizations while the data security is becoming increasingly important. Federated Learning (FL), which enables multiple parties to collaboratively train a model without exchanging the raw data, has attracted more and more attention. Based on the distribution of data, FL can be realized in three scenarios, i.e., horizontal, vertical, and… ▽ More

    Submitted 10 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: To appear in CCPE (Concurrency and Computation: Practice and Experience)

  14. arXiv:2302.12688  [pdf, other

    cs.CV cs.AI cs.LG

    Video4MRI: An Empirical Study on Brain Magnetic Resonance Image Analytics with CNN-based Video Classification Frameworks

    Authors: Yuxuan Zhang, Qingzhong Wang, Jiang Bian, Yi Liu, Yanwu Xu, Dejing Dou, Haoyi Xiong

    Abstract: To address the problem of medical image recognition, computer vision techniques like convolutional neural networks (CNN) are frequently used. Recently, 3D CNN-based models dominate the field of magnetic resonance image (MRI) analytics. Due to the high similarity between MRI data and videos, we conduct extensive empirical studies on video recognition techniques for MRI classification to answer the… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE ISBI'23

  15. arXiv:2301.03028  [pdf, other

    cs.LG

    Generative Time Series Forecasting with Diffusion, Denoise, and Disentanglement

    Authors: Yan Li, Xinjiang Lu, Yaqing Wang, Dejing Dou

    Abstract: Time series forecasting has been a widely explored task of great importance in many applications. However, it is common that real-world time series data are recorded in a short time period, which results in a big gap between the deep model and the limited and noisy time series. In this work, we propose to address the time series forecasting problem with generative modeling and propose a bidirectio… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  16. arXiv:2301.02071  [pdf, other

    cs.CL cs.AI

    Towards Table-to-Text Generation with Pretrained Language Model: A Table Structure Understanding and Text Deliberating Approach

    Authors: Miao Chen, Xinjiang Lu, Tong Xu, Yanyan Li, Jingbo Zhou, Dejing Dou, Hui Xiong

    Abstract: Although remarkable progress on the neural table-to-text methods has been made, the generalization issues hinder the applicability of these models due to the limited source tables. Large-scale pretrained language models sound like a promising solution to tackle such issues. However, how to effectively bridge the gap between the structured table and the text input by fully leveraging table informat… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

  17. arXiv:2301.02068  [pdf, other

    cs.LG cs.AI

    Towards Long-Term Time-Series Forecasting: Feature, Pattern, and Distribution

    Authors: Yan Li, Xinjiang Lu, Haoyi Xiong, Jian Tang, Jiantao Su, Bo Jin, Dejing Dou

    Abstract: Long-term time-series forecasting (LTTF) has become a pressing demand in many applications, such as wind power supply planning. Transformer models have been adopted to deliver high prediction capacity because of the high computational self-attention mechanism. Though one could lower the complexity of Transformers by inducing the sparsity in point-wise self-attentions for LTTF, the limited informat… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

  18. Temporal Output Discrepancy for Loss Estimation-based Active Learning

    Authors: Siyu Huang, Tianyang Wang, Haoyi Xiong, Bihan Wen, Jun Huan, Dejing Dou

    Abstract: While deep learning succeeds in a wide range of tasks, it highly depends on the massive collection of annotated data which is expensive and time-consuming. To lower the cost of data annotation, active learning has been proposed to interactively query an oracle to annotate a small proportion of informative samples in an unlabeled dataset. Inspired by the fact that the samples with higher loss are u… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted for IEEE Transactions on Neural Networks and Learning Systems, 2022. Journal extension of ICCV 2021 [arXiv:2107.14153]

  19. arXiv:2212.09321  [pdf, other

    cs.CV cs.LG

    Learning from Training Dynamics: Identifying Mislabeled Data Beyond Manually Designed Features

    Authors: Qingrui Jia, Xuhong Li, Lei Yu, Jiang Bian, Penghao Zhao, Shupeng Li, Haoyi Xiong, Dejing Dou

    Abstract: While mislabeled or ambiguously-labeled samples in the training set could negatively affect the performance of deep models, diagnosing the dataset and identifying mislabeled samples helps to improve the generalization power. Training dynamics, i.e., the traces left by iterations of optimization algorithms, have recently been proved to be effective to localize mislabeled samples with hand-crafted f… ▽ More

    Submitted 20 December, 2022; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: AAAI23 accepted Conference Paper

  20. arXiv:2211.14633  [pdf, other

    cs.LG cs.CV cs.SI

    A Contextual Master-Slave Framework on Urban Region Graph for Urban Village Detection

    Authors: Congxi Xiao, Jingbo Zhou, Jizhou Huang, Hengshu Zhu, Tong Xu, Dejing Dou, Hui Xiong

    Abstract: Urban villages (UVs) refer to the underdeveloped informal settlement falling behind the rapid urbanization in a city. Since there are high levels of social inequality and social risks in these UVs, it is critical for city managers to discover all UVs for making appropriate renovation policies. Existing approaches to detecting UVs are labor-intensive or have not fully addressed the unique challenge… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

  21. arXiv:2211.13430  [pdf, other

    cs.DC cs.AI cs.LG

    Multi-Job Intelligent Scheduling with Cross-Device Federated Learning

    Authors: Ji Liu, Juncheng Jia, Beichen Ma, Chendi Zhou, Jingbo Zhou, Yang Zhou, Huaiyu Dai, Dejing Dou

    Abstract: Recent years have witnessed a large amount of decentralized data in various (edge) devices of end-users, while the decentralized data aggregation remains complicated for machine learning jobs because of regulations and laws. As a practical approach to handling decentralized data, Federated Learning (FL) enables collaborative global machine learning model training without sharing sensitive raw data… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: To appear in TPDS; 22 pages, 17 figures, 8 tables. arXiv admin note: substantial text overlap with arXiv:2112.05928

  22. Textual Data Augmentation for Patient Outcomes Prediction

    Authors: Qiuhao Lu, Dejing Dou, Thien Huu Nguyen

    Abstract: Deep learning models have demonstrated superior performance in various healthcare applications. However, the major limitation of these deep models is usually the lack of high-quality training data due to the private and sensitive nature of this field. In this study, we propose a novel textual data augmentation method to generate artificial clinical notes in patients' Electronic Health Records (EHR… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: BIBM 2021

  23. arXiv:2208.04360  [pdf, other

    cs.LG eess.SP

    SDWPF: A Dataset for Spatial Dynamic Wind Power Forecasting Challenge at KDD Cup 2022

    Authors: Jingbo Zhou, Xinjiang Lu, Yixiong Xiao, Jiantao Su, Junfu Lyu, Yanjun Ma, Dejing Dou

    Abstract: The variability of wind power supply can present substantial challenges to incorporating wind power into a grid system. Thus, Wind Power Forecasting (WPF) has been widely recognized as one of the most critical issues in wind power integration and operation. There has been an explosion of studies on wind power forecasting problems in the past decades. Nevertheless, how to well handle the WPF proble… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  24. arXiv:2207.12730  [pdf, other

    cs.CV cs.LG

    P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos

    Authors: Jiang Bian, Xuhong Li, Tao Wang, Qingzhong Wang, Jun Huang, Chen Liu, Jun Zhao, Feixiang Lu, Dejing Dou, Haoyi Xiong

    Abstract: While deep learning has been widely used for video analytics, such as video classification and action detection, dense action detection with fast-moving subjects from sports videos is still challenging. In this work, we release yet another sports video benchmark \TheName{} for \emph{\underline{P}}ing \emph{\underline{P}}ong-\emph{\underline{A}}ction detection, which consists of 2,721 video clips c… ▽ More

    Submitted 26 March, 2024; v1 submitted 26 July, 2022; originally announced July 2022.

  25. arXiv:2207.07223  [pdf, ps, other

    cs.LG

    Accelerated Federated Learning with Decoupled Adaptive Optimization

    Authors: Jiayin Jin, Jiaxiang Ren, Yang Zhou, Lingjuan Lyu, Ji Liu, Dejing Dou

    Abstract: The federated learning (FL) framework enables edge clients to collaboratively learn a shared inference model while keeping privacy of training data on clients. Recently, many heuristics efforts have been made to generalize centralized adaptive optimization methods, such as SGDM, Adam, AdaGrad, etc., to federated settings for improving convergence and accuracy. However, there is still a paucity of… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Report number: 01

    Journal ref: ICML 2022

  26. arXiv:2207.06667  [pdf, other

    cs.DC cs.AI cs.LG

    Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources

    Authors: Ji Liu, Daxiang Dong, Xi Wang, An Qin, Xingjian Li, Patrick Valduriez, Dejing Dou, Dianhai Yu

    Abstract: Although more layers and more parameters generally improve the accuracy of the models, such big models generally have high computational complexity and require big memory, which exceed the capacity of small devices for inference and incurs long training time. In addition, it is difficult to afford long training time and inference time of big models even in high performance servers, as well. As an… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: To appear in Concurrency and Computation: Practice and Experience, 16 pages, 7 figures, 5 tables

  27. Distilling Ensemble of Explanations for Weakly-Supervised Pre-Training of Image Segmentation Models

    Authors: Xuhong Li, Haoyi Xiong, Yi Liu, Dingfu Zhou, Zeyu Chen, Yaqing Wang, Dejing Dou

    Abstract: While fine-tuning pre-trained networks has become a popular way to train image segmentation models, such backbone networks for image segmentation are frequently pre-trained using image classification source datasets, e.g., ImageNet. Though image classification datasets could provide the backbone networks with rich visual features and discriminative ability, they are incapable of fully pre-training… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted by Machine Learning

  28. arXiv:2207.01190  [pdf, other

    cs.LG

    Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios

    Authors: Xueying Zhan, Zeyu Dai, Qingzhong Wang, Qing Li, Haoyi Xiong, Dejing Dou, Antoni B. Chan

    Abstract: Pool-based Active Learning (AL) has achieved great success in minimizing labeling cost by sequentially selecting informative unlabeled samples from a large unlabeled data pool and querying their labels from oracle/annotators. However, existing AL sampling strategies might not work well in out-of-distribution (OOD) data scenarios, where the unlabeled data pool contains some data samples that do not… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  29. arXiv:2206.10546  [pdf, other

    cs.DC cs.AI

    FedHiSyn: A Hierarchical Synchronous Federated Learning Framework for Resource and Data Heterogeneity

    Authors: Guanghao Li, Yue Hu, Miao Zhang, Ji Liu, Quanjun Yin, Yong Peng, Dejing Dou

    Abstract: Federated Learning (FL) enables training a global model without sharing the decentralized raw data stored on multiple devices to protect data privacy. Due to the diverse capacity of the devices, FL frameworks struggle to tackle the problems of straggler effects and outdated models. In addition, the data heterogeneity incurs severe accuracy degradation of the global model in the FL training process… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 10 pages, to appear in ICPP'2022

  30. Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization

    Authors: Hang Hua, Xingjian Li, Dejing Dou, Cheng-Zhong Xu, Jiebo Luo

    Abstract: The advent of large-scale pre-trained language models has contributed greatly to the recent progress in natural language processing. Many state-of-the-art language models are first trained on a large text corpus and then fine-tuned on downstream tasks. Despite its recent success and wide adoption, fine-tuning a pre-trained language model often suffers from overfitting, which leads to poor generali… ▽ More

    Submitted 8 November, 2023; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: Accepted by TNNLS

  31. arXiv:2206.01038  [pdf, other

    cs.CV cs.AI cs.MM

    A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications

    Authors: Fei Wu, Qingzhong Wang, Jian Bian, Haoyi Xiong, Ning Ding, Feixiang Lu, Jun Cheng, Dejing Dou

    Abstract: To understand human behaviors, action recognition based on videos is a common approach. Compared with image-based action recognition, videos provide much more information. Reducing the ambiguity of actions and in the last decade, many works focused on datasets, novel models and learning approaches have improved video action recognition to a higher level. However, there are challenges and unsolved… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 26 pages. The toolbox is available at https://github.com/PaddlePaddle/PaddleVideo

  32. arXiv:2205.13359  [pdf, other

    cs.LG

    Feature Forgetting in Continual Representation Learning

    Authors: Xiao Zhang, Dejing Dou, Ji Wu

    Abstract: In continual and lifelong learning, good representation learning can help increase performance and reduce sample complexity when learning new tasks. There is evidence that representations do not suffer from "catastrophic forgetting" even in plain continual learning, but little further fact is known about its characteristics. In this paper, we aim to gain more understanding about representation lea… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  33. A Simple yet Effective Framework for Active Learning to Rank

    Authors: Qingzhong Wang, Haifang Li, Haoyi Xiong, Wen Wang, Jiang Bian, Yu Lu, Shuaiqiang Wang, Zhicong Cheng, Dejing Dou, Dawei Yin

    Abstract: While China has become the biggest online market in the world with around 1 billion internet users, Baidu runs the world largest Chinese search engine serving more than hundreds of millions of daily active users and responding billions queries per day. To handle the diverse query requests from users at web-scale, Baidu has done tremendous efforts in understanding users' queries, retrieve relevant… ▽ More

    Submitted 13 February, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: This paper is accepted to Machine Intelligence Research and a short version is presented in NeurIPS 2022 Workshop on Human in the Loop Learning

  34. arXiv:2204.11536  [pdf, other

    cs.DC cs.AI

    FedDUAP: Federated Learning with Dynamic Update and Adaptive Pruning Using Shared Data on the Server

    Authors: Hong Zhang, Ji Liu, Juncheng Jia, Yang Zhou, Huaiyu Dai, Dejing Dou

    Abstract: Despite achieving remarkable performance, Federated Learning (FL) suffers from two critical challenges, i.e., limited computational resources and low training efficiency. In this paper, we propose a novel FL framework, i.e., FedDUAP, with two original contributions, to exploit the insensitive data on the server and the decentralized data in edge devices to further improve the training efficiency.… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: To appear in IJCAI, 7 pages, 4 figures, 2 tables

  35. arXiv:2204.04213  [pdf, other

    cs.LG cs.AI q-bio.QM

    Structure-aware Protein Self-supervised Learning

    Authors: Can Chen, Jingbo Zhou, Fan Wang, Xue Liu, Dejing Dou

    Abstract: Protein representation learning methods have shown great potential to yield useful representation for many downstream tasks, especially on protein classification. Moreover, a few recent studies have shown great promise in addressing insufficient labels of proteins with self-supervised learning methods. However, existing protein language models are usually pretrained on protein sequences without co… ▽ More

    Submitted 8 April, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted by Bioinformatics; 7 pages 4 figures

  36. arXiv:2203.13450  [pdf, other

    cs.LG

    A Comparative Survey of Deep Active Learning

    Authors: Xueying Zhan, Qingzhong Wang, Kuan-hao Huang, Haoyi Xiong, Dejing Dou, Antoni B. Chan

    Abstract: While deep learning (DL) is data-hungry and usually relies on extensive labeled data to deliver good performance, Active Learning (AL) reduces labeling costs by selecting a small proportion of samples from unlabeled data for labeling and training. Therefore, Deep Active Learning (DAL) has risen as a feasible solution for maximizing model performance under a limited labeling cost/budget in recent y… ▽ More

    Submitted 19 July, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: 24 pages

  37. arXiv:2203.00959  [pdf, other

    cs.CV

    Learning Moving-Object Tracking with FMCW LiDAR

    Authors: Yi Gu, Hongzhi Cheng, Kafeng Wang, Dejing Dou, Chengzhong Xu, Hui Kong

    Abstract: In this paper, we propose a learning-based moving-object tracking method utilizing our newly developed LiDAR sensor, Frequency Modulated Continuous Wave (FMCW) LiDAR. Compared with most existing commercial LiDAR sensors, our FMCW LiDAR can provide additional Doppler velocity information to each 3D point of the point clouds. Benefiting from this, we can generate instance labels as ground truth in a… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: Submitted to IROS 2022

  38. arXiv:2201.02510  [pdf, other

    cs.CL cs.AI

    Predicting Patient Readmission Risk from Medical Text via Knowledge Graph Enhanced Multiview Graph Convolution

    Authors: Qiuhao Lu, Thien Huu Nguyen, Dejing Dou

    Abstract: Unplanned intensive care unit (ICU) readmission rate is an important metric for evaluating the quality of hospital care. Efficient and accurate prediction of ICU readmission risk can not only help prevent patients from inappropriate discharge and potential dangers, but also reduce associated costs of healthcare. In this paper, we propose a new method that uses medical text of Electronic Health Rec… ▽ More

    Submitted 18 December, 2021; originally announced January 2022.

    Comments: SIGIR 2021

  39. arXiv:2112.07980  [pdf, other

    cs.DC

    Data Placement for Multi-Tenant Data Federation on the Cloud

    Authors: Ji Liu, Lei Mo, Sijia Yang, Jingbo Zhou, Shilei Ji, Haoyi Xiong, Dejing Dou

    Abstract: Due to privacy concerns of users and law enforcement in data security and privacy, it becomes more and more difficult to share data among organizations. Data federation brings new opportunities to the data-related cooperation among organizations by providing abstract data interfaces. With the development of cloud computing, organizations store data on the cloud to achieve elasticity and scalabilit… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 15 pages, 8 figures, 4 tables

  40. arXiv:2112.05928  [pdf, other

    cs.DC cs.AI eess.SY

    Efficient Device Scheduling with Multi-Job Federated Learning

    Authors: Chendi Zhou, Ji Liu, Juncheng Jia, Jingbo Zhou, Yang Zhou, Huaiyu Dai, Dejing Dou

    Abstract: Recent years have witnessed a large amount of decentralized data in multiple (edge) devices of end-users, while the aggregation of the decentralized data remains difficult for machine learning jobs due to laws or regulations. Federated Learning (FL) emerges as an effective approach to handling decentralized data without sharing the sensitive raw data, while collaboratively training global machine… ▽ More

    Submitted 15 December, 2021; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: 14 pages, 7 figures, 6 tables

  41. arXiv:2111.10635  [pdf, other

    cs.DC cs.AI cs.LG eess.SY

    HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments

    Authors: Ji Liu, Zhihua Wu, Dianhai Yu, Yanjun Ma, Danlei Feng, Minxu Zhang, Xinxuan Wu, Xuefeng Yao, Dejing Dou

    Abstract: Deep neural networks (DNNs) exploit many layers and a large number of parameters to achieve excellent performance. The training process of DNN models generally handles large-scale input data with many sparse features, which incurs high Input/Output (IO) cost, while some layers are compute-intensive. The training process generally exploits distributed computing resources to reduce training time. In… ▽ More

    Submitted 7 June, 2023; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: 14 pages, 11 figures, 2 tables; To appear in Future Generation Computer Systems (FGCS)

  42. arXiv:2111.00180  [pdf, other

    cs.CL cs.AI

    Hierarchical Heterogeneous Graph Representation Learning for Short Text Classification

    Authors: Yaqing Wang, Song Wang, Quanming Yao, Dejing Dou

    Abstract: Short text classification is a fundamental task in natural language processing. It is hard due to the lack of context information and labeled data in practice. In this paper, we propose a new method called SHINE, which is based on graph neural network (GNN), for short text classification. First, we model the short text dataset as a hierarchical heterogeneous graph consisting of word-level componen… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: Accepted to EMNLP 2021

  43. arXiv:2111.00056  [pdf, other

    cs.CV cs.IT cs.LG

    Generalized Data Weighting via Class-level Gradient Manipulation

    Authors: Can Chen, Shuhao Zheng, Xi Chen, Erqun Dong, Xue Liu, Hao Liu, Dejing Dou

    Abstract: Label noise and class imbalance are two major issues coexisting in real-world datasets. To alleviate the two issues, state-of-the-art methods reweight each instance by leveraging a small amount of clean and unbiased data. Yet, these methods overlook class-level information within each instance, which can be further utilized to improve performance. To this end, in this paper, we propose Generalized… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

    Comments: 17 pages, 8 figures, accepted by NeurIPS 2021 for a poster session, camera-ready version, initial submission to arXiv

  44. SenseMag: Enabling Low-Cost Traffic Monitoring using Non-invasive Magnetic Sensing

    Authors: Kafeng Wang, Haoyi Xiong, Jie Zhang, Hongyang Chen, Dejing Dou, Cheng-Zhong Xu

    Abstract: The operation and management of intelligent transportation systems (ITS), such as traffic monitoring, relies on real-time data aggregation of vehicular traffic information, including vehicular types (e.g., cars, trucks, and buses), in the critical roads and highways. While traditional approaches based on vehicular-embedded GPS sensors or camera networks would either invade drivers' privacy or requ… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

    Comments: Accepted by IEEE Internet of Things Journal

  45. arXiv:2110.03273  [pdf, other

    cs.LG stat.ML

    AgFlow: Fast Model Selection of Penalized PCA via Implicit Regularization Effects of Gradient Flow

    Authors: Haiyan Jiang, Haoyi Xiong, Dongrui Wu, Ji Liu, Dejing Dou

    Abstract: Principal component analysis (PCA) has been widely used as an effective technique for feature extraction and dimension reduction. In the High Dimension Low Sample Size (HDLSS) setting, one may prefer modified principal components, with penalized loadings, and automated penalty selection by implementing model selection among these different models with varying penalties. The earlier work [1, 2] has… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: accepted by Machine Learning

  46. arXiv:2110.02863  [pdf, other

    cs.LG cs.AI cs.CV

    Exploring the Common Principal Subspace of Deep Features in Neural Networks

    Authors: Haoran Liu, Haoyi Xiong, Yaqing Wang, Haozhe An, Dongrui Wu, Dejing Dou

    Abstract: We find that different Deep Neural Networks (DNNs) trained with the same dataset share a common principal subspace in latent spaces, no matter in which architectures (e.g., Convolutional Neural Networks (CNNs), Multi-Layer Preceptors (MLPs) and Autoencoders (AEs)) the DNNs were built or even whether labels have been used in training (e.g., supervised, unsupervised, and self-supervised learning). S… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: Main Text with Appendix, accepted by Machine Learning

  47. arXiv:2109.11730  [pdf, other

    cs.LG

    GeomGCL: Geometric Graph Contrastive Learning for Molecular Property Prediction

    Authors: Shuangli Li, Jingbo Zhou, Tong Xu, Dejing Dou, Hui Xiong

    Abstract: Recently many efforts have been devoted to applying graph neural networks (GNNs) to molecular property prediction which is a fundamental task for computational drug and material discovery. One of major obstacles to hinder the successful prediction of molecule property by GNNs is the scarcity of labeled data. Though graph contrastive learning (GCL) methods have achieved extraordinary performance wi… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

  48. arXiv:2109.00707  [pdf, ps, other

    cs.LG

    Cross-Model Consensus of Explanations and Beyond for Image Classification Models: An Empirical Study

    Authors: Xuhong Li, Haoyi Xiong, Siyu Huang, Shilei Ji, Dejing Dou

    Abstract: Existing interpretation algorithms have found that, even deep models make the same and right predictions on the same image, they might rely on different sets of input features for classification. However, among these sets of features, some common features might be used by the majority of models. In this paper, we are wondering what are the common features used by various models for classification… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  49. arXiv:2107.14153  [pdf, other

    cs.CV cs.LG

    Semi-Supervised Active Learning with Temporal Output Discrepancy

    Authors: Siyu Huang, Tianyang Wang, Haoyi Xiong, Jun Huan, Dejing Dou

    Abstract: While deep learning succeeds in a wide range of tasks, it highly depends on the massive collection of annotated data which is expensive and time-consuming. To lower the cost of data annotation, active learning has been proposed to interactively query an oracle to annotate a small proportion of informative samples in an unlabeled dataset. Inspired by the fact that the samples with higher loss are u… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

    Comments: ICCV 2021. Code is available at https://github.com/siyuhuang/TOD

  50. arXiv:2107.10670  [pdf, other

    q-bio.QM cs.LG

    Structure-aware Interactive Graph Neural Networks for the Prediction of Protein-Ligand Binding Affinity

    Authors: Shuangli Li, Jingbo Zhou, Tong Xu, Liang Huang, Fan Wang, Haoyi Xiong, Weili Huang, Dejing Dou, Hui Xiong

    Abstract: Drug discovery often relies on the successful prediction of protein-ligand binding affinity. Recent advances have shown great promise in applying graph neural networks (GNNs) for better affinity prediction by learning the representations of protein-ligand complexes. However, existing solutions usually treat protein-ligand complexes as topological graph data, thus the biomolecular structural inform… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: 11 pages, 8 figures, Accepted by KDD 2021 (Research Track)