Skip to main content

Showing 1–50 of 173 results for author: Dong, M

  1. arXiv:2406.07011  [pdf, ps, other

    cs.CR

    Breaking Free: Efficient Multi-Party Private Set Union Without Non-Collusion Assumptions

    Authors: Minglang Dong, Yu Chen, Cong Zhang, Yujie Bai

    Abstract: Multi-party private set union (MPSU) protocol enables $m$ $(m > 2)$ parties, each holding a set, to collectively compute the union of their sets without revealing any additional information to other parties. There are two main categories of MPSU protocols: The first builds on public-key techniques. All existing works in this category involve a super-linear number of public-key operations, resultin… ▽ More

    Submitted 1 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2405.14174  [pdf, other

    cs.CV

    Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model

    Authors: Yuheng Shi, Minjing Dong, Chang Xu

    Abstract: Despite the significant achievements of Vision Transformers (ViTs) in various vision tasks, they are constrained by the quadratic complexity. Recently, State Space Models (SSMs) have garnered widespread attention due to their global receptive field and linear complexity with respect to the input length, demonstrating substantial potential across fields including natural language processing and com… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.07527  [pdf, other

    cs.LG cs.AI

    Train Faster, Perform Better: Modular Adaptive Training in Over-Parameterized Models

    Authors: Yubin Shi, Yixuan Chen, Mingzhi Dong, Xiaochen Yang, Dongsheng Li, Yujiang Wang, Robert P. Dick, Qin Lv, Yingying Zhao, Fan Yang, Tun Lu, Ning Gu, Li Shang

    Abstract: Despite their prevalence in deep-learning communities, over-parameterized models convey high demands of computational costs for proper training. This work studies the fine-grained, modular-level learning dynamics of over-parameterized models to attain a more efficient and fruitful training strategy. Empirical evidence reveals that when scaling down into network modules, such as heads in self-atten… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted at NeurIPS 2023

  4. arXiv:2405.00527  [pdf, other

    cs.DB

    ChatBI: Towards Natural Language to Complex Business Intelligence SQL

    Authors: Jinqing Lian, Xinyi Liu, Yingxia Shao, Yang Dong, Ming Wang, Zhang Wei, Tianqi Wan, Ming Dong, Hailin Yan

    Abstract: The Natural Language to SQL (NL2SQL) technology provides non-expert users who are unfamiliar with databases the opportunity to use SQL for data analysis.Converting Natural Language to Business Intelligence (NL2BI) is a popular practical scenario for NL2SQL in actual production systems. Compared to NL2SQL, NL2BI introduces more challenges. In this paper, we propose ChatBI, a comprehensive and eff… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  5. arXiv:2404.02106  [pdf, other

    cs.CV cs.CE

    Neural Ordinary Differential Equation based Sequential Image Registration for Dynamic Characterization

    Authors: Yifan Wu, Mengjin Dong, Rohit Jena, Chen Qin, James C. Gee

    Abstract: Deformable image registration (DIR) is crucial in medical image analysis, enabling the exploration of biological dynamics such as organ motions and longitudinal changes in imaging. Leveraging Neural Ordinary Differential Equations (ODE) for registration, this extension work discusses how this framework can aid in the characterization of sequential biological processes. Utilizing the Neural ODE's a… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Journal extension of NODEO: A Neural Ordinary Differential Equation Based Optimization Framework for Deformable Image Registration, CVPR 2022

  6. arXiv:2403.10927  [pdf, ps, other

    cs.IT cs.LG

    Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC

    Authors: Yang Huang, Miaomiao Dong, Yijie Mao, Wenqiang Liu, Zhen Gao

    Abstract: Utilizing unmanned aerial vehicles (UAVs) with edge server to assist terrestrial mobile edge computing (MEC) has attracted tremendous attention. Nevertheless, state-of-the-art schemes based on deterministic optimizations or single-objective reinforcement learning (RL) cannot reduce the backlog of task bits and simultaneously improve energy efficiency in highly dynamic network environments, where t… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted for publication in the IEEE Transactions on Vehicular Technology

  7. arXiv:2403.10002  [pdf, ps, other

    cs.IT eess.SP

    Fast Group Scheduling for Downlink Large-Scale Multi-Group Multicast Beamforming

    Authors: Chong Zhang, Min Dong, Ben Liang, Ali Afana, Yahia Ahmed

    Abstract: Next-generation wireless networks need to handle massive user access effectively. This paper addresses the problem of joint group scheduling and multicast beamforming for downlink transmission with many active user groups. Aiming to maximize the minimum user throughput, we propose a three-phase approach to tackle this difficult joint optimization problem efficiently. In Phase 1, we utilize the opt… ▽ More

    Submitted 24 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 13 pages, 8 figures

  8. arXiv:2403.08492  [pdf, other

    cs.CL

    Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking

    Authors: Ming Dong, Yujing Chen, Miao Zhang, Hao Sun, Tingting He

    Abstract: Chinese Spell Checking (CSC) is a widely used technology, which plays a vital role in speech to text (STT) and optical character recognition (OCR). Most of the existing CSC approaches relying on BERT architecture achieve excellent performance. However, limited by the scale of the foundation model, BERT-based method does not work well in few-shot scenarios, showing certain limitations in practical… ▽ More

    Submitted 7 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  9. arXiv:2403.08484  [pdf, other

    cs.CL

    Data-oriented Dynamic Fine-tuning Parameter Selection Strategy for FISH Mask based Efficient Fine-tuning

    Authors: Ming Dong, Kang Xue, Bolong Zheng, Tingting He

    Abstract: In view of the huge number of parameters of Large language models (LLMs) , tuning all parameters is very costly, and accordingly fine-tuning specific parameters is more sensible. Most of parameter efficient fine-tuning (PEFT) concentrate on parameter selection strategies, such as additive method, selective method and reparametrization-based method. However, there are few methods that consider the… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  10. arXiv:2403.01450  [pdf, other

    cs.RO

    Collision-Free Robot Navigation in Crowded Environments using Learning based Convex Model Predictive Control

    Authors: Zhuanglei Wen, Mingze Dong, Xiai Chen

    Abstract: Navigating robots safely and efficiently in crowded and complex environments remains a significant challenge. However, due to the dynamic and intricate nature of these settings, planning efficient and collision-free paths for robots to track is particularly difficult. In this paper, we uniquely bridge the robot's perception, decision-making and control processes by utilizing the convex obstacle-fr… ▽ More

    Submitted 14 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  11. arXiv:2402.15721  [pdf, other

    cs.AI cs.CL

    Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models

    Authors: Chaoya Jiang, Wei Ye, Mengfan Dong, Hongrui Jia, Haiyang Xu, Ming Yan, Ji Zhang, Shikun Zhang

    Abstract: Large Vision Language Models exhibit remarkable capabilities but struggle with hallucinations inconsistencies between images and their descriptions. Previous hallucination evaluation studies on LVLMs have identified hallucinations in terms of objects, attributes, and relations but overlooked complex hallucinations that create an entire narrative around a fictional entity. In this paper, we introdu… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  12. arXiv:2402.14140  [pdf, other

    cs.CR

    QuantTM: Business-Centric Threat Quantification for Risk Management and Cyber Resilience

    Authors: Jan von der Assen, Muriel F. Franco, Muyao Dong, Burkhard Stiller

    Abstract: Threat modeling has emerged as a key process for understanding relevant threats within businesses. However, understanding the importance of threat events is rarely driven by the business incorporating the system. Furthermore, prioritization of threat events often occurs based on abstract and qualitative scoring. While such scores enable prioritization, they do not allow the results to be easily in… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  13. arXiv:2401.17585  [pdf, other

    cs.CL cs.AI cs.LG stat.ME

    Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks

    Authors: Wenyue Hua, Jiang Guo, Mingwen Dong, Henghui Zhu, Patrick Ng, Zhiguo Wang

    Abstract: Current approaches of knowledge editing struggle to effectively propagate updates to interconnected facts. In this work, we delve into the barriers that hinder the appropriate propagation of updated knowledge within these models for accurate reasoning. To support our analysis, we introduce a novel reasoning-based benchmark -- ReCoE (Reasoning-based Counterfactual Editing dataset) -- which covers s… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 22 pages, 14 figures, 5 tables

  14. arXiv:2401.00594  [pdf, ps, other

    eess.SP cs.IT

    Efficient Design for Multi-user Downlink Beamforming with Reconfigurable Intelligent Surface

    Authors: Mohammad Ebrahimi, Min Dong

    Abstract: This paper considers downlink multi-user transmission facilitated by a reconfigurable intelligent surface (RIS). First, focusing on the multi-group multicast beamforming scenario, we develop a fast and scalable algorithm for the joint base station (BS) and RIS beamforming optimization to minimize the transmit power subject to the user quality-of-service (QoS) constraints. By exploring the structur… ▽ More

    Submitted 29 February, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: 13 pages, 10 figures

  15. arXiv:2312.16902  [pdf, other

    cs.CV

    Joint Learning for Scattered Point Cloud Understanding with Hierarchical Self-Distillation

    Authors: Kaiyue Zhou, Ming Dong, Peiyuan Zhi, Shengjin Wang

    Abstract: Numerous point-cloud understanding techniques focus on whole entities and have succeeded in obtaining satisfactory results and limited sparsity tolerance. However, these methods are generally sensitive to incomplete point clouds that are scanned with flaws or large gaps. To address this issue, in this paper, we propose an end-to-end architecture that compensates for and identifies partial point cl… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Currently under review. Previously submitted to AAAI and got frustrated. Decisions: 1x weak reject, 2x weak accept, and 1 accept

  16. arXiv:2312.15186  [pdf, other

    cs.DC cs.AI cs.LG

    Efficient Asynchronous Federated Learning with Sparsification and Quantization

    Authors: Juncheng Jia, Ji Liu, Chendi Zhou, Hao Tian, Mianxiong Dong, Dejing Dou

    Abstract: While data is distributed in multiple edge devices, Federated Learning (FL) is attracting more and more attention to collaboratively train a machine learning model without transferring raw data. FL generally exploits a parameter server and a large number of edge devices during the whole process of the model training, while several devices are selected in each round. However, straggler devices may… ▽ More

    Submitted 6 January, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: To appear in Concurrency and Computation: Practice and Experience (CCPE), 21 pages

  17. arXiv:2312.13424  [pdf, ps, other

    cs.IT eess.SP

    Multi-Model Wireless Federated Learning with Downlink Beamforming

    Authors: Chong Zhang, Min Dong, Ben Liang, Ali Afana, Yahia Ahmed

    Abstract: This paper studies the design of wireless federated learning (FL) for simultaneously training multiple machine learning models. We consider round robin device-model assignment and downlink beamforming for concurrent multiple model updates. After formulating the joint downlink-uplink transmission process, we derive the per-model global update expression over communication rounds, capturing the effe… ▽ More

    Submitted 14 January, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 6 pages, 4 figures. Accepted by IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

  18. arXiv:2312.12358  [pdf, other

    cs.IT eess.SP

    Localization and Discrete Beamforming with a Large Reconfigurable Intelligent Surface

    Authors: Baojia Luo, Yili Deng, Miaomiao Dong, Zhongyi Huang, Xiang Chen, Wei Han, Bo Bai

    Abstract: In millimeter-wave (mmWave) cellular systems, reconfigurable intelligent surfaces (RISs) are foreseeably deployed with a large number of reflecting elements to achieve high beamforming gains. The large-sized RIS will make radio links fall in the near-field localization regime with spatial non-stationarity issues. Moreover, the discrete phase restriction on the RIS reflection coefficient incurs exp… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 13 pages

  19. arXiv:2312.06968  [pdf, other

    cs.CV

    Hallucination Augmented Contrastive Learning for Multimodal Large Language Model

    Authors: Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang

    Abstract: Multi-modal large language models (MLLMs) have been shown to efficiently integrate natural language with visual information to handle multi-modal tasks. However, MLLMs still face a fundamental limitation of hallucinations, where they tend to generate erroneous or fabricated information. In this paper, we address hallucinations in MLLMs from a novel perspective of representation learning. We first… ▽ More

    Submitted 23 February, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  20. arXiv:2312.05219  [pdf, other

    cs.CV

    Enhancing Facial Classification and Recognition using 3D Facial Models and Deep Learning

    Authors: Houting Li, Mengxuan Dong, Lok Ming Lui

    Abstract: Accurate analysis and classification of facial attributes are essential in various applications, from human-computer interaction to security systems. In this work, a novel approach to enhance facial classification and recognition tasks through the integration of 3D facial models with deep learning methods was proposed. We extract the most useful information for various tasks using the 3D Facial Mo… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:1903.08527 by other authors

  21. arXiv:2311.18251  [pdf, other

    cs.HC

    Can Large Language Models Be Good Companions? An LLM-Based Eyewear System with Conversational Common Ground

    Authors: Zhenyu Xu, Hailin Xu, Zhouyang Lu, Yingying Zhao, Rui Zhu, Yujiang Wang, Mingzhi Dong, Yuhu Chang, Qin Lv, Robert P. Dick, Fan Yang, Tun Lu, Ning Gu, Li Shang

    Abstract: Developing chatbots as personal companions has long been a goal of artificial intelligence researchers. Recent advances in Large Language Models (LLMs) have delivered a practical solution for endowing chatbots with anthropomorphic language capabilities. However, it takes more than LLMs to enable chatbots that can act as companions. Humans use their understanding of individual personalities to driv… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 36 pages, 25 figures, Under review at ACM IMWUT

  22. arXiv:2310.18619  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Dense Retrieval as Indirect Supervision for Large-space Decision Making

    Authors: Nan Xu, Fei Wang, Mingtao Dong, Muhao Chen

    Abstract: Many discriminative natural language understanding (NLU) tasks have large label spaces. Learning such a process of large-space decision making is particularly challenging due to the lack of training instances per label and the difficulty of selection among many fine-grained labels. Inspired by dense retrieval methods for passage finding in open-domain QA, we propose a reformulation of large-space… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (Findings)

  23. arXiv:2310.14784  [pdf, other

    cs.LG cs.AI

    An Efficient Imbalance-Aware Federated Learning Approach for Wearable Healthcare with Autoregressive Ratio Observation

    Authors: Wenhao Yan, He Li, Kaoru Ota, Mianxiong Dong

    Abstract: Widely available healthcare services are now getting popular because of advancements in wearable sensing techniques and mobile edge computing. People's health information is collected by edge devices such as smartphones and wearable bands for further analysis on servers, then send back suggestions and alerts for abnormal conditions. The recent emergence of federated learning allows users to train… ▽ More

    Submitted 30 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: submitted to IEEE OJCS in Oct. 2023, under review

  24. arXiv:2310.07143  [pdf, other

    cs.LG

    Imitation Learning from Purified Demonstration

    Authors: Yunke Wang, Minjing Dong, Bo Du, Chang Xu

    Abstract: Imitation learning has emerged as a promising approach for addressing sequential decision-making problems, with the assumption that expert demonstrations are optimal. However, in real-world scenarios, expert demonstrations are often imperfect, leading to challenges in effectively applying imitation learning. While existing research has focused on optimizing with imperfect demonstrations, the train… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  25. arXiv:2310.03142  [pdf, ps, other

    cs.IT

    Design and Optimization of Heterogeneous Coded Distributed Computing with Nonuniform File Popularity

    Authors: Yong Deng, Min Dong

    Abstract: This paper studies MapReduce-based heterogeneous coded distributed computing (CDC) where, besides different computing capabilities at workers, input files to be accessed by computing jobs have nonuniform popularity. We propose a file placement strategy that can handle an arbitrary number of input files. Furthermore, we design a nested coded shuffling strategy that can efficiently manage the nonuni… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 15 pages, 7 figures, 3 tables

  26. arXiv:2309.16207  [pdf, other

    cs.CV

    Parameter-Saving Adversarial Training: Reinforcing Multi-Perturbation Robustness via Hypernetworks

    Authors: Huihui Gong, Minjing Dong, Siqi Ma, Seyit Camtepe, Surya Nepal, Chang Xu

    Abstract: Adversarial training serves as one of the most popular and effective methods to defend against adversarial perturbations. However, most defense mechanisms only consider a single type of perturbation while various attack methods might be adopted to perform stronger adversarial attacks against the deployed model in real-world scenarios, e.g., $\ell_2$ or $\ell_\infty$. Defending against various atta… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 9 pages, 2 figures

  27. arXiv:2309.11133  [pdf, other

    cs.CV

    Shape Anchor Guided Holistic Indoor Scene Understanding

    Authors: Mingyue Dong, Linxi Huan, Hanjiang Xiong, Shuhan Shen, Xianwei Zheng

    Abstract: This paper proposes a shape anchor guided learning strategy (AncLearn) for robust holistic indoor scene understanding. We observe that the search space constructed by current methods for proposal feature grouping and instance point sampling often introduces massive noise to instance detection and mesh reconstruction. Accordingly, we develop AncLearn to generate anchors that dynamically fit instanc… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  28. arXiv:2309.09480  [pdf, other

    cs.CV

    Stealthy Physical Masked Face Recognition Attack via Adversarial Style Optimization

    Authors: Huihui Gong, Minjing Dong, Siqi Ma, Seyit Camtepe, Surya Nepal, Chang Xu

    Abstract: Deep neural networks (DNNs) have achieved state-of-the-art performance on face recognition (FR) tasks in the last decade. In real scenarios, the deployment of DNNs requires taking various face accessories into consideration, like glasses, hats, and masks. In the COVID-19 pandemic era, wearing face masks is one of the most effective ways to defend against the novel coronavirus. However, DNNs are kn… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 11 pages, 7 figures

  29. arXiv:2309.07581  [pdf, ps, other

    cs.AR

    A Survey of Graph Pre-processing Methods: From Algorithmic to Hardware Perspectives

    Authors: Zhengyang Lv, Mingyu Yan, Xin Liu, Mengyao Dong, Xiaochun Ye, Dongrui Fan, Ninghui Sun

    Abstract: Graph-related applications have experienced significant growth in academia and industry, driven by the powerful representation capabilities of graph. However, efficiently executing these applications faces various challenges, such as load imbalance, random memory access, etc. To address these challenges, researchers have proposed various acceleration systems, including software frameworks and hard… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  30. arXiv:2308.11838  [pdf, other

    cs.LG cs.AI stat.ML

    A Benchmark Study on Calibration

    Authors: Linwei Tao, Younan Zhu, Haolan Guo, Minjing Dong, Chang Xu

    Abstract: Deep neural networks are increasingly utilized in various machine learning tasks. However, as these models grow in complexity, they often face calibration issues, despite enhanced prediction accuracy. Many studies have endeavored to improve calibration performance through the use of specific loss functions, data preprocessing and training frameworks. Yet, investigations into calibration properties… ▽ More

    Submitted 22 March, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 poster

  31. arXiv:2307.13209  [pdf

    cs.RO cs.AI

    Gait Cycle-Inspired Learning Strategy for Continuous Prediction of Knee Joint Trajectory from sEMG

    Authors: Xueming Fu, Hao Zheng, Luyan Liu, Wenjuan Zhong, Haowen Liu, Wenxuan Xiong, Yuyang Zhang, Yifeng Chen, Dong Wei, Mingjie Dong, Yefeng Zheng, Mingming Zhang

    Abstract: Predicting lower limb motion intent is vital for controlling exoskeleton robots and prosthetic limbs. Surface electromyography (sEMG) attracts increasing attention in recent years as it enables ahead-of-time prediction of motion intentions before actual movement. However, the estimation performance of human joint trajectory remains a challenging problem due to the inter- and intra-subject variatio… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  32. arXiv:2307.10653  [pdf, other

    cs.LG

    Refining the Optimization Target for Automatic Univariate Time Series Anomaly Detection in Monitoring Services

    Authors: Manqing Dong, Zhanxiang Zhao, Yitong Geng, Wentao Li, Wei Wang, Huai Jiang

    Abstract: Time series anomaly detection is crucial for industrial monitoring services that handle a large volume of data, aiming to ensure reliability and optimize system performance. Existing methods often require extensive labeled resources and manual parameter selection, highlighting the need for automation. This paper proposes a comprehensive framework for automatic parameter optimization in time series… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted by 2023 IJCAI Workshop

  33. arXiv:2307.07919  [pdf, other

    cs.AI

    Neural Architecture Retrieval

    Authors: Xiaohuan Pei, Yanxi Li, Minjing Dong, Chang Xu

    Abstract: With the increasing number of new neural architecture designs and substantial existing neural architectures, it becomes difficult for the researchers to situate their contributions compared with existing neural architectures or establish the connections between their designs and other relevant ones. To discover similar neural architectures in an efficient and automatic manner, we define a new prob… ▽ More

    Submitted 17 March, 2024; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: ICLR 2024

  34. arXiv:2307.00315  [pdf, ps, other

    cs.IT

    Joint Downlink-Uplink Beamforming for Wireless Multi-Antenna Federated Learning

    Authors: Chong Zhang, Min Dong, Ben Liang, Ali Afana, Yahia Ahmed

    Abstract: We study joint downlink-uplink beamforming design for wireless federated learning (FL) with a multi-antenna base station. Considering analog transmission over noisy channels and uplink over-the-air aggregation, we derive the global model update expression over communication rounds. We then obtain an upper bound on the expected global loss function, capturing the downlink and uplink beamforming and… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 8 pages, 3 figures. Accepted by International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt), 2023

  35. arXiv:2306.03730  [pdf, other

    eess.IV cs.CV

    Modality-Agnostic Learning for Medical Image Segmentation Using Multi-modality Self-distillation

    Authors: Qisheng He, Nicholas Summerfield, Ming Dong, Carri Glide-Hurst

    Abstract: Medical image segmentation of tumors and organs at risk is a time-consuming yet critical process in the clinic that utilizes multi-modality imaging (e.g, different acquisitions, data types, and sequences) to increase segmentation precision. In this paper, we propose a novel framework, Modality-Agnostic learning through Multi-modality Self-dist-illation (MAG-MS), to investigate the impact of input… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  36. arXiv:2306.03271  [pdf, other

    eess.IV cs.CV

    Dual self-distillation of U-shaped networks for 3D medical image segmentation

    Authors: Soumyanil Banerjee, Ming Dong, Carri Glide-Hurst

    Abstract: U-shaped networks and its variants have demonstrated exceptional results for medical image segmentation. In this paper, we propose a novel dual self-distillation (DSD) framework for U-shaped networks for 3D medical image segmentation. DSD distills knowledge from the ground-truth segmentation labels to the decoder layers and also between the encoder and decoder layers of a single U-shaped network.… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 12 pages, 5 figures, 3 tables

  37. arXiv:2305.16265  [pdf, other

    cs.CL

    UNITE: A Unified Benchmark for Text-to-SQL Evaluation

    Authors: Wuwei Lan, Zhiguo Wang, Anuj Chauhan, Henghui Zhu, Alexander Li, Jiang Guo, Sheng Zhang, Chung-Wei Hang, Joseph Lilien, Yiqun Hu, Lin Pan, Mingwen Dong, Jun Wang, Jiarong Jiang, Stephen Ash, Vittorio Castelli, Patrick Ng, Bing Xiang

    Abstract: A practical text-to-SQL system should generalize well on a wide variety of natural language questions, unseen database schemas, and novel SQL query structures. To comprehensively evaluate text-to-SQL systems, we introduce a UNIfied benchmark for Text-to-SQL Evaluation (UNITE). It is composed of publicly available text-to-SQL datasets, containing natural language questions from more than 12 domains… ▽ More

    Submitted 14 July, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 5 pages

  38. Wital: A COTS WiFi Devices Based Vital Signs Monitoring System Using NLOS Sensing Model

    Authors: Xiang Zhang, Yu Gu, Huan Yan, Yantong Wang, Mianxiong Dong, Kaoru Ota, Fuji Ren, Yusheng Ji

    Abstract: Vital sign (breathing and heartbeat) monitoring is essential for patient care and sleep disease prevention. Most current solutions are based on wearable sensors or cameras; however, the former could affect sleep quality, while the latter often present privacy concerns. To address these shortcomings, we propose Wital, a contactless vital sign monitoring system based on low-cost and widespread comme… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by IEEE THMS

    Journal ref: IEEE Transactions on Human-Machine Systems,2023

  39. arXiv:2305.13665  [pdf, other

    cs.CV cs.AI

    Dual Focal Loss for Calibration

    Authors: Linwei Tao, Minjing Dong, Chang Xu

    Abstract: The use of deep neural networks in real-world applications require well-calibrated networks with confidence scores that accurately reflect the actual probability. However, it has been found that these networks often provide over-confident predictions, which leads to poor calibration. Recent efforts have sought to address this issue by focal loss to reduce over-confidence, but this approach can als… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: ICML 2023 Accept

  40. arXiv:2305.13591  [pdf, other

    cs.RO

    A Single Multi-Task Deep Neural Network with a Multi-Scale Feature Aggregation Mechanism for Manipulation Relationship Reasoning in Robotic Grasping

    Authors: Mingshuai Dong, Yuxuan Bai, Shimin Wei, Xiuli Yu

    Abstract: Grasping specific objects in complex and irregularly stacked scenes is still challenging for robotics. Because the robot is not only required to identify the object's grasping posture but also needs to reason the manipulation relationship between the objects. In this paper, we propose a manipulation relationship reasoning network with a multi-scale feature aggregation (MSFA) mechanism for robot gr… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  41. arXiv:2305.09381  [pdf, other

    cs.MM

    AMD: Autoregressive Motion Diffusion

    Authors: Bo Han, Hao Peng, Minjing Dong, Yi Ren, Yixuan Shen, Chang Xu

    Abstract: Human motion generation aims to produce plausible human motion sequences according to various conditional inputs, such as text or audio. Despite the feasibility of existing methods in generating motion based on short prompts and simple motion patterns, they encounter difficulties when dealing with long prompts or complex motions. The challenges are two-fold: 1) the scarcity of human motion-capture… ▽ More

    Submitted 26 December, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: accepted by AAAI2024

  42. arXiv:2305.08348  [pdf, other

    cs.CL

    Coreference-aware Double-channel Attention Network for Multi-party Dialogue Reading Comprehension

    Authors: Yanling Li, Bowei Zou, Yifan Fan, Mengxing Dong, Yu Hong

    Abstract: We tackle Multi-party Dialogue Reading Comprehension (abbr., MDRC). MDRC stands for an extractive reading comprehension task grounded on a batch of dialogues among multiple interlocutors. It is challenging due to the requirement of understanding cross-utterance contexts and relationships in a multi-turn multi-party conversation. Previous studies have made great efforts on the utterance profiling o… ▽ More

    Submitted 22 May, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: IJCNN2023

  43. arXiv:2305.03711  [pdf, other

    cs.LG cs.CY

    Medical records condensation: a roadmap towards healthcare data democratisation

    Authors: Yujiang Wang, Anshul Thakur, Mingzhi Dong, Pingchuan Ma, Stavros Petridis, Li Shang, Tingting Zhu, David A. Clifton

    Abstract: The prevalence of artificial intelligence (AI) has envisioned an era of healthcare democratisation that promises every stakeholder a new and better way of life. However, the advancement of clinical AI research is significantly hurdled by the dearth of data democratisation in healthcare. To truly democratise data for AI studies, challenges are two-fold: 1. the sensitive information in clinical data… ▽ More

    Submitted 8 January, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  44. arXiv:2305.01670  [pdf

    cs.HC cs.AI cs.CY

    Fears about AI-mediated communication are grounded in different expectations for one's own versus others' use

    Authors: Zoe A. Purcell, Mengchen Dong, Anne-Marie Nussberger, Nils Köbis, Maurice Jakesch

    Abstract: The rapid development of AI-mediated communication technologies (AICTs), which are digital tools that use AI to augment interpersonal messages, has raised concerns about the future of interpersonal trust and prompted discussions about disclosure and uptake. This paper contributes to this discussion by assessing perceptions about the acceptability and use of open and secret AICTs for oneself and ot… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  45. arXiv:2304.04673  [pdf

    q-bio.NC cs.AI

    Regional Deep Atrophy: a Self-Supervised Learning Method to Automatically Identify Regions Associated With Alzheimer's Disease Progression From Longitudinal MRI

    Authors: Mengjin Dong, Long Xie, Sandhitsu R. Das, Jiancong Wang, Laura E. M. Wisse, Robin deFlores, David A. Wolk, Paul A. Yushkevich

    Abstract: Longitudinal assessment of brain atrophy, particularly in the hippocampus, is a well-studied biomarker for neurodegenerative diseases, such as Alzheimer's disease (AD). In clinical trials, estimation of brain progressive rates can be applied to track therapeutic efficacy of disease modifying treatments. However, most state-of-the-art measurements calculate changes directly by segmentation and/or d… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Submitted to NeuroImage for review

  46. RGB-T Tracking Based on Mixed Attention

    Authors: Yang Luo, Xiqing Guo, Mingtao Dong, Jin Yu

    Abstract: RGB-T tracking involves the use of images from both visible and thermal modalities. The primary objective is to adaptively leverage the relatively dominant modality in varying conditions to achieve more robust tracking compared to single-modality tracking. An RGB-T tracker based on mixed attention mechanism to achieve complementary fusion of modalities (referred to as MACFT) is proposed in this pa… ▽ More

    Submitted 17 April, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: 14 pages, 10 figures

    Journal ref: Sensors 23, no. 14: 6609 (2023)

  47. Decentralized Caching under Nonuniform File Popularity and Size: Memory-Rate Tradeoff Characterization

    Authors: Yong Deng, Min Dong

    Abstract: This paper aims to characterize the memory-rate tradeoff for decentralized caching under nonuniform file popularity and size. We consider a recently proposed decentralized modified coded caching scheme (D-MCCS) and formulate the cache placement optimization problem to minimize the average rate for the D-MCCS. To solve this challenging non-convex optimization problem, we first propose a successive… ▽ More

    Submitted 26 June, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 16 pages, 7 figures, 7 tables. Accepted to IEEE/ACM Transactions on Networking

  48. Beamforming and Device Selection Design in Federated Learning with Over-the-air Aggregation

    Authors: Faeze Moradi Kalarde, Min Dong, Ben Liang, Yahia A. Eldemerdash Ahmed, Ho Ting Cheng

    Abstract: Federated learning (FL) with over-the-air computation can efficiently utilize the communication bandwidth but is susceptible to analog aggregation error. Excluding those devices with weak channel conditions can reduce the aggregation error, but it also limits the amount of local training data for FL, which can reduce the training convergence rate. In this work, we jointly design uplink receiver be… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: 12 pages, 8 figures

  49. arXiv:2302.08934  [pdf, other

    cs.IT eess.SP

    Active RIS Aided ISAC Systems: Beamforming Design and Performance Analysis

    Authors: Zhiyuan Yu, Hong Ren, Cunhua Pan, Gui Zhou, Boshi Wang, Mianxiong Dong, Jiangzhou Wang

    Abstract: This paper considers an active reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) system. We aim to maximize radar signal-to-interference-plus-noise-ratio (SINR) by jointly optimizing the beamforming matrix at the dual-function radar-communication (DFRC) base station (BS) and the reflecting coefficients at the active RIS subject to the quality of service (Qo… ▽ More

    Submitted 3 February, 2024; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: 17 pages,11 figures, accepted by IEEE TCOM.The manuscript has been revised to correct several typographical errors

  50. arXiv:2302.06245  [pdf, other

    cs.LG

    Calibrating a Deep Neural Network with Its Predecessors

    Authors: Linwei Tao, Minjing Dong, Daochang Liu, Changming Sun, Chang Xu

    Abstract: Confidence calibration - the process to calibrate the output probability distribution of neural networks - is essential for safety-critical applications of such networks. Recent works verify the link between mis-calibration and overfitting. However, early stopping, as a well-known technique to mitigate overfitting, fails to calibrate networks. In this work, we study the limitions of early stopping… ▽ More

    Submitted 23 May, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: IJCAI 2023 Accept