Skip to main content

Showing 1–50 of 92 results for author: Xiao, D

  1. DuMapNet: An End-to-End Vectorization System for City-Scale Lane-Level Map Generation

    Authors: Deguo Xia, Weiming Zhang, Xiyan Liu, Wei Zhang, Chenting Gong, Jizhou Huang, Mengmeng Yang, Diange Yang

    Abstract: Generating city-scale lane-level maps faces significant challenges due to the intricate urban environments, such as blurred or absent lane markings. Additionally, a standard lane-level map requires a comprehensive organization of lane groupings, encompassing lane direction, style, boundary, and topology, yet has not been thoroughly examined in prior research. These obstacles result in labor-intens… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024, camera-ready version

  2. arXiv:2406.10185  [pdf, other

    cs.CV

    Detecting and Evaluating Medical Hallucinations in Large Vision Language Models

    Authors: Jiawei Chen, Dingkang Yang, Tong Wu, Yue Jiang, Xiaolu Hou, Mingcheng Li, Shunli Wang, Dongling Xiao, Ke Li, Lihua Zhang

    Abstract: Large Vision Language Models (LVLMs) are increasingly integral to healthcare applications, including medical visual question answering and imaging report generation. While these models inherit the robust capabilities of foundational Large Language Models (LLMs), they also inherit susceptibility to hallucinations-a significant concern in high-stakes medical contexts where the margin for error is mi… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2405.19266  [pdf, other

    cs.CL

    PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications

    Authors: Dingkang Yang, Jinjie Wei, Dongling Xiao, Shunli Wang, Tong Wu, Gang Li, Mingcheng Li, Shuaibing Wang, Jiawei Chen, Yue Jiang, Qingyao Xu, Ke Li, Peng Zhai, Lihua Zhang

    Abstract: Developing intelligent pediatric consultation systems offers promising prospects for improving diagnostic efficiency, especially in China, where healthcare resources are scarce. Despite recent advances in Large Language Models (LLMs) for Chinese medicine, their performance is sub-optimal in pediatric applications due to inadequate instruction data and vulnerable training procedures. To address the… ▽ More

    Submitted 3 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: A Technical Report on a Chinese Medical Large Language Model

  4. arXiv:2405.17193  [pdf, other

    cs.GR

    Anisotropic Gauss Reconstruction for Unoriented Point Clouds

    Authors: Yueji Ma, Dong Xiao, Zuoqiang Shi, Bin Wang

    Abstract: Unoriented surface reconstructions based on the Gauss formula have attracted much attention due to their elegant mathematical formulation and excellent performance. However, the isotropic characteristics of the formulation limit their capacity to leverage the anisotropic information within the point cloud. In this work, we propose a novel anisotropic formulation by introducing a convection term in… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 17pages;14figures

  5. arXiv:2405.13080  [pdf, other

    cs.CR cs.LG

    EmInspector: Combating Backdoor Attacks in Federated Self-Supervised Learning Through Embedding Inspection

    Authors: Yuwen Qian, Shuchi Wu, Kang Wei, Ming Ding, Di Xiao, Tao Xiang, Chuan Ma, Song Guo

    Abstract: Federated self-supervised learning (FSSL) has recently emerged as a promising paradigm that enables the exploitation of clients' vast amounts of unlabeled data while preserving data privacy. While FSSL offers advantages, its susceptibility to backdoor attacks, a concern identified in traditional federated supervised learning (FSL), has not been investigated. To fill the research gap, we undertake… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 18 pages, 12 figures

  6. arXiv:2405.08553  [pdf, other

    cs.LG cs.CL

    Improving Transformers with Dynamically Composable Multi-Head Attention

    Authors: Da Xiao, Qingye Meng, Shengping Li, Xingyuan Yuan

    Abstract: Multi-Head Attention (MHA) is a key component of Transformer. In MHA, attention heads work independently, causing problems such as low-rank bottleneck of attention score matrices and head redundancy. We propose Dynamically Composable Multi-Head Attention (DCMHA), a parameter and computation efficient attention architecture that tackles the shortcomings of MHA and increases the expressive power of… ▽ More

    Submitted 4 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted to the 41st International Conference on Machine Learning (ICML'24 oral)

  7. arXiv:2404.17398  [pdf, other

    stat.ML cs.LG

    Online Policy Learning and Inference by Matrix Completion

    Authors: Congyuan Duan, Jingyang Li, Dong Xia

    Abstract: Making online decisions can be challenging when features are sparse and orthogonal to historical ones, especially when the optimal policy is learned through collaborative filtering. We formulate the problem as a matrix completion bandit (MCB), where the expected reward under each arm is characterized by an unknown low-rank matrix. The $ε$-greedy bandit and the online gradient descent algorithm are… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  8. arXiv:2403.05025  [pdf, other

    cs.AI

    Towards Multimodal Human Intention Understanding Debiasing via Subject-Deconfounding

    Authors: Dingkang Yang, Dongling Xiao, Ke Li, Yuzheng Wang, Zhaoyu Chen, Jinjie Wei, Lihua Zhang

    Abstract: Multimodal intention understanding (MIU) is an indispensable component of human expression analysis (e.g., sentiment or humor) from heterogeneous modalities, including visual postures, linguistic contents, and acoustic behaviors. Existing works invariably focus on designing sophisticated structures or fusion strategies to achieve impressive improvements. Unfortunately, they all suffer from the sub… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 14 pages

  9. arXiv:2403.05023  [pdf, other

    cs.CL cs.CV

    Towards Multimodal Sentiment Analysis Debiasing via Bias Purification

    Authors: Dingkang Yang, Mingcheng Li, Dongling Xiao, Yang Liu, Kun Yang, Zhaoyu Chen, Yuzheng Wang, Peng Zhai, Ke Li, Lihua Zhang

    Abstract: Multimodal Sentiment Analysis (MSA) aims to understand human intentions by integrating emotion-related clues from diverse modalities, such as visual, language, and audio. Unfortunately, the current MSA task invariably suffers from unplanned dataset biases, particularly multimodal utterance-level label bias and word-level context bias. These harmful biases potentially mislead models to focus on sta… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 14 pages

  10. arXiv:2403.03599  [pdf, other

    cs.LG

    Learning Invariant Representations of Graph Neural Networks via Cluster Generalization

    Authors: Donglin Xia, Xiao Wang, Nian Liu, Chuan Shi

    Abstract: Graph neural networks (GNNs) have become increasingly popular in modeling graph-structured data due to their ability to learn node representations by aggregating local structure information. However, it is widely acknowledged that the test graph structure may differ from the training graph structure, resulting in a structure shift. In this paper, we experimentally find that the performance of GNNs… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  11. arXiv:2402.16915  [pdf, other

    cs.LG cs.AI

    More Than Routing: Joint GPS and Route Modeling for Refine Trajectory Representation Learning

    Authors: Zhipeng Ma, Zheyan Tu, Xinhai Chen, Yan Zhang, Deguo Xia, Guyue Zhou, Yilun Chen, Yu Zheng, Jiangtao Gong

    Abstract: Trajectory representation learning plays a pivotal role in supporting various downstream tasks. Traditional methods in order to filter the noise in GPS trajectories tend to focus on routing-based methods used to simplify the trajectories. However, this approach ignores the motion details contained in the GPS data, limiting the representation capability of trajectory representation learning. To fil… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  12. arXiv:2402.04602  [pdf, other

    math.ST cs.IT stat.ME

    Online Quantile Regression

    Authors: Yinan Shen, Dong Xia, Wen-Xin Zhou

    Abstract: This paper addresses the challenge of integrating sequentially arriving data within the quantile regression framework, where the number of features is allowed to grow with the number of observations, the horizon is unknown, and memory is limited. We employ stochastic sub-gradient descent to minimize the empirical check loss and study its statistical properties and regret performance. In our analys… ▽ More

    Submitted 18 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  13. arXiv:2401.13639  [pdf, other

    cs.GR

    Winding Clearness for Differentiable Point Cloud Optimization

    Authors: Dong Xiao, Yueji Ma, Zuoqiang Shi, Shiqing Xin, Wenping Wang, Bailin Deng, Bin Wang

    Abstract: We propose to explore the properties of raw point clouds through the \emph{winding clearness}, a concept we first introduce for assessing the clarity of the interior/exterior relationships represented by the winding number field of the point cloud. In geometric modeling, the winding number is a powerful tool for distinguishing the interior and exterior of a given surface $\partial Ω$, and it has b… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  14. arXiv:2401.03820  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices

    Authors: T. Tony Cai, Dong Xia, Mengyue Zha

    Abstract: Estimating a covariance matrix and its associated principal components is a fundamental problem in contemporary statistics. While optimal estimation procedures have been developed with well-understood properties, the increasing demand for privacy preservation introduces new complexities to this classical problem. In this paper, we study optimal differentially private Principal Component Analysis (… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  15. arXiv:2312.08704  [pdf, other

    cs.CV cs.GR

    PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments

    Authors: Rixin Zhou, Ding Xia, Yi Zhang, Honglin Pang, Xi Yang, Chuntao Li

    Abstract: In this paper, we propose a learning-based image fragment pair-searching and -matching approach to solve the challenging restoration problem. Existing works use rule-based methods to match similar contour shapes or textures, which are always difficult to tune hyperparameters for extensive data and computationally time-consuming. Therefore, we propose a neural network that can effectively utilize n… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 14 pages, 16 figures, 4 tables

  16. arXiv:2312.00305  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Multiple Testing of Linear Forms for Noisy Matrix Completion

    Authors: Wanteng Ma, Lilun Du, Dong Xia, Ming Yuan

    Abstract: Many important tasks of large-scale recommender systems can be naturally cast as testing multiple linear forms for noisy matrix completion. These problems, however, present unique challenges because of the subtle bias-and-variance tradeoff of and an intricate dependence among the estimated entries induced by the low-rank structure. In this paper, we develop a general approach to overcome these dif… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  17. arXiv:2311.15598  [pdf, other

    math.ST cs.LG cs.SI stat.ME stat.ML

    Optimal Clustering of Discrete Mixtures: Binomial, Poisson, Block Models, and Multi-layer Networks

    Authors: Zhongyuan Lyu, Ting Li, Dong Xia

    Abstract: In this paper, we first study the fundamental limit of clustering networks when a multi-layer network is present. Under the mixture multi-layer stochastic block model (MMSBM), we show that the minimax optimal network clustering error rate, which takes an exponential form and is characterized by the Renyi divergence between the edge probability distributions of the component networks. We propose a… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  18. arXiv:2311.01327  [pdf, other

    cs.LG cs.DS stat.ML

    High-dimensional Linear Bandits with Knapsacks

    Authors: Wanteng Ma, Dong Xia, Jiashuo Jiang

    Abstract: We study the contextual bandits with knapsack (CBwK) problem under the high-dimensional setting where the dimension of the feature is large. The reward of pulling each arm equals the multiplication of a sparse high-dimensional weight vector and the feature of the current arrival, with additional random noise. In this paper, we investigate how to exploit this sparsity structure to achieve improved… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  19. arXiv:2310.09143  [pdf, other

    cs.CE cs.SI

    An Intrinsic Integrity-Driven Rating Model for a Sustainable Reputation System

    Authors: H. Wen, T. Huang, D. Xiao

    Abstract: In the era of digital markets, the challenge for consumers is discerning quality amidst information asymmetry. While traditional markets use brand mechanisms to address this issue, transferring such systems to internet-based P2P markets, where misleading practices like fake ratings are rampant, remains challenging. Current internet platforms strive to counter this through verification algorithms,… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 36 pages,13 figures

  20. arXiv:2310.07477  [pdf, other

    cs.IR

    GMOCAT: A Graph-Enhanced Multi-Objective Method for Computerized Adaptive Testing

    Authors: Hangyu Wang, Ting Long, Liang Yin, Weinan Zhang, Wei Xia, Qichen Hong, Dingyin Xia, Ruiming Tang, Yong Yu

    Abstract: Computerized Adaptive Testing(CAT) refers to an online system that adaptively selects the best-suited question for students with various abilities based on their historical response records. Most CAT methods only focus on the quality objective of predicting the student ability accurately, but neglect concept diversity or question exposure control, which are important considerations in ensuring the… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: KDD23

  21. arXiv:2309.02698  [pdf, ps, other

    math.ST cs.IT stat.ME

    Quantile and pseudo-Huber Tensor Decomposition

    Authors: Yinan Shen, Dong Xia

    Abstract: This paper studies the computational and statistical aspects of quantile and pseudo-Huber tensor decomposition. The integrated investigation of computational and statistical issues of robust tensor decomposition poses challenges due to the non-smooth loss functions. We propose a projected sub-gradient descent algorithm for tensor decomposition, equipped with either the pseudo-Huber loss or the qua… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  22. arXiv:2307.10818  [pdf, other

    cs.SE

    PHYFU: Fuzzing Modern Physics Simulation Engines

    Authors: Dongwei Xiao, Zhibo Liu, Shuai Wang

    Abstract: A physical simulation engine (PSE) is a software system that simulates physical environments and objects. Modern PSEs feature both forward and backward simulations, where the forward phase predicts the behavior of a simulated system, and the backward phase provides gradients (guidance) for learning-based control tasks, such as a robot arm learning to fetch items. This way, modern PSEs show promisi… ▽ More

    Submitted 13 August, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: This paper is accepted at The 38th IEEE/ACM International Conference on Automated Software Engineering, a.k.a. ASE 2023. Please cite the published version as soon as this paper appears in the conference publications

  23. arXiv:2307.08574  [pdf, other

    cs.LG

    FedCME: Client Matching and Classifier Exchanging to Handle Data Heterogeneity in Federated Learning

    Authors: Jun Nie, Danyang Xiao, Lei Yang, Weigang Wu

    Abstract: Data heterogeneity across clients is one of the key challenges in Federated Learning (FL), which may slow down the global model convergence and even weaken global model performance. Most existing approaches tackle the heterogeneity by constraining local model updates through reference to global information provided by the server. This can alleviate the performance degradation on the aggregated glo… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  24. arXiv:2306.12174  [pdf, other

    cs.CV

    OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

    Authors: Weihao Gao, Zhuo Deng, Zhiyuan Niu, Fuju Rong, Chucheng Chen, Zheng Gong, Wenze Zhang, Daimin Xiao, Fang Li, Zhenjie Cao, Zhaoyi Ma, Wenbin Wei, Lan Ma

    Abstract: Large multimodal language models (LMMs) have achieved significant success in general domains. However, due to the significant differences between medical images and text and general web content, the performance of LMMs in medical scenarios is limited. In ophthalmology, clinical diagnosis relies on multiple modalities of medical images, but unfortunately, multimodal ophthalmic large language models… ▽ More

    Submitted 21 June, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: OphGLM:The first ophthalmology large language-and-vision assistant based on instructions and dialogue

  25. arXiv:2306.04234  [pdf, other

    cs.IR cs.CY

    Set-to-Sequence Ranking-based Concept-aware Learning Path Recommendation

    Authors: Xianyu Chen, Jian Shen, Wei Xia, Jiarui Jin, Yakun Song, Weinan Zhang, Weiwen Liu, Menghui Zhu, Ruiming Tang, Kai Dong, Dingyin Xia, Yong Yu

    Abstract: With the development of the online education system, personalized education recommendation has played an essential role. In this paper, we focus on developing path recommendation systems that aim to generating and recommending an entire learning path to the given user in each session. Noticing that existing approaches fail to consider the correlations of concepts in the path, we propose a novel fr… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  26. arXiv:2306.03372  [pdf, ps, other

    stat.ML cs.LG

    Online Tensor Learning: Computational and Statistical Trade-offs, Adaptivity and Optimal Regret

    Authors: Jian-Feng Cai, Jingyang Li, Dong Xia

    Abstract: We investigate a generalized framework for estimating latent low-rank tensors in an online setting, encompassing both linear and generalized linear models. This framework offers a flexible approach for handling continuous or categorical variables. Additionally, we investigate two specific applications: online tensor completion and online binary tensor learning. To address these challenges, we prop… ▽ More

    Submitted 10 July, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

  27. arXiv:2305.06655  [pdf, other

    cs.CL

    QURG: Question Rewriting Guided Context-Dependent Text-to-SQL Semantic Parsing

    Authors: Linzheng Chai, Dongling Xiao, Jian Yang, Liqun Yang, Qian-Wen Zhang, Yunbo Cao, Zhoujun Li, Zhao Yan

    Abstract: Context-dependent Text-to-SQL aims to translate multi-turn natural language questions into SQL queries. Despite various methods have exploited context-dependence information implicitly for contextual SQL parsing, there are few attempts to explicitly address the dependencies between current question and question context. This paper presents QURG, a novel Question Rewriting Guided approach to help t… ▽ More

    Submitted 16 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  28. arXiv:2305.06199  [pdf, ps, other

    math.ST cs.IT stat.ME stat.ML

    Computationally Efficient and Statistically Optimal Robust High-Dimensional Linear Regression

    Authors: Yinan Shen, Jingyang Li, Jian-Feng Cai, Dong Xia

    Abstract: High-dimensional linear regression under heavy-tailed noise or outlier corruption is challenging, both computationally and statistically. Convex approaches have been proven statistically optimal but suffer from high computational costs, especially since the robust loss functions are usually non-smooth. More recently, computationally fast non-convex approaches via sub-gradient descent are proposed,… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: This manuscript supersedes an earlier one (arXiv:2203.00953). Two manuscripts share around 60% contents. There will be no further update for the earlier manuscript

  29. Alternately denoising and reconstructing unoriented point sets

    Authors: Dong Xiao, Zuoqiang Shi, Bin Wang

    Abstract: We propose a new strategy to bridge point cloud denoising and surface reconstruction by alternately updating the denoised point clouds and the reconstructed surfaces. In Poisson surface reconstruction, the implicit function is generated by a set of smooth basis functions centered at the octnodes. When the octree depth is properly selected, the reconstructed surface is a good smooth approximation o… ▽ More

    Submitted 24 August, 2023; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: Accepted by Computers & Graphics from CAD/Graphics 2023

  30. arXiv:2303.10462  [pdf, other

    cs.LG

    Machine learning with data assimilation and uncertainty quantification for dynamical systems: a review

    Authors: Sibo Cheng, Cesar Quilodran-Casas, Said Ouala, Alban Farchi, Che Liu, Pierre Tandeo, Ronan Fablet, Didier Lucor, Bertrand Iooss, Julien Brajard, Dunhui Xiao, Tijana Janjic, Weiping Ding, Yike Guo, Alberto Carrassi, Marc Bocquet, Rossella Arcucci

    Abstract: Data Assimilation (DA) and Uncertainty quantification (UQ) are extensively used in analysing and reducing error propagation in high-dimensional spatial-temporal dynamics. Typical applications span from computational fluid dynamics (CFD) to geoscience and climate systems. Recently, much effort has been given in combining DA, UQ and machine learning (ML) techniques. These research efforts seek to ad… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

  31. arXiv:2302.04437  [pdf, other

    stat.ML cs.LG stat.AP

    rMultiNet: An R Package For Multilayer Networks Analysis

    Authors: Ting Li, Zhongyuan Lyu, Chenyu Ren, Dong Xia

    Abstract: This paper develops an R package rMultiNet to analyze multilayer network data. We provide two general frameworks from recent literature, e.g. mixture multilayer stochastic block model(MMSBM) and mixture multilayer latent space model(MMLSM) to generate the multilayer network. We also provide several methods to reveal the embedding of both nodes and layers followed by further data analysis methods,… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  32. arXiv:2212.04065  [pdf, other

    cs.LG cs.HC

    SpaceEditing: Integrating Human Knowledge into Deep Neural Networks via Interactive Latent Space Editing

    Authors: Jiafu Wei, Ding Xia, Haoran Xie, Chia-Ming Chang, Chuntao Li, Xi Yang

    Abstract: We propose an interactive editing method that allows humans to help deep neural networks (DNNs) learn a latent space more consistent with human knowledge, thereby improving classification accuracy on indistinguishable ambiguous data. Firstly, we visualize high-dimensional data features through dimensionality reduction methods and design an interactive system \textit{SpaceEditing} to display the vi… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 13 pages, 12 figures, Video URL: https://youtu.be/UTxji6_fs5I

  33. arXiv:2211.02419  [pdf, other

    eess.IV cs.CV cs.LG

    High-Resolution Boundary Detection for Medical Image Segmentation with Piece-Wise Two-Sample T-Test Augmented Loss

    Authors: Yucong Lin, Jinhua Su, Yuhang Li, Yuhao Wei, Hanchao Yan, Saining Zhang, Jiaan Luo, Danni Ai, Hong Song, Jingfan Fan, Tianyu Fu, Deqiang Xiao, Feifei Wang, Jue Hou, Jian Yang

    Abstract: Deep learning methods have contributed substantially to the rapid advancement of medical image segmentation, the quality of which relies on the suitable design of loss functions. Popular loss functions, including the cross-entropy and dice losses, often fall short of boundary detection, thereby limiting high-resolution downstream applications such as automated diagnoses and procedures. We develope… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  34. Point normal orientation and surface reconstruction by incorporating isovalue constraints to Poisson equation

    Authors: Dong Xiao, Zuoqiang Shi, Siyu Li, Bailin Deng, Bin Wang

    Abstract: Oriented normals are common pre-requisites for many geometric algorithms based on point clouds, such as Poisson surface reconstruction. However, it is not trivial to obtain a consistent orientation. In this work, we bridge orientation and reconstruction in the implicit space and propose a novel approach to orient point cloud normals by incorporating isovalue constraints to the Poisson equation. In… ▽ More

    Submitted 30 April, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Accepted by Computer Aided Geometric Design from GMP 2023

  35. arXiv:2209.13762  [pdf, other

    stat.ML cs.LG

    Consensus Knowledge Graph Learning via Multi-view Sparse Low Rank Block Model

    Authors: Tianxi Cai, Dong Xia, Luwan Zhang, Doudou Zhou

    Abstract: Network analysis has been a powerful tool to unveil relationships and interactions among a large number of objects. Yet its effectiveness in accurately identifying important node-node interactions is challenged by the rapidly growing network size, with data being collected at an unprecedented granularity and scale. Common wisdom to overcome such high dimensionality is collapsing nodes into smaller… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  36. arXiv:2209.00399  [pdf, other

    cs.LG cs.IT math.OC

    Optimal Regularized Online Allocation by Adaptive Re-Solving

    Authors: Wanteng Ma, Ying Cao, Danny H. K. Tsang, Dong Xia

    Abstract: This paper introduces a dual-based algorithm framework for solving the regularized online resource allocation problems, which have potentially non-concave cumulative rewards, hard resource constraints, and a non-separable regularizer. Under a strategy of adaptively updating the resource constraints, the proposed framework only requests approximate solutions to the empirical dual problems up to a c… ▽ More

    Submitted 15 July, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

  37. arXiv:2207.14667  [pdf, other

    cs.NE

    Egret Swarm Optimization Algorithm: An Evolutionary Computation Approach for Model Free Optimization

    Authors: Zuyan Chen, Adam Francis, Shuai Li, Bolin Liao, Dunhui Xiao

    Abstract: A novel meta-heuristic algorithm, Egret Swarm Optimization Algorithm (ESOA), is proposed in this paper, which is inspired by two egret species' (Great Egret and Snowy Egret) hunting behavior. ESOA consists of three primary components: Sit-And-Wait Strategy, Aggressive Strategy as well as Discriminant Conditions. The performance of ESOA on 36 benchmark functions as well as 2 engineering problems ar… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: 10 pages, 5 figures, 6 tables. Source code used for this work is available online: see https://github.com/Knightsll/Egret_Swarm_Optimization_Algorithm and https://ww2.mathworks.cn/matlabcentral/fileexchange/115595-egret-swarm-optimization-algorithm-esoa. This paper has been submitted to MDPI mathematics

    MSC Class: 68T05: Evolutionary algorithms; genetic algorithms (computational aspects); see also 68T20 and 90C59

  38. arXiv:2207.13381  [pdf, other

    cs.CV

    Look Closer to Your Enemy: Learning to Attack via Teacher-Student Mimicking

    Authors: Mingjie Wang, Jianxiong Guo, Sirui Li, Dingwen Xiao, Zhiqing Tang

    Abstract: Deep neural networks have significantly advanced person re-identification (ReID) applications in the realm of the industrial internet, yet they remain vulnerable. Thus, it is crucial to study the robustness of ReID systems, as there are risks of adversaries using these vulnerabilities to compromise industrial surveillance systems. Current adversarial methods focus on generating attack samples usin… ▽ More

    Submitted 2 December, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

  39. arXiv:2207.04600  [pdf, other

    math.ST cs.IT cs.LG stat.ME

    Optimal Clustering by Lloyd Algorithm for Low-Rank Mixture Model

    Authors: Zhongyuan Lyu, Dong Xia

    Abstract: This paper investigates the computational and statistical limits in clustering matrix-valued observations. We propose a low-rank mixture model (LrMM), adapted from the classical Gaussian mixture model (GMM) to treat matrix-valued observations, which assumes low-rankness for population center matrices. A computationally efficient clustering method is designed by integrating Lloyd's algorithm and lo… ▽ More

    Submitted 6 June, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

  40. arXiv:2205.07686  [pdf, other

    cs.CL

    CQR-SQL: Conversational Question Reformulation Enhanced Context-Dependent Text-to-SQL Parsers

    Authors: Dongling Xiao, Linzheng Chai, Qian-Wen Zhang, Zhao Yan, Zhoujun Li, Yunbo Cao

    Abstract: Context-dependent text-to-SQL is the task of translating multi-turn questions into database-related SQL queries. Existing methods typically focus on making full use of history context or previously predicted SQL for currently SQL parsing, while neglecting to explicitly comprehend the schema and conversational dependency, such as co-reference, ellipsis and user focus change. In this paper, we propo… ▽ More

    Submitted 24 October, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: Accepted at EMNLP 2022 (findings)

  41. Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification

    Authors: Haojie Liu, Daoxun Xia, Wei Jiang, Chao Xu

    Abstract: Visible-infrared person re-identification (VI-ReID) is a challenging and essential task, which aims to retrieve a set of person images over visible and infrared camera views. In order to mitigate the impact of large modality discrepancy existing in heterogeneous images, previous methods attempt to apply generative adversarial network (GAN) to generate the modality-consisitent data. However, due to… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: 15 pages, 9figures

  42. arXiv:2204.03843  [pdf, other

    cs.CR

    CFL: Cluster Federated Learning in Large-scale Peer-to-Peer Networks

    Authors: Qian Chen, Zilong Wang, Yilin Zhou, Jiawei Chen, Dan Xiao, Xiaodong Lin

    Abstract: Federated learning (FL) has sparked extensive interest in exploiting the private data on clients' local devices. However, the parameter server setting of FL not only has high bandwidth requirements, but also poses data privacy issues and a single point of failure. In this paper, we propose an efficient and privacy-preserving protocol, dubbed CFL, which is the first fine-grained global model traini… ▽ More

    Submitted 6 July, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

  43. arXiv:2203.13754  [pdf

    physics.bio-ph cs.LG

    Fast fluorescence lifetime imaging analysis via extreme learning machine

    Authors: Zhenya Zang, Dong Xiao, Quan Wang, Zinuo Li, Wujun Xie, Yu Chen, David Day Uei Li

    Abstract: We present a fast and accurate analytical method for fluorescence lifetime imaging microscopy (FLIM) using the extreme learning machine (ELM). We used extensive metrics to evaluate ELM and existing algorithms. First, we compared these algorithms using synthetic datasets. Results indicate that ELM can obtain higher fidelity, even in low-photon conditions. Afterwards, we used ELM to retrieve lifetim… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 14 pages, 9 figures

  44. arXiv:2203.00953  [pdf, ps, other

    math.ST cs.IT stat.ME stat.ML

    Computationally Efficient and Statistically Optimal Robust Low-rank Matrix and Tensor Estimation

    Authors: Yinan Shen, Jingyang Li, Jian-Feng Cai, Dong Xia

    Abstract: Low-rank matrix estimation under heavy-tailed noise is challenging, both computationally and statistically. Convex approaches have been proven statistically optimal but suffer from high computational costs, especially since robust loss functions are usually non-smooth. More recently, computationally fast non-convex approaches via sub-gradient descent are proposed, which, unfortunately, fail to del… ▽ More

    Submitted 10 May, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: This manuscript is superseded by the new one (arXiv:2305.06199). There will be no further update of this manuscript and it will not be submitted for publications

  45. arXiv:2201.09040  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Optimal Estimation and Computational Limit of Low-rank Gaussian Mixtures

    Authors: Zhongyuan Lyu, Dong Xia

    Abstract: Structural matrix-variate observations routinely arise in diverse fields such as multi-layer network analysis and brain image clustering. While data of this type have been extensively investigated with fruitful outcomes being delivered, the fundamental questions like its statistical optimality and computational limit are largely under-explored. In this paper, we propose a low-rank Gaussian mixture… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

  46. Learning Modified Indicator Functions for Surface Reconstruction

    Authors: Dong Xiao, Siyou Lin, Zuoqiang Shi, Bin Wang

    Abstract: Surface reconstruction is a fundamental problem in 3D graphics. In this paper, we propose a learning-based approach for implicit surface reconstruction from raw point clouds without normals. Our method is inspired by Gauss Lemma in potential energy theory, which gives an explicit integral formula for the indicator functions. We design a novel deep neural network to perform surface integral and lea… ▽ More

    Submitted 20 February, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: Accepted by Computers & Graphics from SMI 2021

  47. arXiv:2110.15278  [pdf

    eess.SP cs.AI cs.LG

    Self-supervised EEG Representation Learning for Automatic Sleep Staging

    Authors: Chaoqi Yang, Danica Xiao, M. Brandon Westover, Jimeng Sun

    Abstract: Background: Deep learning models have shown great success in automating tasks in sleep medicine by learning from carefully annotated Electroencephalogram (EEG) data. However, effectively utilizing a large amount of raw EEG remains a challenge. Objective: In this paper, we aim to learn robust vector representations from massive unlabeled EEG signals, such that the learned vectorized features (1)… ▽ More

    Submitted 12 February, 2023; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Preprocessing and Code in Github: https://github.com/ycq091044/ContraWR

  48. arXiv:2110.03828  [pdf, other

    eess.IV cs.CV

    SkullEngine: A Multi-stage CNN Framework for Collaborative CBCT Image Segmentation and Landmark Detection

    Authors: Qin Liu, Han Deng, Chunfeng Lian, Xiaoyang Chen, Deqiang Xiao, Lei Ma, Xu Chen, Tianshu Kuang, Jaime Gateno, Pew-Thian Yap, James J. Xia

    Abstract: We propose a multi-stage coarse-to-fine CNN-based framework, called SkullEngine, for high-resolution segmentation and large-scale landmark detection through a collaborative, integrated, and scalable JSD model and three segmentation and landmark detection refinement models. We evaluated our framework on a clinical dataset consisting of 170 CBCT/CT images for the task of segmenting 2 bones (midface… ▽ More

    Submitted 21 December, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 10 pages, 5 figures, accepted by MLMI 2021

  49. arXiv:2110.02940  [pdf, other

    cs.CR cs.AI cs.LG

    Secure Byzantine-Robust Distributed Learning via Clustering

    Authors: Raj Kiriti Velicheti, Derek Xia, Oluwasanmi Koyejo

    Abstract: Federated learning systems that jointly preserve Byzantine robustness and privacy have remained an open problem. Robust aggregation, the standard defense for Byzantine attacks, generally requires server access to individual updates or nonlinear computation -- thus is incompatible with privacy-preserving methods such as secure aggregation via multiparty computation. To this end, we propose SHARE (S… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: 18 pages, 9 Figures

  50. arXiv:2109.05191  [pdf, other

    cs.CV

    A Self-Supervised Deep Framework for Reference Bony Shape Estimation in Orthognathic Surgical Planning

    Authors: Deqiang Xiao, Hannah Deng, Tianshu Kuang, Lei Ma, Qin Liu, Xu Chen, Chunfeng Lian, Yankun Lang, Daeseung Kim, Jaime Gateno, Steve Guofang Shen, Dinggang Shen, Pew-Thian Yap, James J. Xia

    Abstract: Virtual orthognathic surgical planning involves simulating surgical corrections of jaw deformities on 3D facial bony shape models. Due to the lack of necessary guidance, the planning procedure is highly experience-dependent and the planning results are often suboptimal. A reference facial bony shape model representing normal anatomies can provide an objective guidance to improve planning accuracy.… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Journal ref: The 24th International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2021