Skip to main content

Showing 1–50 of 52 results for author: Long, Q

  1. arXiv:2406.07404  [pdf, other

    cs.LG

    Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy

    Authors: Xiaohan Huang, Dongjie Wang, Zhiyuan Ning, Ziyue Qiao, Qingqing Long, Haowei Zhu, Min Wu, Yuanchun Zhou, Meng Xiao

    Abstract: Tabular data optimization methods aim to automatically find an optimal feature transformation process that generates high-value features and improves the performance of downstream machine learning tasks. Current frameworks for automated feature transformation rely on iterative sequence generation tasks, optimizing decision strategies through performance feedback from downstream tasks. However, the… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 17 pages

  2. arXiv:2406.05372  [pdf, ps, other

    stat.ML cs.LG

    Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

    Authors: Jiancong Xiao, Ruoyu Sun, Qi Long, Weijie J. Su

    Abstract: Training Deep Neural Networks (DNNs) with adversarial examples often results in poor generalization to test-time adversarial data. This paper investigates this issue, known as adversarially robust generalization, through the lens of Rademacher complexity. Building upon the studies by Khim and Loh (2018); Yin et al. (2019), numerous works have been dedicated to this problem, yet achieving a satisfa… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  3. arXiv:2406.00611  [pdf, other

    cs.LG stat.ME

    DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

    Authors: Yinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J Getzen, Qi Long, Mayur Naik, Ravi B Parikh, Eric Wong

    Abstract: Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. 22 pages, 5 figures

  4. arXiv:2405.16455  [pdf, other

    stat.ML cs.LG stat.ME

    On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization

    Authors: Jiancong Xiao, Ziniu Li, Xingyu Xie, Emily Getzen, Cong Fang, Qi Long, Weijie J. Su

    Abstract: Accurately aligning large language models (LLMs) with human preferences is crucial for informing fair, economically sound, and statistically efficient decision-making processes. However, we argue that reinforcement learning from human feedback (RLHF) -- the predominant approach for aligning LLMs with human preferences through a reward model -- suffers from an inherent algorithmic bias due to its K… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  5. arXiv:2405.11868  [pdf, other

    cs.LG cs.AI cs.CE cs.IR cs.SI

    Towards Graph Contrastive Learning: A Survey and Beyond

    Authors: Wei Ju, Yifan Wang, Yifang Qin, Zhengyang Mao, Zhiping Xiao, Junyu Luo, Junwei Yang, Yiyang Gu, Dongjie Wang, Qingqing Long, Siyu Yi, Xiao Luo, Ming Zhang

    Abstract: In recent years, deep learning on graphs has achieved remarkable success in various domains. However, the reliance on annotated graph data remains a significant bottleneck due to its prohibitive cost and time-intensive nature. To address this challenge, self-supervised learning (SSL) on graphs has gained increasing attention and has made significant progress. SSL enables machine learning models to… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2405.01839  [pdf, other

    cs.AI cs.MA

    SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning

    Authors: Qian Long, Fangwei Zhong, Mingdong Wu, Yizhou Wang, Song-Chun Zhu

    Abstract: Multi-agent systems (MAS) need to adaptively cope with dynamic environments, changing agent populations, and diverse tasks. However, most of the multi-agent systems cannot easily handle them, due to the complexity of the state and task space. The social impact theory regards the complex influencing factors as forces acting on an agent, emanating from the environment, other agents, and the agent's… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: AAAI 2024 Cooperative Multi-Agent Systems Decision-Making and Learning (CMASDL) Workshop

  7. arXiv:2404.15943  [pdf, other

    cs.LG cs.AI

    Decentralized Personalized Federated Learning based on a Conditional Sparse-to-Sparser Scheme

    Authors: Qianyu Long, Qiyuan Wang, Christos Anagnostopoulos, Daning Bi

    Abstract: Decentralized Federated Learning (DFL) has become popular due to its robustness and avoidance of centralized coordination. In this paradigm, clients actively engage in training by exchanging models with their networked neighbors. However, DFL introduces increased costs in terms of training and communication. Existing methods focus on minimizing communication often overlooking training efficiency a… ▽ More

    Submitted 25 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 15 pages, 9 figures, 3 pages theory

  8. arXiv:2404.07546  [pdf, other

    cs.CL

    Decomposing Label Space, Format and Discrimination: Rethinking How LLMs Respond and Solve Tasks via In-Context Learning

    Authors: Quanyu Long, Yin Wu, Wenya Wang, Sinno Jialin Pan

    Abstract: In-context Learning (ICL) has emerged as a powerful capability alongside the development of scaled-up large language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without updating millions of parameters. However, the precise contributions of demonstrations towards improving end-task performance have not been thoroughly i… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 36 pages, 8 figures

  9. arXiv:2404.01245  [pdf, other

    math.ST cs.CL cs.CR cs.LG stat.ML

    A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules

    Authors: Xiang Li, Feng Ruan, Huiyuan Wang, Qi Long, Weijie J. Su

    Abstract: Since ChatGPT was introduced in November 2022, embedding (nearly) unnoticeable statistical signals into text generated by large language models (LLMs), also known as watermarking, has been used as a principled approach to provable detection of LLM-generated text from its human-written counterpart. In this paper, we introduce a general and flexible framework for reasoning about the statistical effi… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  10. arXiv:2402.16424  [pdf, other

    cs.CV

    COMAE: COMprehensive Attribute Exploration for Zero-shot Hashing

    Authors: Yihang Zhou, Qingqing Long, Yuchen Yan, Xiao Luo, Zeyu Dong, Xuezhi Wang, Zhen Meng, Pengfei Wang, Yuanchun Zhou

    Abstract: Zero-shot hashing (ZSH) has shown excellent success owing to its efficiency and generalization in large-scale retrieval scenarios. While considerable success has been achieved, there still exist urgent limitations. Existing works ignore the locality relationships of representations and attributes, which have effective transferability between seeable classes and unseeable classes. Also, the continu… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 13 pages, 7 figures

  11. Inductive Graph Alignment Prompt: Bridging the Gap between Graph Pre-training and Inductive Fine-tuning From Spectral Perspective

    Authors: Yuchen Yan, Peiyan Zhang, Zheng Fang, Qingqing Long

    Abstract: The "Graph pre-training and fine-tuning" paradigm has significantly improved Graph Neural Networks(GNNs) by capturing general knowledge without manual annotations for downstream tasks. However, due to the immense gap of data and tasks between the pre-training and fine-tuning stages, the model performance is still limited. Inspired by prompt fine-tuning in Natural Language Processing(NLP), many end… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    ACM Class: E.2

  12. arXiv:2402.13532  [pdf, other

    cs.CL

    Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation

    Authors: Quanyu Long, Yue Deng, LeiLei Gan, Wenya Wang, Sinno Jialin Pan

    Abstract: Dense retrievers and retrieval-augmented language models have been widely used in various NLP applications. Despite being designed to deliver reliable and secure outcomes, the vulnerability of retrievers to potential attacks remains unclear, raising concerns about their security. In this paper, we introduce a novel scenario where the attackers aim to covertly disseminate targeted misinformation, s… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  13. arXiv:2402.01231  [pdf, other

    cs.LG

    Unveiling Delay Effects in Traffic Forecasting: A Perspective from Spatial-Temporal Delay Differential Equations

    Authors: Qingqing Long, Zheng Fang, Chen Fang, Chong Chen, Pengfei Wang, Yuanchun Zhou

    Abstract: Traffic flow forecasting is a fundamental research issue for transportation planning and management, which serves as a canonical and typical example of spatial-temporal predictions. In recent years, Graph Neural Networks (GNNs) and Recurrent Neural Networks (RNNs) have achieved great success in capturing spatial-temporal correlations for traffic flow forecasting. Yet, two non-ignorable issues have… ▽ More

    Submitted 25 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 11 pages, 7 figures

  14. arXiv:2402.00447  [pdf, ps, other

    cs.LG cs.AI cs.SI

    A Survey of Data-Efficient Graph Learning

    Authors: Wei Ju, Siyu Yi, Yifan Wang, Qingqing Long, Junyu Luo, Zhiping Xiao, Ming Zhang

    Abstract: Graph-structured data, prevalent in domains ranging from social networks to biochemical analysis, serve as the foundation for diverse real-world systems. While graph neural networks demonstrate proficiency in modeling this type of data, their success is often reliant on significant amounts of labeled data, posing a challenge in practical scenarios with limited annotation resources. To tackle this… ▽ More

    Submitted 19 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024)

  15. arXiv:2311.11551  [pdf, other

    cs.CL

    Adapt in Contexts: Retrieval-Augmented Domain Adaptation via In-Context Learning

    Authors: Quanyu Long, Wenya Wang, Sinno Jialin Pan

    Abstract: Large language models (LLMs) have showcased their capability with few-shot inference known as in-context learning. However, in-domain demonstrations are not always readily available in real scenarios, leading to cross-domain in-context learning. Besides, LLMs are still facing challenges in long-tail knowledge in unseen and unfamiliar domains. The above limitations demonstrate the necessity of Unsu… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  16. arXiv:2309.15809  [pdf, other

    cs.LG stat.ML

    Fair Canonical Correlation Analysis

    Authors: Zhuoping Zhou, Davoud Ataee Tarzanagh, Bojian Hou, Boning Tong, Jia Xu, Yanbo Feng, Qi Long, Li Shen

    Abstract: This paper investigates fairness and bias in Canonical Correlation Analysis (CCA), a widely used statistical technique for examining the relationship between two sets of variables. We present a framework that alleviates unfairness by minimizing the correlation disparity error associated with protected attributes. Our approach enables CCA to learn global projection matrices from all data points whi… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted for publication at NeurIPS 2023, 31 Pages, 14 Figures

  17. arXiv:2309.06805  [pdf, other

    cs.LG cs.AI cs.DC

    FedDIP: Federated Learning with Extreme Dynamic Pruning and Incremental Regularization

    Authors: Qianyu Long, Christos Anagnostopoulos, Shameem Puthiya Parambath, Daning Bi

    Abstract: Federated Learning (FL) has been successfully adopted for distributed training and inference of large-scale Deep Neural Networks (DNNs). However, DNNs are characterized by an extremely large number of parameters, thus, yielding significant challenges in exchanging these parameters among distributed nodes and managing the memory. Although recent DNN compression methods (e.g., sparsification, prunin… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted for publication at ICDM 2023 (Full version in arxiv). The associated code is available at https://github.com/EricLoong/feddip

    ACM Class: H.4; I.2

    Journal ref: 10.1109/ICDM58522.2023.00146

  18. arXiv:2307.07862  [pdf, other

    cs.RO eess.SY

    Sim2Plan: Robot Motion Planning via Message Passing between Simulation and Reality

    Authors: Yizhou Zhao, Yuanhong Zeng, Qian Long, Ying Nian Wu, Song-Chun Zhu

    Abstract: Simulation-to-real is the task of training and developing machine learning models and deploying them in real settings with minimal additional training. This approach is becoming increasingly popular in fields such as robotics. However, there is often a gap between the simulated environment and the real world, and machine learning models trained in simulation may not perform as well in the real wor… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: Published as a conference paper at FTC 2023

  19. arXiv:2305.01794  [pdf, other

    stat.ME cs.LG

    MISNN: Multiple Imputation via Semi-parametric Neural Networks

    Authors: Zhiqi Bu, Zongyu Dai, Yiliang Zhang, Qi Long

    Abstract: Multiple imputation (MI) has been widely applied to missing value problems in biomedical, social and econometric research, in order to avoid improper inference in the downstream data analysis. In the presence of high-dimensional data, imputation models that include feature selection, especially $\ell_1$ regularized regression (such as Lasso, adaptive Lasso, and Elastic Net), are common choices to… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  20. A Comprehensive Survey on Deep Graph Representation Learning

    Authors: Wei Ju, Zheng Fang, Yiyang Gu, Zequn Liu, Qingqing Long, Ziyue Qiao, Yifang Qin, Jianhao Shen, Fang Sun, Zhiping Xiao, Junwei Yang, Jingyang Yuan, Yusheng Zhao, Yifan Wang, Xiao Luo, Ming Zhang

    Abstract: Graph representation learning aims to effectively encode high-dimensional sparse graph-structured data into low-dimensional dense vectors, which is a fundamental task that has been widely studied in a range of fields, including machine learning and data mining. Classic graph embedding methods follow the basic idea that the embedding vectors of interconnected nodes in the graph can still maintain a… ▽ More

    Submitted 27 February, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: Accepted by Neural Networks 2024

  21. arXiv:2302.11466  [pdf, other

    cs.AI cs.CL

    Advancements in Federated Learning: Models, Methods, and Privacy

    Authors: Huiming Chen, Huandong Wang, Qingyue Long, Depeng Jin, Yong Li

    Abstract: Federated learning (FL) is a promising technique for addressing the rising privacy and security issues. Its main ingredient is to cooperatively learn the model among the distributed clients without uploading any sensitive data. In this paper, we conducted a thorough review of the related works, following the development context and deeply mining the key technologies behind FL from both theoretical… ▽ More

    Submitted 5 March, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: 35 pages, submitted to ACM Computing Surveys

  22. arXiv:2211.13297  [pdf, other

    cs.LG stat.ME

    Multiple Imputation with Neural Network Gaussian Process for High-dimensional Incomplete Data

    Authors: Zongyu Dai, Zhiqi Bu, Qi Long

    Abstract: Missing data are ubiquitous in real world applications and, if not adequately handled, may lead to the loss of information and biased findings in downstream analysis. Particularly, high-dimensional incomplete data with a moderate sample size, such as analysis of multi-omics data, present daunting challenges. Imputation is arguably the most popular method for handling missing data, though existing… ▽ More

    Submitted 21 December, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

  23. arXiv:2210.17051  [pdf, other

    cs.LG physics.flu-dyn

    Real-time high-resolution CO$_2$ geological storage prediction using nested Fourier neural operators

    Authors: Gege Wen, Zongyi Li, Qirui Long, Kamyar Azizzadenesheli, Anima Anandkumar, Sally M. Benson

    Abstract: Carbon capture and storage (CCS) plays an essential role in global decarbonization. Scaling up CCS deployment requires accurate and high-resolution modeling of the storage reservoir pressure buildup and the gaseous plume migration. However, such modeling is very challenging at scale due to the high computational costs of existing numerical methods. This challenge leads to significant uncertainties… ▽ More

    Submitted 1 June, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Journal ref: Energy & Environmental Science, 16(4), 1732-1741 (2023)

  24. arXiv:2207.04564  [pdf, other

    cs.CL cs.LG

    Domain Confused Contrastive Learning for Unsupervised Domain Adaptation

    Authors: Quanyu Long, Tianze Luo, Wenya Wang, Sinno Jialin Pan

    Abstract: In this work, we study Unsupervised Domain Adaptation (UDA) in a challenging self-supervised approach. One of the difficulties is how to learn task discrimination in the absence of target labels. Unlike previous literature which directly aligns cross-domain distributions or leverages reverse gradient, we propose Domain Confused Contrastive Learning (DCCL) to bridge the source and the target domain… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: 14 pages, 7 figures, NAACL 2022

  25. arXiv:2203.03185  [pdf, other

    stat.ML cs.LG

    Covariate-Balancing-Aware Interpretable Deep Learning models for Treatment Effect Estimation

    Authors: Kan Chen, Qishuo Yin, Qi Long

    Abstract: Estimating treatment effects is of great importance for many biomedical applications with observational data. Particularly, interpretability of the treatment effects is preferable for many biomedical researchers. In this paper, we first provide a theoretical analysis and derive an upper bound for the bias of average treatment effect (ATE) estimation under the strong ignorability assumption. Derive… ▽ More

    Submitted 24 June, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

  26. arXiv:2112.11507  [pdf, other

    cs.LG stat.AP

    Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems

    Authors: Zongyu Dai, Zhiqi Bu, Qi Long

    Abstract: Missing data are present in most real world problems and need careful handling to preserve the prediction accuracy and statistical consistency in the downstream analysis. As the gold standard of handling missing data, multiple imputation (MI) methods are proposed to account for the imputation uncertainty and provide proper statistical inference. In this work, we propose Multiple Imputation via G… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

  27. A Simple Standard for Sharing Ontological Mappings (SSSOM)

    Authors: Nicolas Matentzoglu, James P. Balhoff, Susan M. Bello, Chris Bizon, Matthew Brush, Tiffany J. Callahan, Christopher G Chute, William D. Duncan, Chris T. Evelo, Davera Gabriel, John Graybeal, Alasdair Gray, Benjamin M. Gyori, Melissa Haendel, Henriette Harmse, Nomi L. Harris, Ian Harrow, Harshad Hegde, Amelia L. Hoyt, Charles T. Hoyt, Dazhi Jiao, Ernesto Jiménez-Ruiz, Simon Jupp, Hyeongsik Kim, Sebastian Koehler , et al. (19 additional authors not shown)

    Abstract: Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, ar… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: Corresponding author: Christopher J. Mungall <cjmungall@lbl.gov>

  28. arXiv:2112.04899  [pdf, other

    cs.LG cs.AI

    Assessing Fairness in the Presence of Missing Data

    Authors: Yiliang Zhang, Qi Long

    Abstract: Missing data are prevalent and present daunting challenges in real data analysis. While there is a growing body of literature on fairness in analysis of fully observed data, there has been little theoretical work on investigating fairness in analysis of incomplete data. In practice, a popular analytical approach for dealing with missing data is to use only the set of complete cases, i.e., observat… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  29. arXiv:2110.12002  [pdf, ps, other

    cs.LG cs.AI stat.ME

    Fairness in Missing Data Imputation

    Authors: Yiliang Zhang, Qi Long

    Abstract: Missing data are ubiquitous in the era of big data and, if inadequately handled, are known to lead to biased findings and have deleterious impact on data-driven decision makings. To mitigate its impact, many missing value imputation methods have been developed. However, the fairness of these imputation methods across sensitive groups has not been studied. In this paper, we conduct the first known… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: Accepted to ICML 2021 Workshop

  30. arXiv:2107.08461  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Bayesian Neural Networks on Accuracy, Privacy and Reliability

    Authors: Qiyiwen Zhang, Zhiqi Bu, Kan Chen, Qi Long

    Abstract: Bayesian neural network (BNN) allows for uncertainty quantification in prediction, offering an advantage over regular neural networks that has not been explored in the differential privacy (DP) framework. We fill this important gap by leveraging recent development in Bayesian deep learning and privacy accounting to offer a more precise analysis of the trade-off between privacy and accuracy in BNN.… ▽ More

    Submitted 18 February, 2023; v1 submitted 18 July, 2021; originally announced July 2021.

  31. Spatial-Temporal Graph ODE Networks for Traffic Flow Forecasting

    Authors: Zheng Fang, Qingqing Long, Guojie Song, Kunqing Xie

    Abstract: Spatial-temporal forecasting has attracted tremendous attention in a wide range of applications, and traffic flow prediction is a canonical and typical example. The complex and long-range spatial-temporal correlations of traffic flow bring it to a most intractable challenge. Existing works typically utilize shallow graph convolution networks (GNNs) and temporal extracting modules to model spatial… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  32. arXiv:2106.07830  [pdf, other

    cs.LG stat.ML

    On the Convergence and Calibration of Deep Learning with Differential Privacy

    Authors: Zhiqi Bu, Hua Wang, Zongyu Dai, Qi Long

    Abstract: Differentially private (DP) training preserves the data privacy usually at the cost of slower convergence (and thus lower accuracy), as well as more severe mis-calibration than its non-private counterpart. To analyze the convergence of DP training, we formulate a continuous time analysis through the lens of neural tangent kernel (NTK), which characterizes the per-sample gradient clipping and the n… ▽ More

    Submitted 19 June, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  33. arXiv:2105.07752  [pdf, other

    cs.AI cs.IR cs.LG

    Explicit Semantic Cross Feature Learning via Pre-trained Graph Neural Networks for CTR Prediction

    Authors: Feng Li, Bencheng Yan, Qingqing Long, Pengjie Wang, Wei Lin, Jian Xu, Bo Zheng

    Abstract: Cross features play an important role in click-through rate (CTR) prediction. Most of the existing methods adopt a DNN-based model to capture the cross features in an implicit manner. These implicit methods may lead to a sub-optimized performance due to the limitation in explicit semantic modeling. Although traditional statistical explicit semantic cross features can address the problem in these i… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: SIGIR 2021, 5 pages; The first two authors contributed equally to this work; Pengjie Wang gave a lot of guidance in this work

  34. arXiv:2104.02995  [pdf, other

    cs.LG cs.AI

    Theoretically Improving Graph Neural Networks via Anonymous Walk Graph Kernels

    Authors: Qingqing Long, Yilun Jin, Yi Wu, Guojie Song

    Abstract: Graph neural networks (GNNs) have achieved tremendous success in graph mining. However, the inability of GNNs to model substructures in graphs remains a significant drawback. Specifically, message-passing GNNs (MPGNNs), as the prevailing type of GNNs, have been theoretically shown unable to distinguish, detect or count many graph substructures. While efforts have been paid to complement the inabil… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: 11 pages

  35. arXiv:2103.01901  [pdf, ps, other

    stat.ML cs.LG

    A Theorem of the Alternative for Personalized Federated Learning

    Authors: Shuxiao Chen, Qinqing Zheng, Qi Long, Weijie J. Su

    Abstract: A widely recognized difficulty in federated learning arises from the statistical heterogeneity among clients: local datasets often come from different but not entirely unrelated distributions, and personalization is, therefore, necessary to achieve optimal results from each individual's perspective. In this paper, we show how the excess risks of personalized federated learning with a smooth, stron… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: 50 pages (main manuscript: 25 pages, appendices: 25 pages)

  36. arXiv:2102.11158  [pdf, other

    stat.ML cs.AI cs.CR cs.CV cs.LG

    Federated $f$-Differential Privacy

    Authors: Qinqing Zheng, Shuxiao Chen, Qi Long, Weijie J. Su

    Abstract: Federated learning (FL) is a training paradigm where the clients collaboratively learn models by repeatedly sharing information without compromising much on the privacy of their local sensitive data. In this paper, we introduce federated $f$-differential privacy, a new notion specifically tailored to the federated setting, based on the framework of Gaussian differential privacy. Federated $f$-diff… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Accepted to AISTATS 2021

  37. arXiv:2101.12699  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Exploring Deep Neural Networks via Layer-Peeled Model: Minority Collapse in Imbalanced Training

    Authors: Cong Fang, Hangfeng He, Qi Long, Weijie J. Su

    Abstract: In this paper, we introduce the \textit{Layer-Peeled Model}, a nonconvex yet analytically tractable optimization program, in a quest to better understand deep neural networks that are trained for a sufficiently long time. As the name suggests, this new model is derived by isolating the topmost layer from the remainder of the neural network, followed by imposing certain constraints separately on th… ▽ More

    Submitted 8 September, 2021; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: Accepted at Proceedings of the National Academy of Sciences (PNAS); Changed the title

  38. arXiv:2012.14954  [pdf

    cs.CR stat.ME

    Privacy-Preserving Methods for Vertically Partitioned Incomplete Data

    Authors: Yi Deng, Xiaoqian Jiang, Qi Long

    Abstract: Distributed health data networks that use information from multiple sources have drawn substantial interest in recent years. However, missing data are prevalent in such networks and present significant analytical challenges. The current state-of-the-art methods for handling missing data require pooling data into a central repository before analysis, which may not be possible in a distributed healt… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

    Journal ref: 2020 AMIA Annual Symposium Proceedings

  39. arXiv:2012.02434  [pdf, other

    cs.SI cs.LG

    Learning Node Representations from Noisy Graph Structures

    Authors: Junshan Wang, Ziyao Li, Qingqing Long, Weiyu Zhang, Guojie Song, Chuan Shi

    Abstract: Learning low-dimensional representations on graphs has proved to be effective in various downstream tasks. However, noises prevail in real-world networks, which compromise networks to a large extent in that edges in networks propagate noises through the whole network instead of only the node itself. While existing methods tend to focus on preserving structural properties, the robustness of the lea… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: 6 pages, 3 figures, ICDM 2020

  40. arXiv:2012.00234  [pdf, other

    cs.CV cs.RO

    RaP-Net: A Region-wise and Point-wise Weighting Network to Extract Robust Features for Indoor Localization

    Authors: Dongjiang Li, Jinyu Miao, Xuesong Shi, Yuxin Tian, Qiwei Long, Tianyu Cai, Ping Guo, Hongfei Yu, Wei Yang, Haosong Yue, Qi Wei, Fei Qiao

    Abstract: Feature extraction plays an important role in visual localization. Unreliable features on dynamic objects or repetitive regions will interfere with feature matching and challenge indoor localization greatly. To address the problem, we propose a novel network, RaP-Net, to simultaneously predict region-wise invariability and point-wise reliability, and then extract features by considering both of th… ▽ More

    Submitted 22 August, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: IROS 2021

  41. arXiv:2009.09654  [pdf, other

    cs.CL cs.AI

    Generative Imagination Elevates Machine Translation

    Authors: Quanyu Long, Mingxuan Wang, Lei Li

    Abstract: There are common semantics shared across text and images. Given a sentence in a source language, whether depicting the visual scene helps translation into a target language? Existing multimodal neural machine translation methods (MNMT) require triplets of bilingual sentence - image for training and tuples of source sentence - image for inference. In this paper, we propose ImagiT, a novel machine t… ▽ More

    Submitted 12 April, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

  42. arXiv:2008.05416  [pdf, other

    cs.CV cs.RO

    DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features

    Authors: Dongjiang Li, Xuesong Shi, Qiwei Long, Shenghui Liu, Wei Yang, Fangshi Wang, Qi Wei, Fei Qiao

    Abstract: A robust and efficient Simultaneous Localization and Mapping (SLAM) system is essential for robot autonomy. For visual SLAM algorithms, though the theoretical framework has been well established for most aspects, feature extraction and association is still empirically designed in most cases, and can be vulnerable in complex environments. This paper shows that feature extraction with deep convoluti… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: 8 pages, 5 figures, to be published in IROS 2020

  43. arXiv:2008.03392  [pdf, other

    stat.ME cs.LG

    Grouping effects of sparse CCA models in variable selection

    Authors: Kefei Liu, Qi Long, Li Shen

    Abstract: The sparse canonical correlation analysis (SCCA) is a bi-multivariate association model that finds sparse linear combinations of two sets of variables that are maximally correlated with each other. In addition to the standard SCCA model, a simplified SCCA criterion which maixmizes the cross-covariance between a pair of canonical variables instead of their cross-correlation, is widely used in the l… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

  44. arXiv:2006.14278  [pdf, other

    cs.LG cs.SI stat.ML

    Graph Structural-topic Neural Network

    Authors: Qingqing Long, Yilun Jin, Guojie Song, Yi Li, Wei Lin

    Abstract: Graph Convolutional Networks (GCNs) achieved tremendous success by effectively gathering local features for nodes. However, commonly do GCNs focus more on node features but less on graph structures within the neighborhood, especially higher-order structural patterns. However, such local structural patterns are shown to be indicative of node properties in numerous fields. In addition, it is not jus… ▽ More

    Submitted 4 July, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

  45. arXiv:2005.05683  [pdf, other

    cs.CL

    On the Robustness of Language Encoders against Grammatical Errors

    Authors: Fan Yin, Quanyu Long, Tao Meng, Kai-Wei Chang

    Abstract: We conduct a thorough study to diagnose the behaviors of pre-trained language encoders (ELMo, BERT, and RoBERTa) when confronted with natural grammatical errors. Specifically, we collect real grammatical errors from non-native speakers and conduct adversarial attacks to simulate these errors on clean text data. We use this approach to facilitate debugging models on downstream applications. Results… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: ACL 2020

  46. arXiv:2004.07684  [pdf, other

    cs.CV

    Joint Semantic Segmentation and Boundary Detection using Iterative Pyramid Contexts

    Authors: Mingmin Zhen, Jinglu Wang, Lei Zhou, Shiwei Li, Tianwei Shen, Jiaxiang Shang, Tian Fang, Quan Long

    Abstract: In this paper, we present a joint multi-task learning framework for semantic segmentation and boundary detection. The critical component in the framework is the iterative pyramid context module (PCM), which couples two tasks and stores the shared latent semantics to interact between the two tasks. For semantic boundary detection, we propose the novel spatial gradient fusion to suppress nonsemantic… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

  47. arXiv:2003.10423  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning

    Authors: Qian Long, Zihan Zhou, Abhibav Gupta, Fei Fang, Yi Wu, Xiaolong Wang

    Abstract: In multi-agent games, the complexity of the environment can grow exponentially as the number of agents increases, so it is particularly challenging to learn good policies when the agent population is large. In this paper, we introduce Evolutionary Population Curriculum (EPC), a curriculum learning paradigm that scales up Multi-Agent Reinforcement Learning (MARL) by progressively increasing the pop… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: The project page is https://sites.google.com/view/epciclr2020 .The source code is released at https://github.com/qian18long/epciclr2020

  48. arXiv:2003.04493  [pdf, other

    stat.ML cs.AI cs.CR cs.LG stat.ME

    Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion

    Authors: Qinqing Zheng, Jinshuo Dong, Qi Long, Weijie J. Su

    Abstract: Datasets containing sensitive information are often sequentially analyzed by many algorithms. This raises a fundamental question in differential privacy regarding how the overall privacy bound degrades under composition. To address this question, we introduce a family of analytical and sharp privacy bounds under composition using the Edgeworth expansion in the framework of the recently proposed f-… ▽ More

    Submitted 25 March, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

  49. arXiv:2001.09678  [pdf, other

    cs.CV

    A Robust Real-Time Computing-based Environment Sensing System for Intelligent Vehicle

    Authors: Qiwei Xie, Qian Long, Liming Zhang, Zhao Sun

    Abstract: For intelligent vehicles, sensing the 3D environment is the first but crucial step. In this paper, we build a real-time advanced driver assistance system based on a low-power mobile platform. The system is a real-time multi-scheme integrated innovation system, which combines stereo matching algorithm with machine learning based obstacle detection approach and takes advantage of the distributed com… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  50. arXiv:1911.11607  [pdf, other

    cs.LG cs.CR stat.ML

    Deep Learning with Gaussian Differential Privacy

    Authors: Zhiqi Bu, Jinshuo Dong, Qi Long, Weijie J. Su

    Abstract: Deep learning models are often trained on datasets that contain sensitive information such as individuals' shopping transactions, personal contacts, and medical records. An increasingly important line of work therefore has sought to train neural networks subject to privacy constraints that are specified by differential privacy or its divergence-based relaxations. These privacy definitions, however… ▽ More

    Submitted 22 July, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: To appear in Harvard Data Science Review