Skip to main content

Showing 1–50 of 184 results for author: Ling, C

  1. arXiv:2406.10310  [pdf, other

    cs.CL cs.AI

    TEG-DB: A Comprehensive Dataset and Benchmark of Textual-Edge Graphs

    Authors: Zhuofeng Li, Zixing Gou, Xiangnan Zhang, Zhongyuan Liu, Sirui Li, Yuntong Hu, Chen Ling, Zheng Zhang, Liang Zhao

    Abstract: Text-Attributed Graphs (TAGs) augment graph structures with natural language descriptions, facilitating detailed depictions of data and their interconnections across various real-world settings. However, existing TAG datasets predominantly feature textual information only at the nodes, with edges typically represented by mere binary or categorical attributes. This lack of rich textual edge annotat… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2406.08231  [pdf, other

    cs.CV cs.AI

    Using Deep Convolutional Neural Networks to Detect Rendered Glitches in Video Games

    Authors: Carlos Garcia Ling, Konrad Tollmar, Linus Gisslen

    Abstract: In this paper, we present a method using Deep Convolutional Neural Networks (DCNNs) to detect common glitches in video games. The problem setting consists of an image (800x800 RGB) as input to be classified into one of five defined classes, normal image, or one of four different kinds of glitches (stretched, low resolution, missing and placeholder textures). Using a supervised approach, we train a… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures, AAIDE conference

  3. arXiv:2405.20790  [pdf, other

    cs.LG cs.CY

    Intersectional Unfairness Discovery

    Authors: Gezheng Xu, Qi Chen, Charles Ling, Boyu Wang, Changjian Shui

    Abstract: AI systems have been shown to produce unfair results for certain subgroups of population, highlighting the need to understand bias on certain sensitive attributes. Current research often falls short, primarily focusing on the subgroups characterized by a single sensitive attribute, while neglecting the nature of intersectional fairness of multiple sensitive attributes. This paper focuses on its on… ▽ More

    Submitted 6 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: ICML-2024 camera-ready

  4. arXiv:2405.16800  [pdf, other

    cs.LG cs.AI

    TAGA: Text-Attributed Graph Self-Supervised Learning by Synergizing Graph and Text Mutual Transformations

    Authors: Zheng Zhang, Yuntong Hu, Bo Pan, Chen Ling, Liang Zhao

    Abstract: Text-Attributed Graphs (TAGs) enhance graph structures with natural language descriptions, enabling detailed representation of data and their relationships across a broad spectrum of real-world scenarios. Despite the potential for deeper insights, existing TAG representation learning primarily relies on supervised methods, necessitating extensive labeled data and limiting applicability across dive… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  5. arXiv:2405.16606  [pdf, other

    cs.SI

    Link Prediction on Textual Edge Graphs

    Authors: Chen Ling, Zhuofeng Li, Yuntong Hu, Zheng Zhang, Zhongyuan Liu, Shuang Zheng, Liang Zhao

    Abstract: Textual-edge Graphs (TEGs), characterized by rich text annotations on edges, are increasingly significant in network science due to their ability to capture rich contextual information among entities. Existing works have proposed various edge-aware graph neural networks (GNNs) or let language models directly make predictions. However, they often fall short of fully capturing the contextualized sem… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  6. arXiv:2405.16506  [pdf, other

    cs.LG

    GRAG: Graph Retrieval-Augmented Generation

    Authors: Yuntong Hu, Zhihan Lei, Zheng Zhang, Bo Pan, Chen Ling, Liang Zhao

    Abstract: While Retrieval-Augmented Generation (RAG) enhances the accuracy and relevance of responses by generative language models, it falls short in graph-based contexts where both textual and topological information are important. Naive RAG approaches inherently neglect the structural intricacies of textual graphs, resulting in a critical gap in the generation process. To address this challenge, we intro… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 14 pages, 4 figures

  7. Modeling User Fatigue for Sequential Recommendation

    Authors: Nian Li, Xin Ban, Cheng Ling, Chen Gao, Lantao Hu, Peng Jiang, Kun Gai, Yong Li, Qingmin Liao

    Abstract: Recommender systems filter out information that meets user interests. However, users may be tired of the recommendations that are too similar to the content they have been exposed to in a short historical period, which is the so-called user fatigue. Despite the significance for a better user experience, user fatigue is seldom explored by existing recommenders. In fact, there are three main challen… ▽ More

    Submitted 22 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: SIGIR 2024

  8. arXiv:2405.10124  [pdf, ps, other

    cs.IT

    Smoothing Linear Codes by Rényi Divergence and Applications to Security Reduction

    Authors: Hao Yan, Cong Ling

    Abstract: The concept of the smoothing parameter plays a crucial role in both lattice-based and code-based cryptography, primarily due to its effectiveness in achieving nearly uniform distributions through the addition of noise. Recent research by Pathegama and Barg has determined the optimal smoothing bound for random codes under Rényi Divergence for any order $α\in (1, \infty)$ \cite{pathegama2024r}. Cons… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  9. arXiv:2405.09784  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Online bipartite matching with imperfect advice

    Authors: Davin Choo, Themis Gouleakis, Chun Kai Ling, Arnab Bhattacharyya

    Abstract: We study the problem of online unweighted bipartite matching with $n$ offline vertices and $n$ online vertices where one wishes to be competitive against the optimal offline algorithm. While the classic RANKING algorithm of Karp et al. [1990] provably attains competitive ratio of $1-1/e > 1/2$, we show that no learning-augmented method can be both 1-consistent and strictly better than $1/2$-robust… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted into ICML 2024

  10. arXiv:2405.04051  [pdf, ps, other

    cs.IT

    On the quantization goodness of polar lattices

    Authors: Ling Liu, Shanxiang Lyu, Cong Ling, Baoming Bai

    Abstract: In this work, we prove that polar lattices, when tailored for lossy compression, are quantization-good in the sense that their normalized second moments approach $\frac{1}{2πe}$ as the dimension of lattices increases. It has been predicted by Zamir et al. \cite{ZamirQZ96} that the Entropy Coded Dithered Quantization (ECDQ) system using quantization-good lattices can achieve the rate-distortion bou… ▽ More

    Submitted 13 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 12 pages, 5 figures, submitted to IEEE for possible publication

  11. arXiv:2405.03070  [pdf, other

    cs.GT

    Layered Graph Security Games

    Authors: Jakub Černý, Chun Kai Ling, Christian Kroer, Garud Iyengar

    Abstract: Security games model strategic interactions in adversarial real-world applications. Such applications often involve extremely large but highly structured strategy sets (e.g., selecting a distribution over all patrol routes in a given graph). In this paper, we represent each player's strategy space using a layered graph whose paths represent an exponentially large strategy space. Our formulation en… ▽ More

    Submitted 9 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: In Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence. IJCAI Press, 2024

  12. arXiv:2405.01680  [pdf, other

    cs.LG

    Physics-Informed Neural Networks: Minimizing Residual Loss with Wide Networks and Effective Activations

    Authors: Nima Hosseini Dashtbayaz, Ghazal Farhani, Boyu Wang, Charles X. Ling

    Abstract: The residual loss in Physics-Informed Neural Networks (PINNs) alters the simple recursive relation of layers in a feed-forward neural network by applying a differential operator, resulting in a loss landscape that is inherently different from those of common supervised problems. Therefore, relying on the existing theory leads to unjustified design choices and suboptimal performance. In this work,… ▽ More

    Submitted 12 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted at IJCAI 2024. V2: Corrected typos

  13. arXiv:2404.14668  [pdf, other

    cs.SI

    Source Localization for Cross Network Information Diffusion

    Authors: Chen Ling, Tanmoy Chowdhury, Jie Ji, Sirui Li, Andreas Züfle, Liang Zhao

    Abstract: Source localization aims to locate information diffusion sources only given the diffusion observation, which has attracted extensive attention in the past few years. Existing methods are mostly tailored for single networks and may not be generalized to handle more complex networks like cross-networks. Cross-network is defined as two interconnected networks, where one network's functionality depend… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Code and data are available at: https://github.com/tanmoysr/CNSL/

  14. arXiv:2403.13574  [pdf, other

    cs.IR cs.AI

    A Large Language Model Enhanced Sequential Recommender for Joint Video and Comment Recommendation

    Authors: Bowen Zheng, Zihan Lin, Enze Liu, Chen Yang, Enyang Bai, Cheng Ling, Wayne Xin Zhao, Ji-Rong Wen

    Abstract: In online video platforms, reading or writing comments on interesting videos has become an essential part of the video watching experience. However, existing video recommender systems mainly model users' interaction behaviors with videos, lacking consideration of comments in user behavior modeling. In this paper, we propose a novel recommendation approach called LSVCR by leveraging user interactio… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  15. arXiv:2403.11440  [pdf, ps, other

    cs.CV

    Boosting Continuous Emotion Recognition with Self-Pretraining using Masked Autoencoders, Temporal Convolutional Networks, and Transformers

    Authors: Weiwei Zhou, Jiada Lu, Chenkun Ling, Weifeng Wang, Shaowei Liu

    Abstract: Human emotion recognition holds a pivotal role in facilitating seamless human-computer interaction. This paper delineates our methodology in tackling the Valence-Arousal (VA) Estimation Challenge, Expression (Expr) Classification Challenge, and Action Unit (AU) Detection Challenge within the ambit of the 6th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW). Our study advo… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  16. arXiv:2402.17946  [pdf, other

    cs.CL

    SparseLLM: Towards Global Pruning for Pre-trained Language Models

    Authors: Guangji Bai, Yijiang Li, Chen Ling, Kibaek Kim, Liang Zhao

    Abstract: The transformative impact of large language models (LLMs) like LLaMA and GPT on natural language processing is countered by their prohibitive computational demands. Pruning has emerged as a pivotal compression strategy, introducing sparsity to enhance both memory and computational efficiency. Yet, traditional global pruning is impractical for LLMs due to scalability issues, while local pruning, de… ▽ More

    Submitted 23 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Preprint. Under review

  17. arXiv:2402.16898  [pdf, other

    cs.SI cs.AI cs.LG math.PR stat.ML

    MIM-Reasoner: Learning with Theoretical Guarantees for Multiplex Influence Maximization

    Authors: Nguyen Do, Tanmoy Chowdhury, Chen Ling, Liang Zhao, My T. Thai

    Abstract: Multiplex influence maximization (MIM) asks us to identify a set of seed users such as to maximize the expected number of influenced users in a multiplex network. MIM has been one of central research topics, especially in nowadays social networking landscape where users participate in multiple online social networks (OSNs) and their influences can propagate among several OSNs simultaneously. Altho… ▽ More

    Submitted 10 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  18. arXiv:2402.13098  [pdf, other

    cs.CL cs.AI

    ELAD: Explanation-Guided Large Language Models Active Distillation

    Authors: Yifei Zhang, Bo Pan, Chen Ling, Yuntong Hu, Liang Zhao

    Abstract: The deployment and application of Large Language Models (LLMs) is hindered by their memory inefficiency, computational demands, and the high costs of API inferences. Traditional distillation methods, which transfer the capabilities of LLMs to smaller models, often fail to determine whether the knowledge has been sufficiently transferred, potentially resulting in high costs or incomplete distillati… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  19. arXiv:2402.10779  [pdf, other

    cs.CL

    A Condensed Transition Graph Framework for Zero-shot Link Prediction with Large Language Models

    Authors: Mingchen Li, Chen Ling, Rui Zhang, Liang Zhao

    Abstract: Zero-shot link prediction (ZSLP) on knowledge graphs aims at automatically identifying relations between given entities. Existing methods primarily employ auxiliary information to predict tail entity given head entity and its relation, yet face challenges due to the occasional unavailability of such detailed information and the inherent simplicity of predicting tail entities based on semantic simi… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  20. arXiv:2402.10189  [pdf, other

    cs.CL cs.LG

    Uncertainty Quantification for In-Context Learning of Large Language Models

    Authors: Chen Ling, Xujiang Zhao, Xuchao Zhang, Wei Cheng, Yanchi Liu, Yiyou Sun, Mika Oishi, Takao Osaki, Katsushi Matsuda, Jie Ji, Guangji Bai, Liang Zhao, Haifeng Chen

    Abstract: In-context learning has emerged as a groundbreaking ability of Large Language Models (LLMs) and revolutionized various fields by providing a few task-relevant demonstrations in the prompt. However, trustworthy issues with LLM's response, such as hallucination, have also been actively discussed. Existing works have been devoted to quantifying the uncertainty in LLM's response, but they often overlo… ▽ More

    Submitted 28 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted to the main conference of NAACL 2024

  21. arXiv:2402.07834  [pdf, other

    cs.LG

    Generalizing across Temporal Domains with Koopman Operators

    Authors: Qiuhao Zeng, Wei Wang, Fan Zhou, Gezheng Xu, Ruizhi Pu, Changjian Shui, Christian Gagne, Shichun Yang, Boyu Wang, Charles X. Ling

    Abstract: In the field of domain generalization, the task of constructing a predictive model capable of generalizing to a target domain without access to target data remains challenging. This problem becomes further complicated when considering evolving dynamics between domains. While various approaches have been proposed to address this issue, a comprehensive understanding of the underlying generalization… ▽ More

    Submitted 15 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 15 pages, 7 figures, Accepted by AAAI 2024. arXiv admin note: text overlap with arXiv:2206.00047

  22. arXiv:2402.03030  [pdf, other

    cs.IT eess.SP

    Rejection-Sampled Universal Quantization for Smaller Quantization Errors

    Authors: Chih Wei Ling, Cheuk Ting Li

    Abstract: We construct a randomized vector quantizer which has a smaller maximum error compared to all known lattice quantizers with the same entropy for dimensions 5, 6, ..., 48, and also has a smaller mean squared error compared to known lattice quantizers with the same entropy for dimensions 35, ..., 48, in the high resolution limit. Moreover, our randomized quantizer has a desirable property that the qu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 15 pages, 2 figures

  23. arXiv:2401.10773  [pdf, other

    cs.IT

    Multilevel lattice codes from Hurwitz quaternion integers

    Authors: Juliana G. F. Souza, Sueli I. R. Costa, Cong Ling

    Abstract: This work presents an extension of the Construction $π_A$ lattices proposed in \cite{huang2017construction}, to Hurwitz quaternion integers. This construction is provided by using an isomorphism from a version of the Chinese remainder theorem applied to maximal orders in contrast to natural orders in prior works. Exploiting this map, we analyze the performance of the resulting multilevel lattice c… ▽ More

    Submitted 27 February, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

  24. arXiv:2401.09490  [pdf, other

    q-bio.QM cs.IR

    Gene-associated Disease Discovery Powered by Large Language Models

    Authors: Jiayu Chang, Shiyu Wang, Chen Ling, Zhaohui Qin, Liang Zhao

    Abstract: The intricate relationship between genetic variation and human diseases has been a focal point of medical research, evidenced by the identification of risk genes regarding specific diseases. The advent of advanced genome sequencing techniques has significantly improved the efficiency and cost-effectiveness of detecting these genetic markers, playing a crucial role in disease diagnosis and forming… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: This is the official paper accepted by AAAI 2024 Workshop on Large Language Models for Biological Discoveries

  25. arXiv:2401.00625  [pdf, ps, other

    cs.LG

    Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models

    Authors: Guangji Bai, Zheng Chai, Chen Ling, Shiyu Wang, Jiaying Lu, Nan Zhang, Tingwei Shi, Ziyang Yu, Mengdan Zhu, Yifei Zhang, Carl Yang, Yue Cheng, Liang Zhao

    Abstract: The burgeoning field of Large Language Models (LLMs), exemplified by sophisticated models like OpenAI's ChatGPT, represents a significant advancement in artificial intelligence. These models, however, bring forth substantial challenges in the high consumption of computational, memory, energy, and financial resources, especially in environments with limited resource capabilities. This survey aims t… ▽ More

    Submitted 3 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: Preprint. GitHub repo: https://github.com/tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers

  26. arXiv:2312.15566  [pdf, other

    stat.ML cs.LG

    Deep Copula-Based Survival Analysis for Dependent Censoring with Identifiability Guarantees

    Authors: Weijia Zhang, Chun Kai Ling, Xuanhui Zhang

    Abstract: Censoring is the central problem in survival analysis where either the time-to-event (for instance, death), or the time-tocensoring (such as loss of follow-up) is observed for each sample. The majority of existing machine learning-based survival analysis methods assume that survival is conditionally independent of censoring given a set of covariates; an assumption that cannot be verified since onl… ▽ More

    Submitted 4 July, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Comments: To appear in AAAI 2024

  27. arXiv:2312.09058  [pdf, other

    cs.GT

    Learning Coalition Structures with Games

    Authors: Yixuan Even Xu, Chun Kai Ling, Fei Fang

    Abstract: Coalitions naturally exist in many real-world systems involving multiple decision makers such as ridesharing, security, and online ad auctions, but the coalition structure among the agents is often unknown. We propose and study an important yet previously overseen problem -- Coalition Structure Learning (CSL), where we aim to carefully design a series of games for the agents and infer the underlyi… ▽ More

    Submitted 18 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 13 pages, 4 figures, 3 tables, aaai 2024

  28. arXiv:2312.05822  [pdf, other

    cs.AI

    Toward Open-ended Embodied Tasks Solving

    Authors: William Wei Wang, Dongqi Han, Xufang Luo, Yifei Shen, Charles Ling, Boyu Wang, Dongsheng Li

    Abstract: Empowering embodied agents, such as robots, with Artificial Intelligence (AI) has become increasingly important in recent years. A major challenge is task open-endedness. In practice, robots often need to perform tasks with novel goals that are multifaceted, dynamic, lack a definitive "end-state", and were not encountered during training. To tackle this problem, this paper introduces \textit{Diffu… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  29. arXiv:2311.16392  [pdf, other

    cs.GT cs.AI

    Multi-defender Security Games with Schedules

    Authors: Zimeng Song, Chun Kai Ling, Fei Fang

    Abstract: Stackelberg Security Games are often used to model strategic interactions in high-stakes security settings. The majority of existing models focus on single-defender settings where a single entity assumes command of all security assets. However, many realistic scenarios feature multiple heterogeneous defenders with their own interests and priorities embedded in a more complex system. Furthermore, d… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Extended version of the paper accepted to GameSec 2023

  30. Hessian Aware Low-Rank Perturbation for Order-Robust Continual Learning

    Authors: Jiaqi Li, Yuanhao Lai, Rui Wang, Changjian Shui, Sabyasachi Sahoo, Charles X. Ling, Shichun Yang, Boyu Wang, Christian Gagné, Fan Zhou

    Abstract: Continual learning aims to learn a series of tasks sequentially without forgetting the knowledge acquired from the previous ones. In this work, we propose the Hessian Aware Low-Rank Perturbation algorithm for continual learning. By modeling the parameter transitions along the sequential tasks with the weight matrix transformation, we propose to apply the low-rank approximation on the task-adaptive… ▽ More

    Submitted 7 July, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted by IEEE Transactions on Knowledge and Data Engineering (TKDE)

  31. arXiv:2310.11672  [pdf, other

    cs.CL

    Open-ended Commonsense Reasoning with Unrestricted Answer Scope

    Authors: Chen Ling, Xuchao Zhang, Xujiang Zhao, Yanchi Liu, Wei Cheng, Mika Oishi, Takao Osaki, Katsushi Matsuda, Haifeng Chen, Liang Zhao

    Abstract: Open-ended Commonsense Reasoning is defined as solving a commonsense question without providing 1) a short list of answer candidates and 2) a pre-defined answer scope. Conventional ways of formulating the commonsense question into a question-answering form or utilizing external knowledge to learn retrieval-based methods are less applicable in the open-ended setting due to an inherent challenge. Wi… ▽ More

    Submitted 27 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  32. arXiv:2310.10408  [pdf, other

    eess.IV cs.CV cs.LG

    A cross Transformer for image denoising

    Authors: Chunwei Tian, Menghua Zheng, Wangmeng Zuo, Shichao Zhang, Yanning Zhang, Chia-Wen Ling

    Abstract: Deep convolutional neural networks (CNNs) depend on feedforward and feedback ways to obtain good performance in image denoising. However, how to obtain effective structural information via CNNs to efficiently represent given noisy images is key for complex scenes. In this paper, we propose a cross Transformer denoising CNN (CTNet) with a serial block (SB), a parallel block (PB), and a residual blo… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  33. arXiv:2309.08301  [pdf, other

    cs.RO

    RaSpectLoc: RAman SPECTroscopy-dependent robot LOCalisation

    Authors: Christopher Thomas Thirgood, Oscar Alejandro Mendez Maldonado, Chao Ling, Jonathan Storey, Simon J Hadfield

    Abstract: This paper presents a new information source for supporting robot localisation: material composition. The proposed method complements the existing visual, structural, and semantic cues utilized in the literature. However, it has a distinct advantage in its ability to differentiate structurally, visually or categorically similar objects such as different doors, by using Raman spectrometers. Such de… ▽ More

    Submitted 21 September, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures. This work will be presented at IROS 2023

  34. arXiv:2309.06982  [pdf, other

    cs.CR

    Communication-Efficient Laplace Mechanism for Differential Privacy via Random Quantization

    Authors: Ali Moradi Shahmiri, Chih Wei Ling, Cheuk Ting Li

    Abstract: We propose the first method that realizes the Laplace mechanism exactly (i.e., a Laplace noise is added to the data) that requires only a finite amount of communication (whereas the original Laplace mechanism requires the transmission of a real number) while guaranteeing privacy against the server and database. Our mechanism can serve as a drop-in replacement for local or centralized differential… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 11 pages, 3 figures, short version to be submitted at 2024 IEEE International Conference on Acoustics, Speech and Signal Processing

  35. arXiv:2309.03433  [pdf, other

    cs.CL

    Improving Open Information Extraction with Large Language Models: A Study on Demonstration Uncertainty

    Authors: Chen Ling, Xujiang Zhao, Xuchao Zhang, Yanchi Liu, Wei Cheng, Haoyu Wang, Zhengzhang Chen, Takao Osaki, Katsushi Matsuda, Haifeng Chen, Liang Zhao

    Abstract: Open Information Extraction (OIE) task aims at extracting structured facts from unstructured text, typically in the form of (subject, relation, object) triples. Despite the potential of large language models (LLMs) like ChatGPT as a general task solver, they lag behind state-of-the-art (supervised) methods in OIE tasks due to two key issues. First, LLMs struggle to distinguish irrelevant context f… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  36. arXiv:2309.02978  [pdf, other

    cs.SI cs.IR

    Helper Recommendation with seniority control in Online Health Community

    Authors: Junruo Gao, Chen Ling, Carl Yang, Liang Zhao

    Abstract: Online health communities (OHCs) are forums where patients with similar conditions communicate their experiences and provide moral support. Social support in OHCs plays a crucial role in easing and rehabilitating patients. However, many time-sensitive questions from patients often remain unanswered due to the multitude of threads and the random nature of patient visits in OHCs. To address this iss… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  37. arXiv:2308.05472  [pdf, other

    cs.IT

    PAC Codes for Source and Joint Source-Channel Coding

    Authors: Mengfan Zheng, Cong Ling

    Abstract: Polarization-adjusted convolutional (PAC) codes, as a concatenated coding scheme based on polar codes, is able to approach the finite-length bound of binary-input AWGN channel at short blocklengths. In this paper, we extend PAC codes to the fields of source coding and joint source-channel coding and show that they can also approach the corresponding finite-length bounds at short blocklengths.

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: 6 pages, 6 figures. Submitted to GC 2023 Workshop - Channel Coding Beyond 5G

  38. arXiv:2306.16077  [pdf, other

    cs.LG cs.AI cs.DC

    Secure and Fast Asynchronous Vertical Federated Learning via Cascaded Hybrid Optimization

    Authors: Ganyu Wang, Qingsong Zhang, Li Xiang, Boyu Wang, Bin Gu, Charles Ling

    Abstract: Vertical Federated Learning (VFL) attracts increasing attention because it empowers multiple parties to jointly train a privacy-preserving model over vertically partitioned data. Recent research has shown that applying zeroth-order optimization (ZOO) has many advantages in building a practical VFL algorithm. However, a vital problem with the ZOO-based VFL is its slow convergence rate, which limits… ▽ More

    Submitted 29 June, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Under Review

  39. arXiv:2306.04802  [pdf, other

    cs.AI cs.CL cs.LG cs.SI

    A Review on Knowledge Graphs for Healthcare: Resources, Applications, and Promises

    Authors: Hejie Cui, Jiaying Lu, Shiyu Wang, Ran Xu, Wenjing Ma, Shaojun Yu, Yue Yu, Xuan Kan, Chen Ling, Tianfan Fu, Liang Zhao, Joyce Ho, Fei Wang, Carl Yang

    Abstract: Healthcare knowledge graphs (HKGs) are valuable tools for organizing biomedical concepts and their relationships with interpretable structures. The recent advent of large language models (LLMs) has paved the way for building more comprehensive and accurate HKGs. This, in turn, can improve the reliability of generated content and enable better evaluation of LLMs. However, the challenges of HKGs suc… ▽ More

    Submitted 19 February, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  40. arXiv:2306.04539  [pdf, other

    cs.LG cs.CL cs.CV cs.IT stat.ML

    Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications

    Authors: Paul Pu Liang, Chun Kai Ling, Yun Cheng, Alex Obolenskiy, Yudong Liu, Rohan Pandey, Alex Wilf, Louis-Philippe Morency, Ruslan Salakhutdinov

    Abstract: In many machine learning systems that jointly learn from multiple modalities, a core research question is to understand the nature of multimodal interactions: how modalities combine to provide new task-relevant information that was not present in either alone. We study this challenge of interaction quantification in a semi-supervised setting with only labeled unimodal data and naturally co-occurri… ▽ More

    Submitted 13 June, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: ICLR 2024, Code available at: https://github.com/pliang279/PID

  41. arXiv:2305.18703  [pdf, other

    cs.CL cs.AI

    Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

    Authors: Chen Ling, Xujiang Zhao, Jiaying Lu, Chengyuan Deng, Can Zheng, Junxiang Wang, Tanmoy Chowdhury, Yun Li, Hejie Cui, Xuchao Zhang, Tianjiao Zhao, Amit Panalkar, Dhagash Mehta, Stefano Pasquali, Wei Cheng, Haoyu Wang, Yanchi Liu, Zhengzhang Chen, Haifeng Chen, Chris White, Quanquan Gu, Jian Pei, Carl Yang, Liang Zhao

    Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of dom… ▽ More

    Submitted 29 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  42. arXiv:2305.06788  [pdf, other

    cs.IT

    Vector Quantization with Error Uniformly Distributed over an Arbitrary Set

    Authors: Chih Wei Ling, Cheuk Ting Li

    Abstract: For uniform scalar quantization, the error distribution is approximately a uniform distribution over an interval (which is also a 1-dimensional ball). Nevertheless, for lattice vector quantization, the error distribution is uniform not over a ball, but over the basic cell of the quantization lattice. In this paper, we construct vector quantizers with periodic properties, where the error is uniform… ▽ More

    Submitted 24 January, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: 22 pages, 3 figures. Short version presented at 2023 IEEE International Symposium on Information Theory

  43. arXiv:2305.02200  [pdf, other

    cs.SI cs.LG

    Deep Graph Representation Learning and Optimization for Influence Maximization

    Authors: Chen Ling, Junji Jiang, Junxiang Wang, My Thai, Lukas Xue, James Song, Meikang Qiu, Liang Zhao

    Abstract: Influence maximization (IM) is formulated as selecting a set of initial users from a social network to maximize the expected number of influenced users. Researchers have made great progress in designing various traditional methods, and their theoretical design and performance gain are close to a limit. In the past few years, learning-based IM methods have emerged to achieve stronger generalization… ▽ More

    Submitted 6 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: In Proceedings of the 40th International Conference on Machine Learning (ICML 2023), Honolulu, Hawaii, USA. PMLR 202, 2023

  44. arXiv:2303.07099  [pdf, other

    cs.CY cs.SI

    Beyond Fish and Bicycles: Exploring the Varieties of Online Women's Ideological Spaces

    Authors: Utkucan Balci, Chen Ling, Emiliano De Cristofaro, Megan Squire, Gianluca Stringhini, Jeremy Blackburn

    Abstract: The Internet has been instrumental in connecting under-represented and vulnerable groups of people. Platforms built to foster social interaction and engagement have enabled historically disenfranchised groups to have a voice. One such vulnerable group is women. In this paper, we explore the diversity in online women's ideological spaces using a multi-dimensional approach. We perform a large-scale,… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Journal ref: Published in the Proceedings of the 15th ACM Web Science Conference 2023 (ACM WebSci 2023). Please cite the WebSci version

  45. arXiv:2302.12247  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.IT

    Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework

    Authors: Paul Pu Liang, Yun Cheng, Xiang Fan, Chun Kai Ling, Suzanne Nie, Richard Chen, Zihao Deng, Nicholas Allen, Randy Auerbach, Faisal Mahmood, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: The recent explosion of interest in multimodal applications has resulted in a wide selection of datasets and methods for representing and integrating information from different modalities. Despite these empirical advances, there remain fundamental research questions: How can we quantify the interactions that are necessary to solve a multimodal task? Subsequently, what are the most suitable multimo… ▽ More

    Submitted 10 December, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023. Code available at: https://github.com/pliang279/PID

  46. arXiv:2302.02093  [pdf

    cs.AI cs.NE

    Knowledge-enhanced Neural Machine Reasoning: A Review

    Authors: Tanmoy Chowdhury, Chen Ling, Xuchao Zhang, Xujiang Zhao, Guangji Bai, Jian Pei, Haifeng Chen, Liang Zhao

    Abstract: Knowledge-enhanced neural machine reasoning has garnered significant attention as a cutting-edge yet challenging research area with numerous practical applications. Over the past few years, plenty of studies have leveraged various forms of external knowledge to augment the reasoning capabilities of deep models, tackling challenges such as effective knowledge integration, implicit knowledge mining,… ▽ More

    Submitted 6 February, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: 8 pages, 3 figures

  47. arXiv:2302.01516  [pdf, other

    cs.CV

    Class Overwhelms: Mutual Conditional Blended-Target Domain Adaptation

    Authors: Pengcheng Xu, Boyu Wang, Charles Ling

    Abstract: Current methods of blended targets domain adaptation (BTDA) usually infer or consider domain label information but underemphasize hybrid categorical feature structures of targets, which yields limited performance, especially under the label distribution shift. We demonstrate that domain labels are not directly necessary for BTDA if categorical distributions of various domains are sufficiently alig… ▽ More

    Submitted 8 March, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Journal ref: AAAI2023 Oral

  48. arXiv:2301.13381  [pdf, other

    cs.LG cs.CV

    When Source-Free Domain Adaptation Meets Learning with Noisy Labels

    Authors: Li Yi, Gezheng Xu, Pengcheng Xu, Jiaqi Li, Ruizhi Pu, Charles Ling, A. Ian McLeod, Boyu Wang

    Abstract: Recent state-of-the-art source-free domain adaptation (SFDA) methods have focused on learning meaningful cluster structures in the feature space, which have succeeded in adapting the knowledge from source domain to unlabeled target domain without accessing the private source data. However, existing methods rely on the pseudo-labels generated by source models that can be noisy due to domain shift.… ▽ More

    Submitted 24 February, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: ICLR 2023 camera-ready

  49. arXiv:2301.09159  [pdf, other

    cs.GT cs.AI cs.LG

    Abstracting Imperfect Information Away from Two-Player Zero-Sum Games

    Authors: Samuel Sokota, Ryan D'Orazio, Chun Kai Ling, David J. Wu, J. Zico Kolter, Noam Brown

    Abstract: In their seminal work, Nayyar et al. (2013) showed that imperfect information can be abstracted away from common-payoff games by having players publicly announce their policies as they play. This insight underpins sound solvers and decision-time planning algorithms for common-payoff games. Unfortunately, a naive application of the same insight to two-player zero-sum games fails because Nash equili… ▽ More

    Submitted 31 July, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

  50. arXiv:2301.07845  [pdf, other

    cs.CV cs.AI

    Foresee What You Will Learn: Data Augmentation for Domain Generalization in Non-stationary Environment

    Authors: Qiuhao Zeng, Wei Wang, Fan Zhou, Charles Ling, Boyu Wang

    Abstract: Existing domain generalization aims to learn a generalizable model to perform well even on unseen domains. For many real-world machine learning applications, the data distribution often shifts gradually along domain indices. For example, a self-driving car with a vision system drives from dawn to dusk, with the sky darkening gradually. Therefore, the system must be able to adapt to changes in ambi… ▽ More

    Submitted 8 March, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: 12 pages, 6 figures, accepted by AAAI 2023