Skip to main content

Showing 101–150 of 494 results for author: Yu, P S

  1. arXiv:2306.11986  [pdf, other

    cs.IR cs.LG

    Addressing the Rank Degeneration in Sequential Recommendation via Singular Spectrum Smoothing

    Authors: Ziwei Fan, Zhiwei Liu, Hao Peng, Philip S. Yu

    Abstract: Sequential recommendation (SR) investigates the dynamic user preferences modeling and generates the next-item prediction. The next item preference is typically generated by the affinity between the sequence and item representations. However, both sequence and item representations suffer from the rank degeneration issue due to the data sparsity problem. The rank degeneration issue significantly imp… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 18 pages, regularizations on preserving embedding rank are surrogates of intra-list recommendation diversity (controllable diversity). The code is in https://github.com/zfan20/SPMRec

  2. arXiv:2306.06470  [pdf, other

    cs.DB

    TALENT: Targeted Mining of Non-overlapping Sequential Patterns

    Authors: Zefeng Chen, Wensheng Gan, Gengsen Huang, Zhenlian Qi, Yan Li, Philip S. Yu

    Abstract: With the widespread application of efficient pattern mining algorithms, sequential patterns that allow gap constraints have become a valuable tool to discover knowledge from biological data such as DNA and protein sequences. Among all kinds of gap-constrained mining, non-overlapping sequence mining can mine interesting patterns and satisfy the anti-monotonic property (the Apriori property). Howeve… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Preprint. 9 figures, 5 tables

  3. arXiv:2306.05257  [pdf, other

    cs.LG q-bio.QM

    Comprehensive evaluation of deep and graph learning on drug-drug interactions prediction

    Authors: Xuan Lin, Lichang Dai, Yafang Zhou, Zu-Guo Yu, Wen Zhang, Jian-Yu Shi, Dong-Sheng Cao, Li Zeng, Haowen Chen, Bosheng Song, Philip S. Yu, Xiangxiang Zeng

    Abstract: Recent advances and achievements of artificial intelligence (AI) as well as deep and graph learning models have established their usefulness in biomedical applications, especially in drug-drug interactions (DDIs). DDIs refer to a change in the effect of one drug to the presence of another drug in the human body, which plays an essential role in drug discovery and clinical research. DDIs prediction… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted by Briefings in Bioinformatics

  4. arXiv:2306.03558  [pdf, other

    cs.CR cs.LG

    Machine Unlearning: A Survey

    Authors: Heng Xu, Tianqing Zhu, Lefeng Zhang, Wanlei Zhou, Philip S. Yu

    Abstract: Machine learning has attracted widespread attention and evolved into an enabling technology for a wide range of highly successful applications, such as intelligent computer vision, speech recognition, medical diagnosis, and more. Yet a special need has arisen where, due to privacy, usability, and/or the right to be forgotten, information about some specific samples needs to be removed from a model… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  5. Inconsistent Matters: A Knowledge-guided Dual-consistency Network for Multi-modal Rumor Detection

    Authors: Mengzhu Sun, Xi Zhang, Jianqiang Ma, Sihong Xie, Yazheng Liu, Philip S. Yu

    Abstract: Rumor spreaders are increasingly utilizing multimedia content to attract the attention and trust of news consumers. Though quite a few rumor detection models have exploited the multi-modal data, they seldom consider the inconsistent semantics between images and texts, and rarely spot the inconsistency among the post contents and background knowledge. In addition, they commonly assume the completen… ▽ More

    Submitted 18 June, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Journal ref: IEEE Transactions on Knowledge and Data Engineering, 2023

  6. arXiv:2306.01603  [pdf, other

    cs.LG cs.CY cs.DC cs.NI

    Decentralized Federated Learning: A Survey and Perspective

    Authors: Liangqi Yuan, Ziran Wang, Lichao Sun, Philip S. Yu, Christopher G. Brinton

    Abstract: Federated learning (FL) has been gaining attention for its ability to share knowledge while maintaining user data, protecting privacy, increasing learning efficiency, and reducing communication overhead. Decentralized FL (DFL) is a decentralized network architecture that eliminates the need for a central server in contrast to centralized FL (CFL). DFL enables direct communication between clients,… ▽ More

    Submitted 4 May, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

  7. arXiv:2305.16663  [pdf, other

    cs.CL

    GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks

    Authors: Xuming Hu, Aiwei Liu, Zeqi Tan, Xin Zhang, Chenwei Zhang, Irwin King, Philip S. Yu

    Abstract: Relation extraction (RE) tasks show promising performance in extracting relations from two entities mentioned in sentences, given sufficient annotations available during training. Such annotations would be labor-intensive to obtain in practice. Existing work adopts data augmentation techniques to generate pseudo-annotated sentences beyond limited annotations. These techniques neither preserve the… ▽ More

    Submitted 14 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023 (Findings), Long Paper, 12 pages

    MSC Class: 68T01 ACM Class: I.2.7

    Journal ref: ACL 2023

  8. arXiv:2305.16166  [pdf, other

    cs.CL

    Multimodal Relation Extraction with Cross-Modal Retrieval and Synthesis

    Authors: Xuming Hu, Zhijiang Guo, Zhiyang Teng, Irwin King, Philip S. Yu

    Abstract: Multimodal relation extraction (MRE) is the task of identifying the semantic relationships between two entities based on the context of the sentence image pair. Existing retrieval-augmented approaches mainly focused on modeling the retrieved textual knowledge, but this may not be able to accurately identify complex relations. To improve the prediction, this research proposes to retrieve textual an… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  9. arXiv:2305.16128  [pdf, other

    cs.CL

    Give Me More Details: Improving Fact-Checking with Latent Retrieval

    Authors: Xuming Hu, Junzhe Chen, Zhijiang Guo, Philip S. Yu

    Abstract: Evidence plays a crucial role in automated fact-checking. When verifying real-world claims, existing fact-checking systems either assume the evidence sentences are given or use the search snippets returned by the search engine. Such methods ignore the challenges of collecting evidence and may not provide sufficient information to verify real-world claims. Aiming at building a better fact-checking… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Fixed minor issues, 11 pages

  10. arXiv:2305.12761  [pdf, other

    cs.CL cs.AI

    Enhancing Cross-lingual Natural Language Inference by Soft Prompting with Multilingual Verbalizer

    Authors: Shuang Li, Xuming Hu, Aiwei Liu, Yawen Yang, Fukun Ma, Philip S. Yu, Lijie Wen

    Abstract: Cross-lingual natural language inference is a fundamental problem in cross-lingual language understanding. Many recent works have used prompt learning to address the lack of annotated parallel corpora in XNLI. However, these methods adopt discrete prompting by simply translating the templates to the target language and need external expert knowledge to design the templates. Besides, discrete promp… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accept at ACL2023

  11. arXiv:2305.12334  [pdf, other

    cs.LG cs.CE physics.atom-ph

    Towards Complex Dynamic Physics System Simulation with Graph Neural ODEs

    Authors: Guangsi Shi, Daokun Zhang, Ming Jin, Shirui Pan, Philip S. Yu

    Abstract: The great learning ability of deep learning models facilitates us to comprehend the real physical world, making learning to simulate complicated particle systems a promising endeavour. However, the complex laws of the physical world pose significant challenges to the learning based simulations, such as the varying spatial dependencies between interacting particles and varying temporal dependencies… ▽ More

    Submitted 29 June, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: 12 pages,5 figures, 6 tables, 49 references

  12. arXiv:2305.07633  [pdf, other

    cs.IR cs.AI cs.LG

    Zero-shot Item-based Recommendation via Multi-task Product Knowledge Graph Pre-Training

    Authors: Ziwei Fan, Zhiwei Liu, Shelby Heinecke, Jianguo Zhang, Huan Wang, Caiming Xiong, Philip S. Yu

    Abstract: Existing recommender systems face difficulties with zero-shot items, i.e. items that have no historical interactions with users during the training stage. Though recent works extract universal item representation via pre-trained language models (PLMs), they ignore the crucial item relationships. This paper presents a novel paradigm for the Zero-Shot Item-based Recommendation (ZSIR) task, which pre… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 11 pages

  13. arXiv:2305.07266  [pdf, other

    cs.CL cs.AI

    Gaussian Prior Reinforcement Learning for Nested Named Entity Recognition

    Authors: Yawen Yang, Xuming Hu, Fukun Ma, Shu'ang Li, Aiwei Liu, Lijie Wen, Philip S. Yu

    Abstract: Named Entity Recognition (NER) is a well and widely studied task in natural language processing. Recently, the nested NER has attracted more attention since its practicality and difficulty. Existing works for nested NER ignore the recognition order and boundary position relation of nested entities. To address these issues, we propose a novel seq2seq model named GPRL, which formulates the nested NE… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted by ICASSP 2023

  14. arXiv:2305.03555  [pdf, other

    cs.LG stat.ML

    Contrastive Graph Clustering in Curvature Spaces

    Authors: Li Sun, Feiyang Wang, Junda Ye, Hao Peng, Philip S. Yu

    Abstract: Graph clustering is a longstanding research topic, and has achieved remarkable success with the deep learning methods in recent years. Nevertheless, we observe that several important issues largely remain open. On the one hand, graph clustering from the geometric perspective is appealing but has rarely been touched before, as it lacks a promising space for geometric clustering. On the other hand,… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted by IJCAI'23

  15. arXiv:2305.03507  [pdf, other

    cs.CL cs.IR

    Read it Twice: Towards Faithfully Interpretable Fact Verification by Revisiting Evidence

    Authors: Xuming Hu, Zhaochen Hong, Zhijiang Guo, Lijie Wen, Philip S. Yu

    Abstract: Real-world fact verification task aims to verify the factuality of a claim by retrieving evidence from the source document. The quality of the retrieved evidence plays an important role in claim verification. Ideally, the retrieved evidence should be faithful (reflecting the model's decision-making process in claim verification) and plausible (convincing to humans), and can improve the accuracy of… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: SIGIR 2023

  16. arXiv:2305.03503  [pdf, other

    cs.CL cs.IR

    Think Rationally about What You See: Continuous Rationale Extraction for Relation Extraction

    Authors: Xuming Hu, Zhaochen Hong, Chenwei Zhang, Irwin King, Philip S. Yu

    Abstract: Relation extraction (RE) aims to extract potential relations according to the context of two entities, thus, deriving rational contexts from sentences plays an important role. Previous works either focus on how to leverage the entity information (e.g., entity types, entity verbalization) to inference relations, but ignore context-focused content, or use counterfactual thinking to remove the model'… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: SIGIR 2023

  17. arXiv:2304.13931  [pdf, other

    cs.DB

    Open Metaverse: Issues, Evolution, and Future

    Authors: Zefeng Chen, Wensheng Gan, Jiayi Sun, Jiayang Wu, Philip S. Yu

    Abstract: With the evolution of content on the web and the Internet, there is a need for cyberspace that can be used to work, live, and play in digital worlds regardless of geography. The Metaverse provides the possibility of future Internet and represents a future trend. In the future, the Metaverse will be a space where the real and the virtual are combined. In this article, we have a comprehensive survey… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Preprint. 6 figures, 2 tables

  18. arXiv:2304.12000  [pdf, other

    cs.AI

    Hierarchical State Abstraction Based on Structural Information Principles

    Authors: Xianghua Zeng, Hao Peng, Angsheng Li, Chunyang Liu, Lifang He, Philip S. Yu

    Abstract: State abstraction optimizes decision-making by ignoring irrelevant environmental information in reinforcement learning with rich observations. Nevertheless, recent approaches focus on adequate representational capacities resulting in essential information loss, affecting their performances on challenging tasks. In this article, we propose a novel mathematical Structural Information principles-base… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  19. arXiv:2304.11433  [pdf, other

    cs.LG cs.AI cs.IR

    Conditional Denoising Diffusion for Sequential Recommendation

    Authors: Yu Wang, Zhiwei Liu, Liangwei Yang, Philip S. Yu

    Abstract: Generative models have attracted significant interest due to their ability to handle uncertainty by learning the inherent data distributions. However, two prominent generative models, namely Generative Adversarial Networks (GANs) and Variational AutoEncoders (VAEs), exhibit challenges that impede achieving optimal performance in sequential recommendation tasks. Specifically, GANs suffer from unsta… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

  20. arXiv:2304.06111  [pdf, other

    cs.CY cs.NI

    Web3: The Next Internet Revolution

    Authors: Shicheng Wan, Hong Lin, Wensheng Gan, Jiahui Chen, Philip S. Yu

    Abstract: Since the first appearance of the World Wide Web, people more rely on the Web for their cyber social activities. The second phase of World Wide Web, named Web 2.0, has been extensively attracting worldwide people that participate in building and enjoying the virtual world. Nowadays, the next internet revolution: Web3 is going to open new opportunities for traditional social models. The decentraliz… ▽ More

    Submitted 22 March, 2023; originally announced April 2023.

    Comments: Preprint. 5 figures, 2 tables

  21. arXiv:2304.06032  [pdf, other

    cs.CY

    Web 3.0: The Future of Internet

    Authors: Wensheng Gan, Zhenqiang Ye, Shicheng Wan, Philip S. Yu

    Abstract: With the rapid growth of the Internet, human daily life has become deeply bound to the Internet. To take advantage of massive amounts of data and information on the internet, the Web architecture is continuously being reinvented and upgraded. From the static informative characteristics of Web 1.0 to the dynamic interactive features of Web 2.0, scholars and engineers have worked hard to make the in… ▽ More

    Submitted 23 March, 2023; originally announced April 2023.

    Comments: ACM Web Conference 2023

  22. arXiv:2304.03344  [pdf, other

    cs.IR cs.AI cs.LG

    Graph Collaborative Signals Denoising and Augmentation for Recommendation

    Authors: Ziwei Fan, Ke Xu, Zhang Dong, Hao Peng, Jiawei Zhang, Philip S. Yu

    Abstract: Graph collaborative filtering (GCF) is a popular technique for capturing high-order collaborative signals in recommendation systems. However, GCF's bipartite adjacency matrix, which defines the neighbors being aggregated based on user-item interactions, can be noisy for users/items with abundant interactions and insufficient for users/items with scarce interactions. Additionally, the adjacency mat… ▽ More

    Submitted 10 April, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Short Paper Accepted by SIGIR 2023, 6 pages

    Journal ref: SIGIR 2023

  23. arXiv:2304.01689  [pdf, other

    cs.CR

    Privacy-Preserving Federated Discovery of DNA Motifs with Differential Privacy

    Authors: Yao Chen, Wensheng Gan, Gengsen Huang, Yongdong Wu, Philip S. Yu

    Abstract: DNA motif discovery is an important issue in gene research, which aims to identify transcription factor binding sites (i.e., motifs) in DNA sequences to reveal the mechanisms that regulate gene expression. However, the phenomenon of data silos and the problem of privacy leakage have seriously hindered the development of DNA motif discovery. On the one hand, the phenomenon of data silos makes data… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Preprint. 7 figures, 1 table

  24. A Post-Training Framework for Improving Heterogeneous Graph Neural Networks

    Authors: Cheng Yang, Xumeng Gong, Chuan Shi, Philip S. Yu

    Abstract: Recent years have witnessed the success of heterogeneous graph neural networks (HGNNs) in modeling heterogeneous information networks (HINs). In this paper, we focus on the benchmark task of HGNNs, i.e., node classification, and empirically find that typical HGNNs are not good at predicting the label of a test node whose receptive field (1) has few training nodes from the same category or (2) has… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: WWW 2023, 12 pages

  25. arXiv:2303.13547  [pdf, other

    cs.CL cs.AI

    A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability

    Authors: Aiwei Liu, Xuming Hu, Lijie Wen, Philip S. Yu

    Abstract: This paper presents the first comprehensive analysis of ChatGPT's Text-to-SQL ability. Given the recent emergence of large-scale conversational language model ChatGPT and its impressive capabilities in both conversational abilities and code generation, we sought to evaluate its Text-to-SQL performance. We conducted experiments on 12 benchmark datasets with different languages, settings, or scenari… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 6 pages, 1 figures

    MSC Class: 68T50 ACM Class: I.2.7

  26. SE-GSL: A General and Effective Graph Structure Learning Framework through Structural Entropy Optimization

    Authors: Dongcheng Zou, Hao Peng, Xiang Huang, Renyu Yang, Jianxin Li, Jia Wu, Chunyang Liu, Philip S. Yu

    Abstract: Graph Neural Networks (GNNs) are de facto solutions to structural data learning. However, it is susceptible to low-quality and unreliable structure, which has been a norm rather than an exception in real-world graphs. Existing graph structure learning (GSL) frameworks still lack robustness and interpretability. This paper proposes a general GSL framework, SE-GSL, through structural entropy and the… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 12 pages,5 figures, accepted by WWW2023

  27. arXiv:2303.04226  [pdf, other

    cs.AI cs.CL cs.LG

    A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT

    Authors: Yihan Cao, Siyu Li, Yixin Liu, Zhiling Yan, Yutong Dai, Philip S. Yu, Lichao Sun

    Abstract: Recently, ChatGPT, along with DALL-E-2 and Codex,has been gaining significant attention from society. As a result, many individuals have become interested in related resources and are seeking to uncover the background and secrets behind its impressive performance. In fact, ChatGPT and other Generative AI (GAI) techniques belong to the category of Artificial Intelligence Generated Content (AIGC), w… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: 44 pages, 15 figures

  28. arXiv:2302.09419  [pdf, other

    cs.AI cs.CL cs.LG

    A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT

    Authors: Ce Zhou, Qian Li, Chen Li, Jun Yu, Yixin Liu, Guangjing Wang, Kai Zhang, Cheng Ji, Qiben Yan, Lifang He, Hao Peng, Jianxin Li, Jia Wu, Ziwei Liu, Pengtao Xie, Caiming Xiong, Jian Pei, Philip S. Yu, Lichao Sun

    Abstract: Pretrained Foundation Models (PFMs) are regarded as the foundation for various downstream tasks with different data modalities. A PFM (e.g., BERT, ChatGPT, and GPT-4) is trained on large-scale data which provides a reasonable parameter initialization for a wide range of downstream applications. BERT learns bidirectional encoder representations from Transformers, which are trained on large datasets… ▽ More

    Submitted 1 May, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: 99 pages, 16 figures

  29. arXiv:2302.07363  [pdf, other

    cs.SI

    Attacking Fake News Detectors via Manipulating News Social Engagement

    Authors: Haoran Wang, Yingtong Dou, Canyu Chen, Lichao Sun, Philip S. Yu, Kai Shu

    Abstract: Social media is one of the main sources for news consumption, especially among the younger generation. With the increasing popularity of news consumption on various social media platforms, there has been a surge of misinformation which includes false information or unfounded claims. As various text- and social context-based fake news detectors are proposed to detect misinformation on social media,… ▽ More

    Submitted 27 April, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: ACM Web Conference 2023 (WWW'23)

  30. Graph-based Village Level Poverty Identification

    Authors: Jing Ma, Liangwei Yang, Qiong Feng, Weizhi Zhang, Philip S. Yu

    Abstract: Poverty status identification is the first obstacle to eradicating poverty. Village-level poverty identification is very challenging due to the arduous field investigation and insufficient information. The development of the Web infrastructure and its modeling tools provides fresh approaches to identifying poor villages. Upon those techniques, we build a village graph for village poverty status id… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 5 pages, accepted by theWebConf 2023

  31. arXiv:2302.06611  [pdf, other

    eess.IV

    Deep Learning and Medical Imaging for COVID-19 Diagnosis: A Comprehensive Survey

    Authors: Song Wu, Yazhou Ren, Aodi Yang, Xinyue Chen, Xiaorong Pu, Jing He, Liqiang Nie, Philip S. Yu

    Abstract: COVID-19 (Coronavirus disease 2019) has been quickly spreading since its outbreak, impacting financial markets and healthcare systems globally. Countries all around the world have adopted a number of extraordinary steps to restrict the spreading virus, where early COVID-19 diagnosis is essential. Medical images such as X-ray images and Computed Tomography scans are becoming one of the main diagnos… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  32. arXiv:2302.04549  [pdf, other

    cs.LG cs.AI

    Weakly Supervised Anomaly Detection: A Survey

    Authors: Minqi Jiang, Chaochuan Hou, Ao Zheng, Xiyang Hu, Songqiao Han, Hailiang Huang, Xiangnan He, Philip S. Yu, Yue Zhao

    Abstract: Anomaly detection (AD) is a crucial task in machine learning with various applications, such as detecting emerging diseases, identifying financial frauds, and detecting fake news. However, obtaining complete, accurate, and precise labels for AD tasks can be expensive and challenging due to the cost and difficulties in data annotation. To address this issue, researchers have developed AD methods th… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Code available at https://github.com/yzhao062/wsad

  33. ConsRec: Learning Consensus Behind Interactions for Group Recommendation

    Authors: Xixi Wu, Yun Xiong, Yao Zhang, Yizhu Jiao, Jiawei Zhang, Yangyong Zhu, Philip S. Yu

    Abstract: Since group activities have become very common in daily life, there is an urgent demand for generating recommendations for a group of users, referred to as group recommendation task. Existing group recommendation methods usually infer groups' preferences via aggregating diverse members' interests. Actually, groups' ultimate choice involves compromises between members, and finally, an agreement can… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: Accepted by WWW'2023

  34. arXiv:2302.00109  [pdf, other

    cs.LG

    OrthoReg: Improving Graph-regularized MLPs via Orthogonality Regularization

    Authors: Hengrui Zhang, Shen Wang, Vassilis N. Ioannidis, Soji Adeshina, Jiani Zhang, Xiao Qin, Christos Faloutsos, Da Zheng, George Karypis, Philip S. Yu

    Abstract: Graph Neural Networks (GNNs) are currently dominating in modeling graph-structure data, while their high reliance on graph structure for inference significantly impedes them from widespread applications. By contrast, Graph-regularized MLPs (GR-MLPs) implicitly inject the graph structure information into model weights, while their performance can hardly match that of GNNs in most tasks. This motiva… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  35. arXiv:2301.12197  [pdf, other

    cs.LG cs.AI cs.IR

    Mutual Wasserstein Discrepancy Minimization for Sequential Recommendation

    Authors: Ziwei Fan, Zhiwei Liu, Hao Peng, Philip S Yu

    Abstract: Self-supervised sequential recommendation significantly improves recommendation performance by maximizing mutual information with well-designed data augmentations. However, the mutual information estimation is based on the calculation of Kullback Leibler divergence with several limitations, including asymmetrical estimation, the exponential need of the sample size, and training instability. Also,… ▽ More

    Submitted 19 June, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: Updated with the correction of the asymmetric mistake on the mutual information connection

  36. arXiv:2301.12104  [pdf, other

    cs.LG cs.AI

    Unbiased and Efficient Self-Supervised Incremental Contrastive Learning

    Authors: Cheng Ji, Jianxin Li, Hao Peng, Jia Wu, Xingcheng Fu, Qingyun Sun, Phillip S. Yu

    Abstract: Contrastive Learning (CL) has been proved to be a powerful self-supervised approach for a wide range of domains, including computer vision and graph representation learning. However, the incremental learning issue of CL has rarely been studied, which brings the limitation in applying it to real-world applications. Contrastive learning identifies the samples with the negative ones from the noise di… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

  37. arXiv:2301.09069  [pdf, other

    cs.LG

    Provable Unrestricted Adversarial Training without Compromise with Generalizability

    Authors: Lilin Zhang, Ning Yang, Yanchao Sun, Philip S. Yu

    Abstract: Adversarial training (AT) is widely considered as the most promising strategy to defend against adversarial attacks and has drawn increasing interest from researchers. However, the existing AT methods still suffer from two challenges. First, they are unable to handle unrestricted adversarial examples (UAEs), which are built from scratch, as opposed to restricted adversarial examples (RAEs), which… ▽ More

    Submitted 18 May, 2024; v1 submitted 22 January, 2023; originally announced January 2023.

  38. arXiv:2301.03780  [pdf, other

    cs.IR cs.LG cs.SI

    Time-aware Hyperbolic Graph Attention Network for Session-based Recommendation

    Authors: Xiaohan Li, Yuqing Liu, Zheng Liu, Philip S. Yu

    Abstract: Session-based Recommendation (SBR) is to predict users' next interested items based on their previous browsing sessions. Existing methods model sessions as graphs or sequences to estimate user interests based on their interacted items to make recommendations. In recent years, graph-based methods have achieved outstanding performance on SBR. However, none of these methods consider temporal informat… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: IEEE Bigdata 2022

  39. arXiv:2301.00015  [pdf, other

    cs.LG cs.AI

    Self-organization Preserved Graph Structure Learning with Principle of Relevant Information

    Authors: Qingyun Sun, Jianxin Li, Beining Yang, Xingcheng Fu, Hao Peng, Philip S. Yu

    Abstract: Most Graph Neural Networks follow the message-passing paradigm, assuming the observed structure depicts the ground-truth node relationships. However, this fundamental assumption cannot always be satisfied, as real-world graphs are always incomplete, noisy, or redundant. How to reveal the inherent graph structure in a unified way remains under-explored. We proposed PRI-GSL, a Graph Structure Learni… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    Comments: Accepted by AAAI 2023

  40. arXiv:2212.14255  [pdf, other

    cs.DB cs.AI

    HUSP-SP: Faster Utility Mining on Sequence Data

    Authors: Chunkai Zhang, Yuting Yang, Zilin Du, Wensheng Gan, Philip S. Yu

    Abstract: High-utility sequential pattern mining (HUSPM) has emerged as an important topic due to its wide application and considerable popularity. However, due to the combinatorial explosion of the search space when the HUSPM problem encounters a low utility threshold or large-scale data, it may be time-consuming and memory-costly to address the HUSPM problem. Several algorithms have been proposed for addr… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: ACM TKDD, 7 figures, 2 tables

  41. arXiv:2212.10452  [pdf, other

    cs.DB cs.AI

    Towards Sequence Utility Maximization under Utility Occupancy Measure

    Authors: Gengsen Huang, Wensheng Gan, Philip S. Yu

    Abstract: The discovery of utility-driven patterns is a useful and difficult research topic. It can extract significant and interesting information from specific and varied databases, increasing the value of the services provided. In practice, the measure of utility is often used to demonstrate the importance, profit, or risk of an object or a pattern. In the database, although utility is a flexible criteri… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Preprint. 7 figures, 8 tables

  42. arXiv:2212.04604  [pdf, other

    cs.LG

    Localized Contrastive Learning on Graphs

    Authors: Hengrui Zhang, Qitian Wu, Yu Wang, Shaofeng Zhang, Junchi Yan, Philip S. Yu

    Abstract: Contrastive learning methods based on InfoNCE loss are popular in node representation learning tasks on graph-structured data. However, its reliance on data augmentation and its quadratic computational complexity might lead to inconsistency and inefficiency problems. To mitigate these limitations, in this paper, we introduce a simple yet effective contrastive model named Localized Graph Contrastiv… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  43. arXiv:2212.00301  [pdf, other

    cs.CL

    Learning to Select from Multiple Options

    Authors: Jiangshu Du, Wenpeng Yin, Congying Xia, Philip S. Yu

    Abstract: Many NLP tasks can be regarded as a selection problem from a set of options, such as classification tasks, multi-choice question answering, etc. Textual entailment (TE) has been shown as the state-of-the-art (SOTA) approach to dealing with those selection problems. TE treats input texts as premises (P), options as hypotheses (H), then handles the selection problem by modeling (P, H) pairwise. Two… ▽ More

    Submitted 11 September, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Accepted by AAAI 2023

  44. arXiv:2211.17068  [pdf, other

    cs.LG

    Self-Supervised Continual Graph Learning in Adaptive Riemannian Spaces

    Authors: Li Sun, Junda Ye, Hao Peng, Feiyang Wang, Philip S. Yu

    Abstract: Continual graph learning routinely finds its role in a variety of real-world applications where the graph data with different tasks come sequentially. Despite the success of prior works, it still faces great challenges. On the one hand, existing methods work with the zero-curvature Euclidean space, and largely ignore the fact that curvature varies over the coming graph sequence. On the other hand,… ▽ More

    Submitted 2 June, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023 (Main Track), 9 pages, 4 figures

  45. arXiv:2211.15588  [pdf, other

    cs.DB

    Internet of Behaviors: A Survey

    Authors: Jiayi Sun, Wensheng Gan, Han-Chieh Chao, Philip S. Yu, Weiping Ding

    Abstract: The Internet of Behavior is a research theme that aims to analyze human behavior data on the Internet from the perspective of behavioral psychology, obtain insights about human behavior, and better understand the intention behind the behavior. In this way, the Internet of Behavior can predict human behavioral trends in the future and even change human behavior, which can provide more convenience f… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Preprint. 9 figures, 1 table

  46. DGRec: Graph Neural Network for Recommendation with Diversified Embedding Generation

    Authors: Liangwei Yang, Shengjie Wang, Yunzhe Tao, Jiankai Sun, Xiaolong Liu, Philip S. Yu, Taiqing Wang

    Abstract: Graph Neural Network (GNN) based recommender systems have been attracting more and more attention in recent years due to their excellent performance in accuracy. Representing user-item interactions as a bipartite graph, a GNN model generates user and item representations by aggregating embeddings of their neighbors. However, such an aggregation procedure often accumulates information purely based… ▽ More

    Submitted 26 November, 2022; v1 submitted 18 November, 2022; originally announced November 2022.

    Comments: 9 pages, WSDM 2023

  47. arXiv:2211.07104  [pdf, other

    cs.IR

    MetaKRec: Collaborative Meta-Knowledge Enhanced Recommender System

    Authors: Liangwei Yang, Shen Wang, Jibing Gong, Shaojie Zheng, Shuying Du, Zhiwei Liu, Philip S. Yu

    Abstract: Knowledge graph (KG) enhanced recommendation has demonstrated improved performance in the recommendation system (RecSys) and attracted considerable research interest. Recently the literature has adopted neural graph networks (GNNs) on the collaborative knowledge graph and built an end-to-end KG-enhanced RecSys. However, the majority of these approaches have three limitations: (1) treat the collabo… ▽ More

    Submitted 13 November, 2022; originally announced November 2022.

    Comments: 10 pages accepted by IEEE BigData

  48. arXiv:2211.06014  [pdf, other

    cs.CL cs.AI

    Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction

    Authors: Xuming Hu, Shiao Meng, Chenwei Zhang, Xiangli Yang, Lijie Wen, Irwin King, Philip S. Yu

    Abstract: Information Extraction (IE) aims to extract structured information from heterogeneous sources. IE from natural language texts include sub-tasks such as Named Entity Recognition (NER), Relation Extraction (RE), and Event Extraction (EE). Most IE systems require comprehensive understandings of sentence structure, implied semantics, and domain knowledge to perform well; thus, IE tasks always need ade… ▽ More

    Submitted 14 November, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication. This work is a substantially extended version of arXiv:2109.06415, with the summary of difference provided in the appendix

  49. arXiv:2211.01830  [pdf, other

    cs.SI cs.AI

    Ranking-based Group Identification via Factorized Attention on Social Tripartite Graph

    Authors: Mingdai Yang, Zhiwei Liu, Liangwei Yang, Xiaolong Liu, Chen Wang, Hao Peng, Philip S. Yu

    Abstract: Due to the proliferation of social media, a growing number of users search for and join group activities in their daily life. This develops a need for the study on the ranking-based group identification (RGI) task, i.e., recommending groups to users. The major challenge in this task is how to effectively and efficiently leverage both the item interaction and group participation of users' online be… ▽ More

    Submitted 16 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: 9 pages. Accepted by WSDM'23. Github: https://github.com/mdyfrank/CFAG

  50. arXiv:2210.16282  [pdf, other

    cs.DB cs.CY

    Big Data Meets Metaverse: A Survey

    Authors: Jiayi Sun, Wensheng Gan, Zefeng Chen, Junhui Li, Philip S. Yu

    Abstract: We are living in the era of big data. The Metaverse is an emerging technology in the future, and it has a combination of big data, AI (artificial intelligence), VR (Virtual Reality), AR (Augmented Reality), MR (mixed reality), and other technologies that will diminish the difference between online and real-life interaction. It has the goal of becoming a platform where we can work, go shopping, pla… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Preprint. 8 figures, 2 tables