Skip to main content

Showing 151–200 of 494 results for author: Yu, P S

  1. arXiv:2210.15882  [pdf, other

    cs.LG cs.AI cs.CL

    Can Current Explainability Help Provide References in Clinical Notes to Support Humans Annotate Medical Codes?

    Authors: Byung-Hak Kim, Zhongfen Deng, Philip S. Yu, Varun Ganapathi

    Abstract: The medical codes prediction problem from clinical notes has received substantial interest in the NLP community, and several recent studies have shown the state-of-the-art (SOTA) code prediction results of full-fledged deep learning-based methods. However, most previous SOTA works based on deep learning are still in early stages in terms of providing textual references and explanations of the pred… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: To appear in Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis (Louhi 2022), Virtual, December 7, 2022

  2. arXiv:2210.13572  [pdf, other

    cs.IR cs.AI cs.LG

    Sequential Recommendation with Auxiliary Item Relationships via Multi-Relational Transformer

    Authors: Ziwei Fan, Zhiwei Liu, Chen Wang, Peijie Huang, Hao Peng, Philip S. Yu

    Abstract: Sequential Recommendation (SR) models user dynamics and predicts the next preferred items based on the user history. Existing SR methods model the 'was interacted before' item-item transitions observed in sequences, which can be viewed as an item relationship. However, there are multiple auxiliary item relationships, e.g., items from similar brands and with similar contents in real-world scenarios… ▽ More

    Submitted 28 October, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to BigData 2022. The code is at https://github.com/zfan20/MT4SR

  3. arXiv:2210.10343  [pdf, other

    cs.CL cs.AI

    Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks

    Authors: Xuming Hu, Yong Jiang, Aiwei Liu, Zhongqiang Huang, Pengjun Xie, Fei Huang, Lijie Wen, Philip S. Yu

    Abstract: Data augmentation techniques have been used to alleviate the problem of scarce labeled data in various NER tasks (flat, nested, and discontinuous NER tasks). Existing augmentation techniques either manipulate the words in the original text that break the semantic coherence of the text, or exploit generative models that ignore preserving entities in the original text, which impedes the use of augme… ▽ More

    Submitted 26 May, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to ACL 2023 (Findings), Long Paper, 14 pages

  4. CLARE: A Semi-supervised Community Detection Algorithm

    Authors: Xixi Wu, Yun Xiong, Yao Zhang, Yizhu Jiao, Caihua Shan, Yiheng Sun, Yangyong Zhu, Philip S. Yu

    Abstract: Community detection refers to the task of discovering closely related subgraphs to understand the networks. However, traditional community detection algorithms fail to pinpoint a particular kind of community. This limits its applicability in real-world networks, e.g., distinguishing fraud groups from normal ones in transaction networks. Recently, semi-supervised community detection emerges as a so… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: Accepted by KDD'2022

  5. arXiv:2210.07990  [pdf, other

    cs.DB cs.CR

    Metaverse: Survey, Applications, Security, and Opportunities

    Authors: Jiayi Sun, Wensheng Gan, Han-Chieh Chao, Philip S. Yu

    Abstract: As a fusion of various emerging digital technologies, the Metaverse aims to build a virtual shared digital space. It is closely related to extended reality, digital twin, blockchain, and other technologies. Its goal is to build a digital space based on the real world, form a virtual economic system, and expand the space of human activities, which injects new vitality into the social, economic, and… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Preprint. 5 figures, 4 tables

  6. arXiv:2210.07011  [pdf, other

    cs.LG

    Variational Graph Generator for Multi-View Graph Clustering

    Authors: Jianpeng Chen, Yawen Ling, Jie Xu, Yazhou Ren, Shudong Huang, Xiaorong Pu, Zhifeng Hao, Philip S. Yu, Lifang He

    Abstract: Multi-view graph clustering (MGC) methods are increasingly being studied due to the explosion of multi-view data with graph structural information. The critical point of MGC is to better utilize the view-specific and view-common information in features and graphs of multiple views. However, existing works have an inherent limitation that they are unable to concurrently utilize the consensus graph… ▽ More

    Submitted 16 December, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: submitted to TNNLS

  7. arXiv:2210.04142  [pdf, other

    cs.LG

    Deep Clustering: A Comprehensive Survey

    Authors: Yazhou Ren, Jingyu Pu, Zhimeng Yang, Jie Xu, Guofeng Li, Xiaorong Pu, Philip S. Yu, Lifang He

    Abstract: Cluster analysis plays an indispensable role in machine learning and data mining. Learning a good data representation is crucial for clustering algorithms. Recently, deep clustering, which can learn clustering-friendly representations using deep neural networks, has been broadly applied in a wide range of clustering tasks. Existing surveys for deep clustering mainly focus on the single-view fields… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  8. arXiv:2209.13556  [pdf, other

    cs.DB cs.AI

    Contrast Pattern Mining: A Survey

    Authors: Yao Chen, Wensheng Gan, Yongdong Wu, Philip S. Yu

    Abstract: Contrast pattern mining (CPM) is an important and popular subfield of data mining. Traditional sequential patterns cannot describe the contrast information between different classes of data, while contrast patterns involving the concept of contrast can describe the significant differences between datasets under different contrast conditions. Based on the number of papers published in this field, w… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Preprint. 3 figures, 6 tables

  9. arXiv:2209.13501  [pdf, other

    cs.AI

    Totally-ordered Sequential Rules for Utility Maximization

    Authors: Chunkai Zhang, Maohua Lyu, Wensheng Gan, Philip S. Yu

    Abstract: High utility sequential pattern mining (HUSPM) is a significant and valuable activity in knowledge discovery and data analytics with many real-world applications. In some cases, HUSPM can not provide an excellent measure to predict what will happen. High utility sequential rule mining (HUSRM) discovers high utility and high confidence sequential rules, allowing it to solve the problem in HUSPM. Al… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Preprint. 4 figures, 8 tables

  10. arXiv:2209.09093  [pdf, other

    cs.CV cs.AI cs.LG

    Scene Graph Modification as Incremental Structure Expanding

    Authors: Xuming Hu, Zhijiang Guo, Yu Fu, Lijie Wen, Philip S. Yu

    Abstract: A scene graph is a semantic representation that expresses the objects, attributes, and relationships between objects in a scene. Scene graphs play an important role in many cross modality tasks, as they are able to capture the interactions between images and texts. In this paper, we focus on scene graph modification (SGM), where the system is required to learn how to update an existing scene graph… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: In COLING 2022 as a long paper. Code and data available at https://github.com/THU-BPM/SGM

  11. arXiv:2209.02908  [pdf, other

    cs.SI

    PERFECT: A Hyperbolic Embedding for Joint User and Community Alignment

    Authors: Li Sun, Zhongbao Zhang, Jiawei Zhang, Feiyang Wang, Yang Du, Sen Su, Philip S. Yu

    Abstract: Social network alignment shows fundamental importance in a wide spectrum of applications. To the best of our knowledge, existing studies mainly focus on network alignment at the individual user level, requiring abundant common information between shared individual users. For the networks that cannot meet such requirements, social community structures actually provide complementary and critical inf… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: Published in IEEE ICDM 2020

  12. arXiv:2209.01539  [pdf, other

    cs.SI cs.AI

    Cross-Network Social User Embedding with Hybrid Differential Privacy Guarantees

    Authors: Jiaqian Ren, Lei Jiang, Hao Peng, Lingjuan Lyu, Zhiwei Liu, Chaochao Chen, Jia Wu, Xu Bai, Philip S. Yu

    Abstract: Integrating multiple online social networks (OSNs) has important implications for many downstream social mining tasks, such as user preference modelling, recommendation, and link prediction. However, it is unfortunately accompanied by growing privacy concerns about leaking sensitive user information. How to fully utilize the data from different online social networks while preserving user privacy… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: accepted by CIKM22

  13. arXiv:2209.00456  [pdf, other

    cs.IR cs.LG

    ContrastVAE: Contrastive Variational AutoEncoder for Sequential Recommendation

    Authors: Yu Wang, Hengrui Zhang, Zhiwei Liu, Liangwei Yang, Philip S. Yu

    Abstract: Aiming at exploiting the rich information in user behaviour sequences, sequential recommendation has been widely adopted in real-world recommender systems. However, current methods suffer from the following issues: 1) sparsity of user-item interactions, 2) uncertainty of sequential records, 3) long-tail items. In this paper, we propose to incorporate contrastive learning into the framework of Vari… ▽ More

    Submitted 5 December, 2022; v1 submitted 26 August, 2022; originally announced September 2022.

    Comments: Accepted by CIKM 2022

  14. arXiv:2208.14230  [pdf, other

    cs.DB cs.AI

    A Generic Algorithm for Top-K On-Shelf Utility Mining

    Authors: Jiahui Chen, Xu Guo, Wensheng Gan, Shichen Wan, Philip S. Yu

    Abstract: On-shelf utility mining (OSUM) is an emerging research direction in data mining. It aims to discover itemsets that have high relative utility in their selling time period. Compared with traditional utility mining, OSUM can find more practical and meaningful patterns in real-life applications. However, there is a major drawback to traditional OSUM. For normal users, it is hard to define a minimum t… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: Preprint. 4 figures, 4 table

  15. A Self-supervised Riemannian GNN with Time Varying Curvature for Temporal Graph Learning

    Authors: Li Sun, Junda Ye, Hao Peng, Philip S. Yu

    Abstract: Representation learning on temporal graphs has drawn considerable research attention owing to its fundamental importance in a wide spectrum of real-world applications. Though a number of studies succeed in obtaining time-dependent representations, it still faces significant challenges. On the one hand, most of the existing methods restrict the embedding space with a certain curvature. However, the… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted by CIKM22 (Full Paper)

    ACM Class: I.2

  16. Position-aware Structure Learning for Graph Topology-imbalance by Relieving Under-reaching and Over-squashing

    Authors: Qingyun Sun, Jianxin Li, Haonan Yuan, Xingcheng Fu, Hao Peng, Cheng Ji, Qian Li, Philip S. Yu

    Abstract: Topology-imbalance is a graph-specific imbalance problem caused by the uneven topology positions of labeled nodes, which significantly damages the performance of GNNs. What topology-imbalance means and how to measure its impact on graph learning remain under-explored. In this paper, we provide a new understanding of topology-imbalance from a global view of the supervision information distribution… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: Accepted by CIKM 2022

  17. arXiv:2208.06973  [pdf, other

    cs.SI

    From Known to Unknown: Quality-aware Self-improving Graph Neural Network for Open Set Social Event Detection

    Authors: Jiaqian Ren, Lei Jiang, Hao Peng, Yuwei Cao, Jia Wu, Philip S. Yu, Lifang He

    Abstract: State-of-the-art Graph Neural Networks (GNNs) have achieved tremendous success in social event detection tasks when restricted to a closed set of events. However, considering the large amount of data needed for training a neural network and the limited ability of a neural network in handling previously unknown data, it remains a challenge for existing GNN-based methods to operate in an open set se… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

  18. arXiv:2208.04760  [pdf, other

    cs.IR

    Time Lag Aware Sequential Recommendation

    Authors: Lihua Chen, Ning Yang, Philip S Yu

    Abstract: Although a variety of methods have been proposed for sequential recommendation, it is still far from being well solved partly due to two challenges. First, the existing methods often lack the simultaneous consideration of the global stability and local fluctuation of user preference, which might degrade the learning of a user's current preference. Second, the existing methods often use a scalar ba… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: This paper has been accepted by CIKM 2022

  19. arXiv:2208.04537  [pdf, other

    cs.LG cs.IR

    Automating DBSCAN via Deep Reinforcement Learning

    Authors: Ruitong Zhang, Hao Peng, Yingtong Dou, Jia Wu, Qingyun Sun, Jingyi Zhang, Philip S. Yu

    Abstract: DBSCAN is widely used in many scientific and engineering fields because of its simplicity and practicality. However, due to its high sensitivity parameters, the accuracy of the clustering result depends heavily on practical experience. In this paper, we first propose a novel Deep Reinforcement Learning guided automatic DBSCAN parameters search framework, namely DRL-DBSCAN. The framework models the… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted by CIKM 2022. The code is available at https://github.com/RingBDStack/DRL-DBSCAN

  20. arXiv:2206.11863  [pdf, other

    cs.CL

    CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking

    Authors: Xuming Hu, Zhijiang Guo, Guanyu Wu, Aiwei Liu, Lijie Wen, Philip S. Yu

    Abstract: The explosion of misinformation spreading in the media ecosystem urges for automated fact-checking. While misinformation spans both geographic and linguistic boundaries, most work in the field has focused on English. Datasets and tools available in other languages, such as Chinese, are limited. In order to bridge this gap, we construct CHEF, the first CHinese Evidence-based Fact-checking dataset o… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: In NAACL 2022 as a long paper. Code and data available at https://github.com/THU-BPM/CHEF

  21. arXiv:2206.10071  [pdf, other

    cs.LG cs.SI

    BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs

    Authors: Kay Liu, Yingtong Dou, Yue Zhao, Xueying Ding, Xiyang Hu, Ruitong Zhang, Kaize Ding, Canyu Chen, Hao Peng, Kai Shu, Lichao Sun, Jundong Li, George H. Chen, Zhihao Jia, Philip S. Yu

    Abstract: Detecting which nodes in graphs are outliers is a relatively new machine learning task with numerous applications. Despite the proliferation of algorithms developed in recent years for this task, there has been no standard comprehensive setting for performance evaluation. Consequently, it has been difficult to understand which methods work well and when under a broad range of settings. To bridge t… ▽ More

    Submitted 15 October, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022. Benchmark available at https://github.com/pygod-team/pygod/tree/main/benchmark

  22. arXiv:2206.07472  [pdf, other

    cs.AI

    Collaborative Knowledge Graph Fusion by Exploiting the Open Corpus

    Authors: Yue Wang, Yao Wan, Lu Bai, Lixin Cui, Zhuo Xu, Ming Li, Philip S. Yu, Edwin R Hancock

    Abstract: To alleviate the challenges of building Knowledge Graphs (KG) from scratch, a more general task is to enrich a KG using triples from an open corpus, where the obtained triples contain noisy entities and relations. It is challenging to enrich a KG with newly harvested triples while maintaining the quality of the knowledge representation. This paper proposes a system to refine a KG using information… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Under review by IEEE Transactions on Knowledge and Data Engineering (TKDE)

  23. arXiv:2206.04728  [pdf, other

    cs.DB cs.AI

    Towards Target Sequential Rules

    Authors: Wensheng Gan, Gengsen Huang, Jian Weng, Tianlong Gu, Philip S. Yu

    Abstract: In many real-world applications, sequential rule mining (SRM) can provide prediction and recommendation functions for a variety of services. It is an important technique of pattern mining to discover all valuable rules that belong to high-frequency and high-confidence sequential rules. Although several algorithms of SRM are proposed to solve various practical problems, there are no studies on targ… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Preprint. 6 figures, 3 tables

  24. arXiv:2206.01535  [pdf, other

    cs.LG cs.AI

    Rethinking and Scaling Up Graph Contrastive Learning: An Extremely Efficient Approach with Group Discrimination

    Authors: Yizhen Zheng, Shirui Pan, Vincent Cs Lee, Yu Zheng, Philip S. Yu

    Abstract: Graph contrastive learning (GCL) alleviates the heavy reliance on label information for graph representation learning (GRL) via self-supervised learning schemes. The core idea is to learn by maximising mutual information for similar instances, which requires similarity computation between two node instances. However, GCL is inefficient in both time and memory consumption. In addition, GCL normally… ▽ More

    Submitted 16 October, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Accepted in NeurIPS 2022

  25. arXiv:2205.15550  [pdf, other

    cs.CL

    A Multi-level Supervised Contrastive Learning Framework for Low-Resource Natural Language Inference

    Authors: Shu'ang Li, Xuming Hu, Li Lin, Aiwei Liu, Lijie Wen, Philip S. Yu

    Abstract: Natural Language Inference (NLI) is a growingly essential task in natural language understanding, which requires inferring the relationship between the sentence pairs (premise and hypothesis). Recently, low-resource natural language inference has gained increasing attention, due to significant savings in manual annotation costs and a better fit with real-world scenarios. Existing works fail to cha… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

  26. arXiv:2205.12179  [pdf, other

    cs.AI

    Evidential Temporal-aware Graph-based Social Event Detection via Dempster-Shafer Theory

    Authors: Jiaqian Ren, Lei Jiang, Hao Peng, Zhiwei Liu, Jia Wu, Philip S. Yu

    Abstract: The rising popularity of online social network services has attracted lots of research on mining social media data, especially on mining social events. Social event detection, due to its wide applications, has now become a trivial task. State-of-the-art approaches exploiting Graph Neural Networks (GNNs) usually follow a two-step strategy: 1) constructing text graphs based on various views (\textit… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted by ICWS2022

  27. arXiv:2205.02225  [pdf, other

    cs.CL cs.AI

    HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction

    Authors: Xuming Hu, Shuliang Liu, Chenwei Zhang, Shu`ang Li, Lijie Wen, Philip S. Yu

    Abstract: Unsupervised relation extraction aims to extract the relationship between entities from natural language sentences without prior information on relational scope or distribution. Existing works either utilize self-supervised schemes to refine relational feature signals by iteratively leveraging adaptive clustering and classification that provoke gradual drift problems, or adopt instance-wise contra… ▽ More

    Submitted 20 February, 2023; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: In NAACL 2022 as a long paper. Code and data available at https://github.com/THU-BPM/HiURE

    Journal ref: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics

  28. arXiv:2205.01757  [pdf, other

    cs.CL cs.LG

    XLTime: A Cross-Lingual Knowledge Transfer Framework for Temporal Expression Extraction

    Authors: Yuwei Cao, William Groves, Tanay Kumar Saha, Joel R. Tetreault, Alex Jaimes, Hao Peng, Philip S. Yu

    Abstract: Temporal Expression Extraction (TEE) is essential for understanding time in natural language. It has applications in Natural Language Processing (NLP) tasks such as question answering, information retrieval, and causal inference. To date, work in this area has mostly focused on English as there is a scarcity of labeled data for other languages. We propose XLTime, a novel framework for multilingual… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: This paper is accepted by the Findings of NAACL 2022

  29. arXiv:2204.12095  [pdf, other

    cs.LG cs.SI

    PyGOD: A Python Library for Graph Outlier Detection

    Authors: Kay Liu, Yingtong Dou, Xueying Ding, Xiyang Hu, Ruitong Zhang, Hao Peng, Lichao Sun, Philip S. Yu

    Abstract: PyGOD is an open-source Python library for detecting outliers in graph data. As the first comprehensive library of its kind, PyGOD supports a wide array of leading graph-based methods for outlier detection under an easy-to-use, well-documented API designed for use by both researchers and practitioners. PyGOD provides modularized components of the different detectors implemented so that users can e… ▽ More

    Submitted 2 June, 2024; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: 9 pages, 1 figures. Published in JMLR Volume 25 MLOSS track. Library available at https://pygod.org

    Journal ref: Journal of Machine Learning Research. 25(141):1-9, 2024

  30. arXiv:2204.08005  [pdf, other

    cs.SI cs.GT

    A Survey on Location-Driven Influence Maximization

    Authors: Taotao Cai, Quan Z. Sheng, Xiangyu Song, Jian Yang, Shuang Wang, Wei Emma Zhang, Jia Wu, Philip S. Yu

    Abstract: Influence Maximization (IM), which aims to select a set of users from a social network to maximize the expected number of influenced users, is an evergreen hot research topic. Its research outcomes significantly impact real-world applications such as business marketing. The booming location-based network platforms of the last decade appeal to the researchers embedding the location information into… ▽ More

    Submitted 14 September, 2022; v1 submitted 17 April, 2022; originally announced April 2022.

  31. arXiv:2204.00266  [pdf, other

    cs.CL cs.IR

    Multifaceted Improvements for Conversational Open-Domain Question Answering

    Authors: Tingting Liang, Yixuan Jiang, Congying Xia, Ziqiang Zhao, Yuyu Yin, Philip S. Yu

    Abstract: Open-domain question answering (OpenQA) is an important branch of textual QA which discovers answers for the given questions based on a large number of unstructured documents. Effectively mining correct answers from the open-domain sources still has a fair way to go. Existing OpenQA systems might suffer from the issues of question complexity and ambiguity, as well as insufficient background knowle… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  32. arXiv:2203.15508  [pdf, other

    cs.LG cs.AI cs.IR

    Improving Contrastive Learning with Model Augmentation

    Authors: Zhiwei Liu, Yongjun Chen, Jia Li, Man Luo, Philip S. Yu, Caiming Xiong

    Abstract: The sequential recommendation aims at predicting the next items in user behaviors, which can be solved by characterizing item relationships in sequences. Due to the data sparsity and noise issues in sequences, a new self-supervised learning (SSL) paradigm is proposed to improve the performance, which employs contrastive learning between positive and negative views of sequences. However, existing… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Preprint. Still under reivew

  33. arXiv:2203.10093  [pdf, other

    cs.LG cs.AI cs.NE q-bio.NC

    Deep reinforcement learning guided graph neural networks for brain network analysis

    Authors: Xusheng Zhao, Jia Wu, Hao Peng, Amin Beheshti, Jessica J. M. Monaghan, David McAlpine, Heivet Hernandez-Perez, Mark Dras, Qiong Dai, Yangyang Li, Philip S. Yu, Lifang He

    Abstract: Modern neuroimaging techniques, such as diffusion tensor imaging (DTI) and functional magnetic resonance imaging (fMRI), enable us to model the human brain as a brain network or connectome. Capturing brain networks' structural information and hierarchical patterns is essential for understanding brain functions and disease states. Recently, the promising network representation learning capability o… ▽ More

    Submitted 24 July, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

  34. arXiv:2203.06467  [pdf, other

    cs.IR cs.AI cs.LG

    G$^3$SR: Global Graph Guided Session-based Recommendation

    Authors: Zhi-Hong Deng, Chang-Dong Wang, Ling Huang, Jian-Huang Lai, Philip S. Yu

    Abstract: Session-based recommendation tries to make use of anonymous session data to deliver high-quality recommendation under the condition that user-profiles and the complete historical behavioral data of a target user are unavailable. Previous works consider each session individually and try to capture user interests within a session. Despite their encouraging results, these models can only perceive int… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

  35. arXiv:2203.00732  [pdf, other

    cs.CL

    Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation in Few Shots

    Authors: Wenting Zhao, Ye Liu, Yao Wan, Philip S. Yu

    Abstract: Few-shot table-to-text generation is a task of composing fluent and faithful sentences to convey table content using limited data. Despite many efforts having been made towards generating impressive fluent sentences by fine-tuning powerful pre-trained language models, the faithfulness of generated content still needs to be improved. To this end, this paper proposes a novel approach Attend, Memoriz… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  36. arXiv:2202.13202  [pdf, other

    cs.DB cs.AI

    TaSPM: Targeted Sequential Pattern Mining

    Authors: Gengsen Huang, Wensheng Gan, Philip S. Yu

    Abstract: Sequential pattern mining (SPM) is an important technique of pattern mining, which has many applications in reality. Although many efficient sequential pattern mining algorithms have been proposed, there are few studies can focus on target sequences. Targeted querying sequential patterns can not only reduce the number of sequences generated by SPM, but also improve the efficiency of users in perfo… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: Preprint. 5 figures, 3 tables

  37. arXiv:2202.13041  [pdf, other

    cs.AI

    Towards Revenue Maximization with Popular and Profitable Products

    Authors: Wensheng Gan, Guoting Chen, Hongzhi Yin, Philippe Fournier-Viger, Chien-Ming Chen, Philip S. Yu

    Abstract: Economic-wise, a common goal for companies conducting marketing is to maximize the return revenue/profit by utilizing the various effective marketing strategies. Consumer behavior is crucially important in economy and targeted marketing, in which behavioral economics can provide valuable insights to identify the biases and profit from customers. Finding credible and reliable information on product… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: ACM/IMS Transactions on Data Science. 4 figures, 5 tables

  38. arXiv:2202.08391  [pdf, other

    cs.LG cs.IR

    Graph Masked Autoencoders with Transformers

    Authors: Sixiao Zhang, Hongxu Chen, Haoran Yang, Xiangguo Sun, Philip S. Yu, Guandong Xu

    Abstract: Recently, transformers have shown promising performance in learning graph representations. However, there are still some challenges when applying transformers to real-world scenarios due to the fact that deep transformers are hard to train from scratch and the quadratic memory consumption w.r.t. the number of nodes. In this paper, we propose Graph Masked Autoencoders (GMAEs), a self-supervised tra… ▽ More

    Submitted 12 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  39. arXiv:2202.07082  [pdf, other

    cs.LG

    Graph Neural Networks for Graphs with Heterophily: A Survey

    Authors: Xin Zheng, Yi Wang, Yixin Liu, Ming Li, Miao Zhang, Di Jin, Philip S. Yu, Shirui Pan

    Abstract: Recent years have witnessed fast developments of graph neural networks (GNNs) that have benefited myriads of graph analytic tasks and applications. In general, most GNNs depend on the homophily assumption that nodes belonging to the same class are more likely to be connected. However, as a ubiquitous graph property in numerous real-world scenarios, heterophily, i.e., nodes with different labels te… ▽ More

    Submitted 24 February, 2024; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 22 pages

  40. arXiv:2202.05145  [pdf

    q-bio.BM cs.LG

    Deep learning for drug repurposing: methods, databases, and applications

    Authors: Xiaoqin Pan, Xuan Lin, Dongsheng Cao, Xiangxiang Zeng, Philip S. Yu, Lifang He, Ruth Nussinov, Feixiong Cheng

    Abstract: Drug development is time-consuming and expensive. Repurposing existing drugs for new therapies is an attractive solution that accelerates drug development at reduced experimental costs, specifically for Coronavirus Disease 2019 (COVID-19), an infectious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, comprehensively obtaining and productively integrating av… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted by WIREs Computational Molecular Science

  41. Large-scale Personalized Video Game Recommendation via Social-aware Contextualized Graph Neural Network

    Authors: Liangwei Yang, Zhiwei Liu, Yu Wang, Chen Wang, Ziwei Fan, Philip S. Yu

    Abstract: Because of the large number of online games available nowadays, online game recommender systems are necessary for users and online game platforms. The former can discover more potential online games of their interests, and the latter can attract users to dwell longer in the platform. This paper investigates the characteristics of user behaviors with respect to the online games on the Steam platfor… ▽ More

    Submitted 10 February, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

  42. arXiv:2201.10069  [pdf, other

    cs.SI cs.AI

    Link Prediction with Contextualized Self-Supervision

    Authors: Daokun Zhang, Jie Yin, Philip S. Yu

    Abstract: Link prediction aims to infer the link existence between pairs of nodes in networks/graphs. Despite their wide application, the success of traditional link prediction algorithms is hindered by three major challenges -- link sparsity, node attribute noise and dynamic changes -- that are faced by many real-world networks. To address these challenges, we propose a Contextualized Self-Supervised Learn… ▽ More

    Submitted 7 September, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: Accepted by TKDE, 14 pages

  43. Dual Space Graph Contrastive Learning

    Authors: Haoran Yang, Hongxu Chen, Shirui Pan, Lin Li, Philip S. Yu, Guandong Xu

    Abstract: Unsupervised graph representation learning has emerged as a powerful tool to address real-world problems and achieves huge success in the graph learning domain. Graph contrastive learning is one of the unsupervised graph representation learning methods, which recently attracts attention from researchers and has achieved state-of-the-art performances on various tasks. The key to the success of grap… ▽ More

    Submitted 4 March, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

  44. arXiv:2201.06035  [pdf, other

    cs.IR cs.AI cs.LG

    Sequential Recommendation via Stochastic Self-Attention

    Authors: Ziwei Fan, Zhiwei Liu, Alice Wang, Zahra Nazari, Lei Zheng, Hao Peng, Philip S. Yu

    Abstract: Sequential recommendation models the dynamics of a user's previous behaviors in order to forecast the next item, and has drawn a lot of attention. Transformer-based approaches, which embed items as vectors and use dot-product self-attention to measure the relationship between items, demonstrate superior capabilities among existing sequential methods. However, users' real-world sequential behaviors… ▽ More

    Submitted 5 March, 2022; v1 submitted 16 January, 2022; originally announced January 2022.

    Comments: updated version for camera-ready

  45. arXiv:2201.05973  [pdf, other

    cs.IR

    Multi-Sparse-Domain Collaborative Recommendation via Enhanced Comprehensive Aspect Preference Learning

    Authors: Xiaoyun Zhao, Ning Yang, Philip S. Yu

    Abstract: Cross-domain recommendation (CDR) has been attracting increasing attention of researchers for its ability to alleviate the data sparsity problem in recommender systems. However, the existing single-target or dual-target CDR methods often suffer from two drawbacks, the assumption of at least one rich domain and the heavy dependence on domain-invariant preference, which are impractical in real world… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

  46. arXiv:2201.05970  [pdf, other

    cs.IR

    Learning from Atypical Behavior: Temporary Interest Aware Recommendation Based on Reinforcement Learning

    Authors: Ziwen Du, Ning Yang, Zhonghua Yu, Philip S. Yu

    Abstract: Traditional robust recommendation methods view atypical user-item interactions as noise and aim to reduce their impact with some kind of noise filtering technique, which often suffers from two challenges. First, in real world, atypical interactions may signal users' temporary interest different from their general preference. Therefore, simply filtering out the atypical interactions as noise may be… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

  47. arXiv:2201.05819  [pdf, other

    cs.LG cs.CR cs.SI

    Interpretable and Effective Reinforcement Learning for Attacking against Graph-based Rumor Detection

    Authors: Yuefei Lyu, Xiaoyu Yang, Jiaxin Liu, Philip S. Yu, Sihong Xie, Xi Zhang

    Abstract: Social networks are frequently polluted by rumors, which can be detected by advanced models such as graph neural networks. However, the models are vulnerable to attacks and understanding the vulnerabilities is critical to rumor detection in practice. To discover subtle vulnerabilities, we design a powerful attacking algorithm to camouflage rumors in social networks based on reinforcement learning… ▽ More

    Submitted 14 October, 2022; v1 submitted 15 January, 2022; originally announced January 2022.

  48. arXiv:2112.10871  [pdf, other

    cs.CV

    Translational Concept Embedding for Generalized Compositional Zero-shot Learning

    Authors: He Huang, Wei Tang, Jiawei Zhang, Philip S. Yu

    Abstract: Generalized compositional zero-shot learning means to learn composed concepts of attribute-object pairs in a zero-shot fashion, where a model is trained on a set of seen concepts and tested on a combined set of seen and unseen concepts. This task is very challenging because of not only the gap between seen and unseen concepts but also the contextual dependency between attributes and objects. This… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

  49. arXiv:2112.08903  [pdf, other

    cs.LG cs.AI

    Graph Structure Learning with Variational Information Bottleneck

    Authors: Qingyun Sun, Jianxin Li, Hao Peng, Jia Wu, Xingcheng Fu, Cheng Ji, Philip S. Yu

    Abstract: Graph Neural Networks (GNNs) have shown promising results on a broad spectrum of applications. Most empirical studies of GNNs directly take the observed graph as input, assuming the observed structure perfectly depicts the accurate and complete relations between nodes. However, graphs in the real world are inevitably noisy or incomplete, which could even exacerbate the quality of graph representat… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022, Preprint version with Appendix

  50. arXiv:2112.07373  [pdf, other

    cs.SI

    Semi-Supervised Variational User Identity Linkage via Noise-Aware Self-Learning

    Authors: Chaozhuo Li, Senzhang Wang, Zheng Liu, Xing Xie, Lei Chen, Philip S. Yu

    Abstract: User identity linkage, which aims to link identities of a natural person across different social platforms, has attracted increasing research interest recently. Existing approaches usually first embed the identities as deterministic vectors in a shared latent space, and then learn a classifier based on the available annotations. However, the formation and characteristics of real-world social platf… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.