Skip to main content

Showing 1–50 of 150 results for author: Miao, C

  1. arXiv:2406.08835  [pdf, other

    cs.SD eess.AS

    A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed

    Authors: Ziyang Zhuang, Chenfeng Miao, Kun Zou, Shuai Gong, Ming Fang, Tao Wei, Zijian Li, Wei Hu, Shaojun Wang, Jing Xiao

    Abstract: Non-autoregressive (NAR) automatic speech recognition (ASR) models predict tokens independently and simultaneously, bringing high inference speed. However, there is still a gap in the accuracy of the NAR models compared to the autoregressive (AR) models. To further narrow the gap between the NAR and AR models, we propose a single-step NAR ASR architecture with high accuracy and inference speed, ca… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2406.06633  [pdf, other

    cs.LG

    PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

    Authors: Xiaoqi Qiu, Yongjie Wang, Xu Guo, Zhiwei Zeng, Yue Yu, Yuhong Feng, Chunyan Miao

    Abstract: Counterfactually Augmented Data (CAD) involves creating new data samples by applying minimal yet sufficient modifications to flip the label of existing data samples to other classes. Training with CAD enhances model robustness against spurious features that happen to correlate with labels by spreading the casual relationships across different classes. Yet, recent research reveals that training wit… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024 main conference

    MSC Class: 68T50 ACM Class: I.2; I.2.7

  3. arXiv:2406.06559  [pdf, other

    cs.CL cs.AI cs.LG

    Harnessing Business and Media Insights with Large Language Models

    Authors: Yujia Bao, Ankit Parag Shah, Neeru Narang, Jonathan Rivers, Rajeev Maksey, Lan Guan, Louise N. Barrere, Shelley Evenson, Rahul Basole, Connie Miao, Ankit Mehta, Fabien Boulay, Su Min Park, Natalie E. Pearson, Eldhose Joy, Tiger He, Sumiran Thakur, Koustav Ghosal, Josh On, Phoebe Morrison, Tim Major, Eva Siqi Wang, Gina Escobar, Jiaheng Wei, Tharindu Cyril Weerasooriya , et al. (8 additional authors not shown)

    Abstract: This paper introduces Fortune Analytics Language Model (FALM). FALM empowers users with direct access to comprehensive business analysis, including market trends, company performance metrics, and expert insights. Unlike generic LLMs, FALM leverages a curated knowledge base built from professional journalism, enabling it to deliver precise and in-depth answers to intricate business questions. Users… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  4. arXiv:2405.13082  [pdf, other

    cs.LG cs.AI cs.CV

    A Survey of Artificial Intelligence in Gait-Based Neurodegenerative Disease Diagnosis

    Authors: Haocong Rao, Minlin Zeng, Xuejiao Zhao, Chunyan Miao

    Abstract: Recent years have witnessed an increasing global population affected by neurodegenerative diseases (NDs), which traditionally require extensive healthcare resources and human effort for medical diagnosis and monitoring. As a crucial disease-related motor symptom, human gait can be exploited to characterize different NDs. The current advances in artificial intelligence (AI) models enable automatic… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 35 pages, 9 figures, 5 tables, citing 272 papers, under review at ACM Computing Survey (CSUR) journal. A up-to-date resource (papers, data, etc.) of this survey (AI4NDD) is provided at https://github.com/Kali-Hac/AI4NDD-Survey

  5. arXiv:2404.01650  [pdf, other

    cs.LG

    Test-Time Model Adaptation with Only Forward Passes

    Authors: Shuaicheng Niu, Chunyan Miao, Guohao Chen, Pengcheng Wu, Peilin Zhao

    Abstract: Test-time adaptation has proven effective in adapting a given trained model to unseen test samples with potential distribution shifts. However, in real-world scenarios, models are usually deployed on resource-limited devices, e.g., FPGAs, and are often quantized and hard-coded with non-modifiable parameters for acceleration. In light of this, existing methods are often infeasible since they heavil… ▽ More

    Submitted 29 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 18 pages, 4 figures, 17 tables, accepted by International Conference on Machine Learning

  6. arXiv:2402.17292  [pdf, other

    cs.CV

    DivAvatar: Diverse 3D Avatar Generation with a Single Prompt

    Authors: Weijing Tao, Biwen Lei, Kunhao Liu, Shijian Lu, Miaomiao Cui, Xuansong Xie, Chunyan Miao

    Abstract: Text-to-Avatar generation has recently made significant strides due to advancements in diffusion models. However, most existing work remains constrained by limited diversity, producing avatars with subtle differences in appearance for a given text prompt. We design DivAvatar, a novel framework that generates diverse avatars, empowering 3D creatives with a multitude of distinct and richly varied 3D… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  7. arXiv:2402.16110  [pdf, other

    cs.IR cs.MM

    Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation with Interpretability

    Authors: Xin Zhou, Chunyan Miao

    Abstract: Multimodal recommender systems amalgamate multimodal information (e.g., textual descriptions, images) into a collaborative filtering framework to provide more accurate recommendations. While the incorporation of multimodal information could enhance the interpretability of these systems, current multimodal models represent users and items utilizing entangled numerical vectors, rendering them arduou… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 12 pages, 7 figures

  8. arXiv:2401.15296  [pdf, other

    cs.CV cs.AI

    A Survey on 3D Skeleton Based Person Re-Identification: Approaches, Designs, Challenges, and Future Directions

    Authors: Haocong Rao, Chunyan Miao

    Abstract: Person re-identification via 3D skeletons is an important emerging research area that triggers great interest in the pattern recognition community. With distinctive advantages for many application scenarios, a great diversity of 3D skeleton based person re-identification (SRID) methods have been proposed in recent years, effectively addressing prominent problems in skeleton modeling and feature le… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: A up-to-date resource (papers, codes, data, etc.) of this survey is provided at https://github.com/Kali-Hac/3D-skeleton-based-person-re-ID-survey

  9. arXiv:2401.09945  [pdf, other

    cs.LG cs.CR cs.IR

    HGAttack: Transferable Heterogeneous Graph Adversarial Attack

    Authors: He Zhao, Zhiwei Zeng, Yongwei Wang, Deheng Ye, Chunyan Miao

    Abstract: Heterogeneous Graph Neural Networks (HGNNs) are increasingly recognized for their performance in areas like the web and e-commerce, where resilience against adversarial attacks is crucial. However, existing adversarial attack methods, which are primarily designed for homogeneous graphs, fall short when applied to HGNNs due to their limited ability to address the structural and semantic complexity… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  10. arXiv:2312.13596  [pdf, ps, other

    cs.LG cs.AI

    Anchoring Path for Inductive Relation Prediction in Knowledge Graphs

    Authors: Zhixiang Su, Di Wang, Chunyan Miao, Lizhen Cui

    Abstract: Aiming to accurately predict missing edges representing relations between entities, which are pervasive in real-world Knowledge Graphs (KGs), relation prediction plays a critical role in enhancing the comprehensiveness and utility of KGs. Recent research focuses on path-based methods due to their inductive and explainable properties. However, these methods face a great challenge when lots of reaso… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  11. arXiv:2312.12191  [pdf, other

    cs.LG cs.AI stat.ML

    CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning

    Authors: Chenyu Sun, Hangwei Qian, Chunyan Miao

    Abstract: Offline reinforcement learning (RL) aims to learn an effective policy from a pre-collected dataset. Most existing works are to develop sophisticated learning algorithms, with less emphasis on improving the data collection process. Moreover, it is even challenging to extend the single-task setting and collect a task-agnostic dataset that allows an agent to perform multiple downstream tasks. In this… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI-24

  12. arXiv:2311.16922  [pdf, other

    cs.CV cs.AI cs.CL

    Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

    Authors: Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing

    Abstract: Large Vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned. Despite their success, LVLMs still suffer from the issue of object hallucinations, where models generate plausible yet incorrect outputs that include objects that do not exist in the images. To mitig… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  13. arXiv:2311.06122  [pdf, other

    cs.CV

    Fight Fire with Fire: Combating Adversarial Patch Attacks using Pattern-randomized Defensive Patches

    Authors: Jianan Feng, Jiachun Li, Changqing Miao, Jianjun Huang, Wei You, Wenchang Shi, Bin Liang

    Abstract: Object detection has found extensive applications in various tasks, but it is also susceptible to adversarial patch attacks. Existing defense methods often necessitate modifications to the target model or result in unacceptable time overhead. In this paper, we adopt a counterattack approach, following the principle of "fight fire with fire," and propose a novel and general methodology for defendin… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  14. arXiv:2310.08069  [pdf, other

    cs.SE cs.CL cs.IR cs.LG

    Rethinking Negative Pairs in Code Search

    Authors: Haochen Li, Xin Zhou, Luu Anh Tuan, Chunyan Miao

    Abstract: Recently, contrastive learning has become a key component in fine-tuning code search models for software development efficiency and effectiveness. It pulls together positive code snippets while pushing negative samples away given search queries. Among contrastive learning, InfoNCE is the most widely used loss function due to its better performance. However, the following problems in negative sampl… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  15. arXiv:2310.03747  [pdf, other

    eess.SP cs.AI cs.LG

    A Knowledge-Driven Cross-view Contrastive Learning for EEG Representation

    Authors: Weining Weng, Yang Gu, Qihui Zhang, Yingying Huang, Chunyan Miao, Yiqiang Chen

    Abstract: Due to the abundant neurophysiological information in the electroencephalogram (EEG) signal, EEG signals integrated with deep learning methods have gained substantial traction across numerous real-world tasks. However, the development of supervised learning methods based on EEG signals has been hindered by the high cost and significant label discrepancies to manually label large-scale EEG datasets… ▽ More

    Submitted 21 September, 2023; originally announced October 2023.

    Comments: 14pages,7 figures

    MSC Class: 68T30 Knowledge representation ACM Class: I.2.4; I.5.2; J.3.1

  16. arXiv:2309.14727  [pdf, other

    eess.SY cs.AI cs.LG

    Effective Multi-Agent Deep Reinforcement Learning Control with Relative Entropy Regularization

    Authors: Chenyang Miao, Yunduan Cui, Huiyun Li, Xinyu Wu

    Abstract: In this paper, a novel Multi-agent Reinforcement Learning (MARL) approach, Multi-Agent Continuous Dynamic Policy Gradient (MACDPP) was proposed to tackle the issues of limited capability and sample efficiency in various scenarios controlled by multiple agents. It alleviates the inconsistency of multiple agents' policy updates by introducing the relative entropy regularization to the Centralized Tr… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  17. arXiv:2309.12657  [pdf, other

    cs.CV

    Exploiting Modality-Specific Features For Multi-Modal Manipulation Detection And Grounding

    Authors: Jiazhen Wang, Bin Liu, Changtao Miao, Zhiwei Zhao, Wanyi Zhuang, Qi Chu, Nenghai Yu

    Abstract: AI-synthesized text and images have gained significant attention, particularly due to the widespread dissemination of multi-modal manipulations on the internet, which has resulted in numerous negative impacts on society. Existing methods for multi-modal manipulation detection and grounding primarily focus on fusing vision-language features to make predictions, while overlooking the importance of m… ▽ More

    Submitted 13 January, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. Camera-ready version and supplementary material

  18. arXiv:2309.04676  [pdf, other

    cs.LG cs.AI stat.ME

    Flexible and Robust Counterfactual Explanations with Minimal Satisfiable Perturbations

    Authors: Yongjie Wang, Hangwei Qian, Yongjie Liu, Wei Guo, Chunyan Miao

    Abstract: Counterfactual explanations (CFEs) exemplify how to minimally modify a feature vector to achieve a different prediction for an instance. CFEs can enhance informational fairness and trustworthiness, and provide suggestions for users who receive adverse predictions. However, recent research has shown that multiple CFEs can be offered for the same instance or instances with slight differences. Multip… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted by CIKM 2023

  19. Hierarchical Skeleton Meta-Prototype Contrastive Learning with Hard Skeleton Mining for Unsupervised Person Re-Identification

    Authors: Haocong Rao, Cyril Leung, Chunyan Miao

    Abstract: With rapid advancements in depth sensors and deep learning, skeleton-based person re-identification (re-ID) models have recently achieved remarkable progress with many advantages. Most existing solutions learn single-level skeleton features from body joints with the assumption of equal skeleton importance, while they typically lack the ability to exploit more informative skeleton features from var… ▽ More

    Submitted 18 September, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: Published at International Journal of Computer Vision (IJCV) 2023. Codes are available at https://github.com/Kali-Hac/Hi-MPC. The Appendix A for Proof (6 pages) and Appendix B for Experiments (13 pages) are included in the version [v3] at arXiv:2307.12917

  20. arXiv:2305.13628  [pdf, other

    cs.CL

    Improving Self-training for Cross-lingual Named Entity Recognition with Contrastive and Prototype Learning

    Authors: Ran Zhou, Xin Li, Lidong Bing, Erik Cambria, Chunyan Miao

    Abstract: In cross-lingual named entity recognition (NER), self-training is commonly used to bridge the linguistic gap by training on pseudo-labeled target-language data. However, due to sub-optimal performance on target languages, the pseudo labels are often noisy and limit the overall performance. In this work, we aim to improve self-training for cross-lingual NER by combining representation learning and… ▽ More

    Submitted 4 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL2023

  21. arXiv:2305.10794  [pdf, other

    cs.CV

    Multi-spectral Class Center Network for Face Manipulation Detection and Localization

    Authors: Changtao Miao, Qi Chu, Zhentao Tan, Zhenchao Jin, Tao Gong, Wanyi Zhuang, Yue Wu, Bin Liu, Honggang Hu, Nenghai Yu

    Abstract: As deepfake content proliferates online, advancing face manipulation forensics has become crucial. To combat this emerging threat, previous methods mainly focus on studying how to distinguish authentic and manipulated face images. Although impressive, image-level classification lacks explainability and is limited to specific application scenarios, spurring recent research on pixel-level prediction… ▽ More

    Submitted 13 July, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Update Version

  22. arXiv:2303.15805  [pdf, other

    cs.CV eess.IV

    StarNet: Style-Aware 3D Point Cloud Generation

    Authors: Yunfan Zhang, Hao Wang, Guosheng Lin, Vun Chan Hua Nicholas, Zhiqi Shen, Chunyan Miao

    Abstract: This paper investigates an open research task of reconstructing and generating 3D point clouds. Most existing works of 3D generative models directly take the Gaussian prior as input for the decoder to generate 3D point clouds, which fail to learn disentangled latent codes, leading noisy interpolated results. Most of the GAN-based models fail to discriminate the local geometries, resulting in the p… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  23. arXiv:2303.06819  [pdf, other

    cs.CV

    TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning with Structure-Trajectory Prompted Reconstruction for Person Re-Identification

    Authors: Haocong Rao, Chunyan Miao

    Abstract: Person re-identification (re-ID) via 3D skeleton data is an emerging topic with prominent advantages. Existing methods usually design skeleton descriptors with raw body joints or perform skeleton sequence representation learning. However, they typically cannot concurrently model different body-component relations, and rarely explore useful semantics from fine-grained representations of body joints… ▽ More

    Submitted 30 July, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023. Codes are available at https://github.com/Kali-Hac/TranSG. Supplemental materials are included in the conference proceedings

  24. arXiv:2303.02836  [pdf, other

    cs.CR

    Blockchain-Empowered Lifecycle Management for AI-Generated Content (AIGC) Products in Edge Networks

    Authors: Yinqiu Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Chunyan Miao, Xuemin, Shen, Abbas Jamalipour

    Abstract: The rapid development of Artificial IntelligenceGenerated Content (AIGC) has brought daunting challenges regarding service latency, security, and trustworthiness. Recently, researchers presented the edge AIGC paradigm, effectively optimize the service latency by distributing AIGC services to edge devices. However, AIGC products are still unprotected and vulnerable to tampering and plagiarization.… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  25. arXiv:2303.01248  [pdf, other

    cs.CL cs.AI

    Can ChatGPT Assess Human Personalities? A General Evaluation Framework

    Authors: Haocong Rao, Cyril Leung, Chunyan Miao

    Abstract: Large Language Models (LLMs) especially ChatGPT have produced impressive results in various areas, but their potential human-like psychology is still largely unexplored. Existing works study the virtual personalities of LLMs but rarely explore the possibility of analyzing human personalities via LLMs. This paper presents a generic evaluation framework for LLMs to assess human personalities based o… ▽ More

    Submitted 13 October, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted to EMNLP 2023. Our codes are available at https://github.com/Kali-Hac/ChatGPT-MBTI

  26. arXiv:2302.00907  [pdf, other

    cs.CL

    History-Aware Hierarchical Transformer for Multi-session Open-domain Dialogue System

    Authors: Tong Zhang, Yong Liu, Boyang Li, Zhiwei Zeng, Pengwei Wang, Yuan You, Chunyan Miao, Lizhen Cui

    Abstract: With the evolution of pre-trained language models, current open-domain dialogue systems have achieved great progress in conducting one-session conversations. In contrast, Multi-Session Conversation (MSC), which consists of multiple sessions over a long term with the same user, is under-investigated. In this paper, we propose History-Aware Hierarchical Transformer (HAHT) for multi-session open-doma… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: EMNLP 2022(Findings)

  27. arXiv:2301.01664  [pdf, ps, other

    cs.CL cs.LG

    Multi-Aspect Explainable Inductive Relation Prediction by Sentence Transformer

    Authors: Zhixiang Su, Di Wang, Chunyan Miao, Lizhen Cui

    Abstract: Recent studies on knowledge graphs (KGs) show that path-based methods empowered by pre-trained language models perform well in the provision of inductive and explainable relation predictions. In this paper, we introduce the concepts of relation path coverage and relation path confidence to filter out unreliable paths prior to model training to elevate the model performance. Moreover, we propose Kn… ▽ More

    Submitted 1 May, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

  28. arXiv:2212.01853  [pdf, other

    cs.CL

    Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE

    Authors: Qihuang Zhong, Liang Ding, Yibing Zhan, Yu Qiao, Yonggang Wen, Li Shen, Juhua Liu, Baosheng Yu, Bo Du, Yixin Chen, Xinbo Gao, Chunyan Miao, Xiaoou Tang, Dacheng Tao

    Abstract: This technical report briefly describes our JDExplore d-team's Vega v2 submission on the SuperGLUE leaderboard. SuperGLUE is more challenging than the widely used general language understanding evaluation (GLUE) benchmark, containing eight difficult language understanding tasks, including question answering, natural language inference, word sense disambiguation, coreference resolution, and reasoni… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: Technical report

  29. arXiv:2211.09394  [pdf, other

    cs.CL

    ConNER: Consistency Training for Cross-lingual Named Entity Recognition

    Authors: Ran Zhou, Xin Li, Lidong Bing, Erik Cambria, Luo Si, Chunyan Miao

    Abstract: Cross-lingual named entity recognition (NER) suffers from data scarcity in the target languages, especially under zero-shot settings. Existing translate-train or knowledge distillation methods attempt to bridge the language gap, but often introduce a high level of noise. To solve this problem, consistency training methods regularize the model to be robust towards perturbations on data or hidden st… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by EMNLP 2022

  30. arXiv:2211.08200  [pdf, other

    cs.LG cs.AI

    On Inferring User Socioeconomic Status with Mobility Records

    Authors: Zheng Wang, Mingrui Liu, Cheng Long, Qianru Zhang, Jiangneng Li, Chunyan Miao

    Abstract: When users move in a physical space (e.g., an urban space), they would have some records called mobility records (e.g., trajectories) generated by devices such as mobile phones and GPS devices. Naturally, mobility records capture essential information of how users work, live and entertain in their daily lives, and therefore, they have been used in a wide range of tasks such as user profile inferen… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: IEEE International Conference on Big Data (IEEE BigData 2022)

  31. arXiv:2211.03057  [pdf, other

    cs.NI

    Towards Green Metaverse Networking Technologies, Advancements and Future Directions

    Authors: Siyue Zhang, Wei Yang Bryan Lim, Wei Chong Ng, Zehui Xiong, Dusit Niyato, Xuemin Sherman Shen, Chunyan Miao

    Abstract: As the Metaverse is iteratively being defined, its potential to unleash the next wave of digital disruption and create real-life value becomes increasingly clear. With distinctive features of immersive experience, simultaneous interactivity, and user agency, the Metaverse has the capability to transform all walks of life. However, the enabling technologies of the Metaverse, i.e., digital twin, art… ▽ More

    Submitted 13 April, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

  32. arXiv:2210.12752  [pdf, other

    cs.CV

    UIA-ViT: Unsupervised Inconsistency-Aware Method based on Vision Transformer for Face Forgery Detection

    Authors: Wanyi Zhuang, Qi Chu, Zhentao Tan, Qiankun Liu, Haojie Yuan, Changtao Miao, Zixiang Luo, Nenghai Yu

    Abstract: Intra-frame inconsistency has been proved to be effective for the generalization of face forgery detection. However, learning to focus on these inconsistency requires extra pixel-level forged location annotations. Acquiring such annotations is non-trivial. Some existing methods generate large-scale synthesized data with location annotations, which is only composed of real images and cannot capture… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: accepted by ECCV 2022 (oral)

  33. arXiv:2210.12285  [pdf, other

    cs.SE cs.IR cs.LG

    Exploring Representation-Level Augmentation for Code Search

    Authors: Haochen Li, Chunyan Miao, Cyril Leung, Yanxian Huang, Yuan Huang, Hongyu Zhang, Yanlin Wang

    Abstract: Code search, which aims at retrieving the most relevant code fragment for a given natural language query, is a common activity in software development practice. Recently, contrastive learning is widely used in code search research, where many data augmentation approaches for source code (e.g., semantic-preserving program transformation) are proposed to learn better representations. However, these… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  34. arXiv:2208.14661  [pdf, other

    cs.GT

    Stochastic Resource Allocation for Semantic Communication-aided Virtual Transportation Networks in the Metaverse

    Authors: Wei Chong Ng, Hongyang Du, Wei Yang Bryan Lim, Zehui Xiong, Dusit Niyato, Chunyan Miao

    Abstract: The physical-virtual world synchronization to develop the Metaverse will require a massive transmission and exchange of data. In this paper, we introduce semantic communication for the development of virtual transportation networks in the Metaverse. Leveraging the perception capabilities of edge devices, virtual service providers (VSPs) can subscribe to their preferred edge devices to receive the… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: 6 pages, 5 figures and 3 tables

  35. arXiv:2208.11814  [pdf, other

    cs.CV cs.AI

    Skeleton Prototype Contrastive Learning with Multi-Level Graph Relation Modeling for Unsupervised Person Re-Identification

    Authors: Haocong Rao, Chunyan Miao

    Abstract: Person re-identification (re-ID) via 3D skeletons is an important emerging topic with many merits. Existing solutions rarely explore valuable body-component relations in skeletal structure or motion, and they typically lack the ability to learn general representations with unlabeled skeleton data for person re-ID. This paper proposes a generic unsupervised Skeleton Prototype Contrastive learning p… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: Submitted to TPAMI in 2021. Pleliminary version of this work has been accepted for oral presentation at IJCAI 2021. Our codes are available at https://github.com/Kali-Hac/SPC-MGR

  36. arXiv:2208.05040  [pdf, ps, other

    cs.NI cs.AI cs.GT

    Economics of Semantic Communication System: An Auction Approach

    Authors: Zi Qin Liew, Hongyang Du, Wei Yang Bryan Lim, Zehui Xiong, Dusit Niyato, Chunyan Miao, Dong In Kim

    Abstract: Semantic communication technologies enable wireless edge devices to communicate effectively by transmitting semantic meaning of data. Edge components, such as vehicles in next-generation intelligent transport systems, use well-trained semantic models to encode and decode semantic information extracted from raw and sensor data. However, the limitation in computing resources makes it difficult to su… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  37. arXiv:2207.14428  [pdf, other

    cs.CV

    Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval

    Authors: Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

    Abstract: This paper investigates an open research problem of generating text-image pairs to improve the training of fine-grained image-to-text cross-modal retrieval task, and proposes a novel framework for paired data augmentation by uncovering the hidden semantic information of StyleGAN2 model. Specifically, we first train a StyleGAN2 model on the given dataset. We then project the real images back to the… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: Accepted at ACM MM 2022

  38. arXiv:2207.14425  [pdf, other

    cs.CV

    3D Cartoon Face Generation with Controllable Expressions from a Single GAN Image

    Authors: Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

    Abstract: In this paper, we investigate an open research task of generating 3D cartoon face shapes from single 2D GAN generated human faces and without 3D supervision, where we can also manipulate the facial expressions of the 3D shapes. To this end, we discover the semantic meanings of StyleGAN latent space, such that we are able to produce face images of various expressions, poses, and lighting by control… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  39. arXiv:2207.11088  [pdf, other

    cs.IR

    Layer-refined Graph Convolutional Networks for Recommendation

    Authors: Xin Zhou, Donghui Lin, Yong Liu, Chunyan Miao

    Abstract: Recommendation models utilizing Graph Convolutional Networks (GCNs) have achieved state-of-the-art performance, as they can integrate both the node information and the topological structure of the user-item interaction graph. However, these GCN-based recommendation models not only suffer from over-smoothing when stacking too many layers but also bear performance degeneration resulting from the exi… ▽ More

    Submitted 24 November, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Accepted as a research track paper in ICDE 2023

  40. Bootstrap Latent Representations for Multi-modal Recommendation

    Authors: Xin Zhou, Hongyu Zhou, Yong Liu, Zhiwei Zeng, Chunyan Miao, Pengwei Wang, Yuan You, Feijun Jiang

    Abstract: This paper studies the multi-modal recommendation problem, where the item multi-modality information (e.g., images and textual descriptions) is exploited to improve the recommendation accuracy. Besides the user-item interaction graph, existing state-of-the-art methods usually use auxiliary graphs (e.g., user-user or item-item relation graph) to augment the learned representations of users and/or i… ▽ More

    Submitted 30 April, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted by Proceedings of the ACM Web Conference 2023 (WWW'23)

  41. arXiv:2207.03776  [pdf, other

    cs.CV

    Towards Intrinsic Common Discriminative Features Learning for Face Forgery Detection using Adversarial Learning

    Authors: Wanyi Zhuang, Qi Chu, Haojie Yuan, Changtao Miao, Bin Liu, Nenghai Yu

    Abstract: Existing face forgery detection methods usually treat face forgery detection as a binary classification problem and adopt deep convolution neural networks to learn discriminative features. The ideal discriminative features should be only related to the real/fake labels of facial images. However, we observe that the features learned by vanilla classification networks are correlated to unnecessary p… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  42. arXiv:2207.02812  [pdf, other

    cs.CV

    Towards Counterfactual Image Manipulation via CLIP

    Authors: Yingchen Yu, Fangneng Zhan, Rongliang Wu, Jiahui Zhang, Shijian Lu, Miaomiao Cui, Xuansong Xie, Xian-Sheng Hua, Chunyan Miao

    Abstract: Leveraging StyleGAN's expressivity and its disentangled latent codes, existing methods can achieve realistic editing of different visual attributes such as age and gender of facial images. An intriguing yet challenging problem arises: Can generative models achieve counterfactual editing against their learnt priors? Due to the lack of counterfactual samples in natural datasets, we investigate this… ▽ More

    Submitted 12 July, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: This paper has been accepted to ACM MM 2022, code may be found here: https://github.com/yingchen001/CF-CLIP

  43. arXiv:2207.00427  [pdf, other

    cs.NI eess.SP

    Semantic Communications for Future Internet: Fundamentals, Applications, and Challenges

    Authors: Wanting Yang, Hongyang Du, Ziqin Liew, Wei Yang Bryan Lim, Zehui Xiong, Dusit Niyato, Xuefen Chi, Xuemin Sherman Shen, Chunyan Miao

    Abstract: With the increasing demand for intelligent services, the sixth-generation (6G) wireless networks will shift from a traditional architecture that focuses solely on high transmission rate to a new architecture that is based on the intelligent connection of everything. Semantic communication (SemCom), a revolutionary architecture that integrates user as well as application requirements and meaning of… ▽ More

    Submitted 13 November, 2022; v1 submitted 10 June, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2103.05391 by other authors

  44. arXiv:2206.14923  [pdf, other

    cs.CV cs.LG

    On Non-Random Missing Labels in Semi-Supervised Learning

    Authors: Xinting Hu, Yulei Niu, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang

    Abstract: Semi-Supervised Learning (SSL) is fundamentally a missing label problem, in which the label Missing Not At Random (MNAR) problem is more realistic and challenging, compared to the widely-adopted yet naive Missing Completely At Random assumption where both labeled and unlabeled data share the same class distribution. Different from existing SSL solutions that overlook the role of "class" in causing… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Journal ref: ICLR 2022

  45. arXiv:2206.14468  [pdf, other

    cs.IR

    Minimalist and High-performance Conversational Recommendation with Uncertainty Estimation for User Preference

    Authors: Yinan Zhang, Boyang Li, Yong Liu, You Yuan, Chunyan Miao

    Abstract: Conversational recommendation system (CRS) is emerging as a user-friendly way to capture users' dynamic preferences over candidate items and attributes. Multi-shot CRS is designed to make recommendations multiple times until the user either accepts the recommendation or leaves at the end of their patience. Existing works are trained with reinforcement learning (RL), which may suffer from unstable… ▽ More

    Submitted 30 June, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

  46. arXiv:2205.14837  [pdf, other

    cs.IR cs.AI

    Enhancing Sequential Recommendation with Graph Contrastive Learning

    Authors: Yixin Zhang, Yong Liu, Yonghui Xu, Hao Xiong, Chenyi Lei, Wei He, Lizhen Cui, Chunyan Miao

    Abstract: The sequential recommendation systems capture users' dynamic behavior patterns to predict their next interaction behaviors. Most existing sequential recommendation methods only exploit the local context information of an individual interaction sequence and learn model parameters solely based on the item prediction loss. Thus, they usually fail to learn appropriate sequence representations. This pa… ▽ More

    Submitted 6 June, 2022; v1 submitted 29 May, 2022; originally announced May 2022.

    Comments: 8 pages, 3 figures, Accepted by IJCAI 2022

  47. arXiv:2205.07493  [pdf, other

    cs.LG

    Multi-scale Attention Flow for Probabilistic Time Series Forecasting

    Authors: Shibo Feng, Chunyan Miao, Ke Xu, Jiaxiang Wu, Pengcheng Wu, Yang Zhang, Peilin Zhao

    Abstract: The probability prediction of multivariate time series is a notoriously challenging but practical task. On the one hand, the challenge is how to effectively capture the cross-series correlations between interacting time series, to achieve accurate distribution modeling. On the other hand, we should consider how to capture the contextual information within time series more accurately to model multi… ▽ More

    Submitted 21 July, 2023; v1 submitted 16 May, 2022; originally announced May 2022.

  48. arXiv:2205.06504  [pdf, other

    cs.CR cs.AI cs.LG

    DualCF: Efficient Model Extraction Attack from Counterfactual Explanations

    Authors: Yongjie Wang, Hangwei Qian, Chunyan Miao

    Abstract: Cloud service providers have launched Machine-Learning-as-a-Service (MLaaS) platforms to allow users to access large-scale cloudbased models via APIs. In addition to prediction outputs, these APIs can also provide other information in a more human-understandable way, such as counterfactual explanations (CF). However, such extra information inevitably causes the cloud models to be more vulnerable t… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: in Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22), June 21-24, 2022, Seoul, Republic of Korea

  49. arXiv:2205.00943  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning

    Authors: Chenyu Sun, Hangwei Qian, Chunyan Miao

    Abstract: In reinforcement learning (RL), it is challenging to learn directly from high-dimensional observations, where data augmentation has recently been shown to remedy this via encoding invariances from raw pixels. Nevertheless, we empirically find that not all samples are equally important and hence simply injecting more augmented inputs may instead cause instability in Q-learning. In this paper, we ap… ▽ More

    Submitted 3 May, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: Full paper with supplementary material, accepted by IJCAI 2022. Acknowledgements and affiliations are updated

  50. arXiv:2204.11351  [pdf, other

    cs.LG cs.AI

    An empirical study of the effect of background data size on the stability of SHapley Additive exPlanations (SHAP) for deep learning models

    Authors: Han Yuan, Mingxuan Liu, Lican Kang, Chenkui Miao, Ying Wu

    Abstract: Nowadays, the interpretation of why a machine learning (ML) model makes certain inferences is as crucial as the accuracy of such inferences. Some ML models like the decision tree possess inherent interpretability that can be directly comprehended by humans. Others like artificial neural networks (ANN), however, rely on external methods to uncover the deduction mechanism. SHapley Additive exPlanati… ▽ More

    Submitted 9 April, 2023; v1 submitted 24 April, 2022; originally announced April 2022.