Skip to main content

Showing 1–50 of 58 results for author: Zhong, Q

  1. arXiv:2407.00341  [pdf, other

    cs.CL

    Iterative Data Augmentation with Large Language Models for Aspect-based Sentiment Analysis

    Authors: Haiyun Li, Qihuang Zhong, Ke Zhu, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Aspect-based Sentiment Analysis (ABSA) is an important sentiment analysis task, which aims to determine the sentiment polarity towards an aspect in a sentence. Due to the expensive and limited labeled data, data augmentation (DA) has become the standard for improving the performance of ABSA. However, current DA methods usually have some shortcomings: 1) poor fluency and coherence, 2) lack of diver… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Work in process

  2. arXiv:2404.14963  [pdf, other

    cs.CL cs.AI

    Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems

    Authors: Qihuang Zhong, Kang Wang, Ziyang Xu, Juhua Liu, Liang Ding, Bo Du, Dacheng Tao

    Abstract: Chain-of-Thought (CoT) prompting has enhanced the performance of Large Language Models (LLMs) across various reasoning tasks. However, CoT still falls short in dealing with complex math word problems, as it usually suffers from three pitfalls: semantic misunderstanding errors, calculation errors and step-missing errors. Prior studies involve addressing the calculation errors and step-missing error… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Work in progress

  3. arXiv:2403.07673  [pdf, other

    cs.CR

    Towards Model Extraction Attacks in GAN-Based Image Translation via Domain Shift Mitigation

    Authors: Di Mi, Yanjun Zhang, Leo Yu Zhang, Shengshan Hu, Qi Zhong, Haizhuan Yuan, Shirui Pan

    Abstract: Model extraction attacks (MEAs) enable an attacker to replicate the functionality of a victim deep neural network (DNN) model by only querying its API service remotely, posing a severe threat to the security and integrity of pay-per-query DNN-based services. Although the majority of current research on MEAs has primarily concentrated on neural classifiers, there is a growing prevalence of image-to… ▽ More

    Submitted 19 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI 2024

  4. arXiv:2402.16072  [pdf

    cs.ET quant-ph

    Demonstration of 3 V Programmable Josephson Junction Arrays Using Non-Integer-Multiple Logic

    Authors: Wenhui Cao, Erkun Yang, Jinjin Li, Huan Qiao, Yuan Zhong, Qing Zhong, Da Xu, Xueshen Wang, Xiaolong Xu, Shijian Wang, Jian Chen

    Abstract: This article demonstrates a new kind of programmable logic for the representation of an integer that can be used for the programmable Josephson voltage standard. It can enable the numbers of junctions in most bits to be variable integer values, which is different from normal binary logic or ternary logic. Consequently, missing junctions due to superconducting short circuits can be tolerated under… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  5. arXiv:2402.11890  [pdf, other

    cs.CL

    Revisiting Knowledge Distillation for Autoregressive Language Models

    Authors: Qihuang Zhong, Liang Ding, Li Shen, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Knowledge distillation (KD) is a common approach to compress a teacher model to reduce its inference cost and memory footprint, by training a smaller student model. However, in the context of autoregressive language models (LMs), we empirically find that larger teacher LMs might dramatically result in a poorer student. In response to this problem, we conduct a series of analyses and reveal that di… ▽ More

    Submitted 16 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL2024 Main Conference

  6. arXiv:2402.11889  [pdf, other

    cs.CL

    ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding

    Authors: Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: With the development of instruction-tuned large language models (LLMs), improving the safety of LLMs has become more critical. However, the current approaches for aligning the LLMs output with expected safety usually require substantial training efforts, e.g., high-quality safety data and expensive computational resources, which are costly and inefficient. To this end, we present reverse prompt co… ▽ More

    Submitted 16 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL2024 Findings

  7. arXiv:2401.11117  [pdf

    eess.SP cs.CY

    A Finger on the Pulse of Cardiovascular Health: Smartphone Photoplethysmography-Based Pulse Waveform Analysis for Blood Pressure Measurement

    Authors: Ivan Liu, Fangyuan Liu, Qi Zhong, Shiguang Ni

    Abstract: Routine blood pressure (BP) monitoring, crucial for health assessment, faces challenges such as limited access to medical-grade equipment and expertise. Portable cuff BP devices, on the other hand, are cumbersome to carry all day and often cost-prohibitive in less developed countries. Besides, these sphygmomanometer-based devices can cause discomfort and disrupt blood flow during measurement. This… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 33 pages, 9 figures

  8. arXiv:2401.09145  [pdf

    cs.CY

    Your blush gives you away: detecting hidden mental states with remote photoplethysmography and thermal imaging

    Authors: Ivan Liu, Fangyuan Liu, Qi Zhong, Fei Ma, Shiguang Ni

    Abstract: Multimodal emotion recognition techniques are increasingly essential for assessing mental states. Image-based methods, however, tend to focus predominantly on overt visual cues and often overlook subtler mental state changes. Psychophysiological research has demonstrated that HR and skin temperature are effective in detecting ANS activities, thereby revealing these subtle changes. However, traditi… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 28 pages, 6 figures

  9. arXiv:2311.16831  [pdf, other

    cs.CY

    Tracking a Year of Polarized Twitter Discourse on Abortion

    Authors: Ashwin Rao, Rong-Ching Chang, Qiankun Zhong, Kristina Lerman, Magdalena Wojcieszak

    Abstract: Abortion is one of the most contentious issues in American politics. The Dobbs v. Jackson Women's Health Organization ruling in 2022, which shifted the authority to regulate abortion from the federal government to the states, triggering intense protests and emotional debates across the nation. Yet, little is known about how online discourse about abortion rights fluctuated on social media platform… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  10. arXiv:2310.13315  [pdf, other

    cs.CL

    Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models

    Authors: Miaoxi Zhu, Qihuang Zhong, Li Shen, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Quantization is a promising approach for reducing memory overhead and accelerating inference, especially in large pre-trained language model (PLM) scenarios. While having no access to original training data due to security and privacy concerns has emerged the demand for zero-shot quantization. Most of the cutting-edge zero-shot quantization methods primarily 1) apply to computer vision tasks, and… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP2023 (Main). Miaoxi Zhu and Qihuang Zhong contribute equally to this work

  11. arXiv:2310.01753  [pdf, other

    cs.LG stat.ML

    CausalTime: Realistically Generated Time-series for Benchmarking of Causal Discovery

    Authors: Yuxiao Cheng, Ziqian Wang, Tingxiong Xiao, Qin Zhong, Jinli Suo, Kunlun He

    Abstract: Time-series causal discovery (TSCD) is a fundamental problem of machine learning. However, existing synthetic datasets cannot properly evaluate or predict the algorithms' performance on real data. This study introduces the CausalTime pipeline to generate time-series that highly resemble the real data and with ground truth causal graphs for quantitative performance evaluation. The pipeline starts f… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  12. arXiv:2309.08096  [pdf, other

    cs.RO

    GelSplitter: Tactile Reconstruction from Near Infrared and Visible Images

    Authors: Yuankai Lin, Yulin Zhou, Kaiji Huang, Qi Zhong, Tao Cheng, Hua Yang, Zhouping Yin

    Abstract: The GelSight-like visual tactile (VT) sensor has gained popularity as a high-resolution tactile sensing technology for robots, capable of measuring touch geometry using a single RGB camera. However, the development of multi-modal perception for VT sensors remains a challenge, limited by the mono camera. In this paper, we propose the GelSplitter, a new framework approach the multi-modal VT sensor w… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  13. arXiv:2307.12616  [pdf, other

    cs.CV cs.AI

    CTVIS: Consistent Training for Online Video Instance Segmentation

    Authors: Kaining Ying, Qing Zhong, Weian Mao, Zhenhua Wang, Hao Chen, Lin Yuanbo Wu, Yifan Liu, Chengxiang Fan, Yunzhi Zhuge, Chunhua Shen

    Abstract: The discrimination of instance embeddings plays a vital role in associating instances across time for online video instance segmentation (VIS). Instance embedding learning is directly supervised by the contrastive loss computed upon the contrastive items (CIs), which are sets of anchor/positive/negative embeddings. Recent online VIS methods leverage CIs sourced from one reference frame only, which… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023. The code is available at https://github.com/KainingYing/CTVIS

  14. arXiv:2305.15275  [pdf, other

    cs.CL

    Self-Evolution Learning for Discriminative Language Model Pretraining

    Authors: Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Masked language modeling, widely used in discriminative language model (e.g., BERT) pretraining, commonly adopts a random masking strategy. However, random masking does not consider the importance of the different words in the sentence meaning, where some of them are more worthy to be predicted. Therefore, various masking strategies (e.g., entity-level masking) are proposed, but most of them requi… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL2023

  15. arXiv:2305.15273  [pdf, other

    cs.CL

    Revisiting Token Dropping Strategy in Efficient BERT Pretraining

    Authors: Qihuang Zhong, Liang Ding, Juhua Liu, Xuebo Liu, Min Zhang, Bo Du, Dacheng Tao

    Abstract: Token dropping is a recently-proposed strategy to speed up the pretraining of masked language models, such as BERT, by skipping the computation of a subset of the input tokens at several middle layers. It can effectively reduce the training time without degrading much performance on downstream tasks. However, we empirically find that token dropping is prone to a semantic loss problem and falls sho… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL2023 Main Conference

  16. arXiv:2305.13547  [pdf, other

    cs.CL cs.NI

    Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks

    Authors: Haoqi Zheng, Qihuang Zhong, Liang Ding, Zhiliang Tian, Xin Niu, Dongsheng Li, Dacheng Tao

    Abstract: Text classification tasks often encounter few shot scenarios with limited labeled data, and addressing data scarcity is crucial. Data augmentation with mixup has shown to be effective on various text classification tasks. However, most of the mixup methods do not consider the varying degree of learning difficulty in different stages of training and generate new samples with one hot labels, resulti… ▽ More

    Submitted 27 November, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  17. arXiv:2305.05890  [pdf, other

    cs.LG stat.ME

    CUTS+: High-dimensional Causal Discovery from Irregular Time-series

    Authors: Yuxiao Cheng, Lianglong Li, Tingxiong Xiao, Zongren Li, Qin Zhong, Jinli Suo, Kunlun He

    Abstract: Causal discovery in time-series is a fundamental problem in the machine learning community, enabling causal reasoning and decision-making in complex scenarios. Recently, researchers successfully discover causality by combining neural networks with Granger causality, but their performances degrade largely when encountering high-dimensional data because of the highly redundant network design and hug… ▽ More

    Submitted 16 August, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: Submit to AAAI-24

  18. arXiv:2304.08767  [pdf, other

    cs.CR cs.AI

    Masked Language Model Based Textual Adversarial Example Detection

    Authors: Xiaomei Zhang, Zhaoxi Zhang, Qi Zhong, Xufei Zheng, Yanjun Zhang, Shengshan Hu, Leo Yu Zhang

    Abstract: Adversarial attacks are a serious threat to the reliable deployment of machine learning models in safety-critical applications. They can misguide current models to predict incorrectly by slightly modifying the inputs. Recently, substantial work has shown that adversarial examples tend to deviate from the underlying data manifold of normal examples, whereas pre-trained masked language models can fi… ▽ More

    Submitted 28 January, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: 13 pages,3 figures

  19. arXiv:2304.03898  [pdf, other

    cs.CL cs.AI

    The Short Text Matching Model Enhanced with Knowledge via Contrastive Learning

    Authors: Ruiqiang Liu, Qiqiang Zhong, Mengmeng Cui, Hanjie Mai, Qiang Zhang, Shaohua Xu, Xiangzheng Liu, Yanlong Du

    Abstract: In recent years, short Text Matching tasks have been widely applied in the fields ofadvertising search and recommendation. The difficulty lies in the lack of semantic information and word ambiguity caused by the short length of the text. Previous works have introduced complement sentences or knowledge bases to provide additional feature information. However, these methods have not fully interacted… ▽ More

    Submitted 19 December, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: 11 pages,2 figures

  20. arXiv:2304.02205  [pdf, other

    cs.AI cs.IR

    MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs

    Authors: Jifan Yu, Mengying Lu, Qingyang Zhong, Zijun Yao, Shangqing Tu, Zhengshan Liao, Xiaoya Li, Manli Li, Lei Hou, Hai-Tao Zheng, Juanzi Li, Jie Tang

    Abstract: Student modeling, the task of inferring a student's learning characteristics through their interactions with coursework, is a fundamental issue in intelligent education. Although the recent attempts from knowledge tracing and cognitive diagnosis propose several promising directions for improving the usability and effectiveness of current models, the existing public datasets are still insufficient… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted by SIGIR 2023

  21. arXiv:2303.13780  [pdf, other

    cs.CL

    Towards Making the Most of ChatGPT for Machine Translation

    Authors: Keqin Peng, Liang Ding, Qihuang Zhong, Li Shen, Xuebo Liu, Min Zhang, Yuanxin Ouyang, Dacheng Tao

    Abstract: ChatGPT shows remarkable capabilities for machine translation (MT). Several prior studies have shown that it achieves comparable results to commercial systems for high-resource languages, but lags behind in complex tasks, e.g., low-resource and distant-language-pairs translation. However, they usually adopt simple prompts which can not fully elicit the capability of ChatGPT. In this paper, we aim… ▽ More

    Submitted 20 October, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: EMNLP 2023 (findings)

  22. arXiv:2303.00565  [pdf, other

    cs.LG cs.DC math.OC

    AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural Networks

    Authors: Hao Sun, Li Shen, Qihuang Zhong, Liang Ding, Shixiang Chen, Jingwei Sun, Jing Li, Guangzhong Sun, Dacheng Tao

    Abstract: Sharpness aware minimization (SAM) optimizer has been extensively explored as it can generalize better for training deep neural networks via introducing extra perturbation steps to flatten the landscape of deep learning models. Integrating SAM with adaptive learning rate and momentum acceleration, dubbed AdaSAM, has already been explored empirically to train large-scale deep neural networks withou… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 18 pages

  23. arXiv:2302.10198  [pdf, other

    cs.CL

    Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT

    Authors: Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Recently, ChatGPT has attracted great attention, as it can generate fluent and high-quality responses to human inquiries. Several prior studies have shown that ChatGPT attains remarkable generation ability compared with existing models. However, the quantitative analysis of ChatGPT's understanding ability has been given little attention. In this report, we explore the understanding ability of Chat… ▽ More

    Submitted 2 March, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: Work in progress. Added results of advanced prompting strategies, e.g., CoT. (19 pages)

  24. arXiv:2302.09268  [pdf, other

    cs.CL

    Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE

    Authors: Qihuang Zhong, Liang Ding, Keqin Peng, Juhua Liu, Bo Du, Li Shen, Yibing Zhan, Dacheng Tao

    Abstract: This technical report briefly describes our JDExplore d-team's submission Vega v1 on the General Language Understanding Evaluation (GLUE) leaderboard, where GLUE is a collection of nine natural language understanding tasks, including question answering, linguistic acceptability, sentiment analysis, text similarity, paraphrase detection, and natural language inference. [Method] We investigate sever… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: Technical report. arXiv admin note: text overlap with arXiv:2212.01853

  25. arXiv:2302.01439  [pdf, other

    cs.CY cs.SI

    #RoeOverturned: Twitter Dataset on the Abortion Rights Controversy

    Authors: Rong-Ching Chang, Ashwin Rao, Qiankun Zhong, Magdalena Wojcieszak, Kristina Lerman

    Abstract: On June 24, 2022, the United States Supreme Court overturned landmark rulings made in its 1973 verdict in Roe v. Wade. The justices by way of a majority vote in Dobbs v. Jackson Women's Health Organization, decided that abortion wasn't a constitutional right and returned the issue of abortion to the elected representatives. This decision triggered multiple protests and debates across the US, espec… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 9 pages, 5 figures

  26. arXiv:2212.01853  [pdf, other

    cs.CL

    Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE

    Authors: Qihuang Zhong, Liang Ding, Yibing Zhan, Yu Qiao, Yonggang Wen, Li Shen, Juhua Liu, Baosheng Yu, Bo Du, Yixin Chen, Xinbo Gao, Chunyan Miao, Xiaoou Tang, Dacheng Tao

    Abstract: This technical report briefly describes our JDExplore d-team's Vega v2 submission on the SuperGLUE leaderboard. SuperGLUE is more challenging than the widely used general language understanding evaluation (GLUE) benchmark, containing eight difficult language understanding tasks, including question answering, natural language inference, word sense disambiguation, coreference resolution, and reasoni… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: Technical report

  27. arXiv:2210.05497  [pdf, other

    cs.CL

    Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models

    Authors: Qihuang Zhong, Liang Ding, Li Shen, Peng Mi, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Fine-tuning large pretrained language models on a limited training corpus usually suffers from poor generalization. Prior works show that the recently-proposed sharpness-aware minimization (SAM) optimization method can improve the model generalization. However, SAM adds a perturbation to each model parameter equally (but not all parameters contribute equally to the optimization of training), which… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted by EMNLP 2022 (Findings)

  28. arXiv:2208.10160  [pdf, other

    cs.CL

    PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation

    Authors: Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Prompt Transfer (PoT) is a recently-proposed approach to improve prompt-tuning, by initializing the target prompt with the existing prompt trained on similar source tasks. However, such a vanilla PoT approach usually achieves sub-optimal performance, as (i) the PoT is sensitive to the similarity of source-target pair and (ii) directly fine-tuning the prompt initialized with source prompt on target… ▽ More

    Submitted 2 April, 2024; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted by IEEE TKDE

  29. arXiv:2208.04708  [pdf, other

    cs.CY cs.AI cs.LG

    Towards a General Pre-training Framework for Adaptive Learning in MOOCs

    Authors: Qingyang Zhong, Jifan Yu, Zheyuan Zhang, Yiming Mao, Yuquan Wang, Yankai Lin, Lei Hou, Juanzi Li, Jie Tang

    Abstract: Adaptive learning aims to stimulate and meet the needs of individual learners, which requires sophisticated system-level coordination of diverse tasks, including modeling learning resources, estimating student states, and making personalized recommendations. Existing deep learning methods have achieved great success over statistical models; however, they still lack generalization for diverse tasks… ▽ More

    Submitted 18 July, 2022; originally announced August 2022.

    Comments: 13 pages, 8 figures

  30. arXiv:2206.07992  [pdf, other

    cs.CY

    Deconstructing written rules and hierarchy in peer produced software communities

    Authors: Mahasweta Chakraborti, Beril Bulat, Qiankun Zhong, Anamika Sen, Seth Frey

    Abstract: We employ recent advances in computational institutional analysis and NLP to investigate the systems of authority that are reflected in the written policy documents of the ASF. Our study to decipher the effective similarities or departures of the ASF model from conventional software companies reveals evidence of both flat and bureaucratic governance in a peer production set up, suggesting a compli… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 9 pages

    ACM Class: H.5.3

  31. E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language Understanding and Generation

    Authors: Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Sequence-to-sequence (seq2seq) learning is a popular fashion for large-scale pretraining language models. However, the prior seq2seq pretraining models generally focus on reconstructive objectives on the decoder side and neglect the effect of encoder-side supervision, which we argue may lead to sub-optimal performance. To verify our hypothesis, we first empirically study the functionalities of the… ▽ More

    Submitted 9 January, 2024; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted by IEEE TKDE 2023

  32. arXiv:2205.11126  [pdf, other

    cs.LG cs.CV

    KRNet: Towards Efficient Knowledge Replay

    Authors: Yingying Zhang, Qiaoyong Zhong, Di Xie, Shiliang Pu

    Abstract: The knowledge replay technique has been widely used in many tasks such as continual learning and continuous domain adaptation. The key lies in how to effectively encode the knowledge extracted from previous data and replay them during current training procedure. A simple yet effective model to achieve knowledge replay is autoencoder. However, the number of stored latent codes in autoencoder increa… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted by ICPR 2022

  33. arXiv:2205.11071  [pdf, other

    cs.CV

    Self-distilled Knowledge Delegator for Exemplar-free Class Incremental Learning

    Authors: Fanfan Ye, Liang Ma, Qiaoyong Zhong, Di Xie, Shiliang Pu

    Abstract: Exemplar-free incremental learning is extremely challenging due to inaccessibility of data from old tasks. In this paper, we attempt to exploit the knowledge encoded in a previously trained classification model to handle the catastrophic forgetting problem in continual learning. Specifically, we introduce a so-called knowledge delegator, which is capable of transferring knowledge from the trained… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted by IJCNN 2022

  34. arXiv:2204.12521  [pdf

    physics.soc-ph cs.SI stat.AP

    Quantifying the selective, stochastic, and complementary drivers of the institutional evolution in online communities

    Authors: Qiankun Zhong, Seth Frey, Martin Hilbert

    Abstract: Institutions and cultures evolve adaptively in response to the current environmental incentives, usually. But sometimes institutional change is due to stochastic drives beyond current fitness, including drift, path dependency, blind imitation, and complementary cooperation in fluctuating environments. Disentangling the selective and stochastic components of social system change enables us to ident… ▽ More

    Submitted 21 August, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: 34 pages, 5 figures

  35. arXiv:2204.07832  [pdf, other

    cs.CL

    A Contrastive Cross-Channel Data Augmentation Framework for Aspect-based Sentiment Analysis

    Authors: Bing Wang, Liang Ding, Qihuang Zhong, Ximing Li, Dacheng Tao

    Abstract: Aspect-based sentiment analysis (ABSA) is a fine-grained sentiment analysis task, which focuses on detecting the sentiment polarity towards the aspect in a sentence. However, it is always sensitive to the multi-aspect challenge, where features of multiple aspects in a sentence will affect each other. To mitigate this issue, we design a novel training framework, called Contrastive Cross-Channel Dat… ▽ More

    Submitted 7 September, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

    Comments: COLING 2022

  36. arXiv:2204.01934  [pdf, other

    cs.CV cs.CR

    Attention Distraction: Watermark Removal Through Continual Learning with Selective Forgetting

    Authors: Qi Zhong, Leo Yu Zhang, Shengshan Hu, Longxiang Gao, Jun Zhang, Yong Xiang

    Abstract: Fine-tuning attacks are effective in removing the embedded watermarks in deep learning models. However, when the source data is unavailable, it is challenging to just erase the watermark without jeopardizing the model performance. In this context, we introduce Attention Distraction (AD), a novel source data-free watermark removal attack, to make the model selectively forget the embedded watermarks… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted by ICME2022

  37. arXiv:2202.09769  [pdf, other

    cs.CV

    Dynamic Spatial Propagation Network for Depth Completion

    Authors: Yuankai Lin, Tao Cheng, Qi Zhong, Wending Zhou, Hua Yang

    Abstract: Image-guided depth completion aims to generate dense depth maps with sparse depth measurements and corresponding RGB images. Currently, spatial propagation networks (SPNs) are the most popular affinity-based methods in depth completion, but they still suffer from the representation limitation of the fixed affinity and the over smoothing during iterations. Our solution is to estimate independent af… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

  38. arXiv:2202.01317  [pdf

    cs.SI cs.CY cs.HC

    Governing online goods: Maturity and formalization in Minecraft, Reddit, and World of Warcraft communities

    Authors: Seth Frey, Qiankun Zhong, Beril Bulat, William D. Weisman, Caitlyn Liu, Stephen Fujimoto, Hannah M. Wang, Charles M. Schweik

    Abstract: Building a successful community means governing active populations and limited resources. This challenge often requires communities to design formal governance systems from scratch. But the characteristics of successful institutional designs are unclear. Communities that are more mature and established may have more elaborate formal policy systems. Alternatively, they may require less formalizatio… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Comments: 23 pages. 4 figures

    ACM Class: J.4; H.5.3

  39. Knowledge Graph Augmented Network Towards Multiview Representation Learning for Aspect-based Sentiment Analysis

    Authors: Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Hua Jin, Dacheng Tao

    Abstract: Aspect-based sentiment analysis (ABSA) is a fine-grained task of sentiment analysis. To better comprehend long complicated sentences and obtain accurate aspect-specific information, linguistic and commonsense knowledge are generally required in this task. However, most current methods employ complicated and inefficient approaches to incorporate external knowledge, e.g., directly searching the grap… ▽ More

    Submitted 13 March, 2023; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: Accepted by IEEE TKDE 2023

  40. arXiv:2112.04178  [pdf, other

    cs.CV

    Topology-aware Convolutional Neural Network for Efficient Skeleton-based Action Recognition

    Authors: Kailin Xu, Fanfan Ye, Qiaoyong Zhong, Di Xie

    Abstract: In the context of skeleton-based action recognition, graph convolutional networks (GCNs) have been rapidly developed, whereas convolutional neural networks (CNNs) have received less attention. One reason is that CNNs are considered poor in modeling the irregular skeleton topology. To alleviate this limitation, we propose a pure CNN architecture named Topology-aware CNN (Ta-CNN) in this paper. In p… ▽ More

    Submitted 8 December, 2021; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022

  41. arXiv:2110.13398  [pdf, other

    cs.CL cs.AI

    Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis

    Authors: Juhua Liu, Qihuang Zhong, Liang Ding, Hua Jin, Bo Du, Dacheng Tao

    Abstract: Aspect-based Sentiment Analysis (ABSA) aims to determine the sentiment polarity towards an aspect. Because of the expensive and limited labelled data, the pretraining strategy has become the de-facto standard for ABSA. However, there always exists severe domain shift between the pretraining and downstream ABSA datasets, hindering the effective knowledge transfer when directly finetuning and making… ▽ More

    Submitted 26 June, 2023; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted by IEEE TASLP 2023

  42. arXiv:2107.13118  [pdf, other

    cs.CV

    Divide-and-Assemble: Learning Block-wise Memory for Unsupervised Anomaly Detection

    Authors: Jinlei Hou, Yingying Zhang, Qiaoyong Zhong, Di Xie, Shiliang Pu, Hong Zhou

    Abstract: Reconstruction-based methods play an important role in unsupervised anomaly detection in images. Ideally, we expect a perfect reconstruction for normal samples and poor reconstruction for abnormal samples. Since the generalizability of deep neural networks is difficult to control, existing models such as autoencoder do not work well. In this work, we interpret the reconstruction of an image as a d… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: accepted by ICCV 2021

  43. arXiv:2107.00316  [pdf, other

    cs.AI

    Leveraging Domain Agnostic and Specific Knowledge for Acronym Disambiguation

    Authors: Qiwei Zhong, Guanxiong Zeng, Danqing Zhu, Yang Zhang, Wangli Lin, Ben Chen, Jiayu Tang

    Abstract: An obstacle to scientific document understanding is the extensive use of acronyms which are shortened forms of long technical phrases. Acronym disambiguation aims to find the correct meaning of an ambiguous acronym in a given text. Recent efforts attempted to incorporate word embeddings and deep learning architectures, and achieved significant effects in this task. In general domains, kinds of fin… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: Second Place Solution, Accepted to SDU@AAAI-21

  44. arXiv:2105.03567  [pdf, other

    cs.LG cs.AI cs.CR

    Multimodal and Contrastive Learning for Click Fraud Detection

    Authors: Weibin Li, Qiwei Zhong, Qingyang Zhao, Hongchun Zhang, Xiaonan Meng

    Abstract: Advertising click fraud detection plays one of the vital roles in current E-commerce websites as advertising is an essential component of its business model. It aims at, given a set of corresponding features, e.g., demographic information of users and statistical features of clicks, predicting whether a click is fraudulent or not in the community. Recent efforts attempted to incorporate attributed… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted to DeMal@WWW 2021

  45. arXiv:2104.13636  [pdf, ps, other

    cs.CV

    Point Cloud Learning with Transformer

    Authors: Qi Zhong, Xian-Feng Han

    Abstract: Remarkable performance from Transformer networks in Natural Language Processing promote the development of these models in dealing with computer vision tasks such as image recognition and segmentation. In this paper, we introduce a novel framework, called Multi-level Multi-scale Point Transformer (MLMSPT) that works directly on the irregular point clouds for representation learning. Specifically,… ▽ More

    Submitted 24 October, 2022; v1 submitted 28 April, 2021; originally announced April 2021.

    Comments: 10 pages, 4 figures

  46. arXiv:2103.10685  [pdf, other

    cs.CL cs.AI cs.LG

    Controllable Generation from Pre-trained Language Models via Inverse Prompting

    Authors: Xu Zou, Da Yin, Qingyang Zhong, Ming Ding, Hongxia Yang, Zhilin Yang, Jie Tang

    Abstract: Large-scale pre-trained language models have demonstrated strong capabilities of generating realistic text. However, it remains challenging to control the generation results. Previous approaches such as prompting are far from sufficient, which limits the usage of language models. To tackle this challenge, we propose an innovative method, inverse prompting, to better control text generation. The co… ▽ More

    Submitted 9 November, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: Slightly different from the KDD version

  47. arXiv:2103.08958  [pdf, other

    cs.CV

    Modulating Localization and Classification for Harmonized Object Detection

    Authors: Taiheng Zhang, Qiaoyong Zhong, Shiliang Pu, Di Xie

    Abstract: Object detection involves two sub-tasks, i.e. localizing objects in an image and classifying them into various categories. For existing CNN-based detectors, we notice the widespread divergence between localization and classification, which leads to degradation in performance. In this work, we propose a mutual learning framework to modulate the two tasks. In particular, the two tasks are forced to… ▽ More

    Submitted 25 March, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Accepted by ICME 2021

  48. arXiv:2101.03700  [pdf, other

    cs.CL

    AT-BERT: Adversarial Training BERT for Acronym Identification Winning Solution for SDU@AAAI-21

    Authors: Danqing Zhu, Wangli Lin, Yang Zhang, Qiwei Zhong, Guanxiong Zeng, Weilin Wu, Jiayu Tang

    Abstract: Acronym identification focuses on finding the acronyms and the phrases that have been abbreviated, which is crucial for scientific document understanding tasks. However, the limited size of manually annotated datasets hinders further improvement for the problem. Recent breakthroughs of language models pre-trained on large corpora clearly show that unsupervised pre-training can vastly improve the p… ▽ More

    Submitted 12 January, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: Accepted to SDU @ AAAI 2021, 8 pages, 3 figures

  49. arXiv:2009.04597  [pdf

    cs.SI cs.CY

    Institutional Similarity Drives Cultural Similarity among Online Communities

    Authors: Qiankun Zhong, Seth Frey

    Abstract: Understanding online communities requires an appreciation of both structure and culture. But basic questions remain difficult to pose. How do these facets interact and drive each other? Using data on the membership and governance styles of 5,000 small-scale online communities, we construct empirical measures for cross-server similarities in institutional structure and culture to explore the influe… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: 39 pages, 8 figures

    MSC Class: H.5.3; J.4; K.6.4

  50. arXiv:2007.14690  [pdf, other

    cs.CV

    Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

    Authors: Fanfan Ye, Shiliang Pu, Qiaoyong Zhong, Chao Li, Di Xie, Huiming Tang

    Abstract: Graph Convolutional Networks (GCNs) have attracted increasing interests for the task of skeleton-based action recognition. The key lies in the design of the graph structure, which encodes skeleton topology information. In this paper, we propose Dynamic GCN, in which a novel convolutional neural network named Contextencoding Network (CeN) is introduced to learn skeleton topology automatically. In p… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: accepted by ACMMM2020