Skip to main content

Showing 1–50 of 71 results for author: Zong, C

  1. arXiv:2406.14537  [pdf, other

    cs.LG q-fin.TR

    MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading

    Authors: Chuqiao Zong, Chaojie Wang, Molei Qin, Lei Feng, Xinrun Wang, Bo An

    Abstract: High-frequency trading (HFT) that executes algorithmic trading in short time scales, has recently occupied the majority of cryptocurrency market. Besides traditional quantitative trading methods, reinforcement learning (RL) has become another appealing approach for HFT due to its terrific ability of handling high-dimensional financial data and solving sophisticated sequential decision-making probl… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted to KDD 2024

  2. arXiv:2406.06594  [pdf, other

    q-fin.CP cs.AI cs.LG

    Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism

    Authors: Chang Zong, Jian Shao, Weiming Lu, Yueting Zhuang

    Abstract: The accurate prediction of stock movements is crucial for investment strategies. Stock prices are subject to the influence of various forms of information, including financial indicators, sentiment analysis, news documents, and relational structures. Predominant analytical approaches, however, tend to address only unimodal or bimodal sources, neglecting the complexity of multimodal data. Further c… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 29 pages, 10 figures

    MSC Class: 68T07 ACM Class: I.2.6; J.4

  3. arXiv:2406.03872  [pdf, other

    cs.CL cs.SD eess.AS

    BLSP-Emo: Towards Empathetic Large Speech-Language Models

    Authors: Chen Wang, Minpeng Liao, Zhongqiang Huang, Junhong Wu, Chengqing Zong, Jiajun Zhang

    Abstract: The recent release of GPT-4o showcased the potential of end-to-end multimodal models, not just in terms of low latency but also in their ability to understand and generate expressive speech with rich emotions. While the details are unknown to the open research community, it likely involves significant amounts of curated data and compute, neither of which is readily accessible. In this paper, we pr… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  4. arXiv:2406.02237  [pdf, other

    cs.CL

    Self-Modifying State Modeling for Simultaneous Machine Translation

    Authors: Donglei Yu, Xiaomian Kang, Yuchen Liu, Yu Zhou, Chengqing Zong

    Abstract: Simultaneous Machine Translation (SiMT) generates target outputs while receiving stream source inputs and requires a read/write policy to decide whether to wait for the next source token or generate a new target token, whose decisions form a \textit{decision path}. Existing SiMT methods, which learn the policy by exploring various decision paths in training, face inherent limitations. These method… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accept to ACL 2024 main conference. 15 pages, 13 figures, 9 tables

  5. arXiv:2405.19744  [pdf, other

    cs.CL cs.AI

    X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions

    Authors: Chong Li, Wen Yang, Jiajun Zhang, Jinliang Lu, Shaonan Wang, Chengqing Zong

    Abstract: Large language models respond well in high-resource languages like English but struggle in low-resource languages. It may arise from the lack of high-quality instruction following data in these languages. Directly translating English samples into these languages can be a solution but unreliable, leading to responses with translation errors and lacking language-specific or cultural knowledge. To ad… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: ACL 2024. Our codes, data and model weights are available at https://github.com/ZNLP/X-Instruction

  6. arXiv:2405.10558  [pdf, other

    cs.SI

    CACL: Community-Aware Heterogeneous Graph Contrastive Learning for Social Media Bot Detection

    Authors: Sirry Chen, Shuo Feng, Songsong Liang, Chen-Chen Zong, Jing Li, Piji Li

    Abstract: Social media bot detection is increasingly crucial with the rise of social media platforms. Existing methods predominantly construct social networks as graph and utilize graph neural networks (GNNs) for bot detection. However, most of these methods focus on how to improve the performance of GNNs while neglecting the community structure within social networks. Moreover, GNNs based methods still fac… ▽ More

    Submitted 3 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024 findings

  7. arXiv:2404.19364  [pdf, other

    cs.CL

    Navigating Brain Language Representations: A Comparative Analysis of Neural Language Models and Psychologically Plausible Models

    Authors: Yunhao Zhang, Shaonan Wang, Xinyi Dong, Jiajun Yu, Chengqing Zong

    Abstract: Neural language models, particularly large-scale ones, have been consistently proven to be most effective in predicting brain neural activity across a range of studies. However, previous research overlooked the comparison of these models with psychologically plausible ones. Moreover, evaluations were reliant on limited, single-modality, and English cognitive datasets. To address these questions, w… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  8. arXiv:2404.19186  [pdf, ps, other

    cs.IT cs.CR math.MG math.NT

    The Mathematical Foundation of Post-Quantum Cryptography

    Authors: Chuanming Zong

    Abstract: On July 5, 2022, the National Institute of Standards and Technology announced four possible post-quantum cryptography standards, three of them are based on lattice theory and the other one is based on Hash function. It is well-known that the security of the lattice cryptography relies on the hardness of the shortest vector problem (SVP) and the closest vector problem (CVP). In fact, the SVP is a s… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 23 pages

    MSC Class: 94A60; 52C17; 11H31

  9. arXiv:2404.06676  [pdf

    cs.LG eess.SP stat.AP

    Topological Feature Search Method for Multichannel EEG: Application in ADHD classification

    Authors: Tianming Cai, Guoying Zhao, Junbin Zang, Chen Zong, Zhidong Zhang, Chenyang Xue

    Abstract: In recent years, the preliminary diagnosis of Attention Deficit Hyperactivity Disorder (ADHD) using electroencephalography (EEG) has garnered attention from researchers. EEG, known for its expediency and efficiency, plays a pivotal role in the diagnosis and treatment of ADHD. However, the non-stationarity of EEG signals and inter-subject variability pose challenges to the diagnostic and classifica… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  10. arXiv:2404.04846  [pdf, other

    cs.CL

    F-MALLOC: Feed-forward Memory Allocation for Continual Learning in Neural Machine Translation

    Authors: Junhong Wu, Yuchen Liu, Chengqing Zong

    Abstract: In the evolving landscape of Neural Machine Translation (NMT), the pretrain-then-finetune paradigm has yielded impressive results. However, the persistent challenge of Catastrophic Forgetting (CF) remains a hurdle. While previous work has introduced Continual Learning (CL) methods to address CF, these approaches grapple with the delicate balance between avoiding forgetting and maintaining system e… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted to the main conference of NAACL 2024

  11. arXiv:2403.17516  [pdf, other

    cs.CL cs.AI

    MapGuide: A Simple yet Effective Method to Reconstruct Continuous Language from Brain Activities

    Authors: Xinpei Zhao, Jingyuan Sun, Shaonan Wang, Jing Ye, Xiaohan Zhang, Chengqing Zong

    Abstract: Decoding continuous language from brain activity is a formidable yet promising field of research. It is particularly significant for aiding people with speech disabilities to communicate through brain signals. This field addresses the complex task of mapping brain signals to text. The previous best attempt reverse-engineered this process in an indirect way: it began by learning to encode brain act… ▽ More

    Submitted 2 April, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024 main conference

  12. arXiv:2403.13368  [pdf, other

    cs.CL cs.AI

    Computational Models to Study Language Processing in the Human Brain: A Survey

    Authors: Shaonan Wang, Jingyuan Sun, Yunhao Zhang, Nan Lin, Marie-Francine Moens, Chengqing Zong

    Abstract: Despite differing from the human language processing mechanism in implementation and algorithms, current language models demonstrate remarkable human-like or surpassing language capabilities. Should computational language models be employed in studying the brain, and if so, when and how? To delve into this topic, this paper reviews efforts in using computational models for brain research, highligh… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  13. arXiv:2403.09131  [pdf, other

    cs.CL cs.AI

    ProSwitch: Knowledge-Guided Instruction Tuning to Generate Professional and Non-Professional Styled Text

    Authors: Chang Zong, Yuyan Chen, Weiming Lu, Jian Shao, Yueting Zhuang

    Abstract: Large Language Models (LLMs) have demonstrated efficacy in various linguistic applications, including text summarization and controlled text generation. However, studies into their capacity of switching between styles via fine-tuning remain underexplored. This study concentrates on textual professionalism and introduces a novel methodology, named ProSwitch, which equips a language model with the a… ▽ More

    Submitted 15 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 8 pages

    MSC Class: 68T50 ACM Class: I.2.7

  14. arXiv:2403.03186  [pdf, other

    cs.AI

    Cradle: Empowering Foundation Agents Towards General Computer Control

    Authors: Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, Yujie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson , et al. (3 additional authors not shown)

    Abstract: Despite the success in specific scenarios, existing foundation agents still struggle to generalize across various virtual scenarios, mainly due to the dramatically different encapsulations of environments with manually designed observation and action spaces. To handle this issue, we propose the General Computer Control (GCC) setting to restrict foundation agents to interact with software through t… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  15. arXiv:2403.01116  [pdf, other

    cs.CL

    MulCogBench: A Multi-modal Cognitive Benchmark Dataset for Evaluating Chinese and English Computational Language Models

    Authors: Yunhao Zhang, Xiaohan Zhang, Chong Li, Shaonan Wang, Chengqing Zong

    Abstract: Pre-trained computational language models have recently made remarkable progress in harnessing the language abilities which were considered unique to humans. Their success has raised interest in whether these models represent and process language like humans. To answer this question, this paper proposes MulCogBench, a multi-modal cognitive benchmark dataset collected from native Chinese and Englis… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  16. arXiv:2402.15198  [pdf, other

    cs.LG

    Bidirectional Uncertainty-Based Active Learning for Open Set Annotation

    Authors: Chen-Chen Zong, Ye-Wen Wang, Kun-Peng Ning, Hai-Bo Ye, Sheng-Jun Huang

    Abstract: Active learning (AL) in open set scenarios presents a novel challenge of identifying the most valuable examples in an unlabeled data pool that comprises data from both known and unknown classes. Traditional methods prioritize selecting informative examples with low confidence, with the risk of mistakenly selecting unknown-class examples with similarly low confidence. Recent methods favor the most… ▽ More

    Submitted 6 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted to ECCV 2024

  17. arXiv:2402.14320  [pdf, other

    cs.CL cs.AI

    Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering

    Authors: Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Eliot Huang, Heng Chang, Yueting Zhuang

    Abstract: Recent progress with LLM-based agents has shown promising results across various tasks. However, their use in answering questions from knowledge bases remains largely unexplored. Implementing a KBQA system using traditional methods is challenging due to the shortage of task-specific training data and the complexity of creating task-focused model structures. In this paper, we present Triad, a unifi… ▽ More

    Submitted 15 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 8 pages

    MSC Class: 68T50 ACM Class: I.2.7

  18. Dirichlet-Based Prediction Calibration for Learning with Noisy Labels

    Authors: Chen-Chen Zong, Ye-Wen Wang, Ming-Kun Xie, Sheng-Jun Huang

    Abstract: Learning with noisy labels can significantly hinder the generalization performance of deep neural networks (DNNs). Existing approaches address this issue through loss correction or example selection methods. However, these methods often rely on the model's predictions obtained from the softmax function, which can be over-confident and unreliable. In this study, we identify the translation invarian… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  19. arXiv:2311.15653  [pdf, other

    cs.CL

    MoDS: Model-oriented Data Selection for Instruction Tuning

    Authors: Qianlong Du, Chengqing Zong, Jiajun Zhang

    Abstract: Instruction tuning has become the de facto method to equip large language models (LLMs) with the ability of following user instructions. Usually, hundreds of thousands or millions of instruction-following pairs are employed to fine-tune the foundation LLMs. Recently, some studies show that a small number of high-quality instruction data is enough. However, how to select appropriate instruction dat… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  20. arXiv:2311.08089  [pdf, other

    cs.CL

    Improving In-context Learning of Multilingual Generative Language Models with Cross-lingual Alignment

    Authors: Chong Li, Shaonan Wang, Jiajun Zhang, Chengqing Zong

    Abstract: Multilingual generative models obtain remarkable cross-lingual in-context learning capabilities through pre-training on large-scale corpora. However, they still exhibit a performance bias toward high-resource languages and learn isolated distributions of multilingual sentence representations, which may hinder knowledge transfer across languages. To bridge this gap, we propose a simple yet effectiv… ▽ More

    Submitted 12 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: NAACL 2024; Our code is available at https://github.com/chongli17/CrossLingualAlignment

  21. arXiv:2311.01149  [pdf, other

    cs.CL

    ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model

    Authors: Jianghao Chen, Pu Jian, Tengxiao Xi, Dongyi Yi, Qianlong Du, Chenglin Ding, Guibo Zhu, Chengqing Zong, Jinqiao Wang, Jiajun Zhang

    Abstract: During the development of large language models (LLMs), the scale and quality of the pre-training data play a crucial role in shaping LLMs' capabilities. To accelerate the research of LLMs, several large-scale datasets, such as C4 [1], Pile [2], RefinedWeb [3] and WanJuan [4], have been released to the public. However, most of the released corpus focus mainly on English, and there is still lack of… ▽ More

    Submitted 10 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

  22. arXiv:2310.10318  [pdf, other

    cs.CL cs.AI

    Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning

    Authors: Chong Li, Shaonan Wang, Yunhao Zhang, Jiajun Zhang, Chengqing Zong

    Abstract: Transformer-based models, even though achieving super-human performance on several downstream tasks, are often regarded as a black box and used as a whole. It is still unclear what mechanisms they have learned, especially their core module: multi-head attention. Inspired by functional specialization in the human brain, which helps to efficiently handle multiple tasks, this work attempts to figure… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 main conference. Our code is available at https://github.com/ZNLP/FunctionalSpecializationInMHA

  23. arXiv:2309.00916  [pdf, other

    cs.CL cs.SD eess.AS

    BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing

    Authors: Chen Wang, Minpeng Liao, Zhongqiang Huang, Jinliang Lu, Junhong Wu, Yuchen Liu, Chengqing Zong, Jiajun Zhang

    Abstract: The emergence of large language models (LLMs) has sparked significant interest in extending their remarkable language capabilities to speech. However, modality alignment between speech and text still remains an open problem. Current solutions can be categorized into two strategies. One is a cascaded approach where outputs (tokens or states) of a separately trained speech recognition system are use… ▽ More

    Submitted 28 May, 2024; v1 submitted 2 September, 2023; originally announced September 2023.

  24. P2M: A Fast Solver for Querying Distance from Point to Mesh Surface

    Authors: Chen Zong, Jiacheng Xu, Jiantao Song, Shuangmin Chen, Shiqing Xin, Wenping Wang, Changhe Tu

    Abstract: Most of the existing point-to-mesh distance query solvers, such as Proximity Query Package (PQP), Embree and Fast Closest Point Query (FCPW), are based on bounding volume hierarchy (BVH). The hierarchical organizational structure enables one to eliminate the vast majority of triangles that do not help find the closest point. In this paper, we develop a totally different algorithmic paradigm, named… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Journal ref: ACM Transactions on Graphics, Volume 42, Issue 4, July 2023

  25. arXiv:2308.06453  [pdf, other

    cs.LG cs.AI cs.CV

    Multi-Label Knowledge Distillation

    Authors: Penghui Yang, Ming-Kun Xie, Chen-Chen Zong, Lei Feng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

    Abstract: Existing knowledge distillation methods typically work by imparting the knowledge of output logits or intermediate feature maps from the teacher network to the student network, which is very successful in multi-class single-label learning. However, these methods can hardly be extended to the multi-label learning scenario, where each instance is associated with multiple semantic labels, because the… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023. The first two authors contributed equally to this work

  26. arXiv:2307.02716  [pdf, other

    cs.CL cs.CV

    CFSum: A Coarse-to-Fine Contribution Network for Multimodal Summarization

    Authors: Min Xiao, Junnan Zhu, Haitao Lin, Yu Zhou, Chengqing Zong

    Abstract: Multimodal summarization usually suffers from the problem that the contribution of the visual modality is unclear. Existing multimodal summarization approaches focus on designing the fusion methods of different modalities, while ignoring the adaptive conditions under which visual modalities are useful. Therefore, we propose a novel Coarse-to-Fine contribution network for multimodal Summarization (… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: acl2023

  27. arXiv:2305.18098  [pdf, other

    cs.CL

    BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages

    Authors: Wen Yang, Chong Li, Jiajun Zhang, Chengqing Zong

    Abstract: Large language models (LLMs) demonstrate promising translation performance among various natural languages. However, many LLMs especially the open-sourced ones, such as BLOOM and LLaMA, are English-dominant and support only dozens of natural languages, making the potential of LLMs on language translation less explored. In this work, we present BigTranslate which adapts LLaMA that covers only 20 la… ▽ More

    Submitted 21 November, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: 16 pages, 4 figures. Our model is available at https://github.com/ZNLP/BigTranslate

  28. arXiv:2305.05226  [pdf, other

    cs.CL

    Multi-Teacher Knowledge Distillation For Text Image Machine Translation

    Authors: Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, Chengqing Zong

    Abstract: Text image machine translation (TIMT) has been widely used in various real-world applications, which translates source language texts in images into another target language sentence. Existing methods on TIMT are mainly divided into two categories: the recognition-then-translation pipeline model and the end-to-end model. However, how to transfer knowledge from the pipeline model into the end-to-end… ▽ More

    Submitted 9 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted at The 17th International Conference on Document Analysis and Recognition (ICDAR 2023)

  29. arXiv:2305.05166  [pdf, other

    cs.CL

    E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation

    Authors: Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, Chengqing Zong

    Abstract: Text image machine translation (TIMT) aims to translate texts embedded in images from one source language to another target language. Existing methods, both two-stage cascade and one-stage end-to-end architectures, suffer from different issues. The cascade models can benefit from the large-scale optical character recognition (OCR) and MT datasets but the two-stage architecture is redundant. The en… ▽ More

    Submitted 9 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted at The 17th International Conference on Document Analysis and Recognition (ICDAR 2023)

  30. arXiv:2303.10429  [pdf, other

    cs.LG

    Protein Sequence Design with Batch Bayesian Optimisation

    Authors: Chuanjiao Zong

    Abstract: Protein sequence design is a challenging problem in protein engineering, which aims to discover novel proteins with useful biological functions. Directed evolution is a widely-used approach for protein sequence design, which mimics the evolution cycle in a laboratory environment and conducts an iterative protocol. However, the burden of laboratory experiments can be reduced by using machine learni… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: 8 pages, 1 figure and 1 table shows final result, 1 algorithm pipeline

  31. arXiv:2301.04788  [pdf, ps, other

    cs.CL

    Language Cognition and Language Computation -- Human and Machine Language Understanding

    Authors: Shaonan Wang, Nai Ding, Nan Lin, Jiajun Zhang, Chengqing Zong

    Abstract: Language understanding is a key scientific issue in the fields of cognitive and computer science. However, the two disciplines differ substantially in the specific research questions. Cognitive science focuses on analyzing the specific mechanism of the brain and investigating the brain's response to language; few studies have examined the brain's language system as a whole. By contrast, computer s… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: A survey of language comprehension in cognitive sciences and language understanding in computer sciences and their relations

  32. RFEPS: Reconstructing Feature-line Equipped Polygonal Surface

    Authors: Rui Xu, Zixiong Wang, Zhiyang Dou, Chen Zong, Shiqing Xin, Mingyan Jiang, Tao Ju, Changhe Tu

    Abstract: Feature lines are important geometric cues in characterizing the structure of a CAD model. Despite great progress in both explicit reconstruction and implicit reconstruction, it remains a challenging task to reconstruct a polygonal surface equipped with feature lines, especially when the input point cloud is noisy and lacks faithful normal vectors. In this paper, we develop a multistage algorithm,… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: SIGGRAPH Asia 2022

  33. arXiv:2212.02800  [pdf, other

    cs.CL

    Life-long Learning for Multilingual Neural Machine Translation with Knowledge Distillation

    Authors: Yang Zhao, Junnan Zhu, Lu Xiang, Jiajun Zhang, Yu Zhou, Feifei Zhai, Chengqing Zong

    Abstract: A common scenario of Multilingual Neural Machine Translation (MNMT) is that each translation task arrives in a sequential manner, and the training data of previous tasks is unavailable. In this scenario, the current methods suffer heavily from catastrophic forgetting (CF). To alleviate the CF, we investigate knowledge distillation based life-long learning methods. Specifically, in one-tomany scena… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  34. arXiv:2210.09556  [pdf, other

    cs.CL cs.SD eess.AS

    Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation

    Authors: Chen Wang, Yuchen Liu, Boxing Chen, Jiajun Zhang, Wei Luo, Zhongqiang Huang, Chengqing Zong

    Abstract: End-to-end Speech Translation (ST) aims at translating the source language speech into target language text without generating the intermediate transcriptions. However, the training of end-to-end methods relies on parallel ST data, which are difficult and expensive to obtain. Fortunately, the supervised data for automatic speech recognition (ASR) and machine translation (MT) are usually more acces… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted by the main conference of EMNLP 2022

  35. arXiv:2210.00450  [pdf, other

    cs.AI cs.SI

    Citation Trajectory Prediction via Publication Influence Representation Using Temporal Knowledge Graph

    Authors: Chang Zong, Yueting Zhuang, Weiming Lu, Jian Shao, Siliang Tang

    Abstract: Predicting the impact of publications in science and technology has become an important research area, which is useful in various real world scenarios such as technology investment, research direction selection, and technology policymaking. Citation trajectory prediction is one of the most popular tasks in this area. Existing approaches mainly rely on mining temporal and graph data from academic a… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures

    ACM Class: I.2.4; J.4

  36. arXiv:2209.01334  [pdf, other

    cs.LG cs.AI

    Noise-Robust Bidirectional Learning with Dynamic Sample Reweighting

    Authors: Chen-Chen Zong, Zheng-Tao Cao, Hong-Tao Guo, Yun Du, Ming-Kun Xie, Shao-Yuan Li, Sheng-Jun Huang

    Abstract: Deep neural networks trained with standard cross-entropy loss are more prone to memorize noisy labels, which degrades their performance. Negative learning using complementary labels is more robust when noisy labels intervene but with an extremely slow model convergence speed. In this paper, we first introduce a bidirectional learning scheme, where positive learning ensures convergence speed while… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

  37. arXiv:2205.13190  [pdf, other

    cs.CL

    Other Roles Matter! Enhancing Role-Oriented Dialogue Summarization via Role Interactions

    Authors: Haitao Lin, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong

    Abstract: Role-oriented dialogue summarization is to generate summaries for different roles in the dialogue, e.g., merchants and consumers. Existing methods handle this task by summarizing each role's content separately and thus are prone to ignore the information from other roles. However, we believe that other roles' content could benefit the quality of summaries, such as the omitted information mentioned… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted by ACL 2022 main conference

  38. arXiv:2201.07126  [pdf, other

    cs.CL

    Instance-aware Prompt Learning for Language Understanding and Generation

    Authors: Feihu Jin, Jinliang Lu, Jiajun Zhang, Chengqing Zong

    Abstract: Recently, prompt learning has become a new paradigm to utilize pre-trained language models (PLMs) and achieves promising results in downstream tasks with a negligible increase of parameters. The current usage of discrete and continuous prompts assumes that the prompt is fixed for a specific task and all samples in the task share the same prompt. However, a task may contain quite diverse samples in… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: 7 pages, 5 figures

  39. arXiv:2112.04104  [pdf, other

    cs.CL

    Learning to Select the Next Reasonable Mention for Entity Linking

    Authors: Jian Sun, Yu Zhou, Chengqing Zong

    Abstract: Entity linking aims to establish a link between entity mentions in a document and the corresponding entities in knowledge graphs (KGs). Previous work has shown the effectiveness of global coherence for entity linking. However, most of the existing global linking methods based on sequential decisions focus on how to utilize previously linked entities to enhance the later decisions. In those methods… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted to AAAI-2022 Workshop on Knowledge Discovery from Unstructured Data in Financial Services

  40. arXiv:2108.13139  [pdf, other

    cs.CL

    CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization

    Authors: Haitao Lin, Liqun Ma, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong

    Abstract: Dialogue summarization has drawn much attention recently. Especially in the customer service domain, agents could use dialogue summaries to help boost their works by quickly knowing customer's issues and service progress. These applications require summaries to contain the perspective of a single speaker and have a clear topic flow structure, while neither are available in existing datasets. There… ▽ More

    Submitted 6 September, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted by EMNLP2021 main conference

  41. Augmenting Slot Values and Contexts for Spoken Language Understanding with Pretrained Models

    Authors: Haitao Lin, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong

    Abstract: Spoken Language Understanding (SLU) is one essential step in building a dialogue system. Due to the expensive cost of obtaining the labeled data, SLU suffers from the data scarcity problem. Therefore, in this paper, we focus on data augmentation for slot filling task in SLU. To achieve that, we aim at generating more diverse data based on existing data. Specifically, we try to exploit the latent l… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: Accepted by Interspeech2021

    Journal ref: https://www.isca-speech.org/archive/interspeech_2021/index.html

  42. arXiv:2010.14920  [pdf, other

    cs.CL

    Bridging the Modality Gap for Speech-to-Text Translation

    Authors: Yuchen Liu, Junnan Zhu, Jiajun Zhang, Chengqing Zong

    Abstract: End-to-end speech translation aims to translate speech in one language into text in another language via an end-to-end way. Most existing methods employ an encoder-decoder structure with a single encoder to learn acoustic representation and semantic information simultaneously, which ignores the speech-and-text modality differences and makes the encoder overloaded, leading to great difficulty in le… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  43. arXiv:2010.04314  [pdf, other

    cs.CL

    Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

    Authors: Xiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong

    Abstract: Document-level neural machine translation has yielded attractive improvements. However, majority of existing methods roughly use all context sentences in a fixed scope. They neglect the fact that different source sentences need different sizes of context. To address this problem, we propose an effective approach to select dynamic context so that the document-level translation model can utilize the… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted to EMNLP 2020 long paper

  44. arXiv:2004.05809  [pdf, other

    cs.CL

    Neural Machine Translation: Challenges, Progress and Future

    Authors: Jiajun Zhang, Chengqing Zong

    Abstract: Machine translation (MT) is a technique that leverages computers to translate human languages automatically. Nowadays, neural machine translation (NMT) which models direct mapping between source and target languages with deep neural networks has achieved a big breakthrough in translation performance and become the de facto paradigm of MT. This article makes a review of NMT framework, discusses the… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

    Comments: Invited Review of Science China Technological Sciences

  45. arXiv:1912.07240  [pdf, other

    cs.CL cs.LG

    Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding

    Authors: Yuchen Liu, Jiajun Zhang, Hao Xiong, Long Zhou, Zhongjun He, Hua Wu, Haifeng Wang, Chengqing Zong

    Abstract: Speech-to-text translation (ST), which translates source language speech into target language text, has attracted intensive attention in recent years. Compared to the traditional pipeline system, the end-to-end ST model has potential benefits of lower latency, smaller model size, and less error propagation. However, it is notoriously difficult to implement such a model without transcriptions as in… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

    Comments: Accepted by AAAI 2020

  46. arXiv:1909.00156  [pdf, other

    cs.CL

    NCLS: Neural Cross-Lingual Summarization

    Authors: Junnan Zhu, Qian Wang, Yining Wang, Yu Zhou, Jiajun Zhang, Shaonan Wang, Chengqing Zong

    Abstract: Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. Existing methods simply divide this task into two steps: summarization and translation, leading to the problem of error propagation. To handle that, we present an end-to-end CLS framework, which we refer to as Neural Cross-Lingual Summarization (NCLS), for th… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: Accepted to EMNLP-IJCNLP 2019

  47. arXiv:1908.06820  [pdf, other

    cs.CL cs.AI

    Are You for Real? Detecting Identity Fraud via Dialogue Interactions

    Authors: Weikang Wang, Jiajun Zhang, Qian Li, Chengqing Zong, Zhifei Li

    Abstract: Identity fraud detection is of great importance in many real-world scenarios such as the financial industry. However, few studies addressed this problem before. In this paper, we focus on identity fraud detection in loan applications and propose to solve this problem with a novel interactive dialogue system which consists of two modules. One is the knowledge graph (KG) constructor organizing the p… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: EMNLP-IJCNLP 2019

  48. arXiv:1907.00820  [pdf, other

    cs.LG cs.CL cs.NE

    Understanding Memory Modules on Learning Simple Algorithms

    Authors: Kexin Wang, Yu Zhou, Shaonan Wang, Jiajun Zhang, Chengqing Zong

    Abstract: Recent work has shown that memory modules are crucial for the generalization ability of neural networks on learning simple algorithms. However, we still have little understanding of the working mechanism of memory modules. To alleviate this problem, we apply a two-step analysis pipeline consisting of first inferring hypothesis about what strategy the model has learned according to visualization an… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: Accepted at the XAI Workshop in IJCAI 2019

  49. arXiv:1906.09601  [pdf, other

    cs.CL cs.AI cs.LG

    Sequence Generation: From Both Sides to the Middle

    Authors: Long Zhou, Jiajun Zhang, Chengqing Zong, Heng Yu

    Abstract: The encoder-decoder framework has achieved promising process for many sequence generation tasks, such as neural machine translation and text summarization. Such a framework usually generates a sequence token by token from left to right, hence (1) this autoregressive decoding procedure is time-consuming when the output sentence becomes longer, and (2) it lacks the guidance of future context which i… ▽ More

    Submitted 23 June, 2019; originally announced June 2019.

    Comments: Accepted by IJCAI 2019

  50. arXiv:1906.04991  [pdf, other

    cs.CL

    Incremental Learning from Scratch for Task-Oriented Dialogue Systems

    Authors: Weikang Wang, Jiajun Zhang, Qian Li, Mei-Yuh Hwang, Chengqing Zong, Zhifei Li

    Abstract: Clarifying user needs is essential for existing task-oriented dialogue systems. However, in real-world applications, developers can never guarantee that all possible user demands are taken into account in the design phase. Consequently, existing systems will break down when encountering unconsidered user needs. To address this problem, we propose a novel incremental learning framework to design ta… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: ACL2019