Skip to main content

Showing 1–50 of 132 results for author: Lam, W

  1. arXiv:2407.04093  [pdf, other

    cs.CL

    Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations

    Authors: Hao Yang, Hongyuan Lu, Xinhua Zeng, Yang Liu, Xiang Zhang, Haoran Yang, Yumeng Zhang, Yiran Wei, Wai Lam

    Abstract: In the rapidly evolving field of natural language processing, dialogue systems primarily employ a single-step dialogue paradigm. Although this paradigm is efficient, it lacks the depth and fluidity of human interactions and does not appear natural. We introduce a novel \textbf{Step}-by-Step Dialogue Paradigm (Stephanie), designed to mimic the ongoing dynamic nature of human conversations. By emplo… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2406.17312  [pdf, other

    cs.CL

    Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning

    Authors: Sen Yang, Leyang Cui, Deng Cai, Xinting Huang, Shuming Shi, Wai Lam

    Abstract: Iterative preference learning, though yielding superior performances, requires online annotated preference labels. In this work, we study strategies to select worth-annotating response pairs for cost-efficient annotation while achieving competitive or even better performances compared with the random selection baseline for iterative preference learning. Built on assumptions regarding uncertainty a… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.10248  [pdf, other

    cs.CL cs.AI

    On the Worst Prompt Performance of Large Language Models

    Authors: Bowen Cao, Deng Cai, Zhisong Zhang, Yuexian Zou, Wai Lam

    Abstract: The performance of large language models (LLMs) is acutely sensitive to the phrasing of prompts, which raises significant concerns about their reliability in real-world scenarios. Existing studies often divide prompts into task-level instructions and case-level inputs and primarily focus on evaluating and improving robustness against variations in tasks-level instructions. However, this setup fail… ▽ More

    Submitted 21 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

  4. arXiv:2404.05955  [pdf, other

    cs.CL cs.AI

    VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?

    Authors: Junpeng Liu, Yifan Song, Bill Yuchen Lin, Wai Lam, Graham Neubig, Yuanzhi Li, Xiang Yue

    Abstract: Multimodal Large Language models (MLLMs) have shown promise in web-related tasks, but evaluating their performance in the web domain remains a challenge due to the lack of comprehensive benchmarks. Existing benchmarks are either designed for general multimodal tasks, failing to capture the unique characteristics of web pages, or focus on end-to-end web agent tasks, unable to measure fine-grained a… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  5. arXiv:2404.01240  [pdf, other

    cs.SE cs.CL cs.CV cs.HC

    AURORA: Navigating UI Tarpits via Automated Neural Screen Understanding

    Authors: Safwat Ali Khan, Wenyu Wang, Yiran Ren, Bin Zhu, Jiangfan Shi, Alyssa McGowan, Wing Lam, Kevin Moran

    Abstract: Nearly a decade of research in software engineering has focused on automating mobile app testing to help engineers in overcoming the unique challenges associated with the software platform. Much of this work has come in the form of Automated Input Generation tools (AIG tools) that dynamically explore app screens. However, such tools have repeatedly been demonstrated to achieve lower-than-expected… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Published at 17th IEEE International Conference on Software Testing, Verification and Validation (ICST) 2024, 12 pages

  6. arXiv:2403.11873  [pdf, other

    cs.CL

    CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite

    Authors: Yifei Yuan, Chen Shi, Runze Wang, Liyi Chen, Renjun Hu, Zengming Zhang, Feijun Jiang, Wai Lam

    Abstract: Generative query rewrite generates reconstructed query rewrites using the conversation history while rely heavily on gold rewrite pairs that are expensive to obtain. Recently, few-shot learning is gaining increasing popularity for this task, whereas these methods are sensitive to the inherent noise due to limited data size. Besides, both attempts face performance degradation when there exists lang… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to COLING 2024

  7. arXiv:2403.09162  [pdf, other

    cs.CL

    Unveiling the Generalization Power of Fine-Tuned Large Language Models

    Authors: Haoran Yang, Yumeng Zhang, Jiaqi Xu, Hongyuan Lu, Pheng Ann Heng, Wai Lam

    Abstract: While Large Language Models (LLMs) have demonstrated exceptional multitasking abilities, fine-tuning these models on downstream, domain-specific datasets is often necessary to yield superior performance on test sets compared to their counterparts without fine-tuning. However, the comprehensive effects of fine-tuning on the LLMs' generalization ability are not fully understood. This paper delves in… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: NAACL 2024

  8. arXiv:2403.07865  [pdf, other

    cs.CL cs.AI cs.CR cs.LG cs.SE

    CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion

    Authors: Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Wai Lam, Lizhuang Ma

    Abstract: The rapid advancement of Large Language Models (LLMs) has brought about remarkable generative capabilities but also raised concerns about their potential misuse. While strategies like supervised fine-tuning and reinforcement learning from human feedback have enhanced their safety, these methods primarily focus on natural languages, which may not generalize to other domains. This paper introduces C… ▽ More

    Submitted 9 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: ACL Findings 2024, Code is available at https://github.com/renqibing/CodeAttack

  9. arXiv:2403.05330  [pdf, other

    cs.CL

    Consecutive Model Editing with Batch alongside HooK Layers

    Authors: Shuaiyi Li, Yang Deng, Deng Cai, Hongyuan Lu, Liang Chen, Wai Lam

    Abstract: As the typical retraining paradigm is unacceptably time- and resource-consuming, researchers are turning to model editing in order to seek an effective, consecutive, and batch-supportive way to edit the model behavior directly. Despite all these practical expectations, existing model editing methods fail to realize all of them. Furthermore, the memory demands for such succession-supportive model e… ▽ More

    Submitted 17 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Under review

  10. arXiv:2402.13064  [pdf, other

    cs.CL

    Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

    Authors: Haoran Li, Qingxiu Dong, Zhengyang Tang, Chaojun Wang, Xingxing Zhang, Haoyang Huang, Shaohan Huang, Xiaolong Huang, Zeqiang Huang, Dongdong Zhang, Yuxian Gu, Xin Cheng, Xun Wang, Si-Qing Chen, Li Dong, Wei Lu, Zhifang Sui, Benyou Wang, Wai Lam, Furu Wei

    Abstract: We introduce Generalized Instruction Tuning (called GLAN), a general and scalable method for instruction tuning of Large Language Models (LLMs). Unlike prior work that relies on seed examples or existing datasets to construct instruction tuning data, GLAN exclusively utilizes a pre-curated taxonomy of human knowledge and capabilities as input and generates large-scale synthetic instruction data ac… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Work in progress

  11. arXiv:2402.07742  [pdf, other

    cs.CL cs.CV

    Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search

    Authors: Yifei Yuan, Clemencia Siro, Mohammad Aliannejadi, Maarten de Rijke, Wai Lam

    Abstract: In mixed-initiative conversational search systems, clarifying questions are used to help users who struggle to express their intentions in a single query. These questions aim to uncover user's information needs and resolve query ambiguities. We hypothesize that in scenarios where multimodal information is pertinent, the clarification process can be improved by using non-textual information. Theref… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to WWW24

  12. arXiv:2402.06925  [pdf, other

    cs.CL

    A Thorough Examination of Decoding Methods in the Era of LLMs

    Authors: Chufan Shi, Haoran Yang, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, Wai Lam

    Abstract: Decoding methods play an indispensable role in converting language models from next-token predictors into practical task solvers. Prior research on decoding methods, primarily focusing on task-specific models, may not extend to the current era of general-purpose large language models (LLMs). Moreover, the recent influx of decoding strategies has further complicated this landscape. This paper provi… ▽ More

    Submitted 17 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  13. arXiv:2312.14591  [pdf, other

    cs.CL

    Reasons to Reject? Aligning Language Models with Judgments

    Authors: Weiwen Xu, Deng Cai, Zhisong Zhang, Wai Lam, Shuming Shi

    Abstract: As humans, we consistently interact with our peers and receive feedback in the form of natural language. This language feedback allows us to maintain appropriate behavior, and rectify potential errors. The question arises naturally: can we use language feedback to align large language models (LLMs)? In contrast to previous research that aligns LLMs with scalar rewards, we present the first systema… ▽ More

    Submitted 6 June, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted at ACL 2024 Findings. Our source codes and models are publicly available at https://github.com/wwxu21/CUT

  14. arXiv:2311.09802  [pdf, other

    cs.AI cs.CL

    Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs

    Authors: Sen Yang, Xin Li, Leyang Cui, Lidong Bing, Wai Lam

    Abstract: Though prompting LLMs with various reasoning structures produces reasoning proofs along with answers, these proofs are not ensured to be causal and reliable due to the inherent defects of LLMs. Tracking such deficiencies, we present a neuro-symbolic integration method, in which a neural LLM is used to represent the knowledge of the problem while an LLM-free symbolic solver is adopted to do deliber… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  15. arXiv:2311.08803  [pdf, other

    cs.CL

    StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving

    Authors: Chang Gao, Haiyun Jiang, Deng Cai, Shuming Shi, Wai Lam

    Abstract: Most existing prompting methods suffer from the issues of generalizability and consistency, as they often rely on instance-specific solutions that may not be applicable to other instances and lack task-level consistency across the selected few-shot examples. To address these limitations, we propose a comprehensive framework, StrategyLLM, allowing LLMs to perform inductive reasoning, deriving gener… ▽ More

    Submitted 24 May, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  16. arXiv:2311.00262  [pdf, other

    cs.CL cs.AI

    Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents

    Authors: Yang Deng, Wenxuan Zhang, Wai Lam, See-Kiong Ng, Tat-Seng Chua

    Abstract: Proactive dialogues serve as a practical yet challenging dialogue problem in the era of large language models (LLMs), where the dialogue policy planning is the key to improving the proactivity of LLMs. Most existing studies enable the dialogue policy planning of LLMs using various prompting schemes or iteratively enhance this capability in handling the given case with verbal AI feedback. However,… ▽ More

    Submitted 11 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Accepted by ICLR 2024

  17. arXiv:2310.14709  [pdf, other

    cs.CL

    Once Upon a $\textit{Time}$ in $\textit{Graph}$: Relative-Time Pretraining for Complex Temporal Reasoning

    Authors: Sen Yang, Xin Li, Lidong Bing, Wai Lam

    Abstract: Our physical world is constantly evolving over time, rendering challenges for pre-trained language models to understand and reason over the temporal contexts of texts. Existing work focuses on strengthening the direct association between a piece of text and its time-stamp. However, the knowledge-time association is usually insufficient for the downstream tasks that require reasoning over temporal… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 main

  18. arXiv:2310.12557  [pdf, other

    cs.CL cs.AI

    DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial Reasoning in Text

    Authors: Shuaiyi Li, Yang Deng, Wai Lam

    Abstract: Spatial reasoning in text plays a crucial role in various real-world applications. Existing approaches for spatial reasoning typically infer spatial relations from pure text, which overlooks the gap between natural language and symbolic structures. Graph neural networks (GNNs) have showcased exceptional proficiency in inducing and aggregating symbolic structures. However, classical GNNs face chall… ▽ More

    Submitted 8 March, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  19. arXiv:2310.12132  [pdf, other

    cs.SE

    The Effects of Computational Resources on Flaky Tests

    Authors: Denini Silva, Martin Gruber, Satyajit Gokhale, Ellen Arteca, Alexi Turcotte, Marcelo d'Amorim, Wing Lam, Stefan Winter, Jonathan Bell

    Abstract: Flaky tests are tests that nondeterministically pass and fail in unchanged code. These tests can be detrimental to developers' productivity. Particularly when tests run in continuous integration environments, the tests may be competing for access to limited computational resources (CPUs, memory etc.), and we hypothesize that resource (in)availability may be a significant factor in the failure rate… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  20. arXiv:2310.02953  [pdf, other

    cs.CL

    JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning

    Authors: Chang Gao, Wenxuan Zhang, Guizhen Chen, Wai Lam

    Abstract: Instruction tuning has become an essential process for optimizing the performance of large language models (LLMs). However, current text-to-text instruction tuning methods, referred to as TextTuning, exhibit significant limitations in terms of generalization, robustness, and controllability, primarily due to the absence of explicit task structures. In this paper, we introduce JsonTuning, a novel s… ▽ More

    Submitted 24 May, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  21. arXiv:2309.16270  [pdf, other

    cs.CL

    Social Media Fashion Knowledge Extraction as Captioning

    Authors: Yifei Yuan, Wenxuan Zhang, Yang Deng, Wai Lam

    Abstract: Social media plays a significant role in boosting the fashion industry, where a massive amount of fashion-related posts are generated every day. In order to obtain the rich fashion information from the posts, we study the task of social media fashion knowledge extraction. Fashion knowledge, which typically consists of the occasion, person attributes, and fashion item information, can be effectivel… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Accepted by SIGIR-AP 2023

  22. arXiv:2309.08325  [pdf, other

    cs.CL

    Distributional Inclusion Hypothesis and Quantifications: Probing for Hypernymy in Functional Distributional Semantics

    Authors: Chun Hei Lo, Wai Lam, Hong Cheng, Guy Emerson

    Abstract: Functional Distributional Semantics (FDS) models the meaning of words by truth-conditional functions. This provides a natural representation for hypernymy but no guarantee that it can be learnt when FDS models are trained on a corpus. In this paper, we probe into FDS models and study the representations learnt, drawing connections between quantifications, the Distributional Inclusion Hypothesis (D… ▽ More

    Submitted 10 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 12 pages

  23. arXiv:2309.04725  [pdf, other

    cs.CL

    EPA: Easy Prompt Augmentation on Large Language Models via Multiple Sources and Multiple Targets

    Authors: Hongyuan Lu, Wai Lam

    Abstract: Large language models (LLMs) have shown promising performance on various NLP tasks via task prompting. And their performance can be further improved by appending task demonstrations to the head of the prompt. And usually, a better performance can be achieved with more demonstrations. However, asking the users to write the demonstrations can be cumbersome. As a simple yet cost-effective workaround,… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  24. arXiv:2306.10022  [pdf

    cs.IR

    The News Delivery Channel Recommendation Based on Granular Neural Network

    Authors: Lin Wu, Rui Li, Jiaxuan Liu, Wong-Hing Lam

    Abstract: With the continuous maturation and expansion of neural network technology, deep neural networks have been widely utilized as the fundamental building blocks of deep learning in a variety of applications, including speech recognition, machine translation, image processing, and the creation of recommendation systems. Therefore, many real-world complex problems can be solved by the deep learning tech… ▽ More

    Submitted 30 May, 2023; originally announced June 2023.

  25. arXiv:2306.02051  [pdf, other

    cs.CL cs.AI

    A Comprehensive Survey on Relation Extraction: Recent Advances and New Frontiers

    Authors: Xiaoyan Zhao, Yang Deng, Min Yang, Lingzhi Wang, Rui Zhang, Hong Cheng, Wai Lam, Ying Shen, Ruifeng Xu

    Abstract: Relation extraction (RE) involves identifying the relations between entities from underlying content. RE serves as the foundation for many natural language processing (NLP) and information retrieval applications, such as knowledge graph completion and question answering. In recent years, deep neural networks have dominated the field of RE and made noticeable progress. Subsequently, the large pre-t… ▽ More

    Submitted 24 June, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

  26. arXiv:2305.18880  [pdf

    cs.CL

    Research on Multilingual News Clustering Based on Cross-Language Word Embeddings

    Authors: Lin Wu, Rui Li, Wong-Hing Lam

    Abstract: Classifying the same event reported by different countries is of significant importance for public opinion control and intelligence gathering. Due to the diverse types of news, relying solely on transla-tors would be costly and inefficient, while depending solely on translation systems would incur considerable performance overheads in invoking translation interfaces and storing translated texts. T… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  27. arXiv:2305.16749  [pdf, other

    cs.SD eess.AS

    Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model

    Authors: Xiang Li, Songxiang Liu, Max W. Y. Lam, Zhiyong Wu, Chao Weng, Helen Meng

    Abstract: Expressive human speech generally abounds with rich and flexible speech prosody variations. The speech prosody predictors in existing expressive speech synthesis methods mostly produce deterministic predictions, which are learned by directly minimizing the norm of prosody prediction error. Its unimodal nature leads to a mismatch with ground truth distribution and harms the model's ability in makin… ▽ More

    Submitted 7 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Proceedings of Interspeech 2023 (doi: 10.21437/Interspeech.2023-715), demo site at https://thuhcsi.github.io/interspeech2023-DiffVar/

  28. arXiv:2305.15719  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Efficient Neural Music Generation

    Authors: Max W. Y. Lam, Qiao Tian, Tang Li, Zongyu Yin, Siyuan Feng, Ming Tu, Yuliang Ji, Rui Xia, Mingbo Ma, Xuchen Song, Jitong Chen, Yuping Wang, Yuxuan Wang

    Abstract: Recent progress in music generation has been remarkably advanced by the state-of-the-art MusicLM, which comprises a hierarchy of three LMs, respectively, for semantic, coarse acoustic, and fine acoustic modelings. Yet, sampling with the MusicLM requires processing through these LMs one by one to obtain the fine-grained acoustic tokens, making it computationally expensive and prohibitive for a real… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  29. arXiv:2305.15676  [pdf, other

    cs.CL

    Enhancing Grammatical Error Correction Systems with Explanations

    Authors: Yuejiao Fei, Leyang Cui, Sen Yang, Wai Lam, Zhenzhong Lan, Shuming Shi

    Abstract: Grammatical error correction systems improve written communication by detecting and correcting language mistakes. To help language learners better understand why the GEC system makes a certain correction, the causes of errors (evidence words) and the corresponding error types are two key factors. To enhance GEC systems with explanations, we introduce EXPECT, a large dataset annotated with evidence… ▽ More

    Submitted 10 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 9 pages, 7 figures, accepted to the main conference of ACL 2023

  30. arXiv:2305.13645  [pdf, other

    cs.CL

    mPMR: A Multilingual Pre-trained Machine Reader at Scale

    Authors: Weiwen Xu, Xin Li, Wai Lam, Lidong Bing

    Abstract: We present multilingual Pre-trained Machine Reader (mPMR), a novel method for multilingual machine reading comprehension (MRC)-style pre-training. mPMR aims to guide multilingual pre-trained language models (mPLMs) to perform natural language understanding (NLU) including both sequence classification and span extraction in multiple languages. To achieve cross-lingual generalization when only sourc… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: To appear at ACL 2023 main conference

  31. arXiv:2305.12675  [pdf, other

    cs.CL

    A Frustratingly Simple Decoding Method for Neural Text Generation

    Authors: Haoran Yang, Deng Cai, Huayang Li, Wei Bi, Wai Lam, Shuming Shi

    Abstract: We introduce a frustratingly simple, super efficient and surprisingly effective decoding method, which we call Frustratingly Simple Decoding (FSD), for neural text generation. The idea behind FSD is straightforward: we build an anti-LM based on previously generated text and use this anti-LM to penalize future generation of what has been generated. The anti-LM can be implemented as simple as an n-g… ▽ More

    Submitted 27 February, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: LREC-Coling 2024

  32. arXiv:2305.10276  [pdf, other

    cs.CL

    Chain-of-Symbol Prompting Elicits Planning in Large Langauge Models

    Authors: Hanxu Hu, Hongyuan Lu, Huajian Zhang, Yun-Ze Song, Wai Lam, Yue Zhang

    Abstract: In this paper, we take the initiative to investigate the performance of LLMs on complex planning tasks that require LLMs to understand a virtual spatial environment simulated via natural language and act correspondingly in text. We propose a benchmark named Natural Language Planning and Action (Natala) composed of a set of novel tasks: Brick World, NLVR-based Manipulations, and Natural Language Na… ▽ More

    Submitted 3 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  33. arXiv:2305.10172  [pdf, other

    cs.CL cs.IR

    Knowledge-enhanced Mixed-initiative Dialogue System for Emotional Support Conversations

    Authors: Yang Deng, Wenxuan Zhang, Yifei Yuan, Wai Lam

    Abstract: Unlike empathetic dialogues, the system in emotional support conversations (ESC) is expected to not only convey empathy for comforting the help-seeker, but also proactively assist in exploring and addressing their problems during the conversation. In this work, we study the problem of mixed-initiative ESC where the user and system can both take the initiative in leading the conversation. Specifica… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023 main conference

  34. arXiv:2305.09193  [pdf, other

    cs.CL

    Easy-to-Hard Learning for Information Extraction

    Authors: Chang Gao, Wenxuan Zhang, Wai Lam, Lidong Bing

    Abstract: Information extraction (IE) systems aim to automatically extract structured information, such as named entities, relations between entities, and events, from unstructured texts. While most existing work addresses a particular IE task, universally modeling various IE tasks with one model has achieved great success recently. Despite their success, they employ a one-stage learning strategy, i.e., dir… ▽ More

    Submitted 19 May, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  35. arXiv:2305.09154  [pdf, other

    cs.CL

    Progressive Translation: Improving Domain Robustness of Neural Machine Translation with Intermediate Sequences

    Authors: Chaojun Wang, Yang Liu, Wai Lam

    Abstract: Previous studies show that intermediate supervision signals benefit various Natural Language Processing tasks. However, it is not clear whether there exist intermediate signals that benefit Neural Machine Translation (NMT). Borrowing techniques from Statistical Machine Translation, we propose intermediate signals which are intermediate sequences from the "source-like" structure to the "target-like… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (Findings)

  36. arXiv:2305.06575  [pdf, other

    cs.CL

    Chain-of-Dictionary Prompting Elicits Translation in Large Language Models

    Authors: Hongyuan Lu, Haoyang Huang, Dongdong Zhang, Haoran Yang, Wai Lam, Furu Wei

    Abstract: Large language models (LLMs) have shown surprisingly good performance in multilingual neural machine translation (MNMT) even when trained without parallel data. Yet, despite the fact that the amount of training data is gigantic, they still struggle with translating rare words, particularly for low-resource languages. Even worse, it is usually unrealistic to retrieve relevant demonstrations for in-… ▽ More

    Submitted 24 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  37. arXiv:2305.02750  [pdf, other

    cs.CL cs.AI

    A Survey on Proactive Dialogue Systems: Problems, Methods, and Prospects

    Authors: Yang Deng, Wenqiang Lei, Wai Lam, Tat-Seng Chua

    Abstract: Proactive dialogue systems, related to a wide range of real-world conversational applications, equip the conversational agent with the capability of leading the conversation direction towards achieving pre-defined targets or fulfilling certain goals from the system side. It is empowered by advanced techniques to progress to more complicated tasks that require strategical and motivational interacti… ▽ More

    Submitted 9 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted by IJCAI 2023 Survey Track

  38. arXiv:2304.04052  [pdf, other

    cs.CL cs.AI cs.LG

    Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder

    Authors: Zihao Fu, Wai Lam, Qian Yu, Anthony Man-Cho So, Shengding Hu, Zhiyuan Liu, Nigel Collier

    Abstract: The sequence-to-sequence (seq2seq) task aims at generating the target sequence based on the given input source sequence. Traditionally, most of the seq2seq task is resolved by the Encoder-Decoder framework which requires an encoder to encode the source sequence and a decoder to generate the target text. Recently, a bunch of new approaches have emerged that apply decoder-only language models direct… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

  39. arXiv:2302.10331  [pdf, ps, other

    cs.LG cs.LO stat.ML

    Causal Razors

    Authors: Wai-yin Lam

    Abstract: When performing causal discovery, assumptions have to be made on how the true causal mechanism corresponds to the underlying joint probability distribution. These assumptions are labeled as causal razors in this work. We review numerous causal razors that appeared in the literature, and offer a comprehensive logical comparison of them. In particular, we scrutinize an unpopular causal razor, namely… ▽ More

    Submitted 7 August, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 29 pages for the main paper. 14 pages for the supplementary materials

  40. arXiv:2302.08092  [pdf, other

    cs.CL cs.IR

    Product Question Answering in E-Commerce: A Survey

    Authors: Yang Deng, Wenxuan Zhang, Qian Yu, Wai Lam

    Abstract: Product question answering (PQA), aiming to automatically provide instant responses to customer's questions in E-Commerce platforms, has drawn increasing attention in recent years. Compared with typical QA problems, PQA exhibits unique challenges such as the subjectivity and reliability of user-generated contents in E-commerce platforms. Therefore, various problem settings and novel methods have b… ▽ More

    Submitted 3 May, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Accepted by ACL 2023 main conference

  41. ChatGPT and Software Testing Education: Promises & Perils

    Authors: Sajed Jalil, Suzzana Rafi, Thomas D. LaToza, Kevin Moran, Wing Lam

    Abstract: Over the past decade, predictive language modeling for code has proven to be a valuable tool for enabling new forms of automation for developers. More recently, we have seen the advent of general purpose "large language models", based on neural transformer architectures, that have been trained on massive datasets of human written text spanning code and natural language. However, despite the demons… ▽ More

    Submitted 11 March, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 2023 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW), 8 pages, 2 tables, 6 figures

    ACM Class: D.2.5

    Journal ref: 2023 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW)

  42. arXiv:2212.07752  [pdf, other

    cs.CL

    Advancing Multilingual Pre-training: TRIP Triangular Document-level Pre-training for Multilingual Language Models

    Authors: Hongyuan Lu, Haoyang Huang, Shuming Ma, Dongdong Zhang, Wai Lam, Furu Wei

    Abstract: Despite the success of multilingual sequence-to-sequence pre-training, most existing approaches rely on document-level monolingual corpora in many different languages, sentence-level bilingual corpora,\footnote{In this paper, we use `bilingual corpora' to denote parallel corpora with `bilingual translation pairs' in many different language pairs, each consisting of two sentences/documents with the… ▽ More

    Submitted 13 May, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  43. arXiv:2212.04755  [pdf, other

    cs.CL

    From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

    Authors: Weiwen Xu, Xin Li, Wenxuan Zhang, Meng Zhou, Wai Lam, Luo Si, Lidong Bing

    Abstract: We present Pre-trained Machine Reader (PMR), a novel method for retrofitting pre-trained masked language models (MLMs) to pre-trained machine reading comprehension (MRC) models without acquiring labeled data. PMR can resolve the discrepancy between model pre-training and downstream fine-tuning of existing MLMs. To build the proposed PMR, we constructed a large volume of general-purpose and high-qu… ▽ More

    Submitted 16 October, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: Accepted to NeurIPS 2023

  44. arXiv:2211.15583  [pdf, other

    cs.CL cs.AI cs.LG

    On the Effectiveness of Parameter-Efficient Fine-Tuning

    Authors: Zihao Fu, Haoran Yang, Anthony Man-Cho So, Wai Lam, Lidong Bing, Nigel Collier

    Abstract: Fine-tuning pre-trained models has been ubiquitously proven to be effective in a wide range of NLP tasks. However, fine-tuning the whole model is parameter inefficient as it always yields an entirely new model for each task. Currently, many research works propose to only fine-tune a small portion of the parameters while keeping most of the parameters shared across different tasks. These methods ac… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  45. arXiv:2210.12775  [pdf, other

    cs.CL cs.AI

    McQueen: a Benchmark for Multimodal Conversational Query Rewrite

    Authors: Yifei Yuan, Chen Shi, Runze Wang, Liyi Chen, Feijun Jiang, Yuan You, Wai Lam

    Abstract: The task of query rewrite aims to convert an in-context query to its fully-specified version where ellipsis and coreference are completed and referred-back according to the history context. Although much progress has been made, less efforts have been paid to real scenario conversations that involve drawing information from more than one modalities. In this paper, we propose the task of multimodal… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted by EMNLP22

  46. arXiv:2210.12674  [pdf, other

    cs.CL

    Towards Generalizable and Robust Text-to-SQL Parsing

    Authors: Chang Gao, Bowen Li, Wenxuan Zhang, Wai Lam, Binhua Li, Fei Huang, Luo Si, Yongbin Li

    Abstract: Text-to-SQL parsing tackles the problem of mapping natural language questions to executable SQL queries. In practice, text-to-SQL parsers often encounter various challenging scenarios, requiring them to be generalizable and robust. While most existing work addresses a particular generalization or robustness challenge, we aim to study it in a more comprehensive manner. In specific, we believe that… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  47. arXiv:2210.09773  [pdf, other

    cs.CL cs.AI

    Retrofitting Multilingual Sentence Embeddings with Abstract Meaning Representation

    Authors: Deng Cai, Xin Li, Jackie Chun-Sing Ho, Lidong Bing, Wai Lam

    Abstract: We introduce a new method to improve existing multilingual sentence embeddings with Abstract Meaning Representation (AMR). Compared with the original textual input, AMR is a structured semantic representation that presents the core concepts and relations in a sentence explicitly and unambiguously. It also helps reduce surface variations across different expressions and languages. Unlike most prior… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: EMNLP2022

  48. arXiv:2210.08855  [pdf, other

    cs.CL

    PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks

    Authors: Weiwen Xu, Xin Li, Yang Deng, Wai Lam, Lidong Bing

    Abstract: Span identification aims at identifying specific text spans from text input and classifying them into pre-defined categories. Different from previous works that merely leverage the Subordinate (SUB) relation (i.e. if a span is an instance of a certain category) to train models, this paper for the first time explores the Peer (PR) relation, which indicates that two spans are instances of the same c… ▽ More

    Submitted 18 May, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: To appear at ACL 2023 main conference

  49. arXiv:2210.08817  [pdf, other

    cs.CL

    PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance

    Authors: Yang Deng, Wenqiang Lei, Wenxuan Zhang, Wai Lam, Tat-Seng Chua

    Abstract: To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC. Compared with existing CQA datasets, PACIFIC exhibits three key features: (i) proactivity, (ii) numerical reasoning, and (iii) hybrid context of tables and text. A new task is defined accordingly to study Proactive Conversational Question Answering (PCQA), which combines… ▽ More

    Submitted 18 March, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted by EMNLP 2022 (main conference)

  50. arXiv:2210.08697  [pdf, other

    cs.CL

    ConReader: Exploring Implicit Relations in Contracts for Contract Clause Extraction

    Authors: Weiwen Xu, Yang Deng, Wenqiang Lei, Wenlong Zhao, Tat-Seng Chua, Wai Lam

    Abstract: We study automatic Contract Clause Extraction (CCE) by modeling implicit relations in legal contracts. Existing CCE methods mostly treat contracts as plain text, creating a substantial barrier to understanding contracts of high complexity. In this work, we first comprehensively analyze the complexity issues of contracts and distill out three implicit relations commonly found in contracts, namely,… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: To appear at EMNLP 2022 main conference