Skip to main content

Showing 1–50 of 119 results for author: Bing, L

  1. arXiv:2406.17294  [pdf, other

    cs.CL

    Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

    Authors: Wenhao Shi, Zhiqiang Hu, Yi Bin, Junhua Liu, Yang Yang, See-Kiong Ng, Lidong Bing, Roy Ka-Wei Lee

    Abstract: Large language models (LLMs) have demonstrated impressive reasoning capabilities, particularly in textual mathematical problem-solving. However, existing open-source image instruction fine-tuning datasets, containing limited question-answer pairs per image, do not fully exploit visual information to enhance the multimodal mathematical reasoning capabilities of Multimodal LLMs (MLLMs). To bridge th… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 8 pages

  2. arXiv:2406.07476  [pdf, other

    cs.CV cs.CL

    VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

    Authors: Zesen Cheng, Sicong Leng, Hang Zhang, Yifei Xin, Xin Li, Guanzheng Chen, Yongxin Zhu, Wenqi Zhang, Ziyang Luo, Deli Zhao, Lidong Bing

    Abstract: In this paper, we present the VideoLLaMA 2, a set of Video Large Language Models (Video-LLMs) designed to enhance spatial-temporal modeling and audio understanding in video and audio-oriented tasks. Building upon its predecessor, VideoLLaMA 2 incorporates a tailor-made Spatial-Temporal Convolution (STC) connector, which effectively captures the intricate spatial and temporal dynamics of video data… ▽ More

    Submitted 17 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: ZC, SL, HZ, YX, and XL contributed equally to this project

  3. arXiv:2405.20267  [pdf, other

    cs.CL

    Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions

    Authors: Ruochen Zhao, Wenxuan Zhang, Yew Ken Chia, Deli Zhao, Lidong Bing

    Abstract: As LLMs evolve on a daily basis, there is an urgent need for a trustworthy evaluation method that can provide robust evaluation results in a timely fashion. Currently, as static benchmarks are prone to contamination concerns, users tend to trust human voting platforms, such as Chatbot Arena. However, human annotations require extensive manual efforts. To provide an automatic, robust, and trustwort… ▽ More

    Submitted 12 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  4. arXiv:2404.12872  [pdf, other

    cs.DB cs.CL

    LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency

    Authors: Zhaodonghui Li, Haitao Yuan, Huiming Wang, Gao Cong, Lidong Bing

    Abstract: Query rewrite, which aims to generate more efficient queries by altering a SQL query's structure without changing the query result, has been an important research problem. In order to maintain equivalence between the rewritten query and the original one during rewriting, traditional query rewrite methods always rewrite the queries following certain rewrite rules. However, some problems still remai… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 12 pages

  5. arXiv:2404.00570  [pdf, other

    cs.CL

    ParaICL: Towards Robust Parallel In-Context Learning

    Authors: Xingxuan Li, Xuan-Phi Nguyen, Shafiq Joty, Lidong Bing

    Abstract: Large language models (LLMs) have become the norm in natural language processing (NLP), excelling in few-shot in-context learning (ICL) with their remarkable abilities. Nonetheless, the success of ICL largely hinges on the choice of few-shot demonstration examples, making the selection process increasingly crucial. Existing methods have delved into optimizing the quantity and semantic similarity o… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Work in progress

  6. arXiv:2403.13315  [pdf, other

    cs.CV

    PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns

    Authors: Yew Ken Chia, Vernon Toh Yan Han, Deepanway Ghosal, Lidong Bing, Soujanya Poria

    Abstract: Large multimodal models extend the impressive capabilities of large language models by integrating multimodal understanding abilities. However, it is not clear how they can emulate the general intelligence and reasoning ability of humans. As recognizing patterns and abstracting concepts are key to general intelligence, we introduce PuzzleVQA, a collection of puzzles based on abstract patterns. Wit… ▽ More

    Submitted 30 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  7. arXiv:2403.10258  [pdf, other

    cs.CL

    Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

    Authors: Chaoqun Liu, Wenxuan Zhang, Yiran Zhao, Anh Tuan Luu, Lidong Bing

    Abstract: Large language models (LLMs) have demonstrated multilingual capabilities; yet, they are mostly English-centric due to the imbalanced training corpora. Existing works leverage this phenomenon to improve their multilingual performances through translation, primarily on natural language processing (NLP) tasks. This work extends the evaluation from NLP tasks to real user queries and from English-centr… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 19 pages

  8. arXiv:2402.18913  [pdf, other

    cs.CL cs.AI

    AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging

    Authors: Yiran Zhao, Wenxuan Zhang, Huiming Wang, Kenji Kawaguchi, Lidong Bing

    Abstract: As an effective alternative to the direct fine-tuning on target tasks in specific languages, cross-lingual transfer addresses the challenges of limited training data by decoupling ''task ability'' and ''language ability'' by fine-tuning on the target task in the source language and another selected task in the target language, respectively. However, they fail to fully separate the task ability fro… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  9. arXiv:2402.18815  [pdf, other

    cs.CL cs.AI

    How do Large Language Models Handle Multilingualism?

    Authors: Yiran Zhao, Wenxuan Zhang, Guizhen Chen, Kenji Kawaguchi, Lidong Bing

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities across diverse languages. This study explores how LLMs handle multilingualism. Based on observed language ratio shifts among layers and the relationships between network structures and certain capabilities, we hypothesize the LLM's multilingual workflow ($\texttt{MWork}$): LLMs initially understand the query, converting multili… ▽ More

    Submitted 24 May, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  10. arXiv:2312.00738  [pdf, other

    cs.CL

    SeaLLMs -- Large Language Models for Southeast Asia

    Authors: Xuan-Phi Nguyen, Wenxuan Zhang, Xin Li, Mahani Aljunied, Zhiqiang Hu, Chenhui Shen, Yew Ken Chia, Xingxuan Li, Jianyu Wang, Qingyu Tan, Liying Cheng, Guanzheng Chen, Yue Deng, Sen Yang, Chaoqun Liu, Hang Zhang, Lidong Bing

    Abstract: Despite the remarkable achievements of large language models (LLMs) in various tasks, there remains a linguistic bias that favors high-resource languages, such as English, often at the expense of low-resource and regional languages. To address this imbalance, we introduce SeaLLMs, an innovative series of language models that specifically focuses on Southeast Asian (SEA) languages. SeaLLMs are buil… ▽ More

    Submitted 1 July, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Technical report, ACL 2024 DEMO TRACK

  11. arXiv:2311.16922  [pdf, other

    cs.CV cs.AI cs.CL

    Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

    Authors: Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing

    Abstract: Large Vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned. Despite their success, LVLMs still suffer from the issue of object hallucinations, where models generate plausible yet incorrect outputs that include objects that do not exist in the images. To mitig… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  12. arXiv:2311.09821  [pdf, other

    cs.CL

    Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning

    Authors: Qingyu Tan, Hwee Tou Ng, Lidong Bing

    Abstract: Knowledge in the real world is being updated constantly. However, it is costly to frequently update large language models (LLMs). Therefore, it is crucial for LLMs to understand the concept of temporal knowledge. However, prior works on temporal question answering (TQA) did not emphasize multi-answer and multi-hop types of temporal reasoning. In this paper, we propose a complex temporal question-a… ▽ More

    Submitted 12 July, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: To appear in Findings of ACL 2024

  13. arXiv:2311.09802  [pdf, other

    cs.AI cs.CL

    Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs

    Authors: Sen Yang, Xin Li, Leyang Cui, Lidong Bing, Wai Lam

    Abstract: Though prompting LLMs with various reasoning structures produces reasoning proofs along with answers, these proofs are not ensured to be causal and reliable due to the inherent defects of LLMs. Tracking such deficiencies, we present a neuro-symbolic integration method, in which a neural LLM is used to represent the knowledge of the problem while an LLM-free symbolic solver is adopted to do deliber… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  14. arXiv:2311.09277  [pdf, other

    cs.CL

    Contrastive Chain-of-Thought Prompting

    Authors: Yew Ken Chia, Guizhen Chen, Luu Anh Tuan, Soujanya Poria, Lidong Bing

    Abstract: Despite the success of chain of thought in enhancing language model reasoning, the underlying process remains less well understood. Although logically sound reasoning appears inherently crucial for chain of thought, prior studies surprisingly reveal minimal impact when using invalid demonstrations instead. Furthermore, the conventional chain of thought does not inform language models on what mista… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  15. arXiv:2311.09022  [pdf, other

    cs.CL

    Exploring the Potential of Large Language Models in Computational Argumentation

    Authors: Guizhen Chen, Liying Cheng, Luu Anh Tuan, Lidong Bing

    Abstract: Computational argumentation has become an essential tool in various domains, including law, public policy, and artificial intelligence. It is an emerging research field in natural language processing that attracts increasing attention. Research on computational argumentation mainly involves two types of tasks: argument mining and argument generation. As large language models (LLMs) have demonstrat… ▽ More

    Submitted 1 July, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted at ACL 2024 Main

  16. arXiv:2311.02205  [pdf, other

    cs.CL

    An Introduction to Natural Language Processing Techniques and Framework for Clinical Implementation in Radiation Oncology

    Authors: Reza Khanmohammadi, Mohammad M. Ghassemi, Kyle Verdecchia, Ahmed I. Ghanem, Luo Bing, Indrin J. Chetty, Hassan Bagher-Ebadian, Farzan Siddiqui, Mohamed Elshaikh, Benjamin Movsas, Kundan Thind

    Abstract: Natural Language Processing (NLP) is a key technique for developing Medical Artificial Intelligence (AI) systems that leverage Electronic Health Record (EHR) data to build diagnostic and prognostic models. NLP enables the conversion of unstructured clinical text into structured data that can be fed into AI algorithms. The emergence of the transformer architecture and large language models (LLMs) h… ▽ More

    Submitted 8 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

  17. arXiv:2310.17924  [pdf, other

    cs.CL

    SOUL: Towards Sentiment and Opinion Understanding of Language

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: Sentiment analysis is a well-established natural language processing task, with sentiment polarity classification being one of its most popular and representative tasks. However, despite the success of pre-trained language models in this area, they often fall short of capturing the broader complexities of sentiment analysis. To address this issue, we propose a new task called Sentiment and Opinion… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference, Short Paper

  18. arXiv:2310.16450  [pdf, other

    cs.CL

    CLEX: Continuous Length Extrapolation for Large Language Models

    Authors: Guanzheng Chen, Xin Li, Zaiqiao Meng, Shangsong Liang, Lidong Bing

    Abstract: Transformer-based Large Language Models (LLMs) are pioneering advances in many natural language processing tasks, however, their exceptional capabilities are restricted within the preset context window of Transformer. Position Embedding (PE) scaling methods, while effective in extending the context window to a specific length, demonstrate either notable limitations in their extrapolation abilities… ▽ More

    Submitted 24 March, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  19. arXiv:2310.14709  [pdf, other

    cs.CL

    Once Upon a $\textit{Time}$ in $\textit{Graph}$: Relative-Time Pretraining for Complex Temporal Reasoning

    Authors: Sen Yang, Xin Li, Lidong Bing, Wai Lam

    Abstract: Our physical world is constantly evolving over time, rendering challenges for pre-trained language models to understand and reason over the temporal contexts of texts. Existing work focuses on strengthening the direct association between a piece of text and its time-stamp. However, the knowledge-time association is usually insufficient for the downstream tasks that require reasoning over temporal… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 main

  20. arXiv:2310.10962  [pdf, other

    cs.CL

    Large Language Models can Contrastively Refine their Generation for Better Sentence Representation Learning

    Authors: Huiming Wang, Zhaodonghui Li, Liying Cheng, Soh De Wen, Lidong Bing

    Abstract: Recently, large language models (LLMs) have emerged as a groundbreaking technology and their unparalleled text generation capabilities have sparked interest in their application to the fundamental sentence representation learning task. Existing methods have explored utilizing LLMs as data annotators to generate synthesized data for training contrastive learning based sentence embedding models such… ▽ More

    Submitted 17 May, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: NAACL 2024

  21. arXiv:2310.06474  [pdf, other

    cs.CL

    Multilingual Jailbreak Challenges in Large Language Models

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: While large language models (LLMs) exhibit remarkable capabilities across a wide range of tasks, they pose potential safety concerns, such as the ``jailbreak'' problem, wherein malicious instructions can manipulate LLMs to exhibit undesirable behavior. Although several preventive measures have been developed to mitigate the potential risks associated with LLMs, they have primarily focused on Engli… ▽ More

    Submitted 3 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  22. arXiv:2306.11372  [pdf, other

    cs.CL cs.AI

    Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts

    Authors: Xuan-Phi Nguyen, Sharifah Mahani Aljunied, Shafiq Joty, Lidong Bing

    Abstract: Large language models (LLMs) are known to effectively perform tasks by simply observing few exemplars. However, in low-resource languages, obtaining such hand-picked exemplars can still be challenging, where unsupervised techniques may be necessary. Moreover, competent generative capabilities of LLMs are observed only in high-resource languages, while their performances among under-represented lan… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Pre-print

  23. arXiv:2306.09697  [pdf, other

    cs.CL

    Class-Adaptive Self-Training for Relation Extraction with Incompletely Annotated Training Data

    Authors: Qingyu Tan, Lu Xu, Lidong Bing, Hwee Tou Ng

    Abstract: Relation extraction (RE) aims to extract relations from sentences and documents. Existing relation extraction models typically rely on supervised machine learning. However, recent studies showed that many RE datasets are incompletely annotated. This is known as the false negative problem in which valid relations are falsely annotated as 'no_relation'. Models trained with such data inevitably make… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Findings

  24. arXiv:2306.08952  [pdf, other

    cs.CL cs.AI

    Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models

    Authors: Qingyu Tan, Hwee Tou Ng, Lidong Bing

    Abstract: Reasoning about time is of fundamental importance. Many facts are time-dependent. For example, athletes change teams from time to time, and different government officials are elected periodically. Previous time-dependent question answering (QA) datasets tend to be biased in either their coverage of time spans or question types. In this paper, we introduce a comprehensive probing dataset \tempreaso… ▽ More

    Submitted 27 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  25. arXiv:2306.05179  [pdf, other

    cs.CL cs.CV

    M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models

    Authors: Wenxuan Zhang, Sharifah Mahani Aljunied, Chang Gao, Yew Ken Chia, Lidong Bing

    Abstract: Despite the existence of various benchmarks for evaluating natural language processing models, we argue that human exams are a more suitable means of evaluating general intelligence for large language models (LLMs), as they inherently demand a much wider range of abilities such as language understanding, domain knowledge, and problem-solving skills. To this end, we introduce M3Exam, a novel benchm… ▽ More

    Submitted 9 November, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 (Datasets and Benchmarks)

  26. arXiv:2306.04757  [pdf, other

    cs.CL cs.AI

    INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models

    Authors: Yew Ken Chia, Pengfei Hong, Lidong Bing, Soujanya Poria

    Abstract: Instruction-tuned large language models have revolutionized natural language processing and have shown great potential in applications such as conversational agents. These models, such as GPT-4, can not only master language but also solve complex tasks in areas like mathematics, coding, medicine, and law. Despite their impressive capabilities, there is still a lack of comprehensive understanding r… ▽ More

    Submitted 15 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Github: https://github.com/declare-lab/instruct-eval Leaderboard: https://declare-lab.github.io/instruct-eval/

  27. arXiv:2306.02858  [pdf, other

    cs.CL cs.CV cs.SD eess.AS

    Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

    Authors: Hang Zhang, Xin Li, Lidong Bing

    Abstract: We present Video-LLaMA a multi-modal framework that empowers Large Language Models (LLMs) with the capability of understanding both visual and auditory content in the video. Video-LLaMA bootstraps cross-modal training from the frozen pre-trained visual and audio encoders and the frozen LLMs. Unlike previous works that complement LLMs to process the visual or audio signals only, Video-LLaMA enables… ▽ More

    Submitted 25 October, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by EMNLP 2023's demo track; Code, Pretrained Model, and Dataset: https://github.com/DAMO-NLP-SG/Video-LLaMA

  28. arXiv:2305.19902  [pdf, other

    cs.CL

    AQE: Argument Quadruplet Extraction via a Quad-Tagging Augmented Generative Approach

    Authors: Jia Guo, Liying Cheng, Wenxuan Zhang, Stanley Kok, Xin Li, Lidong Bing

    Abstract: Argument mining involves multiple sub-tasks that automatically identify argumentative elements, such as claim detection, evidence extraction, stance classification, etc. However, each subtask alone is insufficient for a thorough understanding of the argumentative structure and reasoning process. To learn a complete view of an argument essay and capture the interdependence among argumentative compo… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  29. arXiv:2305.15038  [pdf, other

    cs.CL

    Is GPT-4 a Good Data Analyst?

    Authors: Liying Cheng, Xingxuan Li, Lidong Bing

    Abstract: As large language models (LLMs) have demonstrated their powerful capabilities in plenty of domains and tasks, including context understanding, code generation, language generation, data storytelling, etc., many data analysts may raise concerns if their jobs will be replaced by artificial intelligence (AI). This controversial topic has drawn great attention in public. However, we are still at a sta… ▽ More

    Submitted 22 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 19 pages, 2 figures

  30. arXiv:2305.15014  [pdf, other

    cs.CL

    Unlocking Temporal Question Answering for Large Language Models Using Code Execution

    Authors: Xingxuan Li, Liying Cheng, Qingyu Tan, Hwee Tou Ng, Shafiq Joty, Lidong Bing

    Abstract: Large language models (LLMs) have made significant progress in natural language processing (NLP), and are utilized extensively in various applications. Recent works, such as chain-of-thought (CoT), have shown that intermediate reasoning steps can improve the performance of LLMs for complex reasoning tasks, such as math problems and symbolic question-answering tasks. However, we notice the challeng… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  31. arXiv:2305.15005  [pdf, other

    cs.CL

    Sentiment Analysis in the Era of Large Language Models: A Reality Check

    Authors: Wenxuan Zhang, Yue Deng, Bing Liu, Sinno Jialin Pan, Lidong Bing

    Abstract: Sentiment analysis (SA) has been a long-standing research area in natural language processing. It can offer rich insights into human sentiments and opinions and has thus seen considerable interest from both academia and industry. With the advent of large language models (LLMs) such as ChatGPT, there is a great potential for their employment on SA problems. However, the extent to which existing LLM… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  32. arXiv:2305.14434  [pdf, other

    cs.CL

    Domain-Expanded ASTE: Rethinking Generalization in Aspect Sentiment Triplet Extraction

    Authors: Yew Ken Chia, Hui Chen, Wei Han, Guizhen Chen, Sharifah Mahani Aljunied, Soujanya Poria, Lidong Bing

    Abstract: Aspect Sentiment Triplet Extraction (ASTE) is a subtask of Aspect-Based Sentiment Analysis (ABSA) that considers each opinion term, their expressed sentiment, and the corresponding aspect targets. However, existing methods are limited to the in-domain setting with two domains. Hence, we propose a domain-expanded benchmark to address the in-domain, out-of-domain and cross-domain settings. We suppor… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  33. arXiv:2305.13645  [pdf, other

    cs.CL

    mPMR: A Multilingual Pre-trained Machine Reader at Scale

    Authors: Weiwen Xu, Xin Li, Wai Lam, Lidong Bing

    Abstract: We present multilingual Pre-trained Machine Reader (mPMR), a novel method for multilingual machine reading comprehension (MRC)-style pre-training. mPMR aims to guide multilingual pre-trained language models (mPLMs) to perform natural language understanding (NLU) including both sequence classification and span extraction in multiple languages. To achieve cross-lingual generalization when only sourc… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: To appear at ACL 2023 main conference

  34. arXiv:2305.13628  [pdf, other

    cs.CL

    Improving Self-training for Cross-lingual Named Entity Recognition with Contrastive and Prototype Learning

    Authors: Ran Zhou, Xin Li, Lidong Bing, Erik Cambria, Chunyan Miao

    Abstract: In cross-lingual named entity recognition (NER), self-training is commonly used to bridge the linguistic gap by training on pseudo-labeled target-language data. However, due to sub-optimal performance on target languages, the pseudo labels are often noisy and limit the overall performance. In this work, we aim to improve self-training for cross-lingual NER by combining representation learning and… ▽ More

    Submitted 4 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL2023

  35. arXiv:2305.13269  [pdf, other

    cs.CL

    Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources

    Authors: Xingxuan Li, Ruochen Zhao, Yew Ken Chia, Bosheng Ding, Shafiq Joty, Soujanya Poria, Lidong Bing

    Abstract: We present chain-of-knowledge (CoK), a novel framework that augments large language models (LLMs) by dynamically incorporating grounding information from heterogeneous sources. It results in more factual rationales and reduced hallucination in generation. Specifically, CoK consists of three stages: reasoning preparation, dynamic knowledge adapting, and answer consolidation. Given a knowledge-inten… ▽ More

    Submitted 21 February, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by ICLR 2024

  36. arXiv:2305.13142  [pdf, other

    cs.CL

    Better Sampling of Negatives for Distantly Supervised Named Entity Recognition

    Authors: Lu Xu, Lidong Bing, Wei Lu

    Abstract: Distantly supervised named entity recognition (DS-NER) has been proposed to exploit the automatically labeled training data instead of human annotations. The distantly annotated datasets are often noisy and contain a considerable number of false negatives. The recent approach uses a weighted sampling approach to select a subset of negative samples for training. However, it requires a good classifi… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL Findings 2023

  37. arXiv:2305.13091  [pdf, other

    cs.CL

    Large Language Models are Not Yet Human-Level Evaluators for Abstractive Summarization

    Authors: Chenhui Shen, Liying Cheng, Xuan-Phi Nguyen, Yang You, Lidong Bing

    Abstract: With the recent undeniable advancement in reasoning abilities in large language models (LLMs) like ChatGPT and GPT-4, there is a growing trend for using LLMs on various tasks. One area where LLMs can be employed is as an alternative evaluation metric for complex generative tasks, which generally demands expensive human judges to complement the traditional automatic metrics for various evaluation d… ▽ More

    Submitted 19 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 19 pages, 5 figures

    Journal ref: Findings of EMNLP 2023

  38. arXiv:2305.12678  [pdf, other

    cs.CL

    Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Anh Tuan Luu, Cong-Duy Nguyen, Zhen Hai, Lidong Bing

    Abstract: Multimodal Review Helpfulness Prediction (MRHP) aims to rank product reviews based on predicted helpfulness scores and has been widely applied in e-commerce via presenting customers with useful reviews. Previous studies commonly employ fully-connected neural networks (FCNNs) as the final score predictor and pairwise loss as the training objective. However, FCNNs have been shown to perform ineffici… ▽ More

    Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Published in ACL 2023 (Findings)

  39. arXiv:2305.11791  [pdf, other

    cs.CL

    Enhancing Few-shot NER with Prompt Ordering based Data Augmentation

    Authors: Huiming Wang, Liying Cheng, Wenxuan Zhang, De Wen Soh, Lidong Bing

    Abstract: Recently, data augmentation (DA) methods have been proven to be effective for pre-trained language models (PLMs) in low-resource settings, including few-shot named entity recognition (NER). However, conventional NER DA methods are mostly aimed at sequence labeling models, i.e., token-level classification, and few are compatible with unified autoregressive generation frameworks, which can handle a… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 7 pages, 2 figures

  40. arXiv:2305.11719  [pdf, other

    cs.CV cs.CL

    Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling

    Authors: Shengqiong Wu, Hao Fei, Yixin Cao, Lidong Bing, Tat-Seng Chua

    Abstract: Existing research on multimodal relation extraction (MRE) faces two co-existing challenges, internal-information over-utilization and external-information under-exploitation. To combat that, we propose a novel framework that simultaneously implements the idea of internal-information screening and external-information exploiting. First, we represent the fine-grained semantic structures of the input… ▽ More

    Submitted 25 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  41. arXiv:2305.11442  [pdf, other

    cs.CL cs.AI cs.LG

    Zero-Shot Text Classification via Self-Supervised Tuning

    Authors: Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu, Chip Hong Chang, Lidong Bing

    Abstract: Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data… ▽ More

    Submitted 25 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to the Findings of ACL 2023

  42. arXiv:2305.11255  [pdf, other

    cs.CL

    Reasoning Implicit Sentiment with Chain-of-Thought Prompting

    Authors: Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, Tat-Seng Chua

    Abstract: While sentiment analysis systems try to determine the sentiment polarities of given targets based on the key opinion expressions in input texts, in implicit sentiment analysis (ISA) the opinion cues come in an implicit and obscure manner. Thus detecting implicit sentiment requires the common-sense and multi-hop reasoning ability to infer the latent intent of opinion. Inspired by the recent chain-o… ▽ More

    Submitted 8 June, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: ACL2023 Short Paper

  43. arXiv:2305.09509  [pdf, other

    cs.CL

    Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment Analysis

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: Cross-domain aspect-based sentiment analysis (ABSA) aims to perform various fine-grained sentiment analysis tasks on a target domain by transferring knowledge from a source domain. Since labeled data only exists in the source domain, a model is expected to bridge the domain gap for tackling cross-domain ABSA. Though domain adaptation methods have proven to be effective, most of them are based on a… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: ACL 2023 main conference

  44. arXiv:2305.09193  [pdf, other

    cs.CL

    Easy-to-Hard Learning for Information Extraction

    Authors: Chang Gao, Wenxuan Zhang, Wai Lam, Lidong Bing

    Abstract: Information extraction (IE) systems aim to automatically extract structured information, such as named entities, relations between entities, and events, from unstructured texts. While most existing work addresses a particular IE task, universally modeling various IE tasks with one model has achieved great success recently. Despite their success, they employ a one-stage learning strategy, i.e., dir… ▽ More

    Submitted 19 May, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  45. arXiv:2305.08503  [pdf, other

    cs.CL

    A Hierarchical Encoding-Decoding Scheme for Abstractive Multi-document Summarization

    Authors: Chenhui Shen, Liying Cheng, Xuan-Phi Nguyen, Yang You, Lidong Bing

    Abstract: Pre-trained language models (PLMs) have achieved outstanding achievements in abstractive single-document summarization (SDS). However, such benefits may not fully extend to multi-document summarization (MDS), where the handling of cross-document information is more complex. Previous works either design new MDS architectures or apply PLMs bluntly with concatenated source documents as a reformulated… ▽ More

    Submitted 1 November, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 16 pages, 3 figures

    Journal ref: Findings of EMNLP 2023

  46. arXiv:2305.03268  [pdf, other

    cs.CL

    Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework

    Authors: Ruochen Zhao, Xingxuan Li, Shafiq Joty, Chengwei Qin, Lidong Bing

    Abstract: As large language models (LLMs) have become the norm in NLP, demonstrating good performance in generation and reasoning tasks, one of its most fatal disadvantages is the lack of factual correctness. Generating unfactual texts not only leads to lower performances but also degrades the trust and validity of their applications. Chain-of-Thought (CoT) prompting improves trust and model performance on… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  47. arXiv:2304.11076  [pdf, other

    cs.CL cs.AI

    Can ChatGPT-like Generative Models Guarantee Factual Accuracy? On the Mistakes of New Generation Search Engines

    Authors: Ruochen Zhao, Xingxuan Li, Yew Ken Chia, Bosheng Ding, Lidong Bing

    Abstract: Although large conversational AI models such as OpenAI's ChatGPT have demonstrated great potential, we question whether such models can guarantee factual accuracy. Recently, technology companies such as Microsoft and Google have announced new services which aim to combine search engines with conversational AI. However, we have found numerous mistakes in the public demonstrations that suggest we sh… ▽ More

    Submitted 2 March, 2023; originally announced April 2023.

  48. arXiv:2304.01933  [pdf, other

    cs.CL

    LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

    Authors: Zhiqiang Hu, Lei Wang, Yihuai Lan, Wanyu Xu, Ee-Peng Lim, Lidong Bing, Xing Xu, Soujanya Poria, Roy Ka-Wei Lee

    Abstract: The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e.g., ChatDoctor) or instruction data (e.g., Alpaca). Among the various fine-tuning methods, adapter-based parameter-efficient fine-tuning (PEFT) is undoubtedly one of the most… ▽ More

    Submitted 9 October, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: EMNLP 2023. The code of our framework can be found at https://github.com/AGI-Edgerunners/LLM-Adapters. We will keep all of the code open-source and continue to update the framework with new adapters, LLMs, and tasks

  49. arXiv:2304.00824  [pdf, other

    cs.CL

    Towards Integration of Discriminability and Robustness for Document-Level Relation Extraction

    Authors: Jia Guo, Stanley Kok, Lidong Bing

    Abstract: Document-level relation extraction (DocRE) predicts relations for entity pairs that rely on long-range context-dependent reasoning in a document. As a typical multi-label classification problem, DocRE faces the challenge of effectively distinguishing a small set of positive relations from the majority of negative ones. This challenge becomes even more difficult to overcome when there exists a sign… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: EACL 2023 (Main conference, Long paper)

  50. arXiv:2212.10529  [pdf, other

    cs.CL cs.AI cs.CY

    Evaluating Psychological Safety of Large Language Models

    Authors: Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing

    Abstract: In this work, we designed unbiased prompts to systematically evaluate the psychological safety of large language models (LLMs). First, we tested five different LLMs by using two personality tests: Short Dark Triad (SD-3) and Big Five Inventory (BFI). All models scored higher than the human average on SD-3, suggesting a relatively darker personality pattern. Despite being instruction fine-tuned wit… ▽ More

    Submitted 29 February, 2024; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Preprint. Under review