Skip to main content

Showing 1–50 of 65 results for author: Diab, M

  1. arXiv:2406.17660  [pdf, other

    cs.LG

    Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

    Authors: Aashiq Muhamed, Oscar Li, David Woodruff, Mona Diab, Virginia Smith

    Abstract: Large language model (LLM) training and finetuning are often bottlenecked by limited GPU memory. While existing projection-based optimization methods address this by projecting gradients into a lower-dimensional subspace to reduce optimizer state memory, they typically rely on dense projection matrices, which can introduce computational and memory overheads. In this work, we propose Grass (GRAdien… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2405.20253  [pdf, other

    cs.CL

    Evaluating Large Language Model Biases in Persona-Steered Generation

    Authors: Andy Liu, Mona Diab, Daniel Fried

    Abstract: The task of persona-steered text generation requires large language models (LLMs) to generate text that reflects the distribution of views that an individual fitting a persona could have. People have multifaceted personas, but prior work on bias in LLM-generated opinions has only explored multiple-choice settings or one-dimensional personas. We define an incongruous persona as a persona with multi… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted to Findings of ACL 2024. Code and data available at https://github.com/andyjliu/persona-steered-generation-bias

  3. arXiv:2405.06258  [pdf, other

    cs.CL

    Automatic Generation of Model and Data Cards: A Step Towards Responsible AI

    Authors: Jiarui Liu, Wenkai Li, Zhijing Jin, Mona Diab

    Abstract: In an era of model and data proliferation in machine learning/AI especially marked by the rapid advancement of open-sourced technologies, there arises a critical need for standardized consistent documentation. Our work addresses the information incompleteness in current human-generated model and data cards. We propose an automated generation approach using Large Language Models (LLMs). Our key con… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: NAACL 2024 (Oral)

  4. arXiv:2405.01502  [pdf, other

    cs.CL cs.AI cs.LG

    Analyzing the Role of Semantic Representations in the Era of Large Language Models

    Authors: Zhijing Jin, Yuen Chen, Fernando Gonzalez, Jiarui Liu, Jiayi Zhang, Julian Michael, Bernhard Schölkopf, Mona Diab

    Abstract: Traditionally, natural language processing (NLP) models often use a rich set of features created by linguistic expertise, such as semantic representations. However, in the era of large language models (LLMs), more and more tasks are turned into generic, end-to-end sequence generation problems. In this paper, we investigate the question: what is the role of semantic representations in the era of LL… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: NAACL 2024

  5. arXiv:2404.00756  [pdf, other

    cs.AI cs.LG cs.LO cs.RO

    Recover: A Neuro-Symbolic Framework for Failure Detection and Recovery

    Authors: Cristina Cornelio, Mohammed Diab

    Abstract: Recognizing failures during task execution and implementing recovery procedures is challenging in robotics. Traditional approaches rely on the availability of extensive data or a tight set of constraints, while more recent approaches leverage large language models (LLMs) to verify task steps and replan accordingly. However, these methods often operate offline, necessitating scene resets and incurr… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  6. arXiv:2402.18424  [pdf, other

    cs.CL cs.AI cs.LG

    Emotion Classification in Low and Moderate Resource Languages

    Authors: Shabnam Tafreshi, Shubham Vatsal, Mona Diab

    Abstract: It is important to be able to analyze the emotional state of people around the globe. There are 7100+ active languages spoken around the world and building emotion classification for each language is labor intensive. Particularly for low-resource and endangered languages, building emotion classification can be quite challenging. We present a cross-lingual emotion classifier, where we train an emot… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  7. arXiv:2402.13231  [pdf, other

    cs.CL cs.CY

    Investigating Cultural Alignment of Large Language Models

    Authors: Badr AlKhamissi, Muhammad ElNokrashy, Mai AlKhamissi, Mona Diab

    Abstract: The intricate relationship between language and culture has long been a subject of exploration within the realm of linguistic anthropology. Large Language Models (LLMs), promoted as repositories of collective human knowledge, raise a pivotal question: do these models genuinely encapsulate the diverse knowledge adopted by different cultures? Our study reveals that these models demonstrate greater c… ▽ More

    Submitted 6 July, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: ACL 2024 (Main)

  8. arXiv:2402.11710  [pdf, other

    cs.CL

    A Note on Bias to Complete

    Authors: Jia Xu, Mona Diab

    Abstract: Minimizing social bias strengthens societal bonds, promoting shared understanding and better decision-making. We revisit the definition of bias by discovering new bias types (e.g., societal status) in dynamic environments and describe them relative to context, such as culture, region, time, and personal background. Our framework includes eight hypotheses about bias and a minimizing bias strategy f… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  9. arXiv:2306.05836  [pdf, other

    cs.CL cs.AI cs.LG

    Can Large Language Models Infer Causation from Correlation?

    Authors: Zhijing Jin, Jiarui Liu, Zhiheng Lyu, Spencer Poff, Mrinmaya Sachan, Rada Mihalcea, Mona Diab, Bernhard Schölkopf

    Abstract: Causal inference is one of the hallmarks of human intelligence. While the field of CausalNLP has attracted much interest in the recent years, existing causal inference datasets in NLP primarily rely on discovering causality from empirical knowledge (e.g., commonsense knowledge). In this work, we propose the first benchmark dataset to test the pure causal inference skills of large language models (… ▽ More

    Submitted 17 April, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: ICLR 2024

  10. OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models

    Authors: Badr AlKhamissi, Siddharth Verma, Ping Yu, Zhijing Jin, Asli Celikyilmaz, Mona Diab

    Abstract: In this paper, we conduct a thorough investigation into the reasoning capabilities of Large Language Models (LLMs), focusing specifically on the Open Pretrained Transformers (OPT) models as a representative of such models. Our study entails finetuning three different sizes of OPT on a carefully curated reasoning corpus, resulting in two sets of finetuned models: OPT-R, finetuned without explanatio… ▽ More

    Submitted 24 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Proceedings of the 1st Workshop on Natural Language Reasoning and Structured Explanations (NLRSE) at ACL 2023

  11. arXiv:2212.08286  [pdf, other

    cs.CL

    ALERT: Adapting Language Models to Reasoning Tasks

    Authors: Ping Yu, Tianlu Wang, Olga Golovneva, Badr AlKhamissi, Siddharth Verma, Zhijing Jin, Gargi Ghosh, Mona Diab, Asli Celikyilmaz

    Abstract: Current large language models can perform reasonably well on complex tasks that require step-by-step reasoning with few-shot learning. Are these models applying reasoning skills they have learnt during pre-training and reason outside of their training context, or are they simply memorizing their training corpus at finer granularity and have learnt to better understand their context? To tease apart… ▽ More

    Submitted 7 July, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  12. arXiv:2210.07652  [pdf, other

    cs.CL cs.AI

    Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values

    Authors: Yejin Bang, Tiezheng Yu, Andrea Madotto, Zhaojiang Lin, Mona Diab, Pascale Fung

    Abstract: Many NLP classification tasks, such as sexism/racism detection or toxicity detection, are based on human values. Yet, human values can vary under diverse cultural conditions. Therefore, we introduce a framework for value-aligned classification that performs prediction based on explicitly written human values in the command. Along with the task, we propose a practical approach that distills value-a… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  13. arXiv:2210.01734  [pdf, other

    cs.CL cs.LG

    Text Characterization Toolkit

    Authors: Daniel Simig, Tianlu Wang, Verna Dankers, Peter Henderson, Khuyagbaatar Batsuren, Dieuwke Hupkes, Mona Diab

    Abstract: In NLP, models are usually evaluated by reporting single-number performance scores on a number of readily available benchmarks, without much deeper analysis. Here, we argue that - especially given the well-known fact that benchmarks often contain biases, artefacts, and spurious correlations - deeper results analysis should become the de-facto standard when presenting new models or benchmarks. We p… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  14. arXiv:2209.15168  [pdf, other

    cs.CL cs.LG

    Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification

    Authors: Muhammad ElNokrashy, Badr AlKhamissi, Mona Diab

    Abstract: Language Models pretrained on large textual data have been shown to encode different types of knowledge simultaneously. Traditionally, only the features from the last layer are used when adapting to new tasks or data. We put forward that, when using or finetuning deep pretrained models, intermediate layer features that may be relevant to the downstream task are buried too deep to be used efficient… ▽ More

    Submitted 7 May, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted Oral Presentation at LREC-COLING 2024; 10 pages, 9 figures

  15. arXiv:2205.12495  [pdf, other

    cs.CL

    ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

    Authors: Badr AlKhamissi, Faisal Ladhak, Srini Iyer, Ves Stoyanov, Zornitsa Kozareva, Xian Li, Pascale Fung, Lambert Mathias, Asli Celikyilmaz, Mona Diab

    Abstract: Hate speech detection is complex; it relies on commonsense reasoning, knowledge of stereotypes, and an understanding of social nuance that differs from one culture to the next. It is also difficult to collect a large-scale hate speech annotated dataset. In this work, we frame this problem as a few-shot learning task, and show significant gains with decomposing the task into its "constituent" parts… ▽ More

    Submitted 20 May, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted at EMNLP 2022

    Journal ref: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 2109-2120, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics

  16. arXiv:2205.12484  [pdf, other

    cs.CL cs.AI

    GisPy: A Tool for Measuring Gist Inference Score in Text

    Authors: Pedram Hosseini, Christopher R. Wolfe, Mona Diab, David A. Broniatowski

    Abstract: Decision making theories such as Fuzzy-Trace Theory (FTT) suggest that individuals tend to rely on gist, or bottom-line meaning, in the text when making decisions. In this work, we delineate the process of developing GisPy, an open-source tool in Python for measuring the Gist Inference Score (GIS) in text. Evaluation of GisPy on documents in three benchmarks from the news and scientific text domai… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to the 4th Workshop on Narrative Understanding @ NAACL 2022

  17. arXiv:2205.08533  [pdf, ps, other

    cs.CL

    Consistent Human Evaluation of Machine Translation across Language Pairs

    Authors: Daniel Licht, Cynthia Gao, Janice Lam, Francisco Guzman, Mona Diab, Philipp Koehn

    Abstract: Obtaining meaningful quality scores for machine translation systems through human evaluation remains a challenge given the high variability between human evaluators, partly due to subjective expectations for translation quality for different language pairs. We propose a new metric called XSTS that is more focused on semantic equivalence and a cross-lingual calibration method that enables more cons… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 10 pages

  18. arXiv:2205.07960  [pdf, other

    cs.CL

    Meta AI at Arabic Hate Speech 2022: MultiTask Learning with Self-Correction for Hate Speech Classification

    Authors: Badr AlKhamissi, Mona Diab

    Abstract: In this paper, we tackle the Arabic Fine-Grained Hate Speech Detection shared task and demonstrate significant improvements over reported baselines for its three subtasks. The tasks are to predict if a tweet contains (1) Offensive language; and whether it is considered (2) Hate Speech or not and if so, then predict the (3) Fine-Grained Hate Speech label from one of six categories. Our final soluti… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: Accepted at the 5th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT5/LREC 2022)

  19. arXiv:2205.01068  [pdf, other

    cs.CL cs.LG

    OPT: Open Pre-trained Transformer Language Models

    Authors: Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer

    Abstract: Large language models, which are often trained for hundreds of thousands of compute days, have shown remarkable capabilities for zero- and few-shot learning. Given their computational cost, these models are difficult to replicate without significant capital. For the few that are available through APIs, no access is granted to the full model weights, making them difficult to study. We present Open… ▽ More

    Submitted 21 June, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

  20. arXiv:2204.06031  [pdf, other

    cs.CL cs.AI

    A Review on Language Models as Knowledge Bases

    Authors: Badr AlKhamissi, Millicent Li, Asli Celikyilmaz, Mona Diab, Marjan Ghazvininejad

    Abstract: Recently, there has been a surge of interest in the NLP community on the use of pretrained Language Models (LMs) as Knowledge Bases (KBs). Researchers have shown that LMs trained on a sufficiently large (web) corpus will encode a significant amount of knowledge implicitly in its parameters. The resulting LM can be probed for different kinds of knowledge and thus acting as a KB. This has a major ad… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Preprint

  21. arXiv:2203.09597  [pdf, other

    cs.CL cs.CY

    Towards Responsible Natural Language Annotation for the Varieties of Arabic

    Authors: A. Stevie Bergman, Mona T. Diab

    Abstract: When building NLP models, there is a tendency to aim for broader coverage, often overlooking cultural and (socio)linguistic nuance. In this position paper, we make the case for care and attention to such nuances, particularly in dataset annotation, as well as the inclusion of cultural and linguistic expertise in the process. We present a playbook for responsible dataset creation for polyglossic, m… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: ACL 2022 Findings

  22. arXiv:2202.09625  [pdf, other

    cs.CL

    CALCS 2021 Shared Task: Machine Translation for Code-Switched Data

    Authors: Shuguang Chen, Gustavo Aguilar, Anirudh Srinivasan, Mona Diab, Thamar Solorio

    Abstract: To date, efforts in the code-switching literature have focused for the most part on language identification, POS, NER, and syntactic parsing. In this paper, we address machine translation for code-switched social media data. We create a community shared task. We provide two modalities for participation: supervised and unsupervised. For the supervised setting, participants are challenged to transla… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

  23. arXiv:2201.10430  [pdf, other

    cs.CL

    A Quantitative and Qualitative Analysis of Schizophrenia Language

    Authors: Amal Alqahtani, Efsun Sarioglu Kay, Sardar Hamidian, Michael Compton, Mona Diab

    Abstract: Schizophrenia is one of the most disabling mental health conditions to live with. Approximately one percent of the population has schizophrenia which makes it fairly common, and it affects many people and their families. Patients with schizophrenia suffer different symptoms: formal thought disorder (FTD), delusions, and emotional flatness. In this paper, we quantitatively and qualitatively analyze… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  24. arXiv:2112.10684  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Large Scale Language Modeling with Mixtures of Experts

    Authors: Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, Jingfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giri Anantharaman, Xian Li, Shuohui Chen, Halil Akin, Mandeep Baines, Louis Martin, Xing Zhou, Punit Singh Koura, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Mona Diab, Zornitsa Kozareva, Ves Stoyanov

    Abstract: Mixture of Experts layers (MoEs) enable efficient scaling of language models through conditional computation. This paper presents a detailed empirical study of how autoregressive MoE language models scale in comparison with dense models in a wide range of settings: in- and out-of-domain language modeling, zero- and few-shot priming, and full-shot fine-tuning. With the exception of fine-tuning, we… ▽ More

    Submitted 26 October, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022

  25. arXiv:2112.10668  [pdf, other

    cs.CL cs.AI

    Few-shot Learning with Multilingual Language Models

    Authors: Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona Diab, Veselin Stoyanov, Xian Li

    Abstract: Large-scale generative language models such as GPT-3 are competitive few-shot learners. While these models are known to be able to jointly represent many different languages, their training data is dominated by English, potentially limiting their cross-lingual generalization. In this work, we train multilingual generative language models on a corpus covering a diverse set of languages, and study t… ▽ More

    Submitted 10 November, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: Accepted to EMNLP 2022; 34 pages

  26. arXiv:2112.08615  [pdf, other

    cs.CL

    Knowledge-Augmented Language Models for Cause-Effect Relation Classification

    Authors: Pedram Hosseini, David A. Broniatowski, Mona Diab

    Abstract: Previous studies have shown the efficacy of knowledge augmentation methods in pretrained language models. However, these methods behave differently across domains and downstream tasks. In this work, we investigate the augmentation of pretrained language models with commonsense knowledge in the cause-effect relation classification and commonsense causal reasoning tasks. After automatically verbaliz… ▽ More

    Submitted 1 June, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted to Commonsense Representation and Reasoning (CSRR) @ ACL 2022

  27. arXiv:2111.13654  [pdf, other

    cs.CL cs.AI cs.LG

    Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

    Authors: Peter Hase, Mona Diab, Asli Celikyilmaz, Xian Li, Zornitsa Kozareva, Veselin Stoyanov, Mohit Bansal, Srinivasan Iyer

    Abstract: Do language models have beliefs about the world? Dennett (1995) famously argues that even thermostats have beliefs, on the view that a belief is simply an informational state decoupled from any motivational state. In this paper, we discuss approaches to detecting when models have beliefs about the world, and we improve on methods for updating model beliefs to be more truthful, with a focus on meth… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: 19 pages

  28. arXiv:2111.06474  [pdf, other

    cs.CL

    AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization

    Authors: Alexander R. Fabbri, Xiaojian Wu, Srini Iyer, Haoran Li, Mona Diab

    Abstract: Community Question Answering (CQA) fora such as Stack Overflow and Yahoo! Answers contain a rich resource of answers to a wide range of community-based questions. Each question thread can receive a large number of answers with different perspectives. One goal of answer summarization is to produce a summary that reflects the range of answer perspectives. A major obstacle for this task is the absenc… ▽ More

    Submitted 29 April, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: NAACL 2022; arXiv admin note: substantial text overlap with arXiv:2104.08536

  29. arXiv:2106.00934  [pdf, other

    cs.CL

    Discrete Cosine Transform as Universal Sentence Encoder

    Authors: Nada Almarwani, Mona Diab

    Abstract: Modern sentence encoders are used to generate dense vector representations that capture the underlying linguistic characteristics for a sequence of words, including phrases, sentences, or paragraphs. These kinds of representations are ideal for training a classifier for an end task such as sentiment analysis, question answering and text classification. Different models have been proposed to effici… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: to be published in ACL-IJCNLP 2021

  30. arXiv:2106.00169  [pdf, other

    cs.CL

    Gender Bias Amplification During Speed-Quality Optimization in Neural Machine Translation

    Authors: Adithya Renduchintala, Denise Diaz, Kenneth Heafield, Xian Li, Mona Diab

    Abstract: Is bias amplified when neural machine translation (NMT) models are optimized for speed and evaluated on generic test sets using BLEU? We investigate architectures and techniques commonly used to speed up decoding in Transformer-based models, such as greedy search, quantization, average attention networks (AANs) and shallow decoder models and show their effect on gendered noun translation. We const… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: Accepted at ACL 2021

  31. arXiv:2105.15071  [pdf, other

    cs.CL

    Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data

    Authors: Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona Diab

    Abstract: The scarcity of parallel data is a major obstacle for training high-quality machine translation systems for low-resource languages. Fortunately, some low-resource languages are linguistically related or similar to high-resource languages; these related languages may share many lexical or syntactic structures. In this work, we exploit this linguistic overlap to facilitate translating to and from a… ▽ More

    Submitted 1 June, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: ACL 2021

  32. arXiv:2104.08536  [pdf, other

    cs.CL

    Multi-Perspective Abstractive Answer Summarization

    Authors: Alexander R. Fabbri, Xiaojian Wu, Srini Iyer, Mona Diab

    Abstract: Community Question Answering (CQA) forums such as Stack Overflow and Yahoo! Answers contain a rich resource of answers to a wide range of questions. Each question thread can receive a large number of answers with different perspectives. The goal of multi-perspective answer summarization is to produce a summary that includes all perspectives of the answer. A major obstacle for multi-perspective, ab… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  33. arXiv:2103.13606  [pdf, other

    cs.CL cs.AI

    Predicting Directionality in Causal Relations in Text

    Authors: Pedram Hosseini, David A. Broniatowski, Mona Diab

    Abstract: In this work, we test the performance of two bidirectional transformer-based language models, BERT and SpanBERT, on predicting directionality in causal pairs in the textual content. Our preliminary results show that predicting direction for inter-sentence and implicit causal relations is more challenging. And, SpanBERT performs better than BERT on causal samples with longer span length. We also in… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  34. arXiv:2101.10894  [pdf

    cs.CV cs.CY

    White Paper: Challenges and Considerations for the Creation of a Large Labelled Repository of Online Videos with Questionable Content

    Authors: Thamar Solorio, Mahsa Shafaei, Christos Smailis, Mona Diab, Theodore Giannakopoulos, Heng Ji, Yang Liu, Rada Mihalcea, Smaranda Muresan, Ioannis Kakadiaris

    Abstract: This white paper presents a summary of the discussions regarding critical considerations to develop an extensive repository of online videos annotated with labels indicating questionable content. The main discussion points include: 1) the type of appropriate labels that will result in a valuable repository for the larger AI community; 2) how to design the collection and annotation process, as well… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  35. arXiv:2011.02593  [pdf, other

    cs.CL cs.AI

    Detecting Hallucinated Content in Conditional Neural Sequence Generation

    Authors: Chunting Zhou, Graham Neubig, Jiatao Gu, Mona Diab, Paco Guzman, Luke Zettlemoyer, Marjan Ghazvininejad

    Abstract: Neural sequence models can generate highly fluent sentences, but recent studies have also shown that they are also prone to hallucinate additional content not supported by the input. These variety of fluent but wrong outputs are particularly problematic, as it will not be possible for users to tell they are being presented incorrect content. To detect these errors, we propose a task to predict whe… ▽ More

    Submitted 2 June, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: Accepted by ACL-Finding 2021

  36. arXiv:2006.04016  [pdf, other

    cs.CL cs.LG

    A Multitask Learning Approach for Diacritic Restoration

    Authors: Sawsan Alqahtani, Ajay Mishra, Mona Diab

    Abstract: In many languages like Arabic, diacritics are used to specify pronunciations as well as meanings. Such diacritics are often omitted in written text, increasing the number of possible pronunciations and meanings for a word. This results in a more ambiguous text making computational processing on such text more difficult. Diacritic restoration is the task of restoring missing diacritics in the writt… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Journal ref: ACL 2020

  37. FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization

    Authors: Esin Durmus, He He, Mona Diab

    Abstract: Neural abstractive summarization models are prone to generate content inconsistent with the source document, i.e. unfaithful. Existing automatic metrics do not capture such mistakes effectively. We tackle the problem of evaluating faithfulness of a generated summary given its source document. We first collected human annotations of faithfulness for outputs from numerous models on two datasets. We… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL 2020

  38. arXiv:2004.14961  [pdf, other

    cs.CL

    Mutlitask Learning for Cross-Lingual Transfer of Semantic Dependencies

    Authors: Maryam Aminian, Mohammad Sadegh Rasooli, Mona Diab

    Abstract: We describe a method for developing broad-coverage semantic dependency parsers for languages for which no semantically annotated resource is available. We leverage a multitask learning framework coupled with an annotation projection method. We transfer supervised semantic dependency parse annotations from a rich-resource language to a low-resource language through parallel data, and train a semant… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  39. arXiv:2004.12864  [pdf, other

    cs.CL

    DeSePtion: Dual Sequence Prediction and Adversarial Examples for Improved Fact-Checking

    Authors: Christopher Hidey, Tuhin Chakrabarty, Tariq Alhindi, Siddharth Varia, Kriste Krstovski, Mona Diab, Smaranda Muresan

    Abstract: The increased focus on misinformation has spurred development of data and systems for detecting the veracity of a claim as well as retrieving authoritative evidence. The Fact Extraction and VERification (FEVER) dataset provides such a resource for evaluating end-to-end fact-checking, requiring retrieval of evidence from Wikipedia to validate a veracity prediction. We show that current systems for… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: ACL 2020

  40. arXiv:2004.10793  [pdf, other

    cs.CL

    Learning to Classify Intents and Slot Labels Given a Handful of Examples

    Authors: Jason Krone, Yi Zhang, Mona Diab

    Abstract: Intent classification (IC) and slot filling (SF) are core components in most goal-oriented dialogue systems. Current IC/SF models perform poorly when the number of training examples per class is small. We propose a new few-shot learning task, few-shot IC/SF, to study and improve the performance of IC and SF models on classes not seen at training time in ultra low resource scenarios. We establish a… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: 8 pages, 2 figures

  41. arXiv:2003.08529  [pdf, other

    cs.CL

    Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics for Text Collections

    Authors: Yi-An Lai, Xuan Zhu, Yi Zhang, Mona Diab

    Abstract: Summarizing data samples by quantitative measures has a long history, with descriptive statistics being a case in point. However, as natural language processing methods flourish, there are still insufficient characteristic metrics to describe a collection of texts in terms of the words, sentences, or paragraphs they comprise. In this work, we propose metrics of diversity, density, and homogeneity… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: Accepted by LREC 2020

  42. arXiv:1912.08926  [pdf

    cs.SI cs.LG stat.ML

    Rumor Detection and Classification for Twitter Data

    Authors: Sardar Hamidian, Mona T Diab

    Abstract: With the pervasiveness of online media data as a source of information verifying the validity of this information is becoming even more important yet quite challenging. Rumors spread a large quantity of misinformation on microblogs. In this study we address two common issues within the context of microblog social media. First we detect rumors as a type of misinformation propagation and next we go… ▽ More

    Submitted 24 November, 2019; originally announced December 2019.

  43. Efficient Convolutional Neural Networks for Diacritic Restoration

    Authors: Sawsan Alqahtani, Ajay Mishra, Mona Diab

    Abstract: Diacritic restoration has gained importance with the growing need for machines to understand written texts. The task is typically modeled as a sequence labeling problem and currently Bidirectional Long Short Term Memory (BiLSTM) models provide state-of-the-art results. Recently, Bai et al. (2018) show the advantages of Temporal Convolutional Neural Networks (TCN) over Recurrent Neural Networks (RN… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

    Comments: accepted in EMNLP 2019

    Journal ref: EMNLP 2019

  44. Homograph Disambiguation Through Selective Diacritic Restoration

    Authors: Sawsan Alqahtani, Hanan Aldarmaki, Mona Diab

    Abstract: Lexical ambiguity, a challenging phenomenon in all natural languages, is particularly prevalent for languages with diacritics that tend to be omitted in writing, such as Arabic. Omitting diacritics leads to an increase in the number of homographs: different words with the same spelling. Diacritic restoration could theoretically help disambiguate these words, but in practice, the increase in overal… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

    Comments: accepted in WANLP 2019

  45. Identifying Nuances in Fake News vs. Satire: Using Semantic and Linguistic Cues

    Authors: Or Levi, Pedram Hosseini, Mona Diab, David A. Broniatowski

    Abstract: The blurry line between nefarious fake news and protected-speech satire has been a notorious struggle for social media platforms. Further to the efforts of reducing exposure to misinformation on social media, purveyors of fake news have begun to masquerade as satire sites to avoid being demoted. In this work, we address the challenge of automatically classifying fake news versus satire. Previous w… ▽ More

    Submitted 5 November, 2019; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted to the 2nd Workshop on NLP for Internet Freedom (NLP4IF): Censorship, Disinformation, and Propaganda. Co-located with EMNLP-IJCNLP 2019

  46. arXiv:1909.13016  [pdf, ps, other

    cs.CL

    Overview for the Second Shared Task on Language Identification in Code-Switched Data

    Authors: Giovanni Molina, Fahad AlGhamdi, Mahmoud Ghoneim, Abdelati Hawwari, Nicolas Rey-Villamizar, Mona Diab, Thamar Solorio

    Abstract: We present an overview of the second shared task on language identification in code-switched data. For the shared task, we had code-switched data from two different language pairs: Modern Standard Arabic-Dialectal Arabic (MSA-DA) and Spanish-English (SPA-ENG). We had a total of nine participating teams, with all teams submitting a system for SPA-ENG and four submitting for MSA-DA. Through evaluati… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

  47. arXiv:1909.13009  [pdf

    cs.CL

    Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data

    Authors: Mona Diab, Mahmoud Ghoneim, Abdelati Hawwari, Fahad AlGhamdi, Nada AlMarwani, Mohamed Al-Badrashiny

    Abstract: We present our effort to create a large Multi-Layered representational repository of Linguistic Code-Switched Arabic data. The process involves developing clear annotation standards and Guidelines, streamlining the annotation process, and implementing quality control measures. We used two main protocols for annotation: in-lab gold annotations and crowd sourcing annotations. We developed a web-base… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

  48. arXiv:1909.13008  [pdf, other

    cs.CL

    WASA: A Web Application for Sequence Annotation

    Authors: Fahad AlGhamdi, Mona Diab

    Abstract: Data annotation is an important and necessary task for all NLP applications. Designing and implementing a web-based application that enables many annotators to annotate and enter their input into one central database is not a trivial task. These kinds of web-based applications require a consistent and robust backup for the underlying database and support to enhance the efficiency and speed of the… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

    Journal ref: Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018 , LREC

  49. Part of speech tagging for code switched data

    Authors: Fahad AlGhamdi, Giovanni Molina, Mona Diab, Thamar Solorio, Abdelati Hawwari, Victor Soto, Julia Hirschberg

    Abstract: We address the problem of Part of Speech tagging (POS) in the context of linguistic code switching (CS). CS is the phenomenon where a speaker switches between two languages or variants of the same language within or across utterances, known as intra-sentential or inter-sentential CS, respectively. Processing CS data is especially challenging in intra-sentential data given state of the art monoling… ▽ More

    Submitted 3 November, 2019; v1 submitted 27 September, 2019; originally announced September 2019.

    Comments: Association for Computational Linguistics

  50. arXiv:1909.08705  [pdf, other

    cs.CL cs.LG

    CASA-NLU: Context-Aware Self-Attentive Natural Language Understanding for Task-Oriented Chatbots

    Authors: Arshit Gupta, Peng Zhang, Garima Lalwani, Mona Diab

    Abstract: Natural Language Understanding (NLU) is a core component of dialog systems. It typically involves two tasks - intent classification (IC) and slot labeling (SL), which are then followed by a dialogue management (DM) component. Such NLU systems cater to utterances in isolation, thus pushing the problem of context management to DM. However, contextual information is critical to the correct prediction… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: To appear at EMNLP 2019