Skip to main content

Showing 1–48 of 48 results for author: Ritter, A

  1. arXiv:2407.04952  [pdf, other

    cs.CL cs.CV

    Granular Privacy Control for Geolocation with Vision Language Models

    Authors: Ethan Mendes, Yang Chen, James Hays, Sauvik Das, Wei Xu, Alan Ritter

    Abstract: Vision Language Models (VLMs) are rapidly advancing in their capability to answer information-seeking questions. As these models are widely deployed in consumer applications, they could lead to new privacy risks due to emergent abilities to identify people in photos, geolocate images, etc. As we demonstrate, somewhat surprisingly, current open-source and proprietary VLMs are very capable image geo… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  2. arXiv:2406.12034  [pdf, other

    cs.CL cs.LG

    Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts

    Authors: Junmo Kang, Leonid Karlinsky, Hongyin Luo, Zhen Wang, Jacob Hansen, James Glass, David Cox, Rameswar Panda, Rogerio Feris, Alan Ritter

    Abstract: We present Self-MoE, an approach that transforms a monolithic LLM into a compositional, modular system of self-specialized experts, named MiXSE (MiXture of Self-specialized Experts). Our approach leverages self-specialization, which constructs expert modules using self-generated synthetic data, each equipped with a shared base LLM and incorporating self-optimized routing. This allows for dynamic a… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2402.12261  [pdf, other

    cs.CL

    NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms

    Authors: Jonathan Zheng, Alan Ritter, Wei Xu

    Abstract: The performance of Large Language Models (LLMs) degrades from the temporal drift between data used for model training and newer text seen during inference. One understudied avenue of language change causing data drift is the emergence of neologisms -- new word forms -- over time. We create a diverse resource of recent English neologisms by using several popular collection methods. We analyze tempo… ▽ More

    Submitted 5 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: accepted to ACL 2024 main conference, 9 pages

  4. arXiv:2402.03642  [pdf, other

    cs.CL cs.CY cs.LG cs.SI

    Stanceosaurus 2.0: Classifying Stance Towards Russian and Spanish Misinformation

    Authors: Anton Lavrouk, Ian Ligon, Tarek Naous, Jonathan Zheng, Alan Ritter, Wei Xu

    Abstract: The Stanceosaurus corpus (Zheng et al., 2022) was designed to provide high-quality, annotated, 5-way stance data extracted from Twitter, suitable for analyzing cross-cultural and cross-lingual misinformation. In the Stanceosaurus 2.0 iteration, we extend this framework to encompass Russian and Spanish. The former is of current significance due to prevalent misinformation amid escalating tensions w… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: WNUT2024

  5. arXiv:2402.03131  [pdf, other

    cs.CL cs.LG

    Constrained Decoding for Cross-lingual Label Projection

    Authors: Duong Minh Le, Yang Chen, Alan Ritter, Wei Xu

    Abstract: Zero-shot cross-lingual transfer utilizing multilingual LLMs has become a popular learning paradigm for low-resource languages with no labeled training data. However, for NLP tasks that involve fine-grained predictions on words and phrases, the performance of zero-shot cross-lingual transfer learning lags far behind supervised fine-tuning methods. Therefore, it is common to exploit translation and… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024

  6. arXiv:2311.17136  [pdf, other

    cs.CV cs.AI cs.CL cs.IR

    UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

    Authors: Cong Wei, Yang Chen, Haonan Chen, Hexiang Hu, Ge Zhang, Jie Fu, Alan Ritter, Wenhu Chen

    Abstract: Existing information retrieval (IR) models often assume a homogeneous format, limiting their applicability to diverse user needs, such as searching for images with text descriptions, searching for a news article with a headline image, or finding a similar photo with a query image. To approach such different information-seeking demands, we introduce UniIR, a unified instruction-guided multimodal re… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Our code and dataset are available on this project page: https://tiger-ai-lab.github.io/UniIR/

  7. arXiv:2311.09538  [pdf, other

    cs.CL cs.HC

    Reducing Privacy Risks in Online Self-Disclosures with Language Models

    Authors: Yao Dou, Isadora Krsek, Tarek Naous, Anubha Kabra, Sauvik Das, Alan Ritter, Wei Xu

    Abstract: Self-disclosure, while being common and rewarding in social media interaction, also poses privacy risks. In this paper, we take the initiative to protect the user-side privacy associated with online self-disclosure through detection and abstraction. We develop a taxonomy of 19 self-disclosure categories and curate a large corpus consisting of 4.8K annotated disclosure spans. We then fine-tune a la… ▽ More

    Submitted 23 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted at ACL 2024

  8. arXiv:2311.01011  [pdf, other

    cs.LG cs.CR

    Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

    Authors: Sam Toyer, Olivia Watkins, Ethan Adrian Mendes, Justin Svegliato, Luke Bailey, Tiffany Wang, Isaac Ong, Karim Elmaaroufi, Pieter Abbeel, Trevor Darrell, Alan Ritter, Stuart Russell

    Abstract: While Large Language Models (LLMs) are increasingly being used in real-world applications, they remain vulnerable to prompt injection attacks: malicious third party prompts that subvert the intent of the system designer. To help researchers study this problem, we present a dataset of over 126,000 prompt injection attacks and 46,000 prompt-based "defenses" against prompt injection, all created by p… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  9. arXiv:2310.02224  [pdf, other

    cs.CL

    Can Language Models be Instructed to Protect Personal Information?

    Authors: Yang Chen, Ethan Mendes, Sauvik Das, Wei Xu, Alan Ritter

    Abstract: Large multimodal language models have proven transformative in numerous applications. However, these models have been shown to memorize and leak pre-training data, raising serious user privacy and information security concerns. While data leaks should be prevented, it is also crucial to examine the trade-off between the privacy protection and model utility of proposed approaches. In this paper, we… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  10. arXiv:2310.00160  [pdf, other

    cs.CL cs.AI

    Self-Specialization: Uncovering Latent Expertise within Large Language Models

    Authors: Junmo Kang, Hongyin Luo, Yada Zhu, Jacob Hansen, James Glass, David Cox, Alan Ritter, Rogerio Feris, Leonid Karlinsky

    Abstract: Recent works have demonstrated the effectiveness of self-alignment in which a large language model is aligned to follow general instructions using instructional data generated from the model itself starting from a handful of human-written seeds. Instead of general alignment, in this work, we focus on self-alignment for expert domain specialization (e.g., biomedicine, finance). As a preliminary, we… ▽ More

    Submitted 5 June, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: ACL 2024 (Findings; Long Paper)

  11. arXiv:2305.17280  [pdf, other

    cs.CL

    Improved Instruction Ordering in Recipe-Grounded Conversation

    Authors: Duong Minh Le, Ruohao Guo, Wei Xu, Alan Ritter

    Abstract: In this paper, we study the task of instructional dialogue and focus on the cooking domain. Analyzing the generated output of the GPT-J model, we reveal that the primary challenge for a recipe-grounded dialog system is how to provide the instructions in the correct order. We hypothesize that this is due to the model's lack of understanding of user intent and inability to track the instruction stat… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 main conference

  12. arXiv:2305.14592  [pdf, other

    cs.CL cs.LG

    Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding

    Authors: Ruohao Guo, Wei Xu, Alan Ritter

    Abstract: Language style is often used by writers to convey their intentions, identities, and mastery of language. In this paper, we show that current large language models struggle to capture some language styles without fine-tuning. To address this challenge, we investigate whether LLMs can be meta-trained based on representative lexicons to recognize new styles they have not been fine-tuned on. Experimen… ▽ More

    Submitted 5 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2024 main conference

  13. arXiv:2305.14489  [pdf, other

    cs.CL

    Are Large Language Models Robust Coreference Resolvers?

    Authors: Nghia T. Le, Alan Ritter

    Abstract: Recent work on extending coreference resolution across domains and languages relies on annotated data in both the target domain and language. At the same time, pre-trained large language models (LMs) have been reported to exhibit strong zero- and few-shot learning abilities across a wide range of NLP tasks. However, prior work mostly studied this ability using artificial sentence-level datasets su… ▽ More

    Submitted 14 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  14. arXiv:2305.14456  [pdf, other

    cs.CL cs.AI cs.LG

    Having Beer after Prayer? Measuring Cultural Bias in Large Language Models

    Authors: Tarek Naous, Michael J. Ryan, Alan Ritter, Wei Xu

    Abstract: As the reach of large language models (LMs) expands globally, their ability to cater to diverse cultural contexts becomes crucial. Despite advancements in multilingual capabilities, models are not designed with appropriate cultural nuances. In this paper, we show that multilingual and Arabic monolingual LMs exhibit bias towards entities associated with Western culture. We introduce CAMeL, a novel… ▽ More

    Submitted 20 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  15. arXiv:2305.14336  [pdf, other

    cs.CL

    Schema-Driven Information Extraction from Heterogeneous Tables

    Authors: Fan Bai, Junmo Kang, Gabriel Stanovsky, Dayne Freitag, Alan Ritter

    Abstract: In this paper, we explore the question of whether large language models can support cost-efficient information extraction from tables. We introduce schema-driven information extraction, a new task that transforms tabular data into structured records following a human-authored schema. To assess various LLM's capabilities on this task, we present a benchmark comprised of tables from four diverse dom… ▽ More

    Submitted 12 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  16. arXiv:2305.13582  [pdf, other

    cs.CL

    Translation and Fusion Improves Zero-shot Cross-lingual Information Extraction

    Authors: Yang Chen, Vedaant Shah, Alan Ritter

    Abstract: Large language models (LLMs) combined with instruction tuning have shown significant progress in information extraction (IE) tasks, exhibiting strong generalization capabilities to unseen datasets by following annotation guidelines. However, their applicability to low-resource languages remains limited due to lack of both labeled data for fine-tuning, and unlabeled text for pre-training. In this p… ▽ More

    Submitted 20 June, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

  17. arXiv:2305.01645  [pdf, other

    cs.CL

    Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models

    Authors: Junmo Kang, Wei Xu, Alan Ritter

    Abstract: Fine-tuning large models is highly effective, however, inference can be expensive and produces carbon emissions. Knowledge distillation has been shown to be a practical solution to reduce inference costs, but the distillation process itself requires significant computational resources. Rather than buying or renting GPUs to fine-tune, then distill a large model, an NLP practitioner might instead ch… ▽ More

    Submitted 5 July, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 main conference

  18. arXiv:2302.11713  [pdf, other

    cs.CV cs.AI cs.CL

    Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?

    Authors: Yang Chen, Hexiang Hu, Yi Luan, Haitian Sun, Soravit Changpinyo, Alan Ritter, Ming-Wei Chang

    Abstract: Pre-trained vision and language models have demonstrated state-of-the-art capabilities over existing tasks involving images and texts, including visual question answering. However, it remains unclear whether these models possess the capability to answer questions that are not only querying visual content but knowledge-intensive and information-seeking. In this study, we introduce InfoSeek, a visua… ▽ More

    Submitted 17 October, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: EMNLP 2023 (main conference); Our dataset and evaluation is available at https://open-vision-language.github.io/infoseek/

  19. arXiv:2212.09747  [pdf, other

    cs.CL

    Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023?

    Authors: Shuheng Liu, Alan Ritter

    Abstract: The CoNLL-2003 English named entity recognition (NER) dataset has been widely used to train and evaluate NER models for almost 20 years. However, it is unclear how well models that are trained on this 20-year-old data and developed over a period of decades using the same test set will perform when applied on modern data. In this paper, we evaluate the generalization of over 20 different models tra… ▽ More

    Submitted 11 July, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

  20. arXiv:2212.09683  [pdf, other

    cs.CL

    Human-in-the-loop Evaluation for Early Misinformation Detection: A Case Study of COVID-19 Treatments

    Authors: Ethan Mendes, Yang Chen, Wei Xu, Alan Ritter

    Abstract: We present a human-in-the-loop evaluation framework for fact-checking novel misinformation claims and identifying social media messages that support them. Our approach extracts check-worthy claims, which are aggregated and ranked for review. Stance classifiers are then used to identify tweets supporting novel misinformation claims, which are further reviewed to determine whether they violate relev… ▽ More

    Submitted 3 July, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted to ACL 2023 (main conference)

  21. arXiv:2211.15613  [pdf, other

    cs.CL cs.AI

    Frustratingly Easy Label Projection for Cross-lingual Transfer

    Authors: Yang Chen, Chao Jiang, Alan Ritter, Wei Xu

    Abstract: Translating training data into many languages has emerged as a practical solution for improving cross-lingual transfer. For tasks that involve span-level annotations, such as information extraction or question answering, an additional label projection step is required to map annotated spans onto the translated texts. Recently, a few efforts have utilized a simple mark-then-translate method to join… ▽ More

    Submitted 15 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

  22. arXiv:2210.15954  [pdf, other

    cs.CL

    Stanceosaurus: Classifying Stance Towards Multilingual Misinformation

    Authors: Jonathan Zheng, Ashutosh Baheti, Tarek Naous, Wei Xu, Alan Ritter

    Abstract: We present Stanceosaurus, a new corpus of 28,033 tweets in English, Hindi, and Arabic annotated with stance towards 251 misinformation claims. As far as we are aware, it is the largest corpus annotated with stance towards misinformation claims. The claims in Stanceosaurus originate from 15 fact-checking sources that cover diverse geographical regions and cultures. Unlike existing stance datasets,… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022 main conference

  23. arXiv:2210.03690  [pdf, other

    cs.CL cs.AI

    Few-Shot Anaphora Resolution in Scientific Protocols via Mixtures of In-Context Experts

    Authors: Nghia T. Le, Fan Bai, Alan Ritter

    Abstract: Anaphora resolution is an important task for information extraction across a range of languages, text genres, and domains, motivating the need for methods that do not require large annotated datasets. In-context learning has emerged as a promising approach, yet there are a number of challenges in applying in-context learning to resolve anaphora. For example, encoding a single in-context demonstrat… ▽ More

    Submitted 14 November, 2022; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  24. arXiv:2208.07400  [pdf, other

    cs.CL

    SynKB: Semantic Search for Synthetic Procedures

    Authors: Fan Bai, Alan Ritter, Peter Madrid, Dayne Freitag, John Niekrasz

    Abstract: In this paper we present SynKB, an open-source, automatically extracted knowledge base of chemical synthesis protocols. Similar to proprietary chemistry databases such as Reaxsys, SynKB allows chemists to retrieve structured knowledge about synthetic procedures. By taking advantage of recent advances in natural language processing for procedural texts, SynKB supports more flexible queries about re… ▽ More

    Submitted 6 October, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: Accepted to EMNLP 2022 Demo track

  25. arXiv:2109.04711  [pdf, other

    cs.CL

    Pre-train or Annotate? Domain Adaptation with a Constrained Budget

    Authors: Fan Bai, Alan Ritter, Wei Xu

    Abstract: Recent work has demonstrated that pre-training in-domain language models can boost performance when adapting to a new domain. However, the costs associated with pre-training raise an important question: given a fixed budget, what steps should an NLP practitioner take to maximize performance? In this paper, we view domain adaptation with a constrained budget as a consumer choice problem, where the… ▽ More

    Submitted 13 May, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021

  26. arXiv:2108.11830  [pdf, other

    cs.CL

    Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts

    Authors: Ashutosh Baheti, Maarten Sap, Alan Ritter, Mark Riedl

    Abstract: Dialogue models trained on human conversations inadvertently learn to generate toxic responses. In addition to producing explicitly offensive utterances, these models can also implicitly insult a group or individual by aligning themselves with an offensive statement. To better understand the dynamics of contextually offensive language, we investigate the stance of dialogue model responses in offen… ▽ More

    Submitted 13 September, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: Accepted at EMNLP 2021

  27. arXiv:2101.10244  [pdf, other

    cs.CL

    Process-Level Representation of Scientific Protocols with Interactive Annotation

    Authors: Ronen Tamari, Fan Bai, Alan Ritter, Gabriel Stanovsky

    Abstract: We develop Process Execution Graphs (PEG), a document-level representation of real-world wet lab biochemistry protocols, addressing challenges such as cross-sentence relations, long-range coreference, grounding, and implicit arguments. We manually annotate PEGs in a corpus of complex lab protocols with a novel interactive textual simulator that keeps track of entity traits and semantic constraints… ▽ More

    Submitted 14 April, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: EACL 2021 camera ready. Data, models and code at https://textlabs.github.io/

  28. arXiv:2010.14576  [pdf, other

    cs.CL

    WNUT-2020 Task 1 Overview: Extracting Entities and Relations from Wet Lab Protocols

    Authors: Jeniya Tabassum, Sydney Lee, Wei Xu, Alan Ritter

    Abstract: This paper presents the results of the wet lab information extraction task at WNUT 2020. This task consisted of two sub tasks: (1) a Named Entity Recognition (NER) task with 13 participants and (2) a Relation Extraction (RE) task with 2 participants. We outline the task, data annotation process, corpus statistics, and provide a high-level overview of the participating systems for each sub task.

    Submitted 18 November, 2020; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: to appear in EMNLP 2020 (WNUT)

  29. arXiv:2010.06127  [pdf, other

    cs.CL cs.LG

    Model Selection for Cross-Lingual Transfer

    Authors: Yang Chen, Alan Ritter

    Abstract: Transformers that are pre-trained on multilingual corpora, such as, mBERT and XLM-RoBERTa, have achieved impressive cross-lingual transfer capabilities. In the zero-shot transfer setting, only English training data is used, and the fine-tuned model is evaluated on another target language. While this works surprisingly well, substantial variance has been observed in target language performance betw… ▽ More

    Submitted 9 September, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: EMNLP 2021

  30. arXiv:2006.07425  [pdf, other

    cs.CL

    Measuring Forecasting Skill from Text

    Authors: Shi Zong, Alan Ritter, Eduard Hovy

    Abstract: People vary in their ability to make accurate predictions about the future. Prior studies have shown that some individuals can predict the outcome of future events with consistently better accuracy. This leads to a natural question: what makes some forecasters better than others? In this paper we explore connections between the language people use to describe their predictions and their forecastin… ▽ More

    Submitted 16 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Accepted at ACL 2020

  31. arXiv:2006.02567  [pdf, other

    cs.CL cs.SI

    Extracting a Knowledge Base of COVID-19 Events from Social Media

    Authors: Shi Zong, Ashutosh Baheti, Wei Xu, Alan Ritter

    Abstract: In this paper, we present a manually annotated corpus of 10,000 tweets containing public reports of five COVID-19 events, including positive and negative tests, deaths, denied access to testing, claimed cures and preventions. We designed slot-filling questions for each event type and annotated a total of 31 fine-grained slots, such as the location of events, recent travel, and close contacts. We s… ▽ More

    Submitted 9 September, 2022; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Accepted at COLING 2022

  32. arXiv:2005.10464  [pdf, other

    cs.CL

    Fluent Response Generation for Conversational Question Answering

    Authors: Ashutosh Baheti, Alan Ritter, Kevin Small

    Abstract: Question answering (QA) is an important aspect of open-domain conversational agents, garnering specific research focus in the conversational QA (ConvQA) subtask. One notable limitation of recent ConvQA efforts is the response being answer span extraction from the target corpus, thus ignoring the natural language generation (NLG) aspect of high-quality conversational agents. In this work, we propos… ▽ More

    Submitted 16 December, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: 2020 Annual Conference of the Association for Computational Linguistics

  33. arXiv:2005.01634  [pdf, other

    cs.CL

    Code and Named Entity Recognition in StackOverflow

    Authors: Jeniya Tabassum, Mounica Maddela, Wei Xu, Alan Ritter

    Abstract: There is an increasing interest in studying natural language and computer code together, as large corpora of programming texts become readily available on the Internet. For example, StackOverflow currently has over 15 million programming related questions written by 8.5 million users. Meanwhile, there is still a lack of fundamental NLP techniques for identifying code tokens or software-related nam… ▽ More

    Submitted 15 November, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: updated with better results. (To appear in ACL 2020)

    Report number: Submission ID: 3161779

  34. arXiv:2004.14519  [pdf, other

    cs.CL

    An Empirical Study of Pre-trained Transformers for Arabic Information Extraction

    Authors: Wuwei Lan, Yang Chen, Wei Xu, Alan Ritter

    Abstract: Multilingual pre-trained Transformers, such as mBERT (Devlin et al., 2019) and XLM-RoBERTa (Conneau et al., 2020a), have been shown to enable the effective cross-lingual zero-shot transfer. However, their performance on Arabic information extraction (IE) tasks is not very well studied. In this paper, we pre-train a customized bilingual BERT, dubbed GigaBERT, that is designed specifically for Arabi… ▽ More

    Submitted 7 November, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: 8 pages, EMNLP 2020

  35. arXiv:1912.06806  [pdf, other

    cs.CL cs.IR cs.LG

    SemEval-2013 Task 2: Sentiment Analysis in Twitter

    Authors: Preslav Nakov, Zornitsa Kozareva, Alan Ritter, Sara Rosenthal, Veselin Stoyanov, Theresa Wilson

    Abstract: In recent years, sentiment analysis in social media has attracted a lot of research interest and has been used for a number of applications. Unfortunately, research has been hindered by the lack of suitable datasets, complicating the comparison between approaches. To address this issue, we have proposed SemEval-2013 Task 2: Sentiment Analysis in Twitter, which included two subtasks: A, an expressi… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, microblog sentiment analysis, Twitter opinion mining, SMS

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: SemEval-2013

  36. arXiv:1912.02990  [pdf, ps, other

    cs.CL cs.IR cs.LG cs.SI

    SemEval-2014 Task 9: Sentiment Analysis in Twitter

    Authors: Sara Rosenthal, Preslav Nakov, Alan Ritter, Veselin Stoyanov

    Abstract: We describe the Sentiment Analysis in Twitter task, ran as part of SemEval-2014. It is a continuation of the last year's task that ran successfully as part of SemEval-2013. As in 2013, this was the most popular SemEval task; a total of 46 teams contributed 27 submissions for subtask A (21 teams) and 50 submissions for subtask B (44 teams). This year, we introduced three new test sets: (i) regular… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, microblog sentiment analysis, Twitter opinion mining, sarcasm, LiveJournal, SMS

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: SemEval-2014

  37. arXiv:1912.02387  [pdf, other

    cs.CL cs.IR cs.LG

    SemEval-2015 Task 10: Sentiment Analysis in Twitter

    Authors: Sara Rosenthal, Saif M Mohammad, Preslav Nakov, Alan Ritter, Svetlana Kiritchenko, Veselin Stoyanov

    Abstract: In this paper, we describe the 2015 iteration of the SemEval shared task on Sentiment Analysis in Twitter. This was the most popular sentiment analysis shared task to date with more than 40 teams participating in each of the last three years. This year's shared task competition consisted of five sentiment prediction subtasks. Two were reruns from previous years: (A) sentiment expressed by a phrase… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, sentiment towards a topic, quantification, microblog sentiment analysis; Twitter opinion mining

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: SemEval-2015

  38. arXiv:1912.01973  [pdf, other

    cs.CL cs.IR

    SemEval-2016 Task 4: Sentiment Analysis in Twitter

    Authors: Preslav Nakov, Alan Ritter, Sara Rosenthal, Fabrizio Sebastiani, Veselin Stoyanov

    Abstract: This paper discusses the fourth year of the ``Sentiment Analysis in Twitter Task''. SemEval-2016 Task 4 comprises five subtasks, three of which represent a significant departure from previous editions. The first two subtasks are reruns from prior years and ask to predict the overall sentiment, and the sentiment towards a topic in a tweet. The three new subtasks focus on two variants of the basic `… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, sentiment towards a topic, quantification, microblog sentiment analysis; Twitter opinion mining. arXiv admin note: text overlap with arXiv:1912.00741

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: Final version published in the Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval 2016), San Diego, US, 2016, pp. 1-18

  39. arXiv:1904.00118  [pdf, other

    cs.CL

    Structured Minimally Supervised Learning for Neural Relation Extraction

    Authors: Fan Bai, Alan Ritter

    Abstract: We present an approach to minimally supervised relation extraction that combines the benefits of learned representations and structured learning, and accurately predicts sentence-level relation mentions given only proposition-level supervision from a KB. By explicitly reasoning about missing data during learning, our approach enables large-scale training of 1D convolutional neural networks while m… ▽ More

    Submitted 18 November, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

    Comments: Accepted to NAACL 2019. This version improves the model description(present original "Bag-Size Adaptive Learning Rate" as "Bag-Size Weighting Function"). No result/conclusion change

  40. arXiv:1902.10680  [pdf, other

    cs.CL cs.CR

    Analyzing the Perceived Severity of Cybersecurity Threats Reported on Social Media

    Authors: Shi Zong, Alan Ritter, Graham Mueller, Evan Wright

    Abstract: Breaking cybersecurity events are shared across a range of websites, including security blogs (FireEye, Kaspersky, etc.), in addition to social media platforms such as Facebook and Twitter. In this paper, we investigate methods to analyze the severity of cybersecurity threats based on the language that is used to describe them online. A corpus of 6,000 tweets describing software vulnerabilities is… ▽ More

    Submitted 3 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: Accepted at NAACL 2019

  41. arXiv:1809.01215  [pdf, other

    cs.CL

    Generating More Interesting Responses in Neural Conversation Models with Distributional Constraints

    Authors: Ashutosh Baheti, Alan Ritter, Jiwei Li, Bill Dolan

    Abstract: Neural conversation models tend to generate safe, generic responses for most inputs. This is due to the limitations of likelihood-based decoding objectives in generation tasks with diverse outputs, such as conversation. To address this challenge, we propose a simple yet effective approach for incorporating side information in the form of distributional constraints over the generated responses. We… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

  42. arXiv:1805.00195  [pdf, other

    cs.CL cs.AI

    An Annotated Corpus for Machine Reading of Instructions in Wet Lab Protocols

    Authors: Chaitanya Kulkarni, Wei Xu, Alan Ritter, Raghu Machiraju

    Abstract: We describe an effort to annotate a corpus of natural language instructions consisting of 622 wet lab protocols to facilitate automatic or semi-automatic conversion of protocols into a machine-readable format and benefit biological research. Experimental results demonstrate the utility of our corpus for developing machine learning approaches to shallow semantic parsing of instructional texts. We m… ▽ More

    Submitted 1 May, 2018; originally announced May 2018.

  43. arXiv:1707.07212  [pdf, other

    cs.CL

    "i have a feeling trump will win..................": Forecasting Winners and Losers from User Predictions on Twitter

    Authors: Sandesh Swamy, Alan Ritter, Marie-Catherine de Marneffe

    Abstract: Social media users often make explicit predictions about upcoming events. Such statements vary in the degree of certainty the author expresses toward the outcome:"Leonardo DiCaprio will win Best Actor" vs. "Leonardo DiCaprio may win" or "No way Leonardo wins!". Can popular beliefs on social media predict who will win? To answer this question, we build a corpus of tweets annotated for veridicality… ▽ More

    Submitted 31 August, 2017; v1 submitted 22 July, 2017; originally announced July 2017.

    Comments: Accepted at EMNLP 2017 (long paper)

  44. arXiv:1701.06547  [pdf, ps, other

    cs.CL

    Adversarial Learning for Neural Dialogue Generation

    Authors: Jiwei Li, Will Monroe, Tianlin Shi, Sébastien Jean, Alan Ritter, Dan Jurafsky

    Abstract: In this paper, drawing intuition from the Turing test, we propose using adversarial training for open-domain dialogue generation: the system is trained to produce sequences that are indistinguishable from human-generated dialogue utterances. We cast the task as a reinforcement learning (RL) problem where we jointly train two systems, a generative model to produce response sequences, and a discrimi… ▽ More

    Submitted 23 September, 2017; v1 submitted 23 January, 2017; originally announced January 2017.

  45. arXiv:1608.02904  [pdf, other

    cs.IR cs.CL

    TweeTime: A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter

    Authors: Jeniya Tabassum, Alan Ritter, Wei Xu

    Abstract: We describe TweeTIME, a temporal tagger for recognizing and normalizing time expressions in Twitter. Most previous work in social media analysis has to rely on temporal resolvers that are designed for well-edited text, and therefore suffer from the reduced performance due to domain mismatch. We present a minimally supervised method that learns from large quantities of unlabeled data and requires n… ▽ More

    Submitted 15 November, 2020; v1 submitted 9 August, 2016; originally announced August 2016.

    Comments: EMNLP 2016

  46. arXiv:1606.01541  [pdf, other

    cs.CL

    Deep Reinforcement Learning for Dialogue Generation

    Authors: Jiwei Li, Will Monroe, Alan Ritter, Michel Galley, Jianfeng Gao, Dan Jurafsky

    Abstract: Recent neural models of dialogue generation offer great promise for generating responses for conversational agents, but tend to be shortsighted, predicting utterances one at a time while ignoring their influence on future outcomes. Modeling the future direction of a dialogue is crucial to generating coherent, interesting dialogues, a need which led traditional NLP models of dialogue to draw on rei… ▽ More

    Submitted 29 September, 2016; v1 submitted 5 June, 2016; originally announced June 2016.

  47. arXiv:1510.05198  [pdf, other

    cs.SI cs.CL

    Learning multi-faceted representations of individuals from heterogeneous evidence using neural networks

    Authors: Jiwei Li, Alan Ritter, Dan Jurafsky

    Abstract: Inferring latent attributes of people online is an important social computing task, but requires integrating the many heterogeneous sources of information available on the web. We propose learning individual representations of people using neural nets to integrate rich linguistic and network evidence gathered from social media. The algorithm is able to combine diverse cues, such as the text a pers… ▽ More

    Submitted 11 May, 2017; v1 submitted 18 October, 2015; originally announced October 2015.

  48. arXiv:1411.2679  [pdf, other

    cs.SI cs.AI cs.CL cs.LG

    Inferring User Preferences by Probabilistic Logical Reasoning over Social Networks

    Authors: Jiwei Li, Alan Ritter, Dan Jurafsky

    Abstract: We propose a framework for inferring the latent attitudes or preferences of users by performing probabilistic first-order logical reasoning over the social network graph. Our method answers questions about Twitter users like {\em Does this user like sushi?} or {\em Is this user a New York Knicks fan?} by building a probabilistic model that reasons over user attributes (the user's location or gende… ▽ More

    Submitted 10 November, 2014; originally announced November 2014.