Skip to main content

Showing 1–4 of 4 results for author: Bates, L

  1. arXiv:2311.06649  [pdf, other

    cs.CL

    A Template Is All You Meme

    Authors: Luke Bates, Peter Ebert Christensen, Preslav Nakov, Iryna Gurevych

    Abstract: Memes are a modern form of communication and meme templates possess a base semantics that is customizable by whomever posts it on social media. Machine learning systems struggle with memes, which is likely due to such systems having insufficient context to understand memes, as there is more to memes than the obvious image and text. Here, to aid understanding of memes, we release a knowledge base o… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 9 pages, 11 supplemental pages, 6 Tables, 10 Figures

  2. arXiv:2304.12836  [pdf, other

    cs.CL

    Lessons Learned from a Citizen Science Project for Natural Language Processing

    Authors: Jan-Christoph Klie, Ji-Ung Lee, Kevin Stowe, Gözde Gül Şahin, Nafise Sadat Moosavi, Luke Bates, Dominic Petrak, Richard Eckart de Castilho, Iryna Gurevych

    Abstract: Many Natural Language Processing (NLP) systems use annotated corpora for training and evaluation. However, labeled data is often costly to obtain and scaling annotation projects is difficult, which is why annotation tasks are often outsourced to paid crowdworkers. Citizen Science is an alternative to crowdsourcing that is relatively unexplored in the context of NLP. To investigate whether and how… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted to EACL 2023. Code will be published on github: https://github.com/UKPLab/eacl2023-citizen-science-lessons-learned

  3. arXiv:2302.08957  [pdf, other

    cs.CL

    Like a Good Nearest Neighbor: Practical Content Moderation and Text Classification

    Authors: Luke Bates, Iryna Gurevych

    Abstract: Few-shot text classification systems have impressive capabilities but are infeasible to deploy and use reliably due to their dependence on prompting and billion-parameter language models. SetFit (Tunstall et al., 2022) is a recent, practical approach that fine-tunes a Sentence Transformer under a contrastive learning paradigm and achieves similar results to more unwieldy systems. Inexpensive text… ▽ More

    Submitted 29 January, 2024; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Accepted to EACL 2024

  4. arXiv:2209.11055  [pdf, other

    cs.CL

    Efficient Few-Shot Learning Without Prompts

    Authors: Lewis Tunstall, Nils Reimers, Unso Eun Seo Jo, Luke Bates, Daniel Korat, Moshe Wasserblat, Oren Pereg

    Abstract: Recent few-shot methods, such as parameter-efficient fine-tuning (PEFT) and pattern exploiting training (PET), have achieved impressive results in label-scarce settings. However, they are difficult to employ since they are subject to high variability from manually crafted prompts, and typically require billion-parameter language models to achieve high accuracy. To address these shortcomings, we pr… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.