Skip to main content

Showing 1–24 of 24 results for author: Gangal, V

  1. arXiv:2312.06881  [pdf, other

    cs.LG cs.CL

    DYAD: A Descriptive Yet Abjuring Density efficient approximation to linear neural network layers

    Authors: Sarin Chandy, Varun Gangal, Yi Yang, Gabriel Maggiotti

    Abstract: We devise, implement and performance-asses DYAD, a layer which can serve as a faster and more memory-efficient approximate replacement for linear layers, (nn.Linear() in Pytorch). These layers appear in common subcomponents, such as in the ff module of Transformers. DYAD is based on a bespoke near-sparse matrix structure which approximates the dense "weight" matrix W that matrix-multiplies the inp… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted at WANT workshop at NeurIPS 2023; code at https://github.com/asappresearch/dyad

  2. arXiv:2303.02182  [pdf, other

    cs.LG cs.AI

    CoRL: Environment Creation and Management Focused on System Integration

    Authors: Justin D. Merrick, Benjamin K. Heiner, Cameron Long, Brian Stieber, Steve Fierro, Vardaan Gangal, Madison Blake, Joshua Blackburn

    Abstract: Existing reinforcement learning environment libraries use monolithic environment classes, provide shallow methods for altering agent observation and action spaces, and/or are tied to a specific simulation environment. The Core Reinforcement Learning library (CoRL) is a modular, composable, and hyper-configurable environment creation tool. It allows minute control over agent observations, rewards,… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: for code, see https://github.com/act3-ace/CoRL

  3. arXiv:2210.12846  [pdf, other

    cs.CL

    EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation

    Authors: Sedrick Scott Keh, Rohit K. Bharadwaj, Emmy Liu, Simone Tedeschi, Varun Gangal, Roberto Navigli

    Abstract: We introduce EUREKA, an ensemble-based approach for performing automatic euphemism detection. We (1) identify and correct potentially mislabelled rows in the dataset, (2) curate an expanded corpus called EuphAug, (3) leverage model representations of Potentially Euphemistic Terms (PETs), and (4) explore using representations of semantically close sentences to aid in classification. Using our augme… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022 Figurative Language Workshop; first place for Euphemism Detection Shared Task. Code at https://github.com/sedrickkeh/EUREKA

  4. arXiv:2209.07752  [pdf, other

    cs.CL cs.AI cs.LG

    PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation

    Authors: Sedrick Scott Keh, Kevin Lu, Varun Gangal, Steven Y. Feng, Harsh Jhamtani, Malihe Alikhani, Eduard Hovy

    Abstract: A personification is a figure of speech that endows inanimate entities with properties and actions typically seen as requiring animacy. In this paper, we explore the task of personification generation. To this end, we propose PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation. We curate a corpus of personifications called Personif… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022; official Github repo at https://github.com/sedrickkeh/PINEAPPLE

  5. arXiv:2209.06275  [pdf, other

    cs.CL cs.AI cs.LG

    PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically

    Authors: Sedrick Scott Keh, Steven Y. Feng, Varun Gangal, Malihe Alikhani, Eduard Hovy

    Abstract: Tongue twisters are meaningful sentences that are difficult to pronounce. The process of automatically generating tongue twisters is challenging since the generated utterance must satisfy two conditions at once: phonetic difficulty and semantic meaning. Furthermore, phonetic difficulty is itself hard to characterize and is expressed in natural tongue twisters through a heterogeneous mix of phenome… ▽ More

    Submitted 14 February, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: EACL 2023. Code at https://github.com/sedrickkeh/PANCETTA

  6. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, Jinho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  7. arXiv:2110.00687  [pdf, other

    cs.CL

    Investigating Robustness of Dialog Models to Popular Figurative Language Constructs

    Authors: Harsh Jhamtani, Varun Gangal, Eduard Hovy, Taylor Berg-Kirkpatrick

    Abstract: Humans often employ figurative language use in communication, including during interactions with dialog systems. Thus, it is important for real-world dialog systems to be able to handle popular figurative language constructs like metaphor and simile. In this work, we analyze the performance of existing dialog models in situations where the input dialog context exhibits use of figurative language.… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 Short Paper

  8. arXiv:2109.10856  [pdf, other

    cs.CL cs.LG

    Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data

    Authors: Dheeraj Mekala, Varun Gangal, Jingbo Shang

    Abstract: Existing text classification methods mainly focus on a fixed label set, whereas many real-world applications require extending to new fine-grained classes as the number of samples per label increases. To accommodate such requirements, we introduce a new problem called coarse-to-fine grained classification, which aims to perform fine-grained classification on coarsely annotated data. Instead of ask… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Accepted to appear in EMNLP 2021

  9. arXiv:2109.03892  [pdf, other

    cs.CL cs.AI cs.LG

    Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation Models

    Authors: Steven Y. Feng, Kevin Lu, Zhuofu Tao, Malihe Alikhani, Teruko Mitamura, Eduard Hovy, Varun Gangal

    Abstract: We investigate the use of multimodal information contained in images as an effective method for enhancing the commonsense of Transformer models for text generation. We perform experiments using BART and T5 on concept-to-text generation, specifically the task of generative commonsense reasoning, or CommonGen. We call our approach VisCTG: Visually Grounded Concept-to-Text Generation. VisCTG involves… ▽ More

    Submitted 25 March, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to AAAI 2022. Code at https://github.com/styfeng/VisCTG

  10. arXiv:2108.06643  [pdf, other

    cs.CL cs.AI cs.LG

    SAPPHIRE: Approaches for Enhanced Concept-to-Text Generation

    Authors: Steven Y. Feng, Jessica Huynh, Chaitanya Narisetty, Eduard Hovy, Varun Gangal

    Abstract: We motivate and propose a suite of simple but effective improvements for concept-to-text generation called SAPPHIRE: Set Augmentation and Post-hoc PHrase Infilling and REcombination. We demonstrate their effectiveness on generative commonsense reasoning, a.k.a. the CommonGen task, through experiments using both BART and T5 models. Through extensive automatic and human evaluation, we show that SAPP… ▽ More

    Submitted 1 December, 2021; v1 submitted 14 August, 2021; originally announced August 2021.

    Comments: INLG 2021 [Best Long Paper]. Code available at https://github.com/styfeng/SAPPHIRE

  11. arXiv:2106.09069  [pdf, other

    cs.CL cs.LG

    Automatic Construction of Evaluation Suites for Natural Language Generation Datasets

    Authors: Simon Mille, Kaustubh D. Dhole, Saad Mahamood, Laura Perez-Beltrachini, Varun Gangal, Mihir Kale, Emiel van Miltenburg, Sebastian Gehrmann

    Abstract: Machine learning approaches applied to NLP are often evaluated by summarizing their performance in a single number, for example accuracy. Since most test sets are constructed as an i.i.d. sample from the overall data, this approach overly simplifies the complexity of language and encourages overfitting to the head of the data distribution. As such, rare language phenomena or text about underrepres… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  12. arXiv:2106.02833  [pdf, other

    cs.CL

    Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation

    Authors: Varun Gangal, Harsh Jhamtani, Eduard Hovy, Taylor Berg-Kirkpatrick

    Abstract: Multiple different responses are often plausible for a given open domain dialog context. Prior work has shown the importance of having multiple valid reference responses for meaningful and robust automated evaluations. In such cases, common practice has been to collect more human written references. However, such collection can be expensive, time consuming, and not easily scalable. Instead, we pro… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021

  13. arXiv:2105.03075  [pdf, other

    cs.CL cs.AI cs.LG

    A Survey of Data Augmentation Approaches for NLP

    Authors: Steven Y. Feng, Varun Gangal, Jason Wei, Sarath Chandar, Soroush Vosoughi, Teruko Mitamura, Eduard Hovy

    Abstract: Data augmentation has recently seen increased interest in NLP due to more work in low-resource domains, new tasks, and the popularity of large-scale neural networks that require large amounts of training data. Despite this recent upsurge, this area is still relatively underexplored, perhaps due to the challenges posed by the discrete nature of language data. In this paper, we present a comprehensi… ▽ More

    Submitted 1 December, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted to ACL 2021 Findings. GitHub repo with paper list at https://github.com/styfeng/DataAug4NLP ; Talk at https://www.youtube.com/watch?v=kNBVesKUZCk&ab_channel=StevenFeng ; Podcast at https://www.youtube.com/watch?v=qmqyT_97Poc&ab_channel=GradientFlow and https://thedataexchange.media/data-augmentation-in-natural-language-processing

  14. arXiv:2104.06669  [pdf, other

    cs.CL cs.AI

    NAREOR: The Narrative Reordering Problem

    Authors: Varun Gangal, Steven Y. Feng, Malihe Alikhani, Teruko Mitamura, Eduard Hovy

    Abstract: Many implicit inferences exist in text depending on how it is structured that can critically impact the text's interpretation and meaning. One such structural aspect present in text with chronology is the order of its presentation. For narratives or stories, this is known as the narrative order. Reordering a narrative can impact the temporal, causal, event-based, and other inferences readers draw… ▽ More

    Submitted 27 March, 2022; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted to AAAI 2022; Code at https://github.com/vgtomahawk/NAREORCamReady

  15. arXiv:2102.01672  [pdf, other

    cs.CL cs.AI cs.LG

    The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

    Authors: Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D. Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak , et al. (31 additional authors not shown)

    Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it… ▽ More

    Submitted 1 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

  16. arXiv:2010.04098  [pdf, other

    cs.CL cs.AI

    BERTering RAMS: What and How Much does BERT Already Know About Event Arguments? -- A Study on the RAMS Dataset

    Authors: Varun Gangal, Eduard Hovy

    Abstract: Using the attention map based probing frame-work from (Clark et al., 2019), we observe that, on the RAMS dataset (Ebner et al., 2020), BERT's attention heads have modest but well above-chance ability to spot event arguments sans any training or domain finetuning, vary-ing from a low of 17.77% for Place to a high of 51.61% for Artifact. Next, we find that linear combinations of these heads, estimat… ▽ More

    Submitted 9 October, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted for the BlackBoxNLP 2020 Workshop @EMNLP 2020; Pre-camera-ready copy

  17. arXiv:2010.01794  [pdf, other

    cs.CL cs.AI cs.LG

    GenAug: Data Augmentation for Finetuning Text Generators

    Authors: Steven Y. Feng, Varun Gangal, Dongyeop Kang, Teruko Mitamura, Eduard Hovy

    Abstract: In this paper, we investigate data augmentation for text generation, which we call GenAug. Text generation and language modeling are important tasks within natural language processing, and are especially challenging for low-data regimes. We propose and evaluate various augmentation methods, including some that incorporate external knowledge, for finetuning GPT-2 on a subset of Yelp Reviews. We als… ▽ More

    Submitted 10 October, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020 Deep Learning Inside Out (DeeLIO) Workshop; Code available at https://github.com/styfeng/GenAug

  18. arXiv:2004.12934  [pdf, other

    cs.CL

    SCDE: Sentence Cloze Dataset with High Quality Distractors From Examinations

    Authors: Xiang Kong, Varun Gangal, Eduard Hovy

    Abstract: We introduce SCDE, a dataset to evaluate the performance of computational models through sentence prediction. SCDE is a human-created sentence cloze dataset, collected from public school English examinations. Our task requires a model to fill up multiple blanks in a passage from a shared candidate set with distractors designed by English teachers. Experimental results demonstrate that this task re… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: ACL2020

  19. arXiv:1912.12800  [pdf, other

    cs.CL cs.AI

    Likelihood Ratios and Generative Classifiers for Unsupervised Out-of-Domain Detection In Task Oriented Dialog

    Authors: Varun Gangal, Abhinav Arora, Arash Einolghozati, Sonal Gupta

    Abstract: The task of identifying out-of-domain (OOD) input examples directly at test-time has seen renewed interest recently due to increased real world deployment of models. In this work, we focus on OOD detection for natural language sentence inputs to task-based dialog systems. Our findings are three-fold: First, we curate and release ROSTD (Real Out-of-Domain Sentences From Task-oriented Dialog) - a da… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

    Comments: Accepted for AAAI-2020 Main Track

  20. arXiv:1909.00098  [pdf, other

    cs.CL

    (Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Annotated Stylistic Language Dataset with Multiple Personas

    Authors: Dongyeop Kang, Varun Gangal, Eduard Hovy

    Abstract: Stylistic variation in text needs to be studied with different aspects including the writer's personal traits, interpersonal relations, rhetoric, and more. Despite recent attempts on computational modeling of the variation, the lack of parallel corpora of style language makes it difficult to systematically control the stylistic change as well as evaluate such models. We release PASTEL, the paralle… ▽ More

    Submitted 30 August, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  21. arXiv:1707.08852  [pdf, other

    cs.CL cs.AI cs.LG

    Detecting and Explaining Causes From Text For a Time Series Event

    Authors: Dongyeop Kang, Varun Gangal, Ang Lu, Zheng Chen, Eduard Hovy

    Abstract: Explaining underlying causes or effects about events is a challenging but valuable task. We define a novel problem of generating explanations of a time series event by (1) searching cause and effect relationships of the time series with textual data and (2) constructing a connecting chain between them to generate an explanation. To detect causal features from text, we propose a novel method based… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

    Comments: Accepted at EMNLP 2017

  22. arXiv:1707.01176  [pdf, other

    cs.CL

    CharManteau: Character Embedding Models For Portmanteau Creation

    Authors: Varun Gangal, Harsh Jhamtani, Graham Neubig, Eduard Hovy, Eric Nyberg

    Abstract: Portmanteaus are a word formation phenomenon where two words are combined to form a new word. We propose character-level neural sequence-to-sequence (S2S) methods for the task of portmanteau generation that are end-to-end-trainable, language independent, and do not explicitly use additional phonetic information. We propose a noisy-channel-style model, which allows for the incorporation of unsuperv… ▽ More

    Submitted 24 July, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

    Comments: Accepted for publication in EMNLP 2017

  23. arXiv:1707.01161  [pdf, other

    cs.CL

    Shakespearizing Modern Language Using Copy-Enriched Sequence-to-Sequence Models

    Authors: Harsh Jhamtani, Varun Gangal, Eduard Hovy, Eric Nyberg

    Abstract: Variations in writing styles are commonly used to adapt the content to a specific context, audience, or purpose. However, applying stylistic variations is still by and large a manual process, and there have been little efforts towards automating it. In this paper we explore automated methods to transform text from modern English to Shakespearean English using an end to end trainable neural model w… ▽ More

    Submitted 20 July, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

    Comments: Accepted at EMNLP 2017 Workshop on Stylistic Variation

  24. arXiv:1606.05065  [pdf, other

    cs.SI physics.soc-ph

    HEMI: Hyperedge Majority Influence Maximization

    Authors: Varun Gangal, Balaraman Ravindran, Ramasuri Narayanam

    Abstract: In this work, we consider the problem of influence maximization on a hypergraph. We first extend the Independent Cascade (IC) model to hypergraphs, and prove that the traditional influence maximization problem remains submodular. We then present a variant of the influence maximization problem (HEMI) where one seeks to maximize the number of hyperedges, a majority of whose nodes are influenced. We… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: 10 pages, Accepted for oral presentation at the Social Influence Analysis (SocInf) Workshop, IJCAI 2016