Skip to main content

Showing 1–9 of 9 results for author: Hidey, C

  1. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  2. DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue

    Authors: William Held, Christopher Hidey, Fei Liu, Eric Zhu, Rahul Goel, Diyi Yang, Rushin Shah

    Abstract: Modern virtual assistants use internal semantic parsing engines to convert user utterances to actionable commands. However, prior work has demonstrated that semantic parsing is a difficult multilingual transfer task with low transfer efficiency compared to other tasks. In global markets such as India and Latin America, this is a critical issue as switching between languages is prevalent for biling… ▽ More

    Submitted 26 May, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 9 Pages; ACL Main Conference 2023

  3. arXiv:2204.06748  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Top-K Decoding for Non-Autoregressive Semantic Parsing via Intent Conditioning

    Authors: Geunseob Oh, Rahul Goel, Chris Hidey, Shachi Paul, Aditya Gupta, Pararth Shah, Rushin Shah

    Abstract: Semantic parsing (SP) is a core component of modern virtual assistants like Google Assistant and Amazon Alexa. While sequence-to-sequence-based auto-regressive (AR) approaches are common for conversational semantic parsing, recent studies employ non-autoregressive (NAR) decoders and reduce inference latency while maintaining competitive parsing quality. However, a major drawback of NAR decoders is… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  4. arXiv:2204.04735  [pdf, other

    cs.CL

    Reducing Model Jitter: Stable Re-training of Semantic Parsers in Production Environments

    Authors: Christopher Hidey, Fei Liu, Rahul Goel

    Abstract: Retraining modern deep learning systems can lead to variations in model performance even when trained using the same data and hyper-parameters by simply using different random seeds. We call this phenomenon model jitter. This issue is often exacerbated in production settings, where models are retrained on noisy data. In this work we tackle the problem of stable retraining with a focus on conversat… ▽ More

    Submitted 23 September, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: SIGDIAL 2022 Best Paper

  5. arXiv:2103.06758  [pdf, other

    cs.CL cs.AI

    ENTRUST: Argument Reframing with Language Models and Entailment

    Authors: Tuhin Chakrabarty, Christopher Hidey, Smaranda Muresan

    Abstract: Framing involves the positive or negative presentation of an argument or issue depending on the audience and goal of the speaker (Entman 1983). Differences in lexical framing, the focus of our work, can have large effects on peoples' opinions and beliefs. To make progress towards reframing arguments for positive effects, we create a dataset and method for this task. We use a lexical resource for "… ▽ More

    Submitted 10 April, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: NAACL 2021

  6. arXiv:2004.14677  [pdf, other

    cs.CL cs.AI

    AMPERSAND: Argument Mining for PERSuAsive oNline Discussions

    Authors: Tuhin Chakrabarty, Christopher Hidey, Smaranda Muresan, Kathy Mckeown, Alyssa Hwang

    Abstract: Argumentation is a type of discourse where speakers try to persuade their audience about the reasonableness of a claim by presenting supportive arguments. Most work in argument mining has focused on modeling arguments in monologues. We propose a computational model for argument mining in online persuasive discussion forums that brings together the micro-level (argument as product) and macro-level… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: EMNLP 2019

  7. arXiv:2004.12864  [pdf, other

    cs.CL

    DeSePtion: Dual Sequence Prediction and Adversarial Examples for Improved Fact-Checking

    Authors: Christopher Hidey, Tuhin Chakrabarty, Tariq Alhindi, Siddharth Varia, Kriste Krstovski, Mona Diab, Smaranda Muresan

    Abstract: The increased focus on misinformation has spurred development of data and systems for detecting the veracity of a claim as well as retrieving authoritative evidence. The Fact Extraction and VERification (FEVER) dataset provides such a resource for evaluating end-to-end fact-checking, requiring retrieval of evidence from Wikipedia to validate a veracity prediction. We show that current systems for… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: ACL 2020

  8. arXiv:1905.07000  [pdf, other

    cs.CL

    IMHO Fine-Tuning Improves Claim Detection

    Authors: Tuhin Chakrabarty, Christopher Hidey, Kathleen McKeown

    Abstract: Claims are the central component of an argument. Detecting claims across different domains or data sets can often be challenging due to their varying conceptualization. We propose to alleviate this problem by fine tuning a language model using a Reddit corpus of 5.5 million opinionated claims. These claims are self-labeled by their authors using the internet acronyms IMO/IMHO (in my (humble) opini… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: NAACL 2019

  9. arXiv:1708.03940  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Leveraging Sparse and Dense Feature Combinations for Sentiment Classification

    Authors: Tao Yu, Christopher Hidey, Owen Rambow, Kathleen McKeown

    Abstract: Neural networks are one of the most popular approaches for many natural language processing tasks such as sentiment analysis. They often outperform traditional machine learning models and achieve the state-of-art results on most tasks. However, many existing deep learning models are complex, difficult to train and provide a limited improvement over simpler methods. We propose a simple, robust and… ▽ More

    Submitted 13 August, 2017; originally announced August 2017.

    Comments: 4 pages